CUDA Papers

A collection of research papers and projects utilizing CUDA technology

Tag Archives: GEMM

Benchmarking GPUs to Tune Dense Linear Algebra

http://portal.acm.org/ft_gateway.cfm?id=1413402&type=pdf&doid2=1413370.1413402 http://www2.computer.org/portal/c/document_library/get_file?folderId=97697&name=DLFE-3337.pdf Abstract We present performance results for dense linear algebra using the 8-series NVIDIA GPUs. Our GEMM routine runs 60% faster than the vendor implementation and approaches the peak of hardware capabilities. Our LU, QR and Cholesky factorizations achieve up to 80-90% of the peak GEMM rate. Our parallel LU running on two GPUs […]