Exploiting the capabilities of modern GPUs for dense matrix computations
Impact
Scholar |
Other documents of the author: Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Igual, Francisco; Mayo, Rafael; Quintana-Orti, Enrique S.; Quintana-Ortí, Gregorio
Metadata
Show full item recordcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7036
comunitat-uji-handle3:10234/8620
comunitat-uji-handle4:
INVESTIGACIONThis resource is restricted
http://dx.doi.org/10.1002/cpe.1472 |
Metadata
Title
Exploiting the capabilities of modern GPUs for dense matrix computationsAuthor (s)
Date
2009Publisher
John Wiley & SonsISSN
1532-0626; 1532-0634Type
info:eu-repo/semantics/articlePublisher version
http://onlinelibrary.wiley.com/doi/10.1002/cpe.1472/fullAbstract
We present several algorithms to compute the solution of a linear system of equations on a graphics processor (GPU), as well as general techniques to improve their performance, such as padding and hybrid GPU-CPU ... [+]
We present several algorithms to compute the solution of a linear system of equations on a graphics processor (GPU), as well as general techniques to improve their performance, such as padding and hybrid GPU-CPU computation. We compare single and double precision performance of a modern GPU with unified architecture, and show how iterative refinement with mixed precision can be used to regain full accuracy in the solution of linear systems, exploiting the potential of the processor for single precision arithmetic. Experimental results on a GTX280 using CUBLAS 2.0, the implementation of BLAS for NVIDIA® GPUs with unified architecture, illustrate the performance of the different algorithms and techniques proposed. [-]
Is part of
Concurrency and Computation: Practice and Experience, 21, 18, p. 2457–2477Rights
Copyright © 2009 John Wiley & Sons, Ltd.
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/restrictedAccess
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/restrictedAccess
This item appears in the folowing collection(s)
- ICC_Articles [430]