Exploiting the capabilities of modern GPUs for dense matrix computations
Impacte
Scholar |
Altres documents de l'autoria: Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Igual, Francisco; Mayo, Rafael; Quintana-Orti, Enrique S.; Quintana-Ortí, Gregorio
Metadades
Mostra el registre complet de l'elementcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7036
comunitat-uji-handle3:10234/8620
comunitat-uji-handle4:
INVESTIGACIONAquest recurs és restringit
http://dx.doi.org/10.1002/cpe.1472 |
Metadades
Títol
Exploiting the capabilities of modern GPUs for dense matrix computationsAutoria
Data de publicació
2009Editor
John Wiley & SonsISSN
1532-0626; 1532-0634Tipus de document
info:eu-repo/semantics/articleVersió de l'editorial
http://onlinelibrary.wiley.com/doi/10.1002/cpe.1472/fullParaules clau / Matèries
Resum
We present several algorithms to compute the solution of a linear system of equations on a graphics processor (GPU), as well as general techniques to improve their performance, such as padding and hybrid GPU-CPU ... [+]
We present several algorithms to compute the solution of a linear system of equations on a graphics processor (GPU), as well as general techniques to improve their performance, such as padding and hybrid GPU-CPU computation. We compare single and double precision performance of a modern GPU with unified architecture, and show how iterative refinement with mixed precision can be used to regain full accuracy in the solution of linear systems, exploiting the potential of the processor for single precision arithmetic. Experimental results on a GTX280 using CUBLAS 2.0, the implementation of BLAS for NVIDIA® GPUs with unified architecture, illustrate the performance of the different algorithms and techniques proposed. [-]
Publicat a
Concurrency and Computation: Practice and Experience, 21, 18, p. 2457–2477Drets d'accés
Copyright © 2009 John Wiley & Sons, Ltd.
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/restrictedAccess
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/restrictedAccess
Apareix a les col.leccions
- ICC_Articles [417]