Time and energy modeling of a high-performance multi-threaded Cholesky factorization
Impacte
Scholar |
Altres documents de l'autoria: Catalán, Sandra; Igual, Francisco D.; Mayo, Rafael; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S.
Metadades
Mostra el registre complet de l'elementcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7036
comunitat-uji-handle3:10234/8620
comunitat-uji-handle4:
INVESTIGACIONAquest recurs és restringit
http://dx.doi.org/10.1007/s11227-016-1654-6 |
Metadades
Títol
Time and energy modeling of a high-performance multi-threaded Cholesky factorizationAutoria
Data de publicació
2016-02-05Editor
SpringerCita bibliogràfica
CATALÁN PALLARÉS, Sandra; IGUAL PEÑA, Francisco Daniel; MAYO, Rafael; RODRÍGUEZ SÁNCHEZ, Rafael; QUINTANA ORTÍ, Enrique S. Time and energy modeling of a high-performance multi-threaded Cholesky factorization. Journal of supercomputing (2016), online, pp. 1-13Tipus de document
info:eu-repo/semantics/articleVersió de l'editorial
http://link.springer.com/article/10.1007/s11227-016-1654-6Paraules clau / Matèries
Resum
We present accurate time and energy piece-wise models of high-performance multi-threaded implementations for the general matrix multiplication, triangular system solve with multiple right-hand sides, and symmetric ... [+]
We present accurate time and energy piece-wise models of high-performance multi-threaded implementations for the general matrix multiplication, triangular system solve with multiple right-hand sides, and symmetric rank-k update. Furthermore, these are then assembled to provide accurate models of the Cholesky factorization built on top of these Level-3 BLAS operations. Our models consider the costs, in terms of time and energy, of the floating-point operations involved in the routines as well as the overhead due to data movements across the levels of the memory hierarchy. The accuracy of the multi-threaded models is tested on an Intel Xeon E5-2620 processor, reporting relative errors for the Cholesky factorization that are, respectively, around 2.4 and 2.9 % on average for time and energy. [-]
Publicat a
Journal of supercomputing (2016), onlineDrets d'accés
http://rightsstatements.org/vocab/CNE/1.0/
info:eu-repo/semantics/restrictedAccess
info:eu-repo/semantics/restrictedAccess
Apareix a les col.leccions
- ICC_Articles [413]