Time and energy modeling of a high-performance multi-threaded Cholesky factorization
Impacto
Scholar |
Otros documentos de la autoría: Catalán, Sandra; Igual, Francisco; Mayo, Rafael; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S.
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7036
comunitat-uji-handle3:10234/8620
comunitat-uji-handle4:
INVESTIGACIONEste recurso está restringido
http://dx.doi.org/10.1007/s11227-016-1654-6 |
Metadatos
Título
Time and energy modeling of a high-performance multi-threaded Cholesky factorizationAutoría
Fecha de publicación
2016-02-05Editor
SpringerCita bibliográfica
CATALÁN PALLARÉS, Sandra; IGUAL PEÑA, Francisco Daniel; MAYO, Rafael; RODRÍGUEZ SÁNCHEZ, Rafael; QUINTANA ORTÍ, Enrique S. Time and energy modeling of a high-performance multi-threaded Cholesky factorization. Journal of supercomputing (2016), online, pp. 1-13Tipo de documento
info:eu-repo/semantics/articleVersión de la editorial
http://link.springer.com/article/10.1007/s11227-016-1654-6Palabras clave / Materias
Resumen
We present accurate time and energy piece-wise models of high-performance multi-threaded implementations for the general matrix multiplication, triangular system solve with multiple right-hand sides, and symmetric ... [+]
We present accurate time and energy piece-wise models of high-performance multi-threaded implementations for the general matrix multiplication, triangular system solve with multiple right-hand sides, and symmetric rank-k update. Furthermore, these are then assembled to provide accurate models of the Cholesky factorization built on top of these Level-3 BLAS operations. Our models consider the costs, in terms of time and energy, of the floating-point operations involved in the routines as well as the overhead due to data movements across the levels of the memory hierarchy. The accuracy of the multi-threaded models is tested on an Intel Xeon E5-2620 processor, reporting relative errors for the Cholesky factorization that are, respectively, around 2.4 and 2.9 % on average for time and energy. [-]
Publicado en
Journal of supercomputing (2016), onlineDerechos de acceso
http://rightsstatements.org/vocab/CNE/1.0/
info:eu-repo/semantics/restrictedAccess
info:eu-repo/semantics/restrictedAccess
Aparece en las colecciones
- ICC_Articles [423]