Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processors
Impacto
Scholar |
Otros documentos de la autoría: Aliaga Estellés, José Ignacio; Anzt, Hartwig; Castillo Catalán, María Isabel; Fernández Fernández, Juan Carlos; León Navarro, Germán; Pérez, Joaquín; Quintana-Orti, Enrique S.
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7036
comunitat-uji-handle3:10234/8620
comunitat-uji-handle4:
INVESTIGACIONEste recurso está restringido
http://dx.doi.org/10.1002/cpe.3341 |
Metadatos
Título
Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processorsAutoría
Fecha de publicación
2015Editor
WileyISSN
1532-0626; 1532-0634Tipo de documento
info:eu-repo/semantics/articleVersión de la editorial
http://onlinelibrary.wiley.com/doi/10.1002/cpe.3341/abstractPalabras clave / Materias
Resumen
In this paper, we analyze the interactions occurring in the triangle performance-power-energy for the execu-
tion of a pivotal numerical algorithm, the iterative conjugate gradient (CG) method, on a diverse collection ... [+]
In this paper, we analyze the interactions occurring in the triangle performance-power-energy for the execu-
tion of a pivotal numerical algorithm, the iterative conjugate gradient (CG) method, on a diverse collection of
parallel multithreaded architectures. This analysis is especially timely in a decade where the power wall has
arisen as a major obstacle to build faster processors. Moreover, the CG method has recently been proposed
as a complement to the LINPACK benchmark, as this iterative method is argued to be more archetypical
of the performance of today’s scientific and engineering applications. To gain insights about the benefits of
hands-on optimizations we include runtime and energy efficiency results for both out-of-the-box usage rely-
ing exclusively on compiler optimizations, and implementations manually optimized for target architectures,
that range from general-purpose and digital signal multicore processors to manycore graphics processing
units, all representative of current multithreaded systems. [-]
Publicado en
Concurrency and Computation: Practice and Experience, 2015, vol. 27, núm. 4Derechos de acceso
Published online in Wiley Online Library (wileyonlinelibrary.com). Copyright © 2014 John Wiley & Sons, Ltd.
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/restrictedAccess
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/restrictedAccess
Aparece en las colecciones
- ICC_Articles [427]