Exploiting nested task-parallelism in the H-LU factorization
Ver/ Abrir
Impacto
Scholar |
Otros documentos de la autoría: Carratalá-Sáez, Rocío; Christophersen, Sven; Aliaga Estellés, José Ignacio; Beltran Querol, Vicenç; Börm, Steffen; Quintana-Orti, Enrique S.
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7036
comunitat-uji-handle3:10234/8620
comunitat-uji-handle4:
INVESTIGACIONMetadatos
Título
Exploiting nested task-parallelism in the H-LU factorizationAutoría
Fecha de publicación
2019-04Editor
ElsevierCita bibliográfica
CARRATALÁ-SÁEZ, Rocío, et al. Exploiting Nested Task-Parallelism in the H-LU Factorization. Journal of Computational Science, 2019, 33:20-33Tipo de documento
info:eu-repo/semantics/articleVersión de la editorial
https://www.sciencedirect.com/science/article/pii/S1877750318301352Versión
info:eu-repo/semantics/submittedVersionPalabras clave / Materias
Resumen
We address the parallelization of the LU factorization of hierarchical matrices (-matrices) arising from boundary element methods. Our approach exploits task-parallelism via the OmpSs programming model and runtime, ... [+]
We address the parallelization of the LU factorization of hierarchical matrices (-matrices) arising from boundary element methods. Our approach exploits task-parallelism via the OmpSs programming model and runtime, which discovers the data-flow parallelism intrinsic to the operation at execution time, via the analysis of data dependencies based on the memory addresses of the tasks’ operands. This is especially challenging for H-matrices, as the structures containing the data vary in dimension during the execution. We tackle this issue by decoupling the data structure from that used to detect dependencies. Furthermore, we leverage the support for weak operands and early release of dependencies, recently introduced in OmpSs-2, to accelerate the execution of parallel codes with nested task-parallelism and fine-grain tasks. As a result, we obtain a significant improvement in the parallel performance with respect to our previous work. [-]
Proyecto de investigación
MINECO (CICYT TIN2014-53495-R) ; FEDER (TIN2017-82972-R) ; Universitat Jaume I (UJI-B2017-46); MECD (FPU program)Derechos de acceso
© 2019 Elsevier B.V. All rights reserved.
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/openAccess
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/openAccess
Aparece en las colecciones
- ICC_Articles [430]