Mostrar el registro sencillo del ítem

dc.contributor.authorAliaga Estellés, José Ignacio
dc.contributor.authorDufrechou, Ernesto
dc.contributor.authorEzzatti, Pablo
dc.contributor.authorQuintana-Orti, Enrique S.
dc.date.accessioned2019-06-26T11:05:56Z
dc.date.available2019-06-26T11:05:56Z
dc.date.issued2019
dc.identifier.citationALIAGA, José I., et al. Accelerating the task/data-parallel version of ILUPACK’s BiCG in multi-CPU/GPU configurations. Parallel Computing, 2019.ca_CA
dc.identifier.issn0167-8191
dc.identifier.urihttp://hdl.handle.net/10234/182959
dc.description.abstractILUPACK is a valuable tool for the solution of sparse linear systems via iterative Krylov subspace-based methods. Its relevance for the solution of real problems has motivated several efforts to enhance its performance on parallel machines. In this work we focus on exploiting the task-level parallelism derived from the structure of the BiCG method, in addition to the data-level parallelism of the internal matrix computations, with the goal of boosting the performance of a GPU (graphics processing unit) implementation of this solver. First, we revisit the use of dual-GPU systems to execute independent stages of the BiCG concurrently on both accelerators, while leveraging the extra memory space to improve the data access patterns. In addition, we extend our ideas to compute the BiCG method efficiently in multicore platforms with a single GPU. In this line, we study the possibilities offered by hybrid CPU-GPU computations, as well as a novel synchronization-free sparse triangular linear solver. The experimental results with the new solvers show important acceleration factors with respect to the previous data-parallel CPU and GPU versions.ca_CA
dc.format.extent9 p.ca_CA
dc.language.isoengca_CA
dc.publisherElsevierca_CA
dc.relation.isPartOfParallel Computing, Volume 85, July 2019.ca_CA
dc.rights.urihttp://rightsstatements.org/vocab/CNE/1.0/*
dc.subjectSparse linear systemsca_CA
dc.subjectIterative Krylov-subspace methodsca_CA
dc.subjectData parallelismca_CA
dc.subjectILUPACK preconditionerca_CA
dc.subjectGraphics processing units (GPUs)ca_CA
dc.titleAccelerating the task/data-parallel version of ILUPACK’s BiCG in multi-CPU/GPU configurationsca_CA
dc.typeinfo:eu-repo/semantics/articleca_CA
dc.identifier.doihttps://doi.org/10.1016/j.parco.2019.02.005
dc.relation.projectIDTIN2017-82972-R ; TIN2017-82972-Rca_CA
dc.rights.accessRightsinfo:eu-repo/semantics/restrictedAccessca_CA
dc.relation.publisherVersionhttps://www.sciencedirect.com/science/article/pii/S0167819118301777ca_CA
dc.type.versioninfo:eu-repo/semantics/publishedVersionca_CA


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem