Mostrar el registro sencillo del ítem

dc.contributor.authorAliaga Estellés, José Ignacio
dc.contributor.authorAnzt, Hartwig
dc.contributor.authorQuintana-Orti, Enrique S.
dc.contributor.authorTomás Domínguez, Andrés Enrique
dc.date.accessioned2023-10-06T06:56:03Z
dc.date.available2023-10-06T06:56:03Z
dc.date.issued2023-08-04
dc.identifier.citationALIAGA, José I., et al. Sparse matrix‐vector and matrix‐multivector products for the truncated SVD on graphics processors. Concurrency and Computation: Practice and Experience, 2023, p. e7871.ca_CA
dc.identifier.urihttp://hdl.handle.net/10234/204435
dc.description.abstractMany practical algorithms for numerical rank computations implement an iterative procedure that involves repeated multiplications of a vector, or a collection of vectors, with both a sparse matrix A and its transpose. Unfortunately, the realization of these sparse products on current high performance libraries often deliver much lower arithmetic throughput when the matrix involved in the product is transposed. In this work, we propose a hybrid sparse matrix layout, named CSRC, that combines the flexibility of some well-known sparse formats to offer a number of appealing properties: (1) CSRC can be obtained at low cost from the popular CSR (compressed sparse row) format; (2) CSRC has similar storage requirements as CSR; and especially, (3) the implementation of the sparse product kernels delivers high performance for both the direct product and its transposed variant on modern graphics accelerators thanks to a significant reduction of atomic operations compared to a conventional implementation based on CSR. This solution thus renders considerably higher performance when integrated into an iterative algorithm for the truncated singular value decomposition (SVD), such as the randomized SVD or, as demonstrated in the experimental results, the block Golub–Kahan–Lanczos algorithm.ca_CA
dc.description.sponsorShipFunding for open access charge: CRUE-Universitat Jaume I
dc.format.extent12 p.ca_CA
dc.format.mimetypeapplication/pdfca_CA
dc.language.isoengca_CA
dc.publisherWileyca_CA
dc.rights© 2023 The Authors. Concurrency and Computation: Practice and Experience published by John Wiley & Sons Ltd.ca_CA
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/ca_CA
dc.subjectgraphics processing unitsca_CA
dc.subjectsingular value decompositionca_CA
dc.subjectsparse matrix-multivector productca_CA
dc.subjectsparse matrix-vector productca_CA
dc.titleSparse matrix-vector and matrix-multivector products for the truncated SVD on graphics processorsca_CA
dc.typeinfo:eu-repo/semantics/articleca_CA
dc.identifier.doihttps://doi.org/10.1002/cpe.7871
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca_CA
dc.type.versioninfo:eu-repo/semantics/publishedVersionca_CA
project.funder.nameUS Exascale Computing Projectca_CA
project.funder.nameU.S. Department of Energy Office of Scienceca_CA
project.funder.nameEuropean High-Performance Computing Joint Undertaking (JU)ca_CA
project.funder.nameEuropean Union's Horizon 2020 Research and Innovation Programmeca_CA
project.funder.nameSpanish National Plan for Scientific and Technical Research and Innovation (MCIN/AEI/10.13039/501100011033)ca_CA
project.funder.nameUniversitat Jaume Ica_CA
oaire.awardNumber17-SC-20-SCca_CA
oaire.awardNumber955558 (eFlows4HPC project)ca_CA
oaire.awardNumberPID2020-113656RBca_CA
oaire.awardNumberUJI-B2021-58ca_CA


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

© 2023 The Authors. Concurrency and Computation: Practice and Experience published by John Wiley & Sons Ltd.
Excepto si se señala otra cosa, la licencia del ítem se describe como: © 2023 The Authors. Concurrency and Computation: Practice and Experience published by John Wiley & Sons Ltd.