Mostrar el registro sencillo del ítem

dc.contributor.authorHeavner, Nathan
dc.contributor.authorIgual, Francisco
dc.contributor.authorQuintana-Ortí, Gregorio
dc.contributor.authorMARTINSSON, GUNNAR
dc.date.accessioned2022-10-06T11:33:01Z
dc.date.available2022-10-06T11:33:01Z
dc.date.issued2022-06
dc.identifier.citationN. Heavner, F. D. Igual, G. Quintana-Ortí, and P. G. Martinsson. 2022. Algorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architectures. ACM Trans. Math. Softw. 48, 2, Article 21 (June 2022), 42 pages. https://doi.org/10.1145/3507466ca_CA
dc.identifier.issn0098-3500
dc.identifier.issn1557-7295
dc.identifier.urihttp://hdl.handle.net/10234/200216
dc.description.abstractRandomized singular value decomposition (RSVD) is by now a well-established technique for efficiently computing an approximate singular value decomposition of a matrix. Building on the ideas that underpin RSVD, the recently proposed algorithm “randUTV” computes a full factorization of a given matrix that provides low-rank approximations with near-optimal error. Because the bulk of randUTV is cast in terms of communication-efficient operations such as matrix-matrix multiplication and unpivoted QR factorizations, it is faster than competing rank-revealing factorization methods such as column-pivoted QR in most high-performance computational settings. In this article, optimized randUTV implementations are presented for both shared-memory and distributed-memory computing environments. For shared memory, randUTV is redesigned in terms of an algorithm-by-blocks that, together with a runtime task scheduler, eliminates bottlenecks from data synchronization points to achieve acceleration over the standard blocked algorithm based on a purely fork-join approach. The distributed-memory implementation is based on the ScaLAPACK library. The performance of our new codes compares favorably with competing factorizations available on both shared-memory and distributed-memory architectures.ca_CA
dc.format.extent42 p.ca_CA
dc.format.mimetypeapplication/pdfca_CA
dc.language.isoengca_CA
dc.publisherAssociation for Computing Machinery (ACM)ca_CA
dc.relation.isPartOfACM Transactions on Mathematical Software (TOMS), 2022, vol. 48, no 2ca_CA
dc.rightsCopyright © ACM, Inc.ca_CA
dc.rights.urihttp://rightsstatements.org/vocab/CNE/1.0/ca_CA
dc.subjectmathematics of computingca_CA
dc.subjectcomputations on matricesca_CA
dc.titleAlgorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architecturesca_CA
dc.typeinfo:eu-repo/semantics/articleca_CA
dc.identifier.doihttps://doi.org/10.1145/3507466
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca_CA
dc.relation.publisherVersionhttps://dl.acm.org/doi/full/10.1145/3507466ca_CA
dc.type.versioninfo:eu-repo/semantics/submittedVersionca_CA


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem