Mostrar el registro sencillo del ítem

dc.contributor.authorIgual, Francisco D.
dc.contributor.authorQuintana-Ortí, Gregorio
dc.contributor.authorVan de Geijn, Robert A.
dc.date.accessioned2014-05-13T12:16:27Z
dc.date.available2014-05-13T12:16:27Z
dc.date.issued2012-03-28
dc.identifier.citationIGUAL, Francisco D.; QUINTANA‐ORTÍ, Gregorio; GEIJN, Robert. Scheduling algorithms‐by‐blocks on small clusters. Concurrency and Computation: Practice and Experience, 2013, vol. 25, no 3, p. 367-384ca_CA
dc.identifier.issn1532-0626
dc.identifier.issn1532-0634
dc.identifier.urihttp://hdl.handle.net/10234/92153
dc.description.abstractThe arrival of multicore architectures has generated an interest in reformulating dense matrix computations as algorithms-by-blocks, where submatrices are units of data and computations with those blocks are units of computation. Rather than directly executing such an algorithm, a directed acyclic graph is generated at runtime that is then scheduled by a runtime system such as SuperMatrix. The benefit is a clear separation of concerns between the library and the heuristics for scheduling. In this paper, we show that this approach can be taken one step further using the same methodology and an ad hoc runtime to map algorithms-by-blocks to small clusters. With no change to the library code, and the application that uses it, the computational power of such small clusters can be utilized. An impressive performance on a number of small clusters is reported. As a proof of the flexibility of the solution, we report performance results on accelerated clusters based on graphics processors. We believe this to be a possible step towards programming many-core architectures, as demonstrated by a port of the solution to Intel's Single-chip Cloud Computer (Intel, Santa Clara, CA, USA). Copyright © 2012 John Wiley & Sons, Ltd.ca_CA
dc.format.extent18 p.ca_CA
dc.format.mimetypeapplication/pdfca_CA
dc.language.isoengca_CA
dc.publisherWileyca_CA
dc.relation.isPartOfConcurrency and Computation: Practice and Experience, 2013, vol. 25, no 3ca_CA
dc.rightsCopyright © 2012 John Wiley & Sons, Ltd.ca_CA
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/*
dc.subjectmatrix computationsca_CA
dc.subjectnovel parallel architecturesca_CA
dc.subjectautomatic parallelizationca_CA
dc.titleScheduling algorithms-by-blocks on small clustersca_CA
dc.typeinfo:eu-repo/semantics/articleca_CA
dc.identifier.doihttp://dx.doi.org/10.1002/cpe.2842
dc.rights.accessRightsinfo:eu-repo/semantics/restrictedAccessca_CA
dc.relation.publisherVersionhttp://onlinelibrary.wiley.com/doi/10.1002/cpe.2842/abstractca_CA


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem