Scheduling algorithms-by-blocks on small clusters

Igual, Francisco D.; Quintana-Ortí, Gregorio; Van de Geijn, Robert A.

dc.contributor.author	Igual, Francisco D.
dc.contributor.author	Quintana-Ortí, Gregorio
dc.contributor.author	Van de Geijn, Robert A.
dc.date.accessioned	2014-05-13T12:16:27Z
dc.date.available	2014-05-13T12:16:27Z
dc.date.issued	2012-03-28
dc.identifier.citation	IGUAL, Francisco D.; QUINTANA‐ORTÍ, Gregorio; GEIJN, Robert. Scheduling algorithms‐by‐blocks on small clusters. Concurrency and Computation: Practice and Experience, 2013, vol. 25, no 3, p. 367-384	ca_CA
dc.identifier.issn	1532-0626
dc.identifier.issn	1532-0634
dc.identifier.uri	http://hdl.handle.net/10234/92153
dc.description.abstract	The arrival of multicore architectures has generated an interest in reformulating dense matrix computations as algorithms-by-blocks, where submatrices are units of data and computations with those blocks are units of computation. Rather than directly executing such an algorithm, a directed acyclic graph is generated at runtime that is then scheduled by a runtime system such as SuperMatrix. The benefit is a clear separation of concerns between the library and the heuristics for scheduling. In this paper, we show that this approach can be taken one step further using the same methodology and an ad hoc runtime to map algorithms-by-blocks to small clusters. With no change to the library code, and the application that uses it, the computational power of such small clusters can be utilized. An impressive performance on a number of small clusters is reported. As a proof of the flexibility of the solution, we report performance results on accelerated clusters based on graphics processors. We believe this to be a possible step towards programming many-core architectures, as demonstrated by a port of the solution to Intel's Single-chip Cloud Computer (Intel, Santa Clara, CA, USA). Copyright © 2012 John Wiley & Sons, Ltd.	ca_CA
dc.format.extent	18 p.	ca_CA
dc.format.mimetype	application/pdf	ca_CA
dc.language.iso	eng	ca_CA
dc.publisher	Wiley	ca_CA
dc.relation.isPartOf	Concurrency and Computation: Practice and Experience, 2013, vol. 25, no 3	ca_CA
dc.rights	Copyright © 2012 John Wiley & Sons, Ltd.	ca_CA
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	*
dc.subject	matrix computations	ca_CA
dc.subject	novel parallel architectures	ca_CA
dc.subject	automatic parallelization	ca_CA
dc.title	Scheduling algorithms-by-blocks on small clusters	ca_CA
dc.type	info:eu-repo/semantics/article	ca_CA
dc.identifier.doi	http://dx.doi.org/10.1002/cpe.2842
dc.rights.accessRights	info:eu-repo/semantics/restrictedAccess	ca_CA
dc.relation.publisherVersion	http://onlinelibrary.wiley.com/doi/10.1002/cpe.2842/abstract	ca_CA

Ficheros en el ítem

Ficheros	Tamaño	Formato	Ver
No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

ICC_Articles [414]

Mostrar el registro sencillo del ítem