Mostrar el registro sencillo del ítem

dc.contributor.authorAlonso-Jordá, Pedro
dc.contributor.authorCatalán, Sandra
dc.contributor.authorIgual, Francisco
dc.contributor.authorMayo, Rafael
dc.contributor.authorRodríguez Sánchez, Rafael
dc.contributor.authorQuintana-Orti, Enrique S.
dc.date.accessioned2016-03-16T10:39:47Z
dc.date.available2016-03-16T10:39:47Z
dc.date.issued2015-06
dc.identifier.citationALONSO, Pedro, et al. Time and energy modeling of high–performance Level-3 BLAS on x86 architectures. Simulation Modelling Practice and Theory, 2015, vol. 55, p. 77-94.ca_CA
dc.identifier.urihttp://hdl.handle.net/10234/154006
dc.description.abstractWe present accurate piece-wise models for the time and energy costs of high performance implementations of both the matrix multiplication (gemm) and the triangular system solve with multiple right-hand sides (trsm) on x86 architectures. Our methodology decouples the costs due to the floating-point arithmetic/data movement occurring in the higher levels of the cache hierarchy from those of packing/data transfers between the main memory and the L2/L3 cache. A careful analytical study of the data transfers, in combination with an architecture-specific calibration of the costs per operation, render then the components to assemble piece-wise models for the accurate estimation of gemm and trsm’s performance on x86 processors. Our experimental results on an Intel Xeon E5-2620 processor confirm the accuracy of this approach, which reports relative errors for different shapes of gemm and trsm that are, respectively, around 1.5% and 4.5% on average for both time and energy.ca_CA
dc.format.extent17 p.ca_CA
dc.format.mimetypeapplication/pdfca_CA
dc.language.isoengca_CA
dc.publisherElsevierca_CA
dc.relation.isPartOfSimulation Modelling Practice and Theory Volume 55, June 2015ca_CA
dc.rightsCopyright © 2015 Elsevier B.V. All rights reserved.ca_CA
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/*
dc.subjectModelingca_CA
dc.subjectHigh performanceca_CA
dc.subjectEnergy consumptionca_CA
dc.subjectMatrix multiplicationca_CA
dc.subjectLinear algebraca_CA
dc.titleTime and energy modeling of high–performance Level-3 BLAS on x86 architecturesca_CA
dc.typeinfo:eu-repo/semantics/articleca_CA
dc.identifier.doihttp://dx.doi.org/10.1016/j.simpat.2015.04.003
dc.rights.accessRightsinfo:eu-repo/semantics/restrictedAccessca_CA
dc.relation.publisherVersionhttp://www.sciencedirect.com/science/article/pii/S1569190X15000635ca_CA


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem