Mostrar el registro sencillo del ítem

dc.contributor.authorDolz, Manuel F.
dc.contributor.authorMartínez, Héctor
dc.contributor.authorCastelló, Adrián
dc.contributor.authorAlonso-Jordá, Pedro
dc.contributor.authorQuintana-Orti, Enrique S.
dc.date.accessioned2023-05-23T07:16:14Z
dc.date.available2023-05-23T07:16:14Z
dc.date.issued2023-02-12
dc.identifier.citationDolz, M.F., Martínez, H., Castelló, A. et al. Efficient and portable Winograd convolutions for multi-core processors. J Supercomput 79, 10589–10610 (2023). https://doi.org/10.1007/s11227-023-05088-4ca_CA
dc.identifier.issn0920-8542
dc.identifier.issn1573-0484
dc.identifier.urihttp://hdl.handle.net/10234/202584
dc.description.abstractWe take a step forward towards developing high-performance codes for the convolution operator, based on the Winograd algorithm, that are easy to customise for general-purpose processor architectures. In our approach, augmenting the portability of the solution is achieved via the introduction of vector instructions from Intel SSE/AVX2/AVX512 and ARM NEON/SVE to exploit the single-instruction multiple-data capabilities of current processors as well as OpenMP pragmas to exploit multi-threaded parallelism. While this comes at the cost of sacrificing a fraction of the computational performance, our experimental results on three distinct processors, with Intel Xeon Skylake, ARM Cortex A57 and Fujitsu A64FX processors, show that the impact is affordable and still renders a Winograd-based solution that is competitive when compared with the lowering GEMM-based convolution.ca_CA
dc.format.extent22 p.ca_CA
dc.format.mimetypeapplication/pdfca_CA
dc.language.isoengca_CA
dc.publisherSpringerca_CA
dc.relation.isPartOfThe Journal of Supercomputing (2023) 79:10589–10610ca_CA
dc.relation.uriThe ImageNet dataset used for the current study is publicly available from the web. See https://www.image-net.org/.ca_CA
dc.rights© The Author(s) 2023ca_CA
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/ca_CA
dc.subjectconvolutionca_CA
dc.subjectwinograd minimal filtering algorithmca_CA
dc.subjecthigh performanceca_CA
dc.subjectvector intrinsicsca_CA
dc.subjectSIMD unitsca_CA
dc.subjectmulti-core processorsca_CA
dc.titleEfficient and portable Winograd convolutions for multi-core processorsca_CA
dc.typeinfo:eu-repo/semantics/articleca_CA
dc.identifier.doihttps://doi.org/10.1007/s11227-023-05088-4
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca_CA
dc.type.versioninfo:eu-repo/semantics/publishedVersionca_CA
project.funder.nameCRUE-CSICca_CA
project.funder.nameGeneralitat Valencianaca_CA
project.funder.nameJunta de Andalucíaca_CA
oaire.awardNumberPID2020-113656RB-C21/C22ca_CA
oaire.awardNumberMCIN/AEI/10.13039/501100011033ca_CA
oaire.awardNumberCDEIGENT/2018/014ca_CA
oaire.awardNumberPOSTDOC_21_00025ca_CA
oaire.awardNumberFJC2019-039222-Ica_CA
oaire.awardNumberMCIN/AEI/10.13039/501100011033ca_CA


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

© The Author(s) 2023
Excepto si se señala otra cosa, la licencia del ítem se describe como: © The Author(s) 2023