Mostrar el registro sencillo del ítem

dc.contributor.authorSan Juan, Pablo
dc.contributor.authorCastelló, Adrián
dc.contributor.authorDolz, Manuel F.
dc.contributor.authorAlonso-Jordá, Pedro
dc.contributor.authorQuintana-Orti, Enrique S.
dc.date.accessioned2020-12-04T12:34:06Z
dc.date.available2020-12-04T12:34:06Z
dc.date.issued2020-10
dc.identifier.citationP. San Juan, A. Castelló, M. F. Dolz, P. Alonso-Jordá and E. S. Quintana-Ortí, "High Performance and Portable Convolution Operators for Multicore Processors," 2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Porto, Portugal, 2020, pp. 91-98, doi: 10.1109/SBAC-PAD49847.2020.00023.ca_CA
dc.identifier.issn2643-3001
dc.identifier.urihttp://hdl.handle.net/10234/190734
dc.descriptionPonència presentada a 2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) celebrat a Porto, del 9 al 11 de setembre de 2020ca_CA
dc.description.abstractThe considerable impact of Convolutional Neural Networks on many Artificial Intelligence tasks has led to the development of various high performance algorithms for the convolution operator present in this type of networks. One of these approaches leverages the im2col transform followed by a general matrix multiplication (gemm) in order to take advantage of the highly optimized realizations of the gemm kernel in many linear algebra libraries. The main problems of this approach are 1) the large memory workspace required to host the intermediate matrices generated by the im2col transform; and 2) the time to perform the im2col transform, which is not negligible for complex neural networks. This paper presents a portable high performance convolution algorithm based on the BLIS realization of the gemm kernel that avoids the use of the intermediate memory by taking advantage of the BLIS structure. In addition, the proposed algorithm eliminates the cost of the explicit im2col transform, while maintaining the portability and performance of the underlying realization of gemm in BLIS.ca_CA
dc.format.extent17 p.ca_CA
dc.format.mimetypeapplication/pdfca_CA
dc.language.isoengca_CA
dc.publisherIEEEca_CA
dc.rights©2020 IEEEca_CA
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/*
dc.titleHigh Performance and Portable Convolution Operators for Multicore Processorsca_CA
dc.typeinfo:eu-repo/semantics/conferenceObjectca_CA
dc.identifier.doihttp://dx.doi.org/10.1109/SBAC-PAD49847.2020.00023
dc.relation.projectIDinfo:eu-repo/grantAgreement/MICIU/TIN2017-82972-Rca_CA
dc.relation.projectIDinfo:eu-repo/grantAgreement/GVA/Prometeo-2019/109
dc.relation.projectIDinfo:eu-repo/grantAgreement/GVA/CDEIGENT/2018/014
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca_CA
dc.relation.publisherVersionhttps://ieeexplore.ieee.org/document/9235053ca_CA
dc.type.versioninfo:eu-repo/semantics/submittedVersionca_CA


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem