Towards portable realizations of winograd-based convolution with vector intrinsics and OpenMP
Impacto
Scholar |
Otros documentos de la autoría: Dolz, Manuel F.; Castelló, Adrián; Quintana-Orti, Enrique S.
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7036
comunitat-uji-handle3:10234/146069
comunitat-uji-handle4:
INVESTIGACIONEste recurso está restringido
https://doi.org/ 10.1109/PDP55904.2022.00015 |
Metadatos
Título
Towards portable realizations of winograd-based convolution with vector intrinsics and OpenMPFecha de publicación
2022Editor
IEEEISBN
9781665469586Cita bibliográfica
M. F. Dolz, A. Castelló and E. S. Quintana-Ortí, "Towards Portable Realizations of Winograd-based Convolution with Vector Intrinsics and OpenMP," 2022 30th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP), 2022, pp. 39-46, doi: 10.1109/PDP55904.2022.00015.Tipo de documento
info:eu-repo/semantics/conferenceObjectVersión de la editorial
https://ieeexplore.ieee.org/document/9756700Versión
info:eu-repo/semantics/publishedVersionPalabras clave / Materias
Resumen
We take a step forward in the direction of developing high performance codes for the convolution, based on the Winograd transformation, that are easy to customize for different processor architectures. In our approach, ... [+]
We take a step forward in the direction of developing high performance codes for the convolution, based on the Winograd transformation, that are easy to customize for different processor architectures. In our approach, augmenting the portability of the solution is achieved via the introduction of vector intrinsics to exploit the SIMD (single-instruction multiple-data) capabilities of current processors as well as OpenMP pragmas to exploit multi-thread parallelism. While this comes at the cost of sacrificing a fraction of the computational performance, our experimental results on two distinct processors, with Intel Xeon Skylake and ARM Cortex A57 architectures, show that the impact is affordable, and still renders a Winograd-based solution that is competitive with the general method for the convolution based on the so-called im2col transform followed by a matrix-matrix multiplication.
URL: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9756700&isnumber=9756614 [-]
Descripción
Ponència presentada en el 2022 30th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP)
Publicado en
Actas 2022 30th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP)Entidad financiadora
Agencia Estatal de Investigacion | Generalitat Valenciana | Ministerio de Ciencia, Innovación y Universidades
Código del proyecto o subvención
PID2020- 113656RB-C21/C22 | Prometeo/2019/109 | FJC2019-039222-I | CDEIGENT/2018/014