Buscar
Mostrando ítems 1-2 de 2
Towards portable realizations of winograd-based convolution with vector intrinsics and OpenMP
(IEEE, 2022)
We take a step forward in the direction of developing high performance codes for the convolution, based on the Winograd transformation, that are easy to customize for different processor architectures. In our approach, ...
Convolution Operators for Deep Learning Inference on the Fujitsu A64FX Processor
(IEEE, 2022)
The convolution operator is a crucial kernel for
many computer vision and signal processing applications that
rely on deep learning (DL) technologies. As such, the efficient implementation of this operator has received ...