Listar ICC_Articles por fuente "Journal of Systems Architecture 135 (2023) 102806"
Mostrando ítems 1-1 de 1
-
Reformulating the direct convolution for high-performance deep learning inference on ARM processors
Elsevier (2022-12-20)We present two high-performance implementations of the convolution operator via the direct algorithm that outperform the so-called lowering approach based on the im2col transform plus the gemm kernel on an ARMv8-based ...