Listar por tema "ARMv8 architecture"
Mostrando ítems 1-1 de 1
-
Reformulating the direct convolution for high-performance deep learning inference on ARM processors
Elsevier (2022-12-20)We present two high-performance implementations of the convolution operator via the direct algorithm that outperform the so-called lowering approach based on the im2col transform plus the gemm kernel on an ARMv8-based ...