Listar por autoría "e5588b15-3793-4ef4-a00b-307cce5aae68"
Mostrando ítems 1-1 de 1
-
Reformulating the direct convolution for high-performance deep learning inference on ARM processors
Barrachina Mir, Sergio; Castelló, Adrián; Dolz, Manuel F.; Low, Tze Meng; Martinez, Hector; Quintana-Orti, Enrique S.; Upasana, Sridhar; Tomás Domínguez, Andrés Enrique Elsevier (2022-12-20)We present two high-performance implementations of the convolution operator via the direct algorithm that outperform the so-called lowering approach based on the im2col transform plus the gemm kernel on an ARMv8-based ...