Listar Departament: Enginyeria i Ciència dels Computadors por autoría "23cae432-3965-44cf-96e3-8fbcbb67ef2a"
Mostrando ítems 1-1 de 1
-
Reformulating the direct convolution for high-performance deep learning inference on ARM processors
Barrachina Mir, Sergio; Castelló, Adrián; Dolz, Manuel F.; Low, Tze Meng; Martinez, Hector; Quintana-Orti, Enrique S.; Upasana, Sridhar; Tomás Domínguez, Andrés Enrique Elsevier (2022-12-20)We present two high-performance implementations of the convolution operator via the direct algorithm that outperform the so-called lowering approach based on the im2col transform plus the gemm kernel on an ARMv8-based ...