Listar ICC_Articles por autoría "4ec16ac9-2cc2-4d0d-b5ee-1fe4f1232a1d"
Mostrando ítems 1-1 de 1
-
Automatic generation of ARM NEON micro‑kernels for matrix multiplication
Alaejos, Guillermo; Martínez, Héctor; Castelló, Adrián; Dolz, Manuel F.; Igual, Francisco; Alonso-Jordá, Pedro; Quintana-Orti, Enrique S. Springer (2024-03-12)General matrix multiplication (gemm) is a fundamental kernel in scientifc computing and current frameworks for deep learning. Modern realisations of gemm are mostly written in C, on top of a small, highly tuned micro-kernel ...