Listar UJI: Investigación por autoría "894a19a1-e965-47da-b710-9a2e6a496ffa"
Mostrando ítems 1-1 de 1
-
Automatic generation of ARM NEON micro‑kernels for matrix multiplication
Alaejos, Guillermo; Martínez, Héctor; Castelló, Adrián; Dolz, Manuel F.; Igual, Francisco; Alonso-Jordá, Pedro; Quintana-Orti, Enrique S. Springer (2024-03-12)General matrix multiplication (gemm) is a fundamental kernel in scientifc computing and current frameworks for deep learning. Modern realisations of gemm are mostly written in C, on top of a small, highly tuned micro-kernel ...