Listar ICC_Articles por autoría "c3ed8f0c-8e41-4f40-85b0-ffc168550e51"
Mostrando ítems 1-1 de 1
-
High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS
Castelló, Adrián; Barrachina Mir, Sergio; Dolz, Manuel F.; Quintana-Orti, Enrique S.; San Juan, Pau; Tomás Domínguez, Andrés Enrique Elsevier (2022-03-22)We evolve PyDTNN, a framework for distributed parallel training of Deep Neural Networks (DNNs), into an efficient inference tool for convolutional neural networks. Our optimization process on multicore ARM processors ...