Listar ICC_Articles por fuente "Journal of Systems Architecture. 125 (2022) 102459"
Mostrando ítems 1-1 de 1
-
High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS
Elsevier (2022-03-22)We evolve PyDTNN, a framework for distributed parallel training of Deep Neural Networks (DNNs), into an efficient inference tool for convolutional neural networks. Our optimization process on multicore ARM processors ...