Listar por tema "Matrix multiplication"
Mostrando ítems 1-3 de 3
-
Analytical Modeling is Enough for High Performance BLIS
ACM (2016-09)We show how the BLAS-like Library Instantiation Software (BLIS) framework, which provides a more detailed layering of the GotoBLAS (now maintained as OpenBLAS) implementation, allows one to analytically determine tuning ... -
Architecture-Aware Con guration and Scheduling of Matrix Multiplication on Asymmetric Multicore Processors
Springer US (2016-09)Asymmetric multicore processors (AMPs) have recently emerged as an appealing technology for severely energy-constrained environments, especially in mobile appliances where heterogeneity in applications is mainstream. ... -
Time and energy modeling of high–performance Level-3 BLAS on x86 architectures
Elsevier (2015-06)We present accurate piece-wise models for the time and energy costs of high performance implementations of both the matrix multiplication (gemm) and the triangular system solve with multiple right-hand sides (trsm) on x86 ...