Listar ICC_Articles por autoría "6c39b495-078d-4a3c-bd95-ca23fbcf5ba8"
Mostrando ítems 1-12 de 12
-
Adapting concurrency throttling and voltage–frequency scaling for dense eigensolvers
Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Castaño Álvarez, María Asunción; Dolz, Manuel F.; Quintana-Orti, Enrique S. Springer Verlag (2015)We analyze power dissipation and energy consumption during the execution of high-performance dense linear algebra kernels on multi-core processors. On top of this analysis, we propose and evaluate several strategies to ... -
Are our dense linear algebra libraries energy-friendly?. Time–power–energy trade-offs in BLAS and LAPACK
Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dolz, Manuel F.; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2015-05)In this paper we conduct a detailed analysis of the sources of power dissipation and energy consumption during the execution of current dense linear algebra kernels on multicore processors, binding these two metrics together ... -
Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems
Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dolz, Manuel F.; Martín Huertas, Alberto F.; Mayo, Rafael; Quintana-Orti, Enrique S. Springer US (2014)We investigate the benefits that an energyaware implementation of the runtime in charge of the concurrent execution of ILUPACK —a sophisticated preconditioned iterative solver for sparse linear systems— produces on the ... -
Characterizing the efficiency of multicore and manycore processors for the solution of sparse linear systems
Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2015-09)We analyze the efficiency of servers equipped with state-of-the-art general-purpose multicore processors as well as platforms based on accelerators such as graphics processing units (GPUs) and the Intel Xeon Phi. Following ... -
Communication in task-parallel ILU-preconditioned CG solversusing MPI + OmpSs
Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Flegar, Goran; Bollhöffer, Matthias; Quintana-Orti, Enrique S. Wiley (2017-11-10)We target the parallel solution of sparse linear systems via iterative Krylov subspace–based methods enhanced with incomplete LU (ILU)-type preconditioners on clusters of multicore processors. In order to tackle large-scale ... -
Convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product
Barreda Vayá, Maria; Dolz, Manuel F.; Castaño Álvarez, María Asunción Sage (2020-08-26)Modeling the performance and energy consumption of the sparse matrix-vector product (SpMV) is essential to perform off-line analysis and, for example, choose a target computer architecture that delivers the best ... -
Energy‐aware strategies for task‐parallel sparse linear system solvers
Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Castaño, Asunción Wiley (2018)We present several energy‐aware strategies to improve the energy efficiency of a task‐parallel preconditioned Conjugate Gradient (PCG) iterative solver on a Haswell‐EP Intel Xeon. These techniques leverage the power‐saving ... -
Exploiting Task and Data Parallelism in ILUPACK's Preconditioned CG Solver on NUMA Architectures and Many-core Accelerators
Aliaga Estellés, José Ignacio; Badía Sala, Rosa María; Barreda Vayá, Maria; Bollhöffer, Matthias; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Elsevier (2016-05)We present specialized implementations of the preconditioned iterative linear system solver in ILUPACK for Non-Uniform Memory Access (NUMA) platforms and many-core hardware co-processors based on the Intel Xeon Phi and ... -
Iteration-fusing conjugate gradient for sparse linear systems with MPI + OmpSs
Barreda Vayá, Maria; Aliaga Estellés, José Ignacio; Beltran Querol, Vicenç; Casas, Marc Springer (2019-12-10)In this paper, we target the parallel solution of sparse linear systems via iterative Krylov subspace-based method enhanced with a block-Jacobi preconditioner on a cluster of multicore processors. In order to tackle ... -
Performance modeling of the sparse matrix–vector product via convolutional neural networks
Barreda Vayá, Maria; Dolz, Manuel F.; Castaño Álvarez, María Asunción; Alonso-Jordá, Pedro; Quintana-Orti, Enrique S. Springer (2020-02-04)Modeling the execution time of the sparse matrix–vector multiplication (SpMV) on a current CPU architecture is especially complex due to (i) irregular memory accesses; (ii) indirect memory referencing; and (iii) low ... -
Reproducibility of parallel preconditioned conjugate gradient in hybrid programming environments
Iakymchuk, Roman; Barreda Vayá, Maria; Graillat, Stef; Aliaga Estellés, José Ignacio; Quintana-Orti, Enrique S. Sage (2020-06-17)The Preconditioned Conjugate Gradient method is often employed for the solution of linear systems of equations arising in numerical simulations of physical phenomena. While being widely used, the solver is also known for ... -
Reproducibility strategies for parallel Preconditioned Conjugate Gradient
Iakymchuk, Roman; Barreda Vayá, Maria; Wiesenberger, Matthias; Aliaga Estellés, José Ignacio; Quintana-Orti, Enrique S. Elsevier (2020-01-02)The Preconditioned Conjugate Gradient method is often used in numerical simulations. While being widely used, the solver is also known for its lack of accuracy while computing the residual. In this article, we aim at a ...