Listar ICC_Articles por autoría "6c39b495-078d-4a3c-bd95-ca23fbcf5ba8"

Adapting concurrency throttling and voltage–frequency scaling for dense eigensolvers

Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Castaño Álvarez, María Asunción; Dolz, Manuel F.; Quintana-Orti, Enrique S. Springer Verlag (2015)

We analyze power dissipation and energy consumption during the execution of high-performance dense linear algebra kernels on multi-core processors. On top of this analysis, we propose and evaluate several strategies to ...

Are our dense linear algebra libraries energy-friendly?. Time–power–energy trade-offs in BLAS and LAPACK

Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dolz, Manuel F.; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2015-05)

In this paper we conduct a detailed analysis of the sources of power dissipation and energy consumption during the execution of current dense linear algebra kernels on multicore processors, binding these two metrics together ...

Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems

Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dolz, Manuel F.; Martín Huertas, Alberto F.; Mayo, Rafael; Quintana-Orti, Enrique S. Springer US (2014)

We investigate the benefits that an energyaware implementation of the runtime in charge of the concurrent execution of ILUPACK —a sophisticated preconditioned iterative solver for sparse linear systems— produces on the ...

Characterizing the efficiency of multicore and manycore processors for the solution of sparse linear systems

Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2015-09)

We analyze the efficiency of servers equipped with state-of-the-art general-purpose multicore processors as well as platforms based on accelerators such as graphics processing units (GPUs) and the Intel Xeon Phi. Following ...

Communication in task-parallel ILU-preconditioned CG solversusing MPI + OmpSs

Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Flegar, Goran; Bollhöffer, Matthias; Quintana-Orti, Enrique S. Wiley (2017-11-10)

We target the parallel solution of sparse linear systems via iterative Krylov subspace–based methods enhanced with incomplete LU (ILU)-type preconditioners on clusters of multicore processors. In order to tackle large-scale ...

Convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product

Barreda Vayá, Maria; Dolz, Manuel F.; Castaño Álvarez, María Asunción Sage (2020-08-26)

Modeling the performance and energy consumption of the sparse matrix-vector product (SpMV) is essential to perform off-line analysis and, for example, choose a target computer architecture that delivers the best ...

Energy‐aware strategies for task‐parallel sparse linear system solvers

Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Castaño, Asunción Wiley (2018)

We present several energy‐aware strategies to improve the energy efficiency of a task‐parallel preconditioned Conjugate Gradient (PCG) iterative solver on a Haswell‐EP Intel Xeon. These techniques leverage the power‐saving ...

Exploiting Task and Data Parallelism in ILUPACK's Preconditioned CG Solver on NUMA Architectures and Many-core Accelerators

Aliaga Estellés, José Ignacio; Badía Sala, Rosa María; Barreda Vayá, Maria; Bollhöffer, Matthias; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Elsevier (2016-05)

We present specialized implementations of the preconditioned iterative linear system solver in ILUPACK for Non-Uniform Memory Access (NUMA) platforms and many-core hardware co-processors based on the Intel Xeon Phi and ...

Repositori Universitat Jaume I