• openAccess   A Data-Parallel ILUPACK for Sparse General and Symmetric Indefinite Linear Systems 

      Aliaga Estellés, José Ignacio; Bollhöfer, Matthias; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Springer (2017-05-28)
      The solution of sparse linear systems of large dimension is a critical step in problems that span a diverse range of applications. For this reason, a number of iterative solvers have been developed, among which ILUPACK ...
    • closedAccess   A fast band–Krylov eigensolver for macromolecular functional motion simulation on multicore architectures and graphics processors 

      Aliaga Estellés, José Ignacio; Alonso-Jordá, Pedro; Badía, José; Chacón, Pablo; Davidovic, Davor; López Blanco, José R.; Quintana-Orti, Enrique S. Elsevier (2016-03-15)
      We introduce a new iterative Krylov subspace-based eigensolver for the simulation of macromolecular motions on desktop multithreaded platforms equipped with multicore processors and, possibly, a graphics accelerator (GPU). ...
    • openAccess   A Survey on Malleability Solutions for High-Performance Distributed Computing 

      Aliaga Estellés, José Ignacio; Castillo, Maribel; Iserte, Sergio; Martín Álvarez, Iker; Mayo, Rafael MDPI (2022-05-22)
      Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Performance Computing (HPC) facilities is a cornerstone in the next generation of exascale supercomputers. Process malleability ...
    • closedAccess   Accelerating the task/data-parallel version of ILUPACK’s BiCG in multi-CPU/GPU configurations 

      Aliaga Estellés, José Ignacio; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Elsevier (2019)
      ILUPACK is a valuable tool for the solution of sparse linear systems via iterative Krylov subspace-based methods. Its relevance for the solution of real problems has motivated several efforts to enhance its performance on ...
    • openAccess   Adapting concurrency throttling and voltage–frequency scaling for dense eigensolvers 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Castaño Álvarez, María Asunción; Dolz, Manuel F.; Quintana-Orti, Enrique S. Springer Verlag (2015)
      We analyze power dissipation and energy consumption during the execution of high-performance dense linear algebra kernels on multi-core processors. On top of this analysis, we propose and evaluate several strategies to ...
    • closedAccess   An efficient GPU version of the preconditioned GMRES method 

      Aliaga Estellés, José Ignacio; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Springer (2019-03)
      In a large number of scientific applications, the solution of sparse linear systems is the stage that concentrates most of the computational effort. This situation has motivated the study and development of several iterative ...
    • openAccess   Aplicación sintética para el estudio de maleabilidad en computación de altas prestaciones 

      Martín Álvarez, Iker; Aliaga Estellés, José Ignacio; Castillo Catalán, María Isabel; Iserte, Sergio; Mayo, Rafael Universitat d'Alacant (2022)
      Hoy en día, la mejora del rendimiento en los grandes clusters de ordenadores recomienda el desarrollo de aplicaciones maleables. Así, durante la ejecución de estas aplicaciones en un trabajo, el sistema de gestión de ...
    • closedAccess   Are our dense linear algebra libraries energy-friendly?. Time–power–energy trade-offs in BLAS and LAPACK 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dolz, Manuel F.; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2015-05)
      In this paper we conduct a detailed analysis of the sources of power dissipation and energy consumption during the execution of current dense linear algebra kernels on multicore processors, binding these two metrics together ...
    • openAccess   Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dolz, Manuel F.; Martín Huertas, Alberto F.; Mayo, Rafael; Quintana-Orti, Enrique S. Springer US (2014)
      We investigate the benefits that an energyaware implementation of the runtime in charge of the concurrent execution of ILUPACK —a sophisticated preconditioned iterative solver for sparse linear systems— produces on the ...
    • openAccess   Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs 

      Aliaga Estellés, José Ignacio; Anzt, Hartwig; Quintana-Orti, Enrique S.; Tomás Domínguez, Andrés Enrique; Tsai, Yuhsiang M. Springer (2021)
      We contribute to the optimization of the sparse matrix-vector product on graphics processing units by introducing a variant of the coordinate sparse matrix layout that compresses the integer representation of the matrix ...
    • openAccess   Characterization of Multicore Architectures using Task-Parallel ILU-type Preconditioned CG Solvers 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Quintana-Orti, Enrique S. (2017-07-05)
      We investigate the eficiency of state-of-the-art multicore processors using a multi-threaded task-parallel implementation of the Conjugate Gradient (CG) method, accelerated with an incomplete LU (ILU) preconditioner. ...
    • openAccess   Characterizing the efficiency of multicore and manycore processors for the solution of sparse linear systems 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2015-09)
      We analyze the efficiency of servers equipped with state-of-the-art general-purpose multicore processors as well as platforms based on accelerators such as graphics processing units (GPUs) and the Intel Xeon Phi. Following ...
    • openAccess   Communication in task-parallel ILU-preconditioned CG solversusing MPI + OmpSs 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Flegar, Goran; Bollhöffer, Matthias; Quintana-Orti, Enrique S. Wiley (2017-11-10)
      We target the parallel solution of sparse linear systems via iterative Krylov subspace–based methods enhanced with incomplete LU (ILU)-type preconditioners on clusters of multicore processors. In order to tackle large-scale ...
    • openAccess   Compressed basis GMRES on high-performance graphics processing units 

      Aliaga Estellés, José Ignacio; Anzt, Hartwig; Tomás Domínguez, Andrés Enrique; Quintana-Orti, Enrique S.; Grützmacher, Thomas Sage (2022-08-05)
      Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the performance of practical realizations of these methods is ...
    • openAccess   Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units 

      Aliaga Estellés, José Ignacio; Anzt, Hartwig; Grützmacher, Thomas; Quintana-Orti, Enrique S.; Tomás Domínguez, Andrés Enrique John Wiley and Sons (2021)
      We contribute to the optimization of the sparse matrix-vector product by introducing a variant of the coordinate sparse matrix format that balances the workload distribution and compresses both the indexing arrays and the ...
    • openAccess   Dynamic spawning of MPI processes applied to malleability 

      Martín Álvarez, Iker; Aliaga Estellés, José Ignacio; Castillo Catalán, María Isabel; Iserte, Sergio; Mayo, Rafael SAGE Publications (2023-05-29)
      Malleability allows computing facilities to adapt their workloads through resource management systems to maximize the throughput of the facility and the efficiency of the executed jobs. This technique is based on reconfiguring ...
    • openAccess   Energy‐aware strategies for task‐parallel sparse linear system solvers 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Castaño, Asunción Wiley (2018)
      We present several energy‐aware strategies to improve the energy efficiency of a task‐parallel preconditioned Conjugate Gradient (PCG) iterative solver on a Haswell‐EP Intel Xeon. These techniques leverage the power‐saving ...
    • openAccess   Exploiting nested task-parallelism in the H-LU factorization 

      Carratalá-Sáez, Rocío; Christophersen, Sven; Aliaga Estellés, José Ignacio; Beltran Querol, Vicenç; Börm, Steffen; Quintana-Orti, Enrique S. Elsevier (2019-04)
      We address the parallelization of the LU factorization of hierarchical matrices (-matrices) arising from boundary element methods. Our approach exploits task-parallelism via the OmpSs programming model and runtime, which ...
    • openAccess   Exploiting Task and Data Parallelism in ILUPACK's Preconditioned CG Solver on NUMA Architectures and Many-core Accelerators 

      Aliaga Estellés, José Ignacio; Badía Sala, Rosa María; Barreda Vayá, Maria; Bollhöffer, Matthias; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Elsevier (2016-05)
      We present specialized implementations of the preconditioned iterative linear system solver in ILUPACK for Non-Uniform Memory Access (NUMA) platforms and many-core hardware co-processors based on the Intel Xeon Phi and ...
    • closedAccess   Exploiting thread-level parallelism in the iterative solution of sparse linear systems 

      Aliaga Estellés, José Ignacio; Bollhöfer, Matthias; Martín Huertas, Alberto F.; Quintana-Orti, Enrique S. Elsevier (2011)
      We investigate the efficient iterative solution of large-scale sparse linear systems on shared-memory multiprocessors. Our parallel approach is based on a multilevel ILU preconditioner which preserves the mathematical ...