• closedAccess   A factored variant of the Newton iteration for the solution of algebraic Riccati equations via the matrix sign function 

      Benner, Peter; Ezzatti, Pablo; Quintana-Orti, Enrique S.; Remón Gómez, Alfredo Springer (2013)
      In this paper we introduce a variant of the Newton iteration for the matrix sign function that results in an efficient numerical solver for a certain class of algebraic Riccati equations (AREs). In particular, when the ...
    • closedAccess   A mixed-precision algorithm for the solution of Lyapunov equations on hybrid CPU–GPU platforms 

      Benner, Peter; Ezzatti, Pablo; Kressner, Daniel; Quintana-Orti, Enrique S.; Remón Gómez, Alfredo Elsevier (2011)
      We describe a hybrid Lyapunov solver based on the matrix sign function, where the intensive parts of the computation are accelerated using a graphics processor (GPU) while executing the remaining operations on a general-purpose ...
    • closedAccess   DVFS-control techniques for dense linear algebra operations on multi-core processors 

      Alonso-Jordá, Pedro; Dolz, Manuel F.; Igual, Francisco; Mayo, Rafael; Quintana-Orti, Enrique S. Springer (2012-11)
      This paper analyzes the impact on power consumption of two DVFS-control strategies when applied to the execution of dense linear algebra operations on multi-core processors. The strategies considered here, prototyped as ...
    • openAccess   Efficient model order reduction of large-scale systems on multi-core platforms 

      Ezzatti, Pablo; Quintana-Orti, Enrique S.; Remón Gómez, Alfredo Springer (2011)
      We propose an efficient implementation of the Balanced Truncation (BT) method for model order reduction when the state-space matrix is symmetric (positive definite). Most of the computational effort required by this method ...
    • closedAccess   Energy-efficient execution of dense linear algebra algorithms on multi-core processors 

      Alonso-Jordá, Pedro; Dolz, Manuel F.; Mayo, Rafael; Quintana-Orti, Enrique S. Springer Verlag (2013-09)
      This paper addresses the efficient exploitation of task-level parallelism, present in many dense linear algebra operations, from the point of view of both computational performance and energy consumption. The strategies ...
    • openAccess   Exploiting Task and Data Parallelism in ILUPACK's Preconditioned CG Solver on NUMA Architectures and Many-core Accelerators 

      Aliaga Estellés, José Ignacio; Badía Sala, Rosa María; Barreda Vayá, Maria; Bollhöffer, Matthias; Dufrechou, Ernesto; Ezzatti, Pablo; Quintana-Orti, Enrique S. Elsevier (2016-05)
      We present specialized implementations of the preconditioned iterative linear system solver in ILUPACK for Non-Uniform Memory Access (NUMA) platforms and many-core hardware co-processors based on the Intel Xeon Phi and ...
    • closedAccess   Extending OpenMP to Survive the Heterogeneous Multi-Core Era 

      Ayguadé, Eduardo; Badía Sala, Rosa María; Bellens, Pieter; Cabrera, Daniel; Durán, Alejandro; Ferrer, Roger; González, Marc; Igual, Francisco; Jiménez González, Daniel; Labarta Mancho, Jesús; Martinell, Luis; Martorell, Xavier; Mayo, Rafael; Pérez, Josep M.; Planas, Judit; Quintana-Orti, Enrique S. Springer US (2010)
      This paper advances the state-of-the-art in programming models for exploiting task-level parallelism on heterogeneous many-core systems, presenting a number of extensions to the OpenMP language inspired in the StarSs ...
    • closedAccess   Increasing data locality and introducing Level-3 BLAS in the Neville elimination 

      Alonso-Jordá, Pedro; Cortina Parajón, Raquel; Quintana-Orti, Enrique S.; Ranilla Pastor, José Elsevier (2011-12-01)
      In this paper we present two new algorithmic variants to compute the Neville elimination, with and without pivoting, which improve data locality and cast most of the computations in terms of high-performance Level 3 BLAS. ...
    • openAccess   Parallelizing dense and banded linear algebra libraries using SMPSs 

      Badía Sala, Rosa María; Herrero, Josep R.; Labarta Mancho, Jesús; Pérez, Josep M.; Quintana-Orti, Enrique S.; Quintana-Ortí, Gregorio John Wiley (2009-12-25)
      The promise of future many-core processors, with hundreds of threads running concurrently, has led the developers of linear algebra libraries to rethink their design in order to extract more parallelism, further exploit ...
    • openAccess   Reducing Power Consumption of the LU Factorization with Partial Pivoting on Multi-Core Processors 

      Alonso-Jordá, Pedro; Dolz, Manuel F.; Mayo, Rafael; Quintana-Orti, Enrique S. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2011-07)
      In this paper we analyze the trade-off between energy and performance for a data- parallel execution of the LU factorization with partial pivoting on a multi-core proces- sor. To improve power efficiency, we adapt the ...
    • closedAccess   Revisiting the Gauss-Huard Algorithm for the Solution of Linear Systems on Graphics Accelerators 

      Benner, Peter; Ezzatti, Pablo; Quintana-Orti, Enrique S.; Remón, Alfredo Springer (2016-04-02)
      In 1979, P. Huard presented an efficient variant of the Gauss-Jordan elimination for the solution of linear systems. In particular, this alternative algorithm exhibits the same computational cost as the traditional LU-based ...
    • openAccess   Solving “Large” Dense Matrix Problems on Multi-Core Processors and GPUs 

      Marqués-Andrés, Mercedes; Quintana-Ortí, Gregorio; Quintana-Orti, Enrique S.; Van de Geijn, Robert A. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2009-01)
      Few realize that, for large matrices, many dense matrix computations achieve nearly the same performance when the matrices are stored on disk as when they are stored in a very large main memory. Similarly, few realize ...
    • openAccess   The Impact of the Multi-core Revolution on Signal Processing 

      González, Alberto; BELLOCH, JOSE A.; Martínez, Francisco J.; Alonso-Jordá, Pedro; García, Víctor M.; Quintana-Orti, Enrique S.; Remón Gómez, Alfredo; Vidal, Antonio M. Universidad Politécnica de Valencia (2010)
      This paper analyzes the influence of new multi- core and many-core architectures on Signal Processing. The article covers both the architectural design and the programming models of current general-purpose multi-core ...