• closedAccess   A Parallel Multi-threaded Solver for Symmetric Positive Definite Bordered-Band Linear Systems 

      Benner, Peter; Ezzatti, Pablo; Quintana-Orti, Enrique S.; Remón, Alfredo Springer (2016-04)
      We present a multi-threaded solver for symmetric positive definite linear systems where the coefficient matrix of the problem features a bordered-band non-zero pattern. The algorithms that implement this approach heavily ...
    • closedAccess   A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures 

      Ayguadé, Eduardo; Badía Sala, Rosa María; Cabrera, Daniel; Durán, Alejandro; González, Marc; Igual, Francisco; Jiménez González, Daniel; Labarta Mancho, Jesús; Martorell, Xavier; Mayo, Rafael; Pérez, Josep M.; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2009)
      OpenMP has evolved recently towards expressing unstructured parallelism, targeting the parallelization of a broader range of applications in the current multicore era. Homogeneous multicore architectures from major vendors ...
    • openAccess   Accelerating BST Methods for Model Reduction with Graphics Processors 

      Benner, Peter; Ezzatti, Pablo; Quintana-Orti, Enrique S.; Remón Gómez, Alfredo Springer Berlin Heidelberg (2012)
      Model order reduction of dynamical linear time-invariant system appears in many scientific and engineering applications. Numerically reliable SVD-based methods for this task require O(n3) floating-point arithmetic operations, ...
    • openAccess   Accelerating Model Reduction of Large Linear Systems with Graphics Processors 

      Benner, Peter; Ezzatti, Pablo; Kressner, Daniel; Quintana-Orti, Enrique S.; Remón Gómez, Alfredo Springer Berlin Heidelberg (2012)
      Model order reduction of a dynamical linear time-invariant system appears in many applications from science and engineering. Numerically reliable SVD-based methods for this task require in general O(n3) floating-point ...
    • openAccess   An Extension of the StarSs Programming Model for Platforms with Multiple GPUs 

      Ayguadé, Eduardo; Badía Sala, Rosa María; Igual, Francisco; Labarta Mancho, Jesús; Mayo, Rafael; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2009)
      While general-purpose homogeneous multi-core architectures are becoming ubiquitous, there are clear indications that, for a number of important applications, a better performance/power ratio can be attained using specialized ...
    • closedAccess   Leveraging Task-Parallelism in Energy-Efficient ILU Preconditioners 

      Aliaga Estellés, José Ignacio; Dolz, Manuel F.; Martín Huertas, Alberto F.; Mayo, Rafael; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2012)
      We analyze the energy-performance balance of a task-parallel computation of an ILU-based preconditioner for the solution of sparse linear systems on multi-core processors. In particular, we elaborate a theoretical model ...
    • closedAccess   Parallelization of Multilevel ILU Preconditioners on Distributed-Memory Multiprocessors 

      Aliaga Estellés, José Ignacio; Bollhöfer, Matthias; Martín Huertas, Alberto F.; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2012)
      In this paper we investigate the parallelization of the ILUPACK library for the solution of sparse linear systems on distributed-memory multiprocessors. The parallelization approach employs multilevel graph partitioning ...
    • closedAccess   Revisiting the Gauss-Huard Algorithm for the Solution of Linear Systems on Graphics Accelerators 

      Benner, Peter; Ezzatti, Pablo; Quintana-Orti, Enrique S.; Remón, Alfredo Springer (2016-04-02)
      In 1979, P. Huard presented an efficient variant of the Gauss-Jordan elimination for the solution of linear systems. In particular, this alternative algorithm exhibits the same computational cost as the traditional LU-based ...