• openAccess   Exploiting nested task-parallelism in the H-LU factorization 

      Carratalá-Sáez, Rocío; Christophersen, Sven; Aliaga Estellés, José Ignacio; Beltran Querol, Vicenç; Börm, Steffen; Quintana-Orti, Enrique S. Elsevier (2019-04)
      We address the parallelization of the LU factorization of hierarchical matrices (-matrices) arising from boundary element methods. Our approach exploits task-parallelism via the OmpSs programming model and runtime, which ...
    • openAccess   Hierarchical approach for deriving a reproducible unblocked LU factorization 

      Iakymchuk, Roman; Graillat, Stef; Defour, David; Quintana-Orti, Enrique S. Sage (2019-03-17)
      We propose a reproducible variant of the unblocked LU factorization for graphics processor units (GPUs). For this purpose, we build upon Level-1/2 BLAS kernels that deliver correctly-rounded and reproducible results for ...
    • openAccess   Reducing Power Consumption of the LU Factorization with Partial Pivoting on Multi-Core Processors 

      Alonso-Jordá, Pedro; Dolz, Manuel F.; Mayo, Rafael; Quintana-Orti, Enrique S. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2011-07)
      In this paper we analyze the trade-off between energy and performance for a data- parallel execution of the LU factorization with partial pivoting on a multi-core proces- sor. To improve power efficiency, we adapt the ...
    • closedAccess   Revisiting the Gauss-Huard Algorithm for the Solution of Linear Systems on Graphics Accelerators 

      Benner, Peter; Ezzatti, Pablo; Quintana-Orti, Enrique S.; Remón, Alfredo Springer (2016-04-02)
      In 1979, P. Huard presented an efficient variant of the Gauss-Jordan elimination for the solution of linear systems. In particular, this alternative algorithm exhibits the same computational cost as the traditional LU-based ...
    • openAccess   Solving Dense Linear Systems on Graphics Processors 

      Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Igual, Francisco; Mayo, Rafael; Quintana-Orti, Enrique S. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2008-02)
      We present several algorithms to compute the solution of a linear system of equations on a GPU, as well as general techniques to improve their performance, such as padding and hybrid GPU-CPU computation. We also show how ...
    • closedAccess   Static scheduling of the LU factorization with look-ahead on asymmetric multicore processors 

      Catalán, Sandra; Herrero, José R.; Quintana-Orti, Enrique S.; Rodríguez Sánchez, Rafael Elsevier (2018)
      We analyze the benefits of look-ahead in the parallel execution of the LU factorization with partial pivoting (LUpp) in two distinct “asymmetric” multicore scenarios. The first one corresponds to an actual hardware-asymmetric ...
    • closedAccess   Using desktop computers to solve large-scale dense linear algebra problems 

      Quintana-Orti, Enrique S.; Marqués-Andrés, Mercedes; Quintana-Ortí, Gregorio; Van de Geijn, Robert A. Springer Science+Business Media (2011-11)
      We provide experimental evidence that current desktop computers feature enough computational power to solve large-scale dense linear algebra problems. While the high computational cost of the numerical methods for solving ...