• openAccess   Efficient model order reduction of large-scale systems on multi-core platforms 

      Ezzatti, Pablo; Quintana-Orti, Enrique S.; Remón Gómez, Alfredo Springer (2011)
      We propose an efficient implementation of the Balanced Truncation (BT) method for model order reduction when the state-space matrix is symmetric (positive definite). Most of the computational effort required by this method ...
    • openAccess   Evaluation and Tuning of the Level 3 CUBLAS for Graphics Processors 

      Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Igual, Francisco D.; Mayo, Rafael; Quintana-Orti, Enrique S. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2008-01)
      The increase in performance of the last generations of graphics processors (GPUs) has made this class of platform a coprocessing tool with remarkable success in certain types of operations. In this paper we evaluate the ...
    • openAccess   GLAME@lab: An M-script API for Linear Algebra Operations on Graphics Processors 

      Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Igual, Francisco D.; Mayo, Rafael; Quintana-Orti, Enrique S. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2008-02)
      We propose two high-level application programming interfaces (APIs) to use a graphics processing unit (GPU) as a coprocessor for dense linear algebra operations. Combined with an extension of the FLAME API and an ...
    • closedAccess   Multi-threaded dense linear algebra libraries for low-power asymmetric multicore processors 

      Catalán, Sandra; Herrero Zaragoza, José R.; Igual, Francisco D.; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S.; Adeniyi-Jones, Chris Elsevier (2018-03)
      Dense linear algebra libraries, such as BLAS and LAPACK, provide a relevant collection of numerical tools for many scientific and engineering applications. While there exist high performance implementations of the BLAS ...
    • openAccess   Multithreaded Dense Linear Algebra on Asymmetric Multi-core Processors 

      Catalán Pallarés, Sandra Universitat Jaume I (2018-02-06)
      This dissertation targets two important problems. The first one is the design of low-level DLA kernels for architectures comprising two (or more) classes of cores. The main question we have to address here is how to attain ...
    • openAccess   Out-of-Core Solution of Linear Systems on Graphic Processors 

      Castillo Catalán, María Isabel; Igual, Francisco D.; Mayo, Rafael; Rubio, Rafael; Quintana-Ortí, Gregorio; Quintana-Orti, Enrique S.; Van de Geijn, Robert A. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2008-05)
      We combine two high-level application programming interfaces to solve large-scale linear systems with the data stored on disk using current graphics processors. The result is a simple yet powerful tool that enables a ...
    • openAccess   Solving Dense Linear Systems on Graphics Processors 

      Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Igual, Francisco D.; Mayo, Rafael; Quintana-Orti, Enrique S. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2008-02)
      We present several algorithms to compute the solution of a linear system of equations on a GPU, as well as general techniques to improve their performance, such as padding and hybrid GPU-CPU computation. We also show how ...