• openAccess   Level-3 BLAS on a GPU: Picking the Low Hanging Fruit 

      Quintana-Ortí, Gregorio; Van de Geijn, Robert A. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2009-04)
      The arrival of hardware accelerators has created a new gold rush to be the rst to deliver their promise of high performance for numerical applications. Since they are relatively hard to program, with limited language ...
    • openAccess   Out-of-Core Solution of Linear Systems on Graphic Processors 

      Castillo Catalán, María Isabel; Igual, Francisco; Mayo, Rafael; Rubio, Rafael; Quintana-Ortí, Gregorio; Quintana-Orti, Enrique S.; Van de Geijn, Robert A. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2008-05)
      We combine two high-level application programming interfaces to solve large-scale linear systems with the data stored on disk using current graphics processors. The result is a simple yet powerful tool that enables a ...
    • openAccess   Solving “Large” Dense Matrix Problems on Multi-Core Processors and GPUs 

      Marqués-Andrés, Mercedes; Quintana-Ortí, Gregorio; Quintana-Orti, Enrique S.; Van de Geijn, Robert A. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2009-01)
      Few realize that, for large matrices, many dense matrix computations achieve nearly the same performance when the matrices are stored on disk as when they are stored in a very large main memory. Similarly, few realize ...