• openAccess   Programming matrix algorithms-by-blocks for thread-level parallelism 

      Quintana-Ortí, Gregorio; Quintana-Orti, Enrique S.; Van de Geijn, Robert A.; Van Zee, Field G.; Chan, Ernie Association for Computing Machinery (2009-07)
      With the emergence of thread-level parallelism as the primary means for continued improvement of performance, the programmability issue has reemerged as an obstacle to the use of architectural advances. We argue that ...
    • closedAccess   The FLAME approach: From dense linear algebra algorithms to high-performance multi-accelerator implementations 

      Igual, Francisco; Chan, Ernie; Quintana-Orti, Enrique S.; Quintana-Ortí, Gregorio; Van de Geijn, Robert A. Elsevier (2012)
      Parallel accelerators are playing an increasingly important role in scientific computing. However, it is perceived that their weakness nowadays is their reduced “programmability” in comparison with traditional general-purpose ...
    • openAccess   The libflame library for dense matrix computations 

      Van Zee, Field G.; Chan, Ernie; Van de Geijn, Robert A.; Quintana-Ortí, Gregorio; Quintana-Orti, Enrique S. IEEE Computer Society (2009-11)
      Researchers from the Formal Linear Algebra Method Environment (Flame) project have developed new methodologies for analyzing, designing, and implementing linear algebra libraries. These solutions, which have culminated in ...