• closedAccess   A complete and efficient CUDA-sharing solution for HPC clusters 

      Peña Monferrer, Antonio J.; Reaño, Carlos; Silla, Federico; Mayo, Rafael; Quintana-Orti, Enrique S.; Duato, José Elsevier (2014)
      In this paper we detail the key features, architectural design, and implementation of rCUDA, an advanced framework to enable remote and transparent GPGPU acceleration in HPC clusters. rCUDA allows decoupling GPUs from ...
    • openAccess   Architecture-Aware Con guration and Scheduling of Matrix Multiplication on Asymmetric Multicore Processors 

      Catalán, Sandra; Igual, Francisco; Mayo, Rafael; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S. Springer US (2016-09)
      Asymmetric multicore processors (AMPs) have recently emerged as an appealing technology for severely energy-constrained environments, especially in mobile appliances where heterogeneity in applications is mainstream. ...
    • openAccess   Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dolz, Manuel F.; Martín Huertas, Alberto F.; Mayo, Rafael; Quintana-Orti, Enrique S. Springer US (2014)
      We investigate the benefits that an energyaware implementation of the runtime in charge of the concurrent execution of ILUPACK —a sophisticated preconditioned iterative solver for sparse linear systems— produces on the ...
    • openAccess   Concurrent and Accurate Short Read Mapping on Multicore Processors 

      Martínez Pérez, Héctor; Tárraga, Joaquín; Medina, Ignacio; Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Dopazo, Joaquín; Quintana-Orti, Enrique S. IEEE (2015-09)
      We introduce a parallel aligner with a work-flow organization for fast and accurate mapping of RNA sequences on servers equipped with multicore processors. Our software, HPG Aligner SA1, exploits a suffix array to rapidly ...
    • closedAccess   Evaluating the performance and energy efficiency of the COSMO-ART model system 

      Charles, Joseph; Sawyer, William; Dolz, Manuel F.; Catalán, Sandra Springer Berlin Heidelberg (2015-05)
      In this paper we investigate the energy footprint and performance profiling of COSMO-ART on various HPC platforms. This model is an extension of the operational weather forecast model of the German weather service (DWD), ...
    • closedAccess   Graphics processing unit computing and exploitation of hardware accelerators 

      Amor, Margarita; Doallo, Ramón; Fraguela, Basilio B.; Herrero, Josep R.; Quintana-Orti, Enrique S.; Strzodka, Robert Wiley (2013-01-02)
      This special issue contributes to this promising field with extended and carefully reviewed versions of selected papers from two workshops, namely the 2nd Minisymposium on GPU Computing, which was held as part of the 9th ...
    • closedAccess   High performance computing tools in science and engineering 

      Quintana-Orti, Enrique S.; Vigo Aguiar, Jesús; Ranilla Pastor, José Springer Science+Business Media (2011-11)
      New large-scale problems with growing computational demands continuously arise in many scientific and engineering applications as, e.g., in bioinformatics, computational chemistry, communications or astrophysics. Effectively ...
    • closedAccess   High performance computing tools in science and engineering II 

      Quintana-Orti, Enrique S.; Vigo Aguiar, Jesús; Ranilla Pastor, José Springer Science+Business Media (2011-12)
      This special issue collects research papers selected among those presented at the second minisymposium “HPC applied to Computational Problems in Science and Engineering” which was held in June 2010, in Almeria, Spain. ...
    • closedAccess   The FLAME approach: From dense linear algebra algorithms to high-performance multi-accelerator implementations 

      Igual, Francisco; Chan, Ernie; Quintana-Orti, Enrique S.; Quintana-Ortí, Gregorio; Van de Geijn, Robert A. Elsevier (2012)
      Parallel accelerators are playing an increasingly important role in scientific computing. However, it is perceived that their weakness nowadays is their reduced “programmability” in comparison with traditional general-purpose ...