• closedAccess   A complete and efficient CUDA-sharing solution for HPC clusters 

      Peña Monferrer, Antonio J.; Reaño, Carlos; Silla, Federico; Mayo, Rafael; Quintana-Orti, Enrique S.; Duato, José Elsevier (2014)
      In this paper we detail the key features, architectural design, and implementation of rCUDA, an advanced framework to enable remote and transparent GPGPU acceleration in HPC clusters. rCUDA allows decoupling GPUs from ...
    • openAccess   DMR API: Improving cluster productivity by turning applications into malleable 

      Iserte, Sergio; Mayo, Rafael; Quintana-Orti, Enrique S.; Beltrán, Vicenç; Peña Monferrer, Antonio J. Elsevier (2018)
      Adaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of processes. To carry out these job reconfigurations, we have designed a methodology which enables a job to communicate with ...
    • openAccess   Dynamic reconfiguration of noniterative scientific applications A case study with HPG aligner 

      Iserte, Sergio; Martínez Pérez, Héctor; Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Mayo, Rafael; Peña Monferrer, Antonio J. Springer (2018-09)
      Several studies have proved the benefits of job malleability, that is, the capacity of an application to adapt its parallelism to a dynamically changing number of allocated processors. The most remarkable advantages of ...
    • closedAccess   Improving the user experience of the rCUDA remote GPU virtualization framework 

      Reaño, Carlos; Silla, Federico; Castelló, Adrián; Peña Monferrer, Antonio J.; Mayo, Rafael; Quintana-Orti, Enrique S. Wiley (2014-10)
      Graphics processing units (GPUs) are being increasingly embraced by the high-performance computing community as an effective way to reduce execution time by accelerating parts of their applications. remote CUDA (rCUDA) was ...
    • closedAccess   On the adequacy of lightweight thread approaches for high-level parallel programming models 

      Castelló, Adrián; Mayo, Rafael; Sala, Kevin; Beltran Querol, Vicenç; Balaji, Pavan; Peña Monferrer, Antonio J. Elsevier (2018-07)
      High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: ...