• closedAccess   A complete and efficient CUDA-sharing solution for HPC clusters 

      Peña Monferrer, Antonio J.; Reaño, Carlos; Silla, Federico; Mayo, Rafael; Quintana-Orti, Enrique S.; Duato, José Elsevier (2014)
      In this paper we detail the key features, architectural design, and implementation of rCUDA, an advanced framework to enable remote and transparent GPGPU acceleration in HPC clusters. rCUDA allows decoupling GPUs from ...
    • closedAccess   A parallel solver for huge dense linear systems  

      Badía, José; Movilla, Jose L.; Climente, Juan I.; Castillo Catalán, María Isabel; Marqués-Andrés, Mercedes; Mayo, Rafael; Quintana-Orti, Enrique S.; Planelles, Josep Elsevier (2011-11)
      HDSS (Huge Dense Linear System Solver) is a Fortran Application Programming Interface (API) to facilitate the parallel solution of very large dense systems to scientists and engineers. The API makes use of parallelism to ...
    • closedAccess   A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures 

      Ayguadé, Eduardo; Badía Sala, Rosa María; Cabrera, Daniel; Durán, Alejandro; González, Marc; Igual, Francisco; Jiménez González, Daniel; Labarta Mancho, Jesús; Martorell, Xavier; Mayo, Rafael; Pérez, Josep M.; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2009)
      OpenMP has evolved recently towards expressing unstructured parallelism, targeting the parallelization of a broader range of applications in the current multicore era. Homogeneous multicore architectures from major vendors ...
    • openAccess   A simulator to assess energy saving strategies and policies in HPC workloads 

      Quintana-Orti, Enrique S.; Mayo, Rafael; Iserte, Sergio; Fernández Fernández, Juan Carlos; Dolz, Manuel F. Association for Computing Machinery (ACM) (2012-07)
      In recent years power consumption of high performance computing (HPC) clusters has become a growing problem due, e.g., to the economic cost of electricity, the emission of car- bon dioxide (with negative impact on the ...
    • openAccess   A Survey on Malleability Solutions for High-Performance Distributed Computing 

      Aliaga Estellés, José Ignacio; Castillo, Maribel; Iserte, Sergio; Martín Álvarez, Iker; Mayo, Rafael MDPI (2022-05-22)
      Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Performance Computing (HPC) facilities is a cornerstone in the next generation of exascale supercomputers. Process malleability ...
    • openAccess   An Extension of the StarSs Programming Model for Platforms with Multiple GPUs 

      Ayguadé, Eduardo; Badía Sala, Rosa María; Igual, Francisco; Labarta Mancho, Jesús; Mayo, Rafael; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2009)
      While general-purpose homogeneous multi-core architectures are becoming ubiquitous, there are clear indications that, for a number of important applications, a better performance/power ratio can be attained using specialized ...
    • openAccess   Analysis of Threading Libraries for High Performance Computing 

      Castelló, Adrián; Mayo, Rafael; Seo, Sangmin; Balaji, Pavan; Quintana-Orti, Enrique S.; Peña Monferrer, Antonio J. IEEE (2020-01-30)
      With the appearance of multi-many core machines, applications and runtime systems evolved in order to exploit the new on-node concurrency that brought new software paradigms. POSIX threads (Pthreads) was widely-adopted for ...
    • openAccess   Aplicación sintética para el estudio de maleabilidad en computación de altas prestaciones 

      Martín Álvarez, Iker; Aliaga Estellés, José Ignacio; Castillo Catalán, María Isabel; Iserte, Sergio; Mayo, Rafael Universitat d'Alacant (2022)
      Hoy en día, la mejora del rendimiento en los grandes clusters de ordenadores recomienda el desarrollo de aplicaciones maleables. Así, durante la ejecución de estas aplicaciones en un trabajo, el sistema de gestión de ...
    • openAccess   Architecture-Aware Con guration and Scheduling of Matrix Multiplication on Asymmetric Multicore Processors 

      Catalán, Sandra; Igual, Francisco; Mayo, Rafael; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S. Springer US (2016-09)
      Asymmetric multicore processors (AMPs) have recently emerged as an appealing technology for severely energy-constrained environments, especially in mobile appliances where heterogeneity in applications is mainstream. ...
    • openAccess   Assessing Power Monitoring Approaches for Energy and Power Analysis of Computers 

      El Mehdi Diouria, Mohammed; Dolz, Manuel F.; Glückc, Olivier; Lefèvre, Laurent; Alonso-Jordá, Pedro; Catalán, Sandra; Mayo, Rafael; Quintana-Orti, Enrique S. Elsevier (2014-06)
      Large-scale distributed systems (e.g., datacenters, HPC systems, clouds, large-scale networks, etc.) consume and will consume enormous amounts of energy. Therefore, accurately monitoring the power dissipation and energy ...
    • openAccess   Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Dolz, Manuel F.; Martín Huertas, Alberto F.; Mayo, Rafael; Quintana-Orti, Enrique S. Springer US (2014)
      We investigate the benefits that an energyaware implementation of the runtime in charge of the concurrent execution of ILUPACK —a sophisticated preconditioned iterative solver for sparse linear systems— produces on the ...
    • openAccess   Attaining High Performance in General-Purpose Computations on Current Graphics Processors 

      Igual, Francisco; Mayo, Rafael; Quintana-Orti, Enrique S. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2008-01)
      The increase in performance of the last generations of graphics processors (GPUs) has made this class of hardware a coprocessing platform of remarkable success in certain types of operations. In this paper we evaluate ...
    • closedAccess   Color and texture analysis using emerging parallel architectures 

      Igual, Francisco; Mayo, Rafael; Hartley, Timothy; Çatalyürek, Ümit V.; Ruiz, Antonio; Ujaldon, Manuel SAGE Publications (2011-11)
      While image texture is effective for use in pattern-recognition and image-analysis algorithms, textural features are time-consuming to calculate on standard CPUs. Therefore, we present novel implementations of textural-feature ...
    • openAccess   DMRlib: Easy-coding and Efficient Resource Management for Job Malleability 

      Iserte, Sergio; Mayo, Rafael; Quintana-Orti, Enrique S.; Pena, Antonio J. IEEE (2020-09-09)
      Process malleability has proved to have a highly positive impact on the resource utilization and global productivity in data centers compared with the conventional static resource allocation policy. However, the non-negligible ...
    • closedAccess   DVFS-control techniques for dense linear algebra operations on multi-core processors 

      Alonso-Jordá, Pedro; Dolz, Manuel F.; Igual, Francisco; Mayo, Rafael; Quintana-Orti, Enrique S. Springer (2012-11)
      This paper analyzes the impact on power consumption of two DVFS-control strategies when applied to the execution of dense linear algebra operations on multi-core processors. The strategies considered here, prototyped as ...
    • openAccess   DVFS-Technique for Dense Linear Algebra Operations on Multi-Core Processors 

      Alonso-Jordá, Pedro; Dolz, Manuel F.; Mayo, Rafael; Quintana-Orti, Enrique S. Departament d' Enginyeria i Ciència dels Computadors, Universitat Jaume I (2011-05)
      This paper addresses the efficient explotation of task-level parallelism, present in many dense linear algebra operations, from the point of view of both computational performance and energy consumption. In particular, ...
    • openAccess   Dynamic Management of Resource Allocation for OmpSs Jobs 

      Iserte, Sergio; Peña Monferrer, Antonio J.; Mayo, Rafael; Quintana-Orti, Enrique S.; Beltran Querol, Vicenç Carretero Pérez, Jesús (2016-02)
      The main purpose of this thesis is to research in the relation between task-based programming models and resource management systems in order to provide a smart autonomous load-balancing and fault-tolerant system. Thus, ...
    • openAccess   Dynamic reconfiguration of noniterative scientific applications A case study with HPG aligner 

      Iserte, Sergio; Martínez Pérez, Héctor; Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Mayo, Rafael; Peña Monferrer, Antonio J. Springer (2018-09)
      Several studies have proved the benefits of job malleability, that is, the capacity of an application to adapt its parallelism to a dynamically changing number of allocated processors. The most remarkable advantages of ...
    • openAccess   Enabling big data analytics in the hybrid cloud using iterative MapReduce 

      Clemente-Castelló, Francisco J.; Bogdan, Nicolae; Katrinis, Kostas; Rafique, M. Mustafa; Mayo, Rafael; Fernández Fernández, Juan Carlos; Loreti, Daniela HAL-Inria (2015-12)
      The cloud computing model has seen tremendous commercial success through its materialization via two prominent models to date, namely public and private cloud. Recently, a third model combining the former two ...
    • closedAccess   Energy-efficient execution of dense linear algebra algorithms on multi-core processors 

      Alonso-Jordá, Pedro; Dolz, Manuel F.; Mayo, Rafael; Quintana-Orti, Enrique S. Springer Verlag (2013-09)
      This paper addresses the efficient exploitation of task-level parallelism, present in many dense linear algebra operations, from the point of view of both computational performance and energy consumption. The strategies ...