• closedAccess   A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures 

      Ayguadé, Eduardo; Badía Sala, Rosa María; Cabrera, Daniel; Durán, Alejandro; González, Marc; Igual, Francisco D.; Jiménez González, Daniel; Labarta Mancho, Jesús; Martorell, Xavier; Mayo, Rafael; Pérez, Josep M.; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2009)
      OpenMP has evolved recently towards expressing unstructured parallelism, targeting the parallelization of a broader range of applications in the current multicore era. Homogeneous multicore architectures from major vendors ...
    • openAccess   Analysis of Threading Libraries for High Performance Computing 

      Castelló, Adrián; Mayo, Rafael; Seo, Sangmin; Balaji, Pavan; Quintana-Orti, Enrique S.; Peña Monferrer, Antonio J. IEEE (2020-01-30)
      With the appearance of multi-many core machines, applications and runtime systems evolved in order to exploit the new on-node concurrency that brought new software paradigms. POSIX threads (Pthreads) was widely-adopted for ...
    • openAccess   Argobots: A Lightweight Low-Level Threading and Tasking Framework 

      Seo, Sangmin; Amer, Abdelhalim; Balaji, Pavan; Bordage, Cyril; Bosilca, George; Brooks, Alex; Carns, Philip; Castelló, Adrián; Genet, Damien; Herault, Thomas; Iwasaki, Shintaro; Jindal, Prateek; Kalé, Laxmikant V.; Krishnamoorthy, Sriram; Lifflander, Jonathan; Lu, Huiwei; Meneses, Esteban; Snir, Marc; Sun, Yanhua; Taura, Kenjiro; Beckman, Pete IEEE (2017-10)
      In the past few decades, a number of user-level threading and tasking models have been proposed in the literature to address the shortcomings of OS-level threads, primarily with respect to cost and flexibility. Current ...
    • closedAccess   On the adequacy of lightweight thread approaches for high-level parallel programming models 

      Castelló, Adrián; Mayo, Rafael; Sala, Kevin; Beltran Querol, Vicenç; Balaji, Pavan; Peña Monferrer, Antonio J. Elsevier (2018-07)
      High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: ...
    • openAccess   Strategies to parallelize a finite element mesh truncation technique on multi-core and many-core architectures 

      Badia, Jose M.; Amor-Martin, Adrian; BELLOCH, JOSE A.; Garcia-Castillo, Luis Emilio Springer (2022-12-02)
      Achieving maximum parallel performance on multi-core CPUs and many-core GPUs is a challenging task depending on multiple factors. These include, for example, the number and granularity of the computations or the use of the ...
    • openAccess   Unification of Lightweight Thread Solutions and their Application in High Performance Programming 

      Castelló Gimeno, Adrián Universitat Jaume I (2018-10-02)
      Lightweight thread (LWT) libraries have been developed to tackle fine-grained and dynamic software requirements. These libraries are based on the concept of threads that are managed in the user-space. This thesis aims ...