Cerca

Ara mostrant els elements 1-10 d 58

1
2
3
4
. . .
6

Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models

Castelló, Adrián; Pena, Antonio J.; Mayo, Rafael; Planas, Judit; Quintana-Orti, Enrique S.; Balaji, Pavan (Springer, 2016-06-21)

Directive-based programming models, such as OpenMP, OpenACC, and OmpSs, enable users to accelerate applications by using coprocessors with little effort. These devices offer significant computing power, but their use can ...

Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units

Aliaga Estellés, José Ignacio; Anzt, Hartwig; Grützmacher, Thomas; Quintana-Orti, Enrique S.; Tomás Domínguez, Andrés Enrique (John Wiley and Sons, 2021)

We contribute to the optimization of the sparse matrix-vector product by introducing a variant of the coordinate sparse matrix format that balances the workload distribution and compresses both the indexing arrays and the ...

FaST-LMM for Two-Way Epistasis Tests on High-Performance Clusters

Martínez Pérez, Héctor; Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Quintana-Orti, Enrique S.; Rambla, Jordi; Farré, Xavier; Navarro, Arcadi (Mary Ann Liebert, 2018-08)

We introduce a version of the epistasis test in FaST-LMM for clusters of multithreaded processors. This new software maintains the sensitivity of the original FaST-LMM while delivering acceleration that is close to linear ...

Noise estimation for hyperspectral subspace identification on FPGAs

León, Germán; González, Carlos; Mayo, Rafael; Mozos, Daniel; Quintana-Orti, Enrique S. (Springer, 2019-05)

We present a reliable and efficient FPGA implementation of a procedure for the computation of the noise estimation matrix, a key stage for subspace identification of hyperspectral images. Our hardware realization is based ...

Energy Balance between Voltage-Frequency Scaling and Resilience for Linear Algebra Routines on Low-Power Multicore Architectures

Catalán, Sandra; Herrero Zaragoza, José R.; Quintana-Orti, Enrique S.; Rodríguez Sánchez, Rafael (Elsevier, 2017)

Near Threshold Voltage (NTV) computing has been recently proposed as a technique to save energy, at the cost of incurring higher error rates including, among others, Silent Data Corruption (SDC). In this paper, we evaluate ...

1
2
3
4
. . .
6

Autoria

Remón Gómez, Alfredo (11)

Aliaga Estellés, José Ignacio (10)

Repositori Universitat Jaume I

Cerca

Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models

Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units

FaST-LMM for Two-Way Epistasis Tests on High-Performance Clusters

Noise estimation for hyperspectral subspace identification on FPGAs

Energy Balance between Voltage-Frequency Scaling and Resilience for Linear Algebra Routines on Low-Power Multicore Architectures

Analysis of Threading Libraries for High Performance Computing

Communication in task-parallel ILU-preconditioned CG solversusing MPI + OmpSs

A framework for genomic sequencing on clusters of multicore and manycore processors

Modeling power consumption of 3D MPDATA and the CG method on ARM and Intel multicore architectures

DMR API: Improving cluster productivity by turning applications into malleable

Cerca

Filtres

Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models

Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units

FaST-LMM for Two-Way Epistasis Tests on High-Performance Clusters

Noise estimation for hyperspectral subspace identification on FPGAs

Energy Balance between Voltage-Frequency Scaling and Resilience for Linear Algebra Routines on Low-Power Multicore Architectures

Analysis of Threading Libraries for High Performance Computing

Communication in task-parallel ILU-preconditioned CG solversusing MPI + OmpSs

A framework for genomic sequencing on clusters of multicore and manycore processors

Modeling power consumption of 3D MPDATA and the CG method on ARM and Intel multicore architectures

DMR API: Improving cluster productivity by turning applications into malleable