Buscar
Dynamic Management of Resource Allocation for OmpSs Jobs
(Carretero Pérez, Jesús, 2016-02)
The main purpose of this thesis is to research in the relation between task-based programming models and resource management systems in order to provide a smart autonomous load-balancing and fault-tolerant system. Thus, ...
Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models
(Springer, 2016-06-21)
Directive-based programming models, such as OpenMP, OpenACC, and OmpSs, enable users to accelerate applications by using coprocessors with little effort. These devices offer significant computing power, but their use can ...
Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units
(John Wiley and Sons, 2021)
We contribute to the optimization of the sparse matrix-vector product by introducing a variant of the coordinate sparse matrix format that balances the workload distribution and compresses both the indexing arrays and the ...
FaST-LMM for Two-Way Epistasis Tests on High-Performance Clusters
(Mary Ann Liebert, 2018-08)
We introduce a version of the epistasis test in FaST-LMM for clusters of multithreaded processors. This new software maintains the sensitivity of the original FaST-LMM while delivering acceleration that is close to linear ...
Evaluating fault tolerance on asymmetric multicore systems-on-chip using iso-metrics
(Institution of Engineering and Technology, 2016-03)
The end of Dennard scaling has promoted low power consumption into a first-order concern for computing systems. However, conventional power conservation schemes such as voltage and frequency scaling are reaching their ...
An efficient GPU version of the preconditioned GMRES method
(Springer, 2019-03)
In a large number of scientific applications, the solution of sparse linear systems is the stage that concentrates most of the computational effort. This situation has motivated the study and development of several iterative ...
Noise estimation for hyperspectral subspace identification on FPGAs
(Springer, 2019-05)
We present a reliable and efficient FPGA implementation of a procedure for the computation of the noise estimation matrix, a key stage for subspace identification of hyperspectral images. Our hardware realization is based ...
Extending the Gauss-Huard method for the solution of Lyapunov matrix equations and matrix inversion
(Wiley, 2017-05-10)
The solution of linear systems is a recurrent operation in scientific and engineering applications, traditionally addressed via the LU factorization. The Gauss-Huard (GH) algorithm has been introduced as an efficient ...
Energy Balance between Voltage-Frequency Scaling and Resilience for Linear Algebra Routines on Low-Power Multicore Architectures
(Elsevier, 2017)
Near Threshold Voltage (NTV) computing has been recently proposed as a technique to save energy, at the cost of incurring higher error rates including, among others, Silent Data Corruption (SDC). In this paper, we evaluate ...
Analysis of Threading Libraries for High Performance Computing
(IEEE, 2020-01-30)
With the appearance of multi-many core machines, applications and runtime systems evolved in order to exploit the new on-node concurrency that brought new software paradigms. POSIX threads (Pthreads) was widely-adopted for ...