Listar ICC_Articles por autoría "934f640f-222e-42bd-8b2e-effea14491ca"
Mostrando ítems 1-17 de 17
-
A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting
Catalán, Sandra; Herrero Zaragoza, José R.; Quintana-Orti, Enrique S.; Rodríguez Sánchez, Rafael; Van de Geijn, Robert A. IEEE (2019-01)We propose two novel techniques for overcoming load-imbalance encountered when implementing so-called look-ahead mechanisms in relevant dense matrix factorizations for the solution of linear systems. Both techniques target ... -
An analytical methodology to derive power models based on hardware and software metrics
Dolz, Manuel F.; Kunkel, Julian; Chasapis, Konstantinos; Catalán, Sandra Springer Berlin Heidelberg (2015-09)The use of models to predict the power con- sumption of a system is an appealing alternative to wattmeters since they avoid hardware costs and are easy to deploy. In this paper, we present a systematic ... -
Architecture-Aware Con guration and Scheduling of Matrix Multiplication on Asymmetric Multicore Processors
Catalán, Sandra; Igual, Francisco D.; Mayo, Rafael; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S. Springer US (2016-09)Asymmetric multicore processors (AMPs) have recently emerged as an appealing technology for severely energy-constrained environments, especially in mobile appliances where heterogeneity in applications is mainstream. ... -
Assessing Power Monitoring Approaches for Energy and Power Analysis of Computers
El Mehdi Diouria, Mohammed; Dolz, Manuel F.; Glückc, Olivier; Lefèvre, Laurent; Alonso-Jordá, Pedro; Catalán, Sandra; Mayo, Rafael; Quintana-Orti, Enrique S. Elsevier (2014-06)Large-scale distributed systems (e.g., datacenters, HPC systems, clouds, large-scale networks, etc.) consume and will consume enormous amounts of energy. Therefore, accurately monitoring the power dissipation and energy ... -
Energy Balance between Voltage-Frequency Scaling and Resilience for Linear Algebra Routines on Low-Power Multicore Architectures
Catalán, Sandra; Herrero, José R.; Quintana-Orti, Enrique S.; Rodríguez Sánchez, Rafael Elsevier (2018)Near Threshold Voltage (NTV) computing has been recently proposed as a technique to save energy, at the cost of incurring higher error rates including, among others, Silent Data Corruption (SDC). In this paper, we evaluate ... -
Energy Balance between Voltage-Frequency Scaling and Resilience for Linear Algebra Routines on Low-Power Multicore Architectures
Catalán, Sandra; Herrero Zaragoza, José R.; Quintana-Orti, Enrique S.; Rodríguez Sánchez, Rafael Elsevier (2017)Near Threshold Voltage (NTV) computing has been recently proposed as a technique to save energy, at the cost of incurring higher error rates including, among others, Silent Data Corruption (SDC). In this paper, we evaluate ... -
Evaluating fault tolerance on asymmetric multicore systems-on-chip using iso-metrics
Chalios, Charalampos; Nikolopoulos, Dimitrios S.; Catalán, Sandra; Quintana-Orti, Enrique S. Institution of Engineering and Technology (2016-03)The end of Dennard scaling has promoted low power consumption into a first-order concern for computing systems. However, conventional power conservation schemes such as voltage and frequency scaling are reaching their ... -
Evaluating the performance and energy efficiency of the COSMO-ART model system
Charles, Joseph; Sawyer, William; Dolz, Manuel F.; Catalán, Sandra Springer Berlin Heidelberg (2015-05)In this paper we investigate the energy footprint and performance profiling of COSMO-ART on various HPC platforms. This model is an extension of the operational weather forecast model of the German weather service (DWD), ... -
Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD
Rodríguez Sánchez, Rafael; Catalán, Sandra; Herrero, José R.; Quintana-Orti, Enrique S.; Tomás Domínguez, Andrés Enrique Springer Verlag (2019)We address the reduction to compact band forms, via unitary similarity transformations, for the solution of symmetric eigenvalue problems and the computation of the singular value decomposition (SVD). Concretely, in the ... -
Multi-threaded dense linear algebra libraries for low-power asymmetric multicore processors
Catalán, Sandra; Herrero Zaragoza, José R.; Igual, Francisco D.; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S.; Adeniyi-Jones, Chris Elsevier (2018-03)Dense linear algebra libraries, such as BLAS and LAPACK, provide a relevant collection of numerical tools for many scientific and engineering applications. While there exist high performance implementations of the BLAS ... -
Programming parallel dense matrix factorizations with look-ahead and OpenMP
Catalán, Sandra; Castelló, Adrián; Igual, Francisco D.; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S. Springer (2019)We investigate a parallelization strategy for dense matrix factorization (DMF) algorithms, using OpenMP, that departs from the legacy (or conventional) solution, which simply extracts concurrency from a multi-threaded ... -
Reducing the cost of power monitoring with DC wattmeters
Castaño Álvarez, María Asunción; Catalán, Sandra; Mayo, Rafael; Quintana-Orti, Enrique S. Springer Berlin Heidelberg (2015-05)The use of internal DC wattmeters, connected to the ATX lines that distribute power from the supply unit to the computer components, is a luring method to profile power in server configurations due to the accurate and ... -
Revisiting conventional task schedulers to exploit asymmetry in multi-core architectures for dense linear algebra operations
Costero, Luis; Igual, Francisco D.; Olcoz, Katzalin; Catalán, Sandra; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S. Elsevier (2017)Dealing with asymmetry in the architecture opens a plethora of questions related with the performance- and energy-efficient scheduling of task-parallel applications. While there exist early attempts to tackle this problem, ... -
Static scheduling of the LU factorization with look-ahead on asymmetric multicore processors
Catalán, Sandra; Herrero, José R.; Quintana-Orti, Enrique S.; Rodríguez Sánchez, Rafael Elsevier (2018)We analyze the benefits of look-ahead in the parallel execution of the LU factorization with partial pivoting (LUpp) in two distinct “asymmetric” multicore scenarios. The first one corresponds to an actual hardware-asymmetric ... -
Time and energy modeling of a high-performance multi-threaded Cholesky factorization
Catalán, Sandra; Igual, Francisco D.; Mayo, Rafael; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S. Springer (2016-02-05)We present accurate time and energy piece-wise models of high-performance multi-threaded implementations for the general matrix multiplication, triangular system solve with multiple right-hand sides, and symmetric rank-k ... -
Time and energy modeling of high–performance Level-3 BLAS on x86 architectures
Alonso-Jordá, Pedro; Catalán, Sandra; Igual, Francisco D.; Mayo, Rafael; Rodríguez Sánchez, Rafael; Quintana-Orti, Enrique S. Elsevier (2015-06)We present accurate piece-wise models for the time and energy costs of high performance implementations of both the matrix multiplication (gemm) and the triangular system solve with multiple right-hand sides (trsm) on x86 ... -
Two-sided orthogonal reductions to condensed forms on asymmetric multicore processors
Alonso-Jordá, Pedro; Catalán, Sandra; Herrero, José R.; Quintana-Orti, Enrique S.; Rodríguez Sánchez, Rafael Elsevier (2018)We investigate how to leverage the heterogeneous resources of an Asymmetric Multicore Processor (AMP) in order to deliver high performance in the reduction to condensed forms for the solution of dense eigenvalue and ...