Browsing ICC_Articles by Author "c34f8dc8-ea45-4ffc-ba5a-66c7a855eab1"

Now showing items 1-5 of 5

A customized precision format based on mantissa segmentation for accelerating sparse linear algebra

Grützmacher, Thomas; Cojean, Terry; Flegar, Goran; Göbel, Fritz; Anzt, Hartwig Wiley (2019)

In this work, we pursue the idea of radically decoupling the floating point format used for arithmetic operations from the format used to store the data in memory. We complement this idea with a customized precision memory ...
Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation

Grützmacher, Thomas; Cojean, Terry; Flegar, Goran; Anzt, Hartwig; Quintana-Orti, Enrique S. Association for Computing Machinery (ACM) (2020-03)

We describe the application of a communication-reduction technique for the PageRank algorithm that dynamically adapts the precision of the data access to the numerical requirements of the algorithm as the iteration converges. ...
Compressed basis GMRES on high-performance graphics processing units

Aliaga Estellés, José Ignacio; Anzt, Hartwig; Tomás Domínguez, Andrés Enrique; Quintana-Orti, Enrique S.; Grützmacher, Thomas Sage (2022-08-05)

Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the performance of practical realizations of these methods is ...
Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units

Aliaga Estellés, José Ignacio; Anzt, Hartwig; Grützmacher, Thomas; Quintana-Orti, Enrique S.; Tomás Domínguez, Andrés Enrique John Wiley and Sons (2021)

We contribute to the optimization of the sparse matrix-vector product by introducing a variant of the coordinate sparse matrix format that balances the workload distribution and compresses both the indexing arrays and the ...
Toward a modular precision ecosystem for high-performance computing

Anzt, Hartwig; Flegar, Goran; Grützmacher, Thomas; Quintana-Orti, Enrique S. Sage (2019-05)

With the memory bandwidth of current computer architectures being significantly slower than the (floating point) arithmetic performance, many scientific computations only leverage a fraction of the computational power in ...

Browsing ICC_Articles by Author "c34f8dc8-ea45-4ffc-ba5a-66c7a855eab1"

A customized precision format based on mantissa segmentation for accelerating sparse linear algebra ﻿

Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation ﻿

Compressed basis GMRES on high-performance graphics processing units ﻿

Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units ﻿

Toward a modular precision ecosystem for high-performance computing ﻿

A customized precision format based on mantissa segmentation for accelerating sparse linear algebra

Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation

Compressed basis GMRES on high-performance graphics processing units

Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units

Toward a modular precision ecosystem for high-performance computing