Now showing items 1-5 of 5

    • closedAccess   A customized precision format based on mantissa segmentation for accelerating sparse linear algebra 

      Grützmacher, Thomas; Cojean, Terry; Flegar, Goran; Göbel, Fritz; Anzt, Hartwig Wiley (2019)
      In this work, we pursue the idea of radically decoupling the floating point format used for arithmetic operations from the format used to store the data in memory. We complement this idea with a customized precision memory ...
    • closedAccess   Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation 

      Grützmacher, Thomas; Cojean, Terry; Flegar, Goran; Anzt, Hartwig; Quintana-Orti, Enrique S. Association for Computing Machinery (ACM) (2020-03)
      We describe the application of a communication-reduction technique for the PageRank algorithm that dynamically adapts the precision of the data access to the numerical requirements of the algorithm as the iteration converges. ...
    • openAccess   Compressed basis GMRES on high-performance graphics processing units 

      Aliaga Estellés, José Ignacio; Anzt, Hartwig; Tomás Domínguez, Andrés Enrique; Quintana-Orti, Enrique S.; Grützmacher, Thomas Sage (2022-08-05)
      Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the performance of practical realizations of these methods is ...
    • openAccess   Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units 

      Aliaga Estellés, José Ignacio; Anzt, Hartwig; Grützmacher, Thomas; Quintana-Orti, Enrique S.; Tomás Domínguez, Andrés Enrique John Wiley and Sons (2021)
      We contribute to the optimization of the sparse matrix-vector product by introducing a variant of the coordinate sparse matrix format that balances the workload distribution and compresses both the indexing arrays and the ...
    • openAccess   Toward a modular precision ecosystem for high-performance computing 

      Anzt, Hartwig; Flegar, Goran; Grützmacher, Thomas; Quintana-Orti, Enrique S. Sage (2019-05)
      With the memory bandwidth of current computer architectures being significantly slower than the (floating point) arithmetic performance, many scientific computations only leverage a fraction of the computational power in ...