Browsing ICC_Articles by Keyword "fault tolerance"
Now showing items 1-2 of 2
-
Fine-grained bit-flip protection for relaxation methods
Elsevier (2019-09)Resilience is considered a challenging under-addressed issue that the high performance computing community (HPC) will have to face in order to produce reliable Exascale systems by the beginning of the next decade. As part ... -
Reliability Evaluation of LU Decomposition on GPU-Accelerated System-on-Chip Under Proton Irradiation
IEEE (2022-03-22)Graphic processing units (GPUs) have become a basic accelerator both in high-performance nodes and low-power system-on-chip (SoC). They provide massive data parallelism and very high performance per watt. However, their ...