Analyzing the impact of the MPI allreduce in distributed training of convolutional neural networks

Castelló, Adrián; Catalán Carbó, Mar; Dolz, Manuel F.; Quintana-Orti, Enrique S.; Duato, José

View/Open

Impact

Scholar | Other documents of the author: Castelló, Adrián; Catalán Carbó, Mar; Dolz, Manuel F.; Quintana-Orti, Enrique S.; Duato, José

Show METS | MarcXML

Export to

Metadata

Show full item record

Metadata

Title

Analyzing the impact of the MPI allreduce in distributed training of convolutional neural networks

Author (s)

Castelló, Adrián;

Catalán Carbó, Mar;

Dolz, Manuel F.

;

Quintana-Orti, Enrique S.;

Duato, José

Date

2022-01-10

Publisher

Springer

URI

http://hdl.handle.net/10234/196782

DOI

https://doi.org/10.1007/s00607-021-01029-2

Bibliographic citation

Castelló, A., Catalán, M., Dolz, M.F. et al. Analyzing the impact of the MPI allreduce in distributed training of convolutional neural networks. Computing, 105, 1101–1119 (2023). https://doi.org/10.1007/s00607-021-01029-2

Type

info:eu-repo/semantics/article

Version

info:eu-repo/semantics/publishedVersion

Subject

message passing interface (MPI) |

collective communication primitives |

Allreduce |

deep learning |

distributed training

Abstract

For many distributed applications, data communication poses an important bottleneck from the points of view of performance and energy consumption. As more cores are integrated per node, in general the global perfo ... [+]

Is part of

Computing (2023)

Funder Name

Ministerio de Ciencia, Innovación y Universidades (Spain) | Generalitat Valenciana

Project code

TIN2017-82972-R | Prometeo/2019/109 | CDEIGENT/2018/014 | FJC2019-039222-I

Rights

This item appears in the folowing collection(s)

ICC_Articles [419]

Repositori Universitat Jaume I