Using machine learning to model the training scalability of convolutional neural networks on clusters of GPUs

Barrachina Mir, Sergio; Castelló, Adrián; Catalán Carbó, Mar; Dolz, Manuel F.; Mestre Miravet, Jose Ignacio

Ver/Abrir

Impacto

Scholar | Otros documentos de la autoría: Barrachina Mir, Sergio; Castelló, Adrián; Catalán Carbó, Mar; Dolz, Manuel F.; Mestre Miravet, Jose Ignacio

Mostrar METS | MarcXML

Exportar a

Metadatos

Mostrar el registro completo del ítem

Metadatos

Título

Using machine learning to model the training scalability of convolutional neural networks on clusters of GPUs

Autoría

Barrachina Mir, Sergio;

Castelló, Adrián;

Catalán Carbó, Mar;

Dolz, Manuel F.

;

Mestre Miravet, Jose Ignacio

Fecha de publicación

2021-08-30

Editor

Springer

URI

http://hdl.handle.net/10234/195009

DOI

https://doi.org/10.1007/s00607-021-00997-9

Cita bibliográfica

Barrachina, S., Castelló, A., Catalán, M. et al. Using machine learning to model the training scalability of convolutional neural networks on clusters of GPUs. Computing 105, 915–934 (2023). https://doi.org/10.1007/s00607-021-00997-9

Tipo de documento

info:eu-repo/semantics/article

Versión

info:eu-repo/semantics/publishedVersion

Palabras clave / Materias

deep neural networks (DNNs) |

distributed training |

multi-layer perceptron (MLP) based modeling |

analytical modeling |

clusters |

GPUs

Resumen

In this work, we build a general piece-wise model to analyze data-parallel (DP) training costs of convolutional neural networks (CNNs) on clusters of GPUs. This general model is based on i) multi-layer perceptrons ... [+]

Publicado en

Computing 105, 915–934 (2023)

Entidad financiadora

CRUE-CSIC agreement with Springer Nature | Ministerio de Ciencia, Innovación y Universidades (Spain) | Generalitat Valenciana

Código del proyecto o subvención

TIN2017-82972-R | Prometeo/2019/109 | Plan GenT project CDEIGENT/2018/014

Título del proyecto o subvención

Open Access funding provided

Derechos de acceso

Aparece en las colecciones

ICC_Articles [427]

Repositori Universitat Jaume I