Colonnelli, I., Casella, B., Mittone, G., Arfat, Y., Cantalupo, B., Esposito, R., … & Aldinucci, M. (2022, May). Federated Learning meets HPC and cloud. In ML4Astro International Conference (pp. 193-199). Cham: Springer International Publishing.

DOI: https://doi.org/10.1007/978-3-031-34167-0_39

Download

Abstract

HPC and AI are fated to meet for several reasons. This article will discuss some of them and argue why this will happen through the methods and technologies underpinning cloud computing. As a paradigmatic example, we present a new Federated Learning (FL) system that collaboratively trains a deep learning model in different supercomputing centers. The system is based on the StreamFlow workflow manager designed for hybrid cloud-HPC infrastructures.