Supervised training of large networks requires large labeled datasets, which in turn demand high computational costs. While active practitioners in deep learning primarily develop and train their networks on local computing devices, with the increase of networks complexity, there is an urgent need to create, train, and test models on clusters.
In this workshop, we overview the basics of Docker and Singularity. (Working knowledge of Singularity as given in the Uppmax workshop on Singularity is desirable.) Distributed training using TensorFlow and Horovod frameworks on a supercomputer will be covered. Moreover, it will be shown how to use Singularity containers in conjunction with TensorFlow and Horovod to upscale an AI app.
The workshop will be entirely online using zoom.
Basic knowledge of UNIX OS and familiarity with NNs are required.
To be announced soon,
The event is fully booked.
For questions regarding this event please contact us at training@enccs.se.
This training is intended for users established in the European Union or a country associated with Horizon 2020. You can read more about the countries associated with Horizon2020 here https://ec.europa.eu/info/research-and-innovation/statistics/framework-programme-facts-and-figures/horizon-2020-country-profiles_en