This event has passed.

High Performance Data Analytics in Python

Name: High Performance Data Analytics in Python
Start: 2023-09-05T09:00:00+02:00
End: 2023-09-07T12:00:00+02:00
Location: Online

5 Sep 2023 @ 09:00 – 7 Sep 2023 @ 12:00 CEST

Python is an industry-standard programming language for working with data on all levels of the data analytics pipeline, thanks to the rich ecosystem of libraries ranging from generic numerical libraries to special-purpose and/or domain-specific packages which are often supported by large developer communities and stable funding sources.

This online workshop is meant to give an overview of working with research data in Python using general libraries for storing, processing, analysing and sharing data. The focus is on improving performance. After covering tools for performant processing (netcdf, numpy, pandas, scipy) on single workstations the focus shifts to parallel, distributed and GPU computing (snakemake, numba, dask, multiprocessing, mpi4py).

Prerequisites

Basic experience with Python
Basic experience in working in a Linux-like terminal
Some prior experience in working with large or small datasets

Preliminary Agenda

To be updated soon.

Registration

Registrations are now closed for this event.

—-

Disclaimer

This training is intended for users established in the European Union or a country associated with Horizon 2020. You can read more about the countries associated with Horizon2020 here https://ec.europa.eu/info/research-and-innovation/statistics/framework-programme-facts-and-figures/horizon-2020-country-profiles_e

ENCCS

Online

Related Events

Event Navigation

This project has received funding from the European High-Performance Computing Joint Undertaking (JU) under grant agreement No 951732. The JU receives support from the European Union’s Horizon 2020 research and innovation programme and its associated countries .