BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//ENCCS - ECPv6.15.16//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-ORIGINAL-URL:https://enccs.se
X-WR-CALDESC:Events for ENCCS
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:Europe/Stockholm
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:20240331T010000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:20241027T010000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:20250330T010000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:20251026T010000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:20260329T010000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:20261025T010000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=Europe/Stockholm:20250121T090000
DTEND;TZID=Europe/Stockholm:20250123T120000
DTSTAMP:20260419T132714
CREATED:20241128T094841Z
LAST-MODIFIED:20241218T122942Z
UID:36407-1737450000-1737633600@enccs.se
SUMMARY:High-Performance Data Analytics with Python (Online)
DESCRIPTION:Register here\n\n\n\n\n\n\nAbout the course\n\n\n\nWelcome to the online workshop on High Performance Data Analytics in Python on Jan. 21-23 (2025). Python is a modern\, object-oriented\, and industry-standard programming language for working with data on all levels of the data analytics pipeline. A rich ecosystem of libraries ranging from generic numerical libraries to special-purpose and/or domain-specific packages has been developing using Python for data analysis and scientific computing. \n\n\n\n\nThis three half-day online workshop is meant to give an overview of working with research data in Python using general libraries for storing\, processing\, analyzing and sharing data. The focus is on improving performance. After covering tools for performant processing (netcdf\, numpy\, pandas\, scipy) on single workstations the focus shifts to parallel\, distributed and GPU computing (snakemake\, numba\, dask\, multiprocessing\, mpi4py). \n\n\n\n\nWho is this workshop for?\n\n\n\nHigh-Performance Data Analytics in Python is for all researchers and engineers who work with large or small datasets and who want to learn powerful tools and best practices for writing more performant\, parallelised\, robust\, and reproducible data analysis pipelines. This workshop is an interactive online event\, featuring live coding\, demos\, and practical exercises. We aim to equip you with the tools and knowledge to write efficient\, high-performance code using Python. \n\n\n\nPrerequisites\n\n\n\n\nBasic experience with Python\n\n\n\nBasic experience in working in a Linux-like terminal\n\n\n\nSome prior experience in working with large or small datasets\n\n\n\n\nKey takeaways\n\n\n\nAfter attending the workshop\, you should: \n\n\n\n\nHave a good overview of available tools and libraries for improving performance in Python\n\n\n\nKnow what libraries are available for efficiently storing\, reading and writing large data\n\n\n\nBe comfortable working with NumPy arrays and Pandas dataframes\n\n\n\nBe able to explain why Python code is often slow\n\n\n\nUnderstand the concept of vectorisation\n\n\n\nUnderstand the importance of measuring performance and profiling code before optimizing\n\n\n\nBe able to describe the difference between “embarrasing”\, shared-memory and distributed-memory parallelism\n\n\n\nKnow the basics of parallel workflows\, multiprocessing\, multithreading and MPI\n\n\n\nUnderstand pre-compilation and know basic usage of Numba and Cython\n\n\n\nHave a mental model of how Dask achieves parallelism\n\n\n\nRemember key hardware differences between CPUs and GPUs\n\n\n\nBe able to create simple GPU kernels with Numba\n\n\n\n\n\n\n\n	TimeContents\n\n\n\n\n	Day 1 (Jan. 21)09:00-09:10Welcome\n\n\n	09:10-09:20Motivation\n\n\n	09:20-10:00Scientific data\n\n\n	10:00-10:20Break\n\n\n	10:20-11:00Efficient array computing\n\n\n	11:00-11:20Break\n\n\n	11:20-12:00Efficient array computing\n\n\n	\n\n\n	Day 2 (Jan. 22)09:00-09:40Parallel computing\n\n\n	09:40-09:50Break\n\n\n	09:50-10:20Parallel computing\n\n\n	10:20-10:40Break\n\n\n	10:40-11:20Profiling and optimizing\n\n\n	11:20-11:30Break\n\n\n	11:30-12:00Profiling and optimizing\n\n\n	\n\n\n	Day 3 (Jan. 23)09:00-09:40Performance boosting\n\n\n	09:40-09:50Break\n\n\n	09:50-10:20Performance boosting\n\n\n	10:20-10:40Break\n\n\n	10:40-11:20Dask for scalable analytics\n\n\n	11:20-11:30Break\n\n\n	11:30-12:00Dask for scalable analytics\n\n\n\n\n\n\nMore events & contact\n\n\n\nCheck out more upcoming events from ENCCS and our European network at https://enccs.se/events. \n\n\n\nFor questions regarding this workshop or general questions about ENNCS training events\, please contact training@enccs.se \n\n\n\nSchedules can change!\n\n\n\nTo ensure that everyone has the opportunity to participate\, we kindly request that you let us know as soon as possible if you are unable to attend an event after registering. \n\n\n\nPlease send us an email at training@enccs.se to cancel your attendance. \n\n\n\nWe understand things can change\, but repeated cancellations without notice may unfortunately result in your name being removed from future event registration lists. \n\n\n\n\n\n\n\nRegulations\n\n\n\nDue to EuroCC2 regulations\, we CAN NOT ACCEPT generic or private email addresses. Please use your official university or company email address for registration. \n\n\n\nThis training is for users who live and work in the European Union or a country associated with Horizon 2020. You can read more about the countries associated with Horizon2020 HERE.
URL:https://enccs.se/events/hpda-python-jan-25/
CATEGORIES:ENCCS Event
ATTACH;FMTTYPE=image/webp:https://media.enccs.se/2024/11/python-hpda-25.webp
END:VEVENT
END:VCALENDAR