Oct 29 • 09:00 – Oct 30 • 17:00 CET

KTH, ParaTools, the University of Oregon, and ENCCS (EuroCC National Competence Centre Sweden) come together for the organization of this hybrid workshop. The event brings together code developers, researchers, and research software engineers working on high-performance computing (HPC). It provides an excellent opportunity for sharing innovative ideas and best practices for performance monitoring and software management on supercomputers and cloud platforms. This workshop includes:
The theoretical optimization of complex code is somewhat limited, and great progress in scientific computing has been achieved through performance analysis tools. Indeed, advances in massively parallel systems like the LUMI supercomputer and cloud HPC platforms have further boosted our understanding of complex phenomena at unprecedented resolutions. To leverage these resources effectively, robust tools for performance monitoring, debugging, and software management are required. These tools enable the development of key applications for science, industry, and society.knowledge and enabling key applications for science, industry, and society.
The TAU Performance System® is a state-of-the-art, robust suite of tools for performance measurement, analysis, and visualization in high-performance computing (HPC). It is highly portable and supports the full range of HPC systems, software, and applications. TAU provides profiling and tracing capabilities for performance analysis of parallel programs written in Fortran, C, C++, UPC, Java, and Python.
TAU (Tuning and Analysis Utilities) can instrument code at the level of functions, methods, basic blocks, or statements, or via event-based sampling, thereby capturing detailed performance data. It supports instrumentation inserted automatically (via the Program Database Toolkit), dynamically (for example with DyninstAPI), during runtime in the Java Virtual Machine, or manually via its API. Its visualization tool, ParaProf, presents performance data graphically in both aggregate and per node/context/thread form, allowing users to rapidly identify performance bottlenecks. TAU can also produce event traces compatible with Vampir, Paraver, or JumpShot trace viewers.
The E4S (Extreme-scale Scientific Software Stack) project is an open-source software ecosystem designed for developing, deploying, and running scientific and HPC/AI applications. It is a community effort to curate and maintain a collection of software packages, enabling both from-source builds and containerized deployments across a broad spectrum of platforms.
E4S provides container images, pre-installed binaries, and source builds of over a hundred HPC and AI software packages, including libraries, runtimes, tools, and frameworks, ensuring performance portability across CPU and GPU architectures. It leverages the Spack package manager as its core delivery mechanism, enabling reproducible, customized, and interoperable builds.
The E4S software distribution is rigorously tested on a wide range of platforms, from laptops and departmental clusters to leadership-class supercomputers, and supports container runtimes such as Docker, Singularity, Shifter, and CharlieCloud. Releases of E4S are designed to expand support for evolving hardware architectures (for example, new GPU platforms) and to incorporate widely used AI/ML frameworks and scientific applications.
After attending this workshop, you will:
• Understand core features of performance evaluation tools
• Understand performance evaluation tools and techniques for CPU and GPU platforms
• Understand how to optimize applications on AMD GPUs on LUMI and NVIDIA GPUs on Commercial Cloud Platforms
• Use a comprehensive software stack for HPC and AI applications
The event is going to be held at the Stockholm RISE offices on KTH campus Drottning Kristinas väg 61 in room Sundblassalen. The closest metro station (marked T) is Tekniska Högskolan. However, we will also broadcast the seminars and hands-on sessions live for online participants who will be able to participate actively in the workshop through live Q&A and chat.
Friday, 24 October 2025 (14.00-18.00): HPC onboarding session (Online)
Tuesday, 28 October 2025 (10.00-12.00): HPC onboarding session (Online)
Wednesday, 29 October 2025: Working with TAU Performance System®
| Time (CET) | Topic |
|---|---|
| 08.30-09.00 | Reception |
| 09.00-09.15 | Welcome, logistics and introductions |
| 09.15-10.00 | Introduction to TAU (Tuning and Analysis Utilities) |
| 10.00-10.30 | – TAU performance system setup on LUMI and Cloud HPC platforms – Hands-on: Simple MPI Examples |
| 10.30-11.00 | Coffee break |
| 11.00-12.00 | – Instrumentation: MPI, GPU runtimes, tau_exec – Hands-on: AWS examples using TAU and ParaProf – Performance Application Programming Interface (PAPI) |
| 12.00-14.00 | Lunch |
| 14.00-15.30 | – Hands-on: TAU and GPU (ROCm and CUDA) Runtime examples – Perfetto.dev trace visualiser |
| 15.30-15.45 | Coffee break |
| 15.45-16.45 | – TAUdb (TAU’s performance database) – PerfExplorer (TAU’s perfomance data mining environment) |
| 16.45-17.00 | -Q&A and concluding remarks |
Thursday, 30 October 2025: Working with the E4S Project
| Time (CET) | Topic |
|---|---|
| 08.30-09.00 | Reception |
| 09.00-09.15 | Welcome, logistics and introductions |
| 09.15-10.00 | Introduction to E4S (Exascale Scientific Software Stack) |
| 10.00-10.30 | – E4S system setup on LUMI and Cloud HPC platforms – Simple Docker and Singularity examples |
| 10.30-11.00 | Coffee break |
| 11.00-12.00 | – Containers in HPC – Spack package manager – Hands-on: MPI examples on LUMI and Cloud HPC platforms – Introduction to E4S tools |
| 12.00-14.00 | Lunch |
| 14.00-15.30 | – AI tools – Jupyter notebooks – Codium – Pytorch, JAX – NVIDIA Nemo – NVIDIA BioNemo |
| 15.30-15.45 | Coffee break |
| 15.45-16.45 | – E4S e4s-cl and e4s-alc tools on commercial cloud HPC platforms – Hands-on: HPC and AI tools |
| 16.45-17.00 | Q&A and concluding remarks |
Due to EuroCC2 regulations, we cannot except generic or private email addresses. Please use your official university or company email address.
This training is for users that live and work in the European Union or a country associated with Horizon 2020. You can read more about the countries associated with Horizon2020 here https://ec.europa.eu/info/research-and-innovation/statistics/framework-programme-facts-and-figures/horizon-2020-country-profiles_e
Do you want to learn more about MPI, GPU programming and other supercomputing-related fields? Take a look on our lessons page.