Loading Events

« All Events

  • This event has passed.

Performance monitoring with TAU and E4S on LUMI and Cloud HPC platforms

Oct 29 09:00 Oct 30 17:00 CET

Overview

KTH, ParaTools, the University of Oregon, and ENCCS (EuroCC National Competence Centre Sweden) come together for the organization of this hybrid workshop. The event brings together code developers, researchers, and research software engineers working on high-performance computing (HPC). It provides an excellent opportunity for sharing innovative ideas and best practices for performance monitoring and software management on supercomputers and cloud platforms. This workshop includes:

  • Introduction to TAU, TAU Performance System and the E4S project.
  • Performance Evaluation on LUMI and Commercial Cloud Platforms.
  • Instrumentation of MPI and GPU (ROCm and CUDA) Applications.
  • Performance Data Management with TAUdb (TAU Database)
  • Analyzing performance data with ParaProf and PerfExplorer.
  • Containers and Spack Package Managers in HPC with E4S.
  • E4S (e4s-cl and e4s-alc) Tools on Commercial Cloud HPC Platforms.
  • AI tools with Jupyter Notebooks, Codium, PyTorch, JAX, NVIDIA NeMo and BioNeMo.

The theoretical optimization of complex code is somewhat limited, and great progress in scientific computing has been achieved through performance analysis tools. Indeed, advances in massively parallel systems like the LUMI supercomputer and cloud HPC platforms have further boosted our understanding of complex phenomena at unprecedented resolutions. To leverage these resources effectively, robust tools for performance monitoring, debugging, and software management are required. These tools enable the development of key applications for science, industry, and society.knowledge and enabling key applications for science, industry, and society.

Performance Monitoring with TAU and E4S

The TAU Performance System® is a state-of-the-art, robust suite of tools for performance measurement, analysis, and visualization in high-performance computing (HPC). It is highly portable and supports the full range of HPC systems, software, and applications. TAU provides profiling and tracing capabilities for performance analysis of parallel programs written in Fortran, C, C++, UPC, Java, and Python.

TAU (Tuning and Analysis Utilities) can instrument code at the level of functions, methods, basic blocks, or statements, or via event-based sampling, thereby capturing detailed performance data. It supports instrumentation inserted automatically (via the Program Database Toolkit), dynamically (for example with DyninstAPI), during runtime in the Java Virtual Machine, or manually via its API. Its visualization tool, ParaProf, presents performance data graphically in both aggregate and per node/context/thread form, allowing users to rapidly identify performance bottlenecks. TAU can also produce event traces compatible with Vampir, Paraver, or JumpShot trace viewers.

The E4S (Extreme-scale Scientific Software Stack) project is an open-source software ecosystem designed for developing, deploying, and running scientific and HPC/AI applications. It is a community effort to curate and maintain a collection of software packages, enabling both from-source builds and containerized deployments across a broad spectrum of platforms.

E4S provides container images, pre-installed binaries, and source builds of over a hundred HPC and AI software packages, including libraries, runtimes, tools, and frameworks, ensuring performance portability across CPU and GPU architectures. It leverages the Spack package manager as its core delivery mechanism, enabling reproducible, customized, and interoperable builds.

The E4S software distribution is rigorously tested on a wide range of platforms, from laptops and departmental clusters to leadership-class supercomputers, and supports container runtimes such as Docker, Singularity, Shifter, and CharlieCloud. Releases of E4S are designed to expand support for evolving hardware architectures (for example, new GPU platforms) and to incorporate widely used AI/ML frameworks and scientific applications.

What you will learn

After attending this workshop, you will:

• Understand core features of performance evaluation tools
• Understand performance evaluation tools and techniques for CPU and GPU platforms
• Understand how to optimize applications on AMD GPUs on LUMI and NVIDIA GPUs on Commercial Cloud Platforms
• Use a comprehensive software stack for HPC and AI applications

Prerequisites

  • PhD students, postdocs, industry engineers
  • Basic familiarity with general physics and plasma physic
  • Some previous practical experience running some plasma code
  • Basic familiarity with Unix shell
  • Basic familiarity with HPC environment is helpful but not mandatory

Format/venue

The event is going to be held at the Stockholm RISE offices on KTH campus Drottning Kristinas väg 61 in room Sundblassalen. The closest metro station (marked T) is Tekniska Högskolan. However, we will also broadcast the seminars and hands-on sessions live for online participants who will be able to participate actively in the workshop through live Q&A and chat. 

Agenda

Friday, 24 October 2025 (14.00-18.00): HPC onboarding session (Online)

  • ~2 hour online session to help participants log into the LUMI cluster
  • ~2 hour online session to help participants log into the Cloud HPC cluster

Tuesday, 28 October 2025 (10.00-12.00): HPC onboarding session (Online)

  • ~2 hour online Q&A session to clear unanswered HPC cluster questions.

Wednesday, 29 October 2025: Working with TAU Performance System®

Time (CET)Topic
08.30-09.00Reception
09.00-09.15Welcome, logistics and introductions
09.15-10.00Introduction to TAU (Tuning and Analysis Utilities)
10.00-10.30– TAU performance system setup on LUMI and Cloud HPC platforms
– Hands-on: Simple MPI Examples

10.30-11.00Coffee break
11.00-12.00– Instrumentation: MPI, GPU runtimes, tau_exec
– Hands-on: AWS examples using TAU and ParaProf
– Performance Application Programming Interface (PAPI)

12.00-14.00Lunch
14.00-15.30– Hands-on: TAU and GPU (ROCm and CUDA) Runtime examples
– Perfetto.dev trace visualiser

15.30-15.45Coffee break
15.45-16.45– TAUdb (TAU’s performance database)
– PerfExplorer (TAU’s perfomance data mining environment)

16.45-17.00-Q&A and concluding remarks

Thursday, 30 October 2025: Working with the E4S Project

Time (CET)Topic
08.30-09.00Reception
09.00-09.15Welcome, logistics and introductions
09.15-10.00Introduction to E4S (Exascale Scientific Software Stack)
10.00-10.30– E4S system setup on LUMI and Cloud HPC platforms
– Simple Docker and Singularity examples

10.30-11.00Coffee break
11.00-12.00– Containers in HPC
– Spack package manager
– Hands-on: MPI examples on LUMI and Cloud HPC platforms
– Introduction to E4S tools

12.00-14.00Lunch
14.00-15.30– AI tools
– Jupyter notebooks
– Codium
– Pytorch, JAX
– NVIDIA Nemo
– NVIDIA BioNemo

15.30-15.45Coffee break
15.45-16.45– E4S e4s-cl and e4s-alc tools on commercial cloud HPC platforms
– Hands-on: HPC and AI tools

16.45-17.00 Q&A and concluding remarks


Disclaimer

Due to EuroCC2 regulations, we cannot except generic or private email addresses. Please use your official university or company email address.

This training is for users that live and work in the European Union or a country associated with Horizon 2020. You can read more about the countries associated with Horizon2020 here https://ec.europa.eu/info/research-and-innovation/statistics/framework-programme-facts-and-figures/horizon-2020-country-profiles_e

Additional lesson materials

Do you want to learn more about MPI, GPU programming and other supercomputing-related fields? Take a look on our lessons page.

Drottning Kristinas väg 61
Stockholm, 114 28 Sweden
Free