The High-Performance Computing Centre Stuttgart (HLRS) was founded as Germany’s first federal high-performance computing (HPC) centre. It operates one of the fastest supercomputers in the world. It offers various HPC solutions and services for universities, research institutions, and industry. Furthermore, HLRS is a worldwide leader in engineering and global system sciences. Staff scientists at HLRS investigate emerging technologies such as Artificial Intelligence (AI), Cloud Computing, and Quantum Computing (QC) towards realising hybrid workflows and lowering the hurdle for non-experts using HPC technologies. In this context, HLRS is significantly involved in international and national research projects across the abovementioned research areas.

Federation of High-Performance Computing (HPC)

InHPC-DE is a project aimed at further integrating the three German national HPC centres that comprise the Gauss Centre for Supercomputing (GCS). The project’s overarching goal is to create user-centric services for supercomputing that enable consistent technical and organisational use of HPC and prepare the production systems for future usage. A key aspect of this project’s predecessor was evaluating and adopting Gaia-X, a European initiative for a federated and secure data infrastructure, with HPC. As a result, dedicated Cloud infrastructures distributed across the three HPC centres were put into operation to explore the potential of Gaia-X in the context of high-performance computing. This includes developing new operating concepts for GCS systems and obtaining feedback from test users.

Integration of Cloud Technologies into HPC

A particular focus is on integrating cloud technologies into the HPC landscape to meet the growing demands of the AI community and other scientific disciplines. The increasing complexity of AI models, hybrid workflows, and new application areas such as quantum computing and edge computing requires a flexible and scalable infrastructure. By integrating cloud technologies into HPC, we want to:

  • Increase flexibility and scalability: dynamically adapt resources to meet demands.
  • Improve accessibility: offer HPC services to a wider range of users.
  • Foster innovation: develop new operating models and services that are coming with the uptake of AI workloads on HPC systems.

In this context, we are looking for a

Cloud Engineer
(100 %, m/f/x, up to TV-L 13, scientific employee)

to work with us on these topics in the context of the project InHPC-DE.

  • Integration of Cloud technologies. Implement and evaluate Kubernetes, Infrastructure-as-Code (IaC), and container registries in HPC environments.
  • Developing new operating models. Investigate and implement interactive access models, on-demand resource allocation, and flexible scheduling mechanisms.
  • Support for the AI community. Adapt the HPC environment to the specific requirements of AI researchers and developers.
  • Collaboration and user support. Close cooperation with other project teams and external partners, e. g., to support third parties with onboarding on the provided Cloud infrastructure.
  • Maintenance and system administration. Ensuring high availability, reliability, and performance of the cloud-based HPC systems. This might include setting up a monitoring framework for the underlying infrastructure.
  • Technical Reporting. Contribute to technical documentation and reports related to project activities. It includes documenting your work, troubleshooting steps, and reporting on real-world solutions.

  • Bachelor-s or master-s degree in a relevant technical field.
  • Proven experience in DevOps or similar technical roles, with a strong focus on containerisation and orchestration tools (Apptainer, Podman, Kubernetes) and CI/CD tools, including GitLab CI/CD, Jenkins, or GitHub CI.
  • Proficiency in using version control systems (e. g., Git) and automating software installations using Infrastructure-as-Code (IaC) tools, such as Ansible.
  • Very good Linux and programming skills in at least one high-level programming language, such as Python.
  • Excellent technical communication skills, both written and verbal, for collaborating with internal and external stakeholders.
  • You are fluent in English, both written and spoken.

Desired Skills

  • Experience with HPC environments. Knowledge of batch scheduling, resource management, and parallelisation.
  • Security awareness: Knowledge of container security and multi-tenancy concepts.
  • Experience with AI workloads. Knowledge of machine learning frameworks and deep learning.
  • Understanding of Gaia-X. Experience with the Sovereign Cloud Stack (SCS).

  • A professional working environment in a highly motivated international team.
  • Exciting insights into the latest and best simulation, artificial intelligence, and quantum computing technologies.
  • A very good working atmosphere in an interdisciplinary team of top scientists and project partners.
  • Flexible working hours, including trust-based working hours or a flextime model.
  • The possibility of arrangements for working independently of location (e. g., home office) (dt. ortsunabhängiges Arbeiten).
  • Contract and remuneration according to the collective agreement of the federal states (TV-L).
  • Attractive social benefits of the public sector.
  • Allowance of -25 per month for local public transport.
  • Use the wide range of further education and training opportunities (e. g., soft skills, languages, specialist courses, management seminars) and the University of Stuttgart sports facilities (on-site and virtual).

Additional information

This is a temporary position offered for scientific employees following the legal regulations. Employment is limited to the project’s duration, scheduled to run until 31.12.2026. The salary for this position is based on your personal qualifications up to the level of TV-L 13.