logo

JobNob

Your Career. Our Passion.

DevOps Engineer for Cloud and HPC (100%, m/f/x, up to TV-L 13, scientific employee) HLRS_05_2024


Höchstleistungsrechenzentrum Stuttgart


Location

Stuttgart | Germany


Job description

Stellenbeschreibung

The High-Performance Computing Centre Stuttgart (HLRS) was founded as Germany's first federal high-performance computing ( HPC ) centre. It operates one of the fastest supercomputers in the world. It offers various HPC solutions and services for universities, research institutions, and industry. Furthermore, HLRS is a worldwide leader in engineering and global system sciences. Staff scientists at HLRS investigate emerging technologies such as Artificial Intelligence ( AI ), Cloud Computing, and Quantum Computing ( QC ) towards realising hybrid workflows and lowering the hurdle for non-experts using HPC technologies. In this context, HLRS is significantly involved in international and national research projects across the abovementioned research areas.

Towards a EuroHPC CI/CD Pilot Platform

The European coordination and support action CASTIEL 2 works closely with the project EuroCC 2 to develop a European network of National Competence Centres (NCCs) for High-Performance Computing (HPC). By promoting collaboration and information sharing among NCCs, CASTIEL 2 is helping address differences in HPC expertise among the participating countries and identify synergies that could enhance the development of HPC competencies Europe-wide. CASTIEL 2 supports the EuroHPC JU’s Centres of Excellence in HPC (CoEs), such as HiDALGO2 and EXCELLERAT P2, concerning training and collaboration. The CoEs are also tasked with executing large-scale codes on European supercomputers to pave the way towards exascale.

For this purpose, the EuroHPC Joint Undertaking (JU) is putting supercomputing centres across Europe into operation, including installing LUMI, the fastest European HPC system. The supercomputers are complemented step-by-step with additional services and support to establish a European HPC ecosystem covering hardware and software aspects.

DevOps Meets HPC

The JU aims to make software developed within European-funded research projects available to a larger user community on all EuroHPC JU supercomputers. The concept of continuous integration and continuous deployment, short CI/CD, fosters this vision. The well-known concept CI/CD simplifies and automates software development, from implementing code over testing and quality checks to deploying the compiled software artefacts on selected target infrastructures. CASTIEL 2 coordinates this activity.

In this context, we are looking for a

DevOps Engineer for Cloud and HPC
(100%, m/f/x, up to TV-L 13, scientific employee)
HLRS_05_2024

to work with us on the above-mentioned objectives in EuroCC 2 and CASTIEL 2. The advertised position offers the possibility of a doctorate, which HLRS actively supports.

Your Role

The job involves designing and coordinating the implementation of a EuroHPC Continuous Integration/Continuous Deployment (CI/CD) Pilot Platform for EuroHPC JU supercomputers to automate software testing and deployment. The project focuses on standardising code deployment across EuroHPC JU supercomputers and addressing technical challenges using the latest technologies, such as GitLab Runner. The job offer includes coordinating stakeholders from all CoEs and EuroHPC Hosting Entities to enable automatic code deployment via CI/CD mechanisms.

Ø  Coordination and Collaboration. Collaborate with representatives from EuroHPC Hosting Entities (supercomputers) and Centres of Excellence to contribute to the definition of the overall CI/CD architecture and solution to be deployed for EuroHPC supercomputers. You will coordinate (virtual) meetings and initiate activities between stakeholders.

Ø  Continuous Integration and Delivery. Establish and manage a robust CI/CD platform for EuroHPC JU in close collaboration with other stakeholders. Your efforts in configuring and maintaining components of this platform will be critical. Tools under consideration are, amongst others, GitLab and GitLab Runner for the deployment of codes.

Ø  Application Workflows. To advance the convergence of HPC and AI, you will be tasked with applying workflow orchestrators such as Apache Airflow to author, schedule, and monitor hybrid workflows in data engineering and data pipeline automation. You will further work on bridging Cloud and HPC systems, enabling users to deploy AI workflows seamlessly.

Ø  Technical Reporting. Contribute to technical documentation and reports related to EuroCC 2 and CASTIEL 2's activities. It includes documenting your work, troubleshooting steps, and solutions implemented.

Anforderungsprofil & Qualifikationen

Your Qualifications

Ø  Bachelor’s or Master’s degree in a relevant technical field.

Ø  Proven experience in DevOps or similar technical roles, with a strong focus on containerisation and orchestration tools (Apptainer, Kubernetes) and CI/CD tools, including GitLab CI/CD, Jenkins, or GitHub CI.

Ø  Proficiency in using version control systems (e.g., Git) and automating software installations using Infrastructure as Code (IaC) tools, such as Ansible or Terraform.

Ø  Very good Linux and programming skills in at least one high-level programming language, such as Python.

Ø  Excellent technical communication skills, both written and verbal, for collaborating with internal and external stakeholders.

Ø  You are fluent in English, both written and spoken.

Ideally, your profile is completed as follows

Ø  Familiarity with HPC and Cloud Computing environments.

Ø  Proficiency in agile software development methodologies.

Ø  Demonstrated experience monitoring and logging tools such as Prometheus or Grafana.

Ø  A strong aptitude for conceptual thinking and a solution-oriented approach to problem-solving.

As a member of our team, you can expect

Ø  A professional working environment in a highly motivated international team.

Ø  Exciting insights into the latest and best technologies in the fields of simulation, big data, artificial intelligence, and quantum computing.

Ø  A very good working atmosphere in an interdisciplinary team of top scientists and project partners.

Ø  Flexible working hours, including trust-based working hours or a flextime model.

Ø  The possibility of arrangements for working independently of location (e.g., home office) (dt. ortsunabhängiges Arbeiten).

Ø  Contract and remuneration according to the collective agreement of the federal states (TV-L).

Ø  Attractive social benefits of the public sector.

Ø  Allowance of €25 per month for local public transport.

Ø  Use of the wide range of further education and training opportunities (e.g., soft skills, languages, specialist courses, management seminars) and the sports facilities of the University of Stuttgart (on-site and virtual).

Additional information

This is a temporary position offered for scientific employees following the legal regulations. Employment in this position is limited to the project's duration, scheduled to run until 31.12.2025. The salary for this position is based on your personal qualifications up to the level of TV-L 13.
When possible, HLRS is committed to supporting and retaining talent even after projects. Before the project ends, HLRS will examine the possibilities for extending your contract based on available funding.
You will be working at HLRS within the Converged Computing (C2) department headed by Dennis Hoppe. If you have any questions about this job offer, please e-mail [email protected].

Are you interested?

Then we look forward to receiving your application! Your application should include a cover letter, your CV, and relevant references.

Please send your application by March, 31st 2024, via e-mail (as one PDF file) with the subject " HLRS_05_2024" to [email protected] .

The University of Stuttgart supports efforts to increase the proportion of women in scientific fields and is therefore particularly interested in applications from women. Full-time positions are generally divisible. Severely disabled persons are given priority in the case of equal suitability. The recruitment of scientific/non-scientific staff is carried out by the Central Administration (Rector's Office).
Information on the handling of applicant data under Art. 13 DS-GVO can be found at:  


Job tags

VollzeitHomeoffice


Salary

All rights reserved