Job Description
The High-Performance Computing Centre Stuttgart (HLRS) was founded as Germany's first federal high-performance computing (HPC) centre. It operates one of the fastest supercomputers in the world. It offers various HPC solutions and services for universities, research institutions, and industry. Furthermore, HLRS is a worldwide leader in engineering and global system sciences. Staff scientists at HLRS investigate emerging technologies such as Artificial Intelligence (AI), Cloud Computing, and Quantum Computing (QC) towards realising hybrid workflows and lowering the hurdle for non-experts using HPC technologies. In this context, HLRS is significantly involved in international and national research projects across the abovementioned research areas.
HammerHAI - The AI Factory for Industry and Science
The HammerHAI project offers to establish an AI Factory at the High-Performance Computing Center Stuttgart (HLRS), supported by a strong consortium from Germany, to successfully meet the growing demand for artificial intelligence (AI) infrastructure across Europe. HammerHAI will be a one-stop shop for many AI users, focusing primarily on start-ups, small and medium-sized enterprises (SMEs), and large industrial companies, as well as supporting academic institutions and the public sector. It will offer tailored services and infrastructure to accelerate AI innovation and help develop a competitive AI ecosystem in Europe. The AI Factory will be located in a region that is one of Europe-s powerhouses in manufacturing and engineering innovation, and it will be integrated into an ecosystem that promotes talent building that will be the basis of an ongoing digital transition.
The AI Factory HammerHAI will provide secure, scalable, and AI-optimised supercomputing resources to meet the needs of start-ups, SMEs, industry, and research institutions. Its infrastructure will enable users to easily migrate their AI applications from laptops or cloud environments to supercomputers, providing the computing power needed to develop large-scale AI models. Hereby, the AI Factory will support the entire AI lifecycle, from data preparation to model training, deployment, monitoring, and retraining, and will provide a comprehensive package of services to ensure efficient and effective AI development and operation.
Shaping the Future of AI in HPC
We seek a highly motivated AI Application Support Engineer to assist users in effectively deploying, managing, and optimizing AI workloads within the AI Factory HammerHAI at HLRS. This role will provide hands-on support to AI users, troubleshoot issues, and ensure best practices for AI development and deployment in high-performance computing environments. The successful candidate will work closely with AI researchers, software developers, and system administrators to ensure a seamless AI application experience. This position requires expertise in AI and, ideally, in high-performance computing systems. In this context, we are looking for a
AI/ML Application Support Engineer
(m/f/d, up to TV-L 13, 100 %)
HLRS_10_2025
to work on HammerHAI with a strong team of AI experts in close collaboration with end users, system administrators and external stakeholders.
This is a temporary position. Employment is limited in accordance with the legal regulations to the project's duration, which is currently scheduled to run for 3 years. The salary for this position is based on your personal qualifications up to the level of TV-L 13.
- Provide one-on-one user support for external users regarding AI and hybrid HPC-AI workflows (simulations).
- Collect and analyse user requirements to tailor AI software architectures and stacks.
- Collaborate with industry partners and researchers to develop next-generation AI solutions leveraging AI-optimised supercomputing resources.
- Benchmark, analyse and optimise AI workflows on HPC systems using profiling tools.
- Contribute to user-oriented documentation and scientific publications (e.g., conferences, white papers, journals).
- A Master-s or PhD in Computer Science, Artificial Intelligence, Computational Sciences, Engineering, or a related field.
- Strong expertise in AI, Machine Learning, and Deep Learning.
- Experience in software development with AI/ML libraries (e.g., PyTorch, JAX, Horovod).
- Understanding of CPU, GPU, parallel computing architectures, and collective communications.
- In-depth knowledge of Linux OS, version control, and issue tracking.
- Familiarity with containerisation (e.g., Singularity, Apptainer) and cloud-based HPC approaches is a plus.
- Excellent technical communication skills, both written and verbal, for collaborating with internal and external stakeholders.
- Ability to work in a collaborative, interdisciplinary environment with technical and non-technical stakeholders.
- Problem-solving mindset with the ability to address and resolve issues effectively.
- You are fluent in English, both written and spoken.
- A professional working environment in a friendly, highly motivated and collaborative international team.
- Flexible working hours with a flexitime model and the possibility of compensating for time off in addition to the regular 30 days of vacation.
- Flexible work hours with currently up to 60% home office (upon request).
- Attractive social benefits of the public service.
- Subsidy of € 25 per month for public transport and the possibility of job bike leasing.
- Use the wide range of further education and training opportunities (e.g., soft skills, languages, specialist courses, leadership seminars) and the sports offers of the University of Stuttgart.
- Fixed-term employment with salary and working conditions up to TV-L13.
The University of Stuttgart invites women to apply for this job opening to strengthen the presence of female workers in the scientific areas. Full-time positions may generally be turned into part-time positions. Disabled people will have priority as long as they are equally qualified. The central administration of the University of Stuttgart will handle the recruitment process.
View More