StepStone

Job Description

„Research for a life without cancer“ is our mission at the German Cancer Research Center. We investigate how cancer develops, identify cancer risk factors and look for new cancer prevention strategies. We develop new methods with which tumors can be diagnosed more precisely and cancer patients can be treated more successfully. Every contribution counts – whether in research, administration or infrastructure. This is what makes our daily work so meaningful and exciting.

The German Cancer Research Center is seeking for the recently launched infrastructure initiative German Human Genome-Phenome Archive (GHGA) a Data Steward / Bioinformatician.

Reference number: 2024-0231

GHGA (www.ghga.de) is part of the national program for research data infrastructures (NFDI). As a node of the federated European Genome-Phenome Archive, GHGA will contribute towards the establishment of an international infrastructure for human genome data. These activities are closely coordinated with leading international initiatives and networks, such as the European Genomics Data Infrastructure (GDI), and the Global Alliance for Genomics and Health (GA4GH). This infrastructure will support genomics data hubs with software tools for secure data / metadata storage, interactive data portals with data visualization, and streamlined data deposition and acquisition solutions. GHGA is also part of genom.DE and the upcoming German national genome sequencing model project (“Modellvorhaben Genom­sequenzierung"). The GHGA main office, which is coordinated by Prof. Oliver Stegle (Division of Computational Genomics and Systems Genetics), is located at DKFZ in close coordination with the European Molecular Biology Laboratory (EMBL, Prof. Jan Korbel).


We are looking for a Data Steward / Data Manager with a bioinformatics background to operate the data management system within GHGA and the new model project genome sequencing. As part of the role, you will take responsibility to moderate day-to-day activities on the data portal, optimize and extend existing operation protocols, and also guide the product development by incorporating user feedback for new versions of the GHGA toolset. The successful candidate will be part of an interdisciplinary research and data management team developing and applying a diverse range of state-of-the-art methodology to implement and maintain bioinformatics workflows in a cloud compute environment. The role will involve close cooperation with other partners in the GHGA network, as well as international networks and initiatives.

Your tasks:

  • Identification of suitable datasets and analysis of their potential for data sharing
  • Quality control of datasets and preparation of data and metadata for submission into EGA or GHGA
  • Development and implementation of SOPs for data management of human omics data as part of the GHGA infrastructure
  • Strategy and processes for data ingest, metadata validation and quality control of data
  • Handling of data access requests and managing the necessary administrative and legal processes
  • Closely follow the field of secure data handling and sharing and interaction with international initiatives such as the Global Alliance for Genomics and Health (GA4GH)

We offer the opportunity to help shape one of the most important emerging scientific data infrastructures for the storage and exchange of omics data in Germany.

Your possibilities:

  • Get exposed to a plethora of fields ranging from AI-guided clinical decision-making to modern web development technologies
  • Work with an interdisciplinary team of experts bringing state-of-the-art biomedical research closer to clinics and patients
  • Take part in international organizations such as the Global Alliance for Genomics and Health (GA4GH), the European Genome Phenome Archive (EGA) and ELIXIR shaping the future of genomics across borders
  • Widen your expertise with extensive possibilities of training programs, seminars and conferences
  • Enjoy a flexible work environment with the opportunity to work from home part-time, balancing office presence and remote work

  • Master's degree or equivalent qualification in bioinformatics, computational biology, biological science, computer science, physics, mathematics, engineering, or other fields, possibly with a PhD degree
  • Demonstrated experience and expertise in the development of data management / data curation concepts is expected
  • Expertise in NGS, (human) omics data, data security and data privacy is beneficial, as is communicating results and ideas to colleagues and (inter)national collaboration partners
  • Proficiency with UNIX-based systems and relevant programming languages such as Python or R is a prerequisite
  • Experience with the development and implementation of structured metadata using modeling languages such as LinkML and JSON Schema is beneficial
  • Expertise in software development and cloud computing is beneficial
  • Excellent collaboration and communication skills in English

The ideal applicant should have demonstrated the ability to work independently and creatively. The candidate should have excellent communications skills and be able to articulate clearly the technical needs, set clear goals and work within an interdisciplinary setting, communicating with other partners.


  • Excellent framework conditions: state-of-the-art equipment and oppor­tunities for inter­national networking at the highest level
  • 30 days of vacation per year
  • Flexible working hours
  • Remuneration according to TV-L incl. company pension scheme and capital-forming payments
  • Possibility of mobile work and part-time work
  • Family-friendly working environ­ment
  • Sustainable travel to work: subsidized Germany job ticket
  • Unleash your full potential: targeted offers for your personal develop­ment promote your talents
  • Our Corporate Health Management Program offers a holistic approach to your well-being
View More