Site Reliability Engineer (f/m/d)

1&1 Mail & Media

Jobbeschreibung

With our strong brands GMX, WEB.DE and mail.com and over 43 million active users, we are the leading email and communications platform in Germany, Austria and Switzerland. From this strong market position, services and apps are developed that simplify users' digital lives - from office tools and cloud solutions to personal ID management.

Your Tasks

Do you have a passion for complex and highly available infrastructures and applications? Do you want to make a real difference in a digital and agile environment with flat hierarchies? Then we are looking for you! With us, you will find more than just a job: you can expect cutting-edge technologies such as Kubernetes or Kafka as well as the opportunity to use your skills and ideas to drive topics forward and develop yourself individually. As a Site Reliability Engineer, you will be responsible for ensuring that millions of customers can access their emails, news and cloud data via our websites and mobile apps.

  • You work together with software development teams on CI/CD pipelines, for example to roll out Java applications on multi-datacentre Kubernetes platforms.
  • You advise our product teams on measures to improve resilience and fault tolerance using suitable SLOs and develop solutions, e.g. by using caching or auto-failover technologies.
  • You automate recurring processes, e.g. in the context of certificate management or software updates, with tools such as Renovate or Ansible and develop infrastructure as code (Helm Charts, Docker container images).
  • You improve our observability in order to uncover potential sources of error using tracing (Jaeger, OpenTelemetry) and metrics, e.g. by creating Grafana dashboards and Prometheus alert rules.
  • You develop acceptance tests that allow you to make changes reliably and at any time.

Your Profile

Have you completed a technical degree or comparable training? Are you motivated to become part of an international, agile team that drives change and continuously strives for improvement? Are you eager to learn, interested in cutting-edge technologies and can quickly familiarise yourself with new topics? Are you a true team player and do you have a hands-on approach to work? Then we look forward to receiving your application.

  • You are familiar with the management of large Linux or container environments.
  • You have relevant experience with at least one programming language (e.g. Python or Go).
  • HTTP, REST, TLS, Docker and Git are no foreign words for you.
  • You are enthusiastic about complex, technical challenges and strive for innovative and efficient solutions in an environment that allows you to contribute your ideas and actively shape the future of IT infrastructure.
  • You value an open error culture: `Making mistakes is part of life, as long as you learn from them.'.

Our Benefits

  • Our corporate culture: „You“ culture and no dress code, flat hierarchies, open and transparent communication
  • Individual development opportunities: diverse training courses, e-learning and internal communities, language courses, mentoring
  • Events: Slack Days, open source projects, meet-ups
  • Relocation service: support with the relocation to Germany
  • Benefits and additional services: company pension scheme, capital-forming benefits, discounts on own products, job ticket, bike leasing, corporate benefits portal
  • Attractive working conditions: 30 days holiday, hybrid working, full-time and part-time arrangements, free choice between Linux, Mac or Windows
  • Social: team events, summer and winter parties, family and care service, sports and fitness programmes, subsidised canteen, free fruit and drinks, health courses
  • Topics that are also important to us: Sustainability, diversity and our values and leadership principles - find out more on our website mail-and-media.com

With our strong brands GMX, WEB.DE and mail.com and over 43 million active users, we are the leading email and communications platform in Germany, Austria and Switzerland. From this strong market position, services and apps are developed that simplify users' digital lives - from office tools and cloud solutions to personal ID management.

Your Tasks

Do you have a passion for complex and highly available infrastructures and applications? Do you want to make a real difference in a digital and agile environment with flat hierarchies? Then we are looking for you! With us, you will find more than just a job: you can expect cutting-edge technologies such as Kubernetes or Kafka as well as the opportunity to use your skills and ideas to drive topics forward and develop yourself individually. As a Site Reliability Engineer, you will be responsible for ensuring that millions of customers can access their emails, news and cloud data via our websites and mobile apps.

  • You work together with software development teams on CI/CD pipelines, for example to roll out Java applications on multi-datacentre Kubernetes platforms.
  • You advise our product teams on measures to improve resilience and fault tolerance using suitable SLOs and develop solutions, e.g. by using caching or auto-failover technologies.
  • You automate recurring processes, e.g. in the context of certificate management or software updates, with tools such as Renovate or Ansible and develop infrastructure as code (Helm Charts, Docker container images).
  • You improve our observability in order to uncover potential sources of error using tracing (Jaeger, OpenTelemetry) and metrics, e.g. by creating Grafana dashboards and Prometheus alert rules.
  • You develop acceptance tests that allow you to make changes reliably and at any time.

Your Profile

Have you completed a technical degree or comparable training? Are you motivated to become part of an international, agile team that drives change and continuously strives for improvement? Are you eager to learn, interested in cutting-edge technologies and can quickly familiarise yourself with new topics? Are you a true team player and do you have a hands-on approach to work? Then we look forward to receiving your application.

  • You are familiar with the management of large Linux or container environments.
  • You have relevant experience with at least one programming language (e.g. Python or Go).
  • HTTP, REST, TLS, Docker and Git are no foreign words for you.
  • You are enthusiastic about complex, technical challenges and strive for innovative and efficient solutions in an environment that allows you to contribute your ideas and actively shape the future of IT infrastructure.
  • You value an open error culture: `Making mistakes is part of life, as long as you learn from them.'.

Our Benefits

  • Our corporate culture: „You“ culture and no dress code, flat hierarchies, open and transparent communication
  • Individual development opportunities: diverse training courses, e-learning and internal communities, language courses, mentoring
  • Events: Slack Days, open source projects, meet-ups
  • Relocation service: support with the relocation to Germany
  • Benefits and additional services: company pension scheme, capital-forming benefits, discounts on own products, job ticket, bike leasing, corporate benefits portal
  • Attractive working conditions: 30 days holiday, hybrid working, full-time and part-time arrangements, free choice between Linux, Mac or Windows
  • Social: team events, summer and winter parties, family and care service, sports and fitness programmes, subsidised canteen, free fruit and drinks, health courses
  • Topics that are also important to us: Sustainability, diversity and our values and leadership principles - find out more on our website mail-and-media.com
Mehr