Beschreibung
For our client we are looking for a DevOps Engineer (f/m/d) Cloud Infrastructure with Kubernetes
Start: 18.08.2025
Duration: 31.12.2025 (long term engagement (2026))
Capacity: 100% if possible
Location: 75% Remote, 25% Berlin (1 week Berlin / 3 weeks remote in rotation), up to 50% onsite in peak times
Language: English, German is a plus
Budget: 80,00 EUR netto remote, 92,25 EUR netto onsite
Team:
The ESL Product Line is responsible for a product portfolio central to the platform, consisting of an Infrastructure as a Service Product, a managed Kubernetes Service, a resource management service to facilitate scalable management of platform permissions and a service lifecycle workflow engine enabling.
All services together constitute a core part of an on-premise private cloud platform for all business applications of the client, including IT/OT critical applications required for maintaining and operating.
For the whole product portfolio, the product line owns the complete product flow, from product management, architecture, delivery up until Tier 3 operations.
Objectives:
- Consulting on CI/CD pipelines and ensure operational readiness for deployments
- Ensure operational stability and responsiveness for ESL, focus Monitoring, Incident, Problem and Change Management
- Reduce operational toil and improve service reliability
- Ensure platform operations adhere to security and compliance standards
Skills (must-have):
- At least of 5 years of operational experience with self-managed Kubernetes clusters, self-managed services providing Kubernetes clusters and productive applications or systems in on premise environments
- Deep understanding of networking concepts, including protocols, load balancing, and security.
- Profound knowledge and implementation experience with CI/CD processes, tooling (e.g. GitLab, Jenkins, Tekton, Argo Workflows, and Argo CD), concepts and associated quality and security assurance for software delivery
- Fundamental understanding of core operations processes (incident management, change management, problem management, IT Service Management) as well as SRE concepts
- Experience in gathering operational insights from monitoring or observability including SLI/SLA/SLO management and tracking.
- Hand-on experience in documenting procedures properly and enforcing clear runbooks or playbooks.
- Hands-on experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog).
Skills (should-have):
- Project experience in software engineering (in Go Lang, C/C++ or Python) with significant experience in building RESTful services in distributed environments.
Start: 18.08.2025
Duration: 31.12.2025 (long term engagement (2026))
Capacity: 100% if possible
Location: 75% Remote, 25% Berlin (1 week Berlin / 3 weeks remote in rotation), up to 50% onsite in peak times
Language: English, German is a plus
Budget: 80,00 EUR netto remote, 92,25 EUR netto onsite
Team:
The ESL Product Line is responsible for a product portfolio central to the platform, consisting of an Infrastructure as a Service Product, a managed Kubernetes Service, a resource management service to facilitate scalable management of platform permissions and a service lifecycle workflow engine enabling.
All services together constitute a core part of an on-premise private cloud platform for all business applications of the client, including IT/OT critical applications required for maintaining and operating.
For the whole product portfolio, the product line owns the complete product flow, from product management, architecture, delivery up until Tier 3 operations.
Objectives:
- Consulting on CI/CD pipelines and ensure operational readiness for deployments
- Ensure operational stability and responsiveness for ESL, focus Monitoring, Incident, Problem and Change Management
- Reduce operational toil and improve service reliability
- Ensure platform operations adhere to security and compliance standards
Skills (must-have):
- At least of 5 years of operational experience with self-managed Kubernetes clusters, self-managed services providing Kubernetes clusters and productive applications or systems in on premise environments
- Deep understanding of networking concepts, including protocols, load balancing, and security.
- Profound knowledge and implementation experience with CI/CD processes, tooling (e.g. GitLab, Jenkins, Tekton, Argo Workflows, and Argo CD), concepts and associated quality and security assurance for software delivery
- Fundamental understanding of core operations processes (incident management, change management, problem management, IT Service Management) as well as SRE concepts
- Experience in gathering operational insights from monitoring or observability including SLI/SLA/SLO management and tracking.
- Hand-on experience in documenting procedures properly and enforcing clear runbooks or playbooks.
- Hands-on experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog).
Skills (should-have):
- Project experience in software engineering (in Go Lang, C/C++ or Python) with significant experience in building RESTful services in distributed environments.