Full-Time Site Reliability Engineer
Ververica is hiring a remote Full-Time Site Reliability Engineer. The career level for this job opening is Experienced and is accepting Germany based applicants remotely. Read complete job description before applying.
Ververica
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
About VervericaVerverica empowers businesses with real-time data processing and analytics, leveraging Apache Flink.
Role OverviewAs a Site Reliability Engineer at Ververica, you will maintain infrastructure across AWS, GCP, and Azure, collaborating with engineering teams for feature delivery and security.
Key Responsibilities
- Build and maintain infrastructure for Ververica’s Unified Streaming Data Platform across AWS, GCP, and Azure.
- Design and manage Infrastructure as Code (IaC) using Terraform.
- Implement and enhance observability tooling (Grafana, Prometheus, etc.).
- Ensure system reliability through SRE best practices (SLIs, SLOs, error budgets).
- Improve infrastructure architecture and efficiency.
- Enhance CI/CD pipelines.
- Monitor and resolve security vulnerabilities.
- Contribute to new product launches.
- Participate in on-call rotations.
- Maintain documentation.
Requirements
- Bachelor’s degree in Computer Science or related field.
- Minimum 2 years of experience with Kubernetes, Helm, controllers, and operators.
- Proficiency in Terraform.
- Strong knowledge of observability tools.
- Experience with SRE principles (SLIs, SLOs, error budgets).
- Solid understanding of Linux systems and cloud networking.
- Experience managing multiple Kubernetes clusters.
- Familiarity with distributed systems and streaming data platforms.
- Knowledge of cloud-native security best practices.