Full-Time Site Reliability Engineer
Cognigy is hiring a remote Full-Time Site Reliability Engineer. The career level for this job opening is Experienced and is accepting Germany based applicants remotely. Read complete job description before applying.
This job was posted 3 months ago and is likely no
longer active. We encourage you to explore more recent opportunities on our site. However, you
may still try your luck using 'Apply Now' link below. We recommend focusing on newer listings
available here.
Cognigy
Job Title
Site Reliability Engineer
Posted
Career Level
Full-Time
Career Level
Experienced
Locations Accepted
Germany
Share
Job Details
Cognigy is looking for a Site Reliability Engineer to join our Engineering department.
Your responsibilities will include:
- Automate Everything – Streamline the provisioning of Kubernetes clusters and automate deployment processes for Cognigy.AI, ensuring efficiency and scalability.
- Ensure Stability & Reliability – Proactively monitor product installations and infrastructure to guarantee high availability and seamless performance for our SaaS offerings.
- Optimize & Reduce Costs – Help engineering teams automate repetitive tasks, improving efficiency, reducing operational overhead, and driving cost-effective solutions.
- Mentor & Guide – Provide expertise and mentorship to fellow SRE engineers, fostering a culture of knowledge sharing and continuous improvement.
- Full Lifecycle Involvement – Be part of an agile development team, engaging in all phases of software engineering—from inception and coding to testing, deployment, and operations. Drive automation at every level.
- Performance & Scalability Focus – Enhance development team productivity by optimizing observability, scalability, availability, and reliability. Lead postmortems after major incidents to drive continuous improvement.
- Embrace Continuous Learning – Stay ahead of industry trends, continuously develop your skills, share knowledge, and take ownership of new challenges—all while enjoying the journey.
- User-Centric Mindset – Seek and incorporate customer feedback into development processes, ensuring new features enhance user experience while keeping the codebase deployment-ready at all times.
About you
- Kubernetes & Containerization Expertise – You have several years of experience running containers in production, including building Kubernetes clusters from the ground up. You’re also experienced with managed Kubernetes services like AWS EKS, Azure AKS, or Google GKE.
- Infrastructure Auto-Scaling – You know how to leverage tools like the Horizontal Pod Autoscaler and Cluster Autoscaler to ensure seamless scaling and performance.
- Cloud Proficiency – You have hands-on experience with major cloud platforms like AWS, Microsoft Azure or GCP.
- Strong Networking & Security Knowledge – You understand concepts like VPCs, subnets, internet gateways, and web security best practices.
- Passion for Automation – You have multiple years of experience with CI/CD systems (e.g., Jenkins, GitLab) and believe in automating whenever possible.
- Infrastructure as Code Enthusiast – You’re proficient with tools like Terraform, Helm, and Flux for managing infrastructure and deployments.
- Programming Skills – You have expertise in one or more programming languages such as Golang, Python, Ruby, JavaScript, Java, or Perl.
- Hands-on with Key Technologies – You’re familiar with Docker, Kubernetes, Message brokers, NoSQL and SQL databases and other core technologies.
- Monitoring & Observability – You have experience with monitoring tools like Prometheus, Grafana, and the ELK Stack, ensuring system reliability and performance.
- Proactive & Solution-Oriented Mindset – You anticipate scaling needs, take full ownership of challenges, and apply a systematic approach to problem-solving.
- Incident Response & Decision Making – You stay calm under pressure, make decisions with urgency when necessary, and are comfortable being on call for emergencies.
- Team-Oriented & Globally Minded – You’re eager to collaborate in an international, dynamic, and highly motivated team, contributing to the growth of Cognigy.AI.
Skills
FAQs
What is the last date for applying to the job?
The deadline to apply for Full-Time Site Reliability Engineer at Cognigy is
5th of October 2025
. We consider jobs older than one month to have expired.
Which countries are accepted for this remote job?
This job accepts [
Germany
] applicants. .
Related Jobs You May Like
Azure DevOps Engineer
Jersey City, NJ
2 days ago
.NET
Azure
DevOps
Derex Technologies Inc
Full-Time
Experienced
Lead Palantir Developer
Seattle, WA
2 days ago
CI/CD Pipelines
Data Engineering
Palantir Foundry
Logic20/20 Inc.
Full-Time
Experienced
YEAR $156750 - $173329
Cloud AppOps Engineer
Atlanta, GA
3 days ago
Application Support
AWS
Cloud Services (EC2, S3, IAM, ELB, VPC, VPN)
Sutherland
Full-Time
Experienced
Query Tuning Specialist - Database Performance - Postgre
Austin, Texas
3 days ago
Database Management
Performance Tuning
Problem-solving
ServiceNow
Full-Time
Experienced
DevOps Engineer, Playout
New York, New York
3 days ago
CICD
Cloud Services (AWS, GCP, Azure)
DevOps
NBCUniversal
Full-Time
Experienced
YEAR $90000 - $110000
Query Tuning Specialist - Database Performance - Postgres
Austin, Texas
3 days ago
Database Management
Performance Tuning
SaaS/PaaS/Cloud Development
ServiceNow
Full-Time
Experienced
Lead Palantir Developer
Seattle, WA
4 days ago
CI/CD Pipelines
Cloud ETL
Palantir Foundry
Logic20/20 Inc.
Full-Time
Experienced
YEAR $156750 - $173329
Cloud AppOps Engineer
Atlanta, GA
4 days ago
Application Support
AWS
Cloud Security
Sutherland
Full-Time
Experienced
Site Reliability Engineer
Stamford, Connecticut
4 days ago
Cloud Platforms (AWS, GCP, Azure)
Configuration Management
Monitoring And Alerting Tools
NBCUniversal
Full-Time
Experienced
YEAR $110000 - $145000
Senior Cloud Platform Engineer (Networking)
Berlin, Germany
5 days ago
AWS
Go
Networking
Scalable GmbH
Full-Time
Experienced