Contractor Site Reliability Engineer

Monks is hiring a remote Contractor Site Reliability Engineer. The career level for this job opening is Experienced and is accepting India based applicants remotely. Read complete job description before applying.

This job was posted 1 month ago and is likely no longer active. We encourage you to explore more recent opportunities on our site. However, you may still try your luck using 'Apply Now' link below. We recommend focusing on newer listings available here.

Monks

Job Title

Site Reliability Engineer

Posted

Career Level

Contractor

Career Level

Experienced

Locations Accepted

India

Job Details

We are looking for a Site Reliability Engineer (SRE) with a strong background in observability, automation, and platform resilience to drive the operability and reliability of our Disaster Recovery as a Service (DRaaS) solution. This role is essential in ensuring our DR environments are resilient, observable, and continuously improving. You’ll collaborate with DR architects, security, infrastructure, and engineering teams to define SLIs/SLOs for critical systems, reduce operational toil, and lead efforts such as chaos engineering and game-day simulations.
  • Key Responsibilities:
  • Build and maintain observability dashboards and proactive alerting systems to monitor DR environments across Azure, AWS, and private cloud (e.g., HPE GreenLake).
  • Define and track Service Level Indicators (SLIs) and Error Budgets aligned with strict RPO/RTO targets.
  • Collaborate on runbook automation, synthetic testing, and validation pipelines for DR readiness.
  • Lead chaos engineering initiatives and game-day exercises to proactively identify weak points and ensure high system resilience.
  • Conduct post-incident reviews, implement feedback loops, and own the resulting automation backlog.
  • Work with DR architecture and engineering teams to drive infrastructure as code (IaC) practices and platform reliability improvements.
  • Participate in quarterly failover/failback simulations, monitor performance, and propose observability enhancements.
  • Help define SLOs for protected application groups (VPGs) and contribute to reporting for DR testing and compliance audits.
  • Advocate for and implement best practices around toil reduction, incident response, and on-call efficiency.
  • Requirements:
  • 5+ years of experience in SRE, DevOps, or Platform Engineering roles.
  • Strong hands-on experience with observability tools like Grafana, Prometheus, Datadog, or Splunk.
  • Experience designing and maintaining SLIs/SLOs, error budgets, and availability dashboards.
  • Proficiency in at least one scripting or programming language (e.g., Python, Bash, Go).
  • Knowledge of disaster recovery principles, RPO/RTO targets, and infrastructure failover practices.
  • Experience with incident response, blameless postmortems, and tracking improvement actions.
  • Familiarity with IaC tools such as Terraform, Ansible, or CloudFormation.
  • Experience with CI/CD, automated testing, and cloud-native deployments in Azure or AWS.
  • Strong problem-solving and collaboration skills, with the ability to work across cross-functional teams.
  • Fluent in English (written and spoken).
  • Nice to Have (strong plus):
  • Experience with Zerto, Veeam, or similar DR orchestration platforms.
  • Background in chaos engineering using tools like Gremlin or LitmusChaos.
  • Exposure to TISAX, ISO 27001, or other compliance-aligned monitoring.
  • Knowledge of Kubernetes and container orchestration for DR environments.
  • Previous experience in platform reliability for mission-critical systems.

FAQs

What is the last date for applying to the job?

The deadline to apply for Contractor Site Reliability Engineer at Monks is 6th of November 2025 . We consider jobs older than one month to have expired.

Which countries are accepted for this remote job?

This job accepts [ India ] applicants. .

Related Jobs You May Like

Cloud DevOps Engineer (AWS, Rust, Python)

Worldwide
2 weeks ago
AWS
Cloud Security
Python
BruntWork
Contractor
Experienced

Data DevOps Engineer

Poland
2 weeks ago
Airflow
AWS
Dbt
Hard Rock Digital
Contractor
Experienced

DevOps Engineer

Warsaw, Poland
2 weeks ago
C/C++
CI/CD
Gitlab/GitHub
Act Digital
Contractor
Experienced

Senior DevOps Engineer (AWS)

Worldwide
2 weeks ago
Ansible
AWS
CI/CD
Proxify
Contractor
Experienced
YEAR $50000 - $80000

DevOps Engineer

USA
2 weeks ago
AWS
Azure
CI/CD
Pierce Technology Corp
Contractor
Experienced

DevOps & GitHub Solutions Architect AI - Remote

Texas
2 weeks ago
AI
Cloud Computing
DevOps
Ajna Infotech
Contractor
Experienced

Azure DevOps Lead

Latam
3 weeks ago
Azure Cloud Services
Azure DevOps
CI/CD Pipelines
Bridgenext, Inc
Contractor
Experienced

Looking for a specific job?