Contractor Site Reliability Engineer
Virtasant is hiring a remote Contractor Site Reliability Engineer. The career level for this job opening is Experienced and is accepting USA based applicants remotely. Read complete job description before applying.
Virtasant
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
We’re looking for a Site Reliability Engineer to join a high-impact cloud infrastructure team at one of Virtasant’s key technology partners. You'll play a critical role in improving system observability, ensuring platform reliability, and embedding proactive engineering practices across a globally distributed environment.
This is a hands-on technical role ideal for someone who brings a developer’s mindset to SRE work.
What You’ll Do
- Drive the creation and evolution of observability systems — including dashboards, logging, alerting, and instrumentation.
- Identify trends, anomalies, and early warning signs through data analysis.
- Work with engineers to drive the adoption of observability best practices across squads.
- Surface, propose, and implement proactive reliability improvements across AWS environments.
- Contribute to build, test, and deploy workflows (CI/CD), with a strong emphasis on automation.
- Collaborate across teams using agile ceremonies, async-first workflows, and direct feedback loops.
What We’re Looking For
Must-Have Experience
- Deep knowledge of observability tooling, preferably with Datadog
- Hands-on SRE experience within AWS, including Lambda, containers, and IAM
- Strong programming skills in Python and Ruby
- Experience with Terraform and infrastructure as code (IaC) practices
- Familiarity with incident management, on-call rotations, and SLAs
- Ability to identify patterns and risks from telemetry and act on them proactively
Nice-to-Haves
- Previous experience as a software developer or DevOps engineer
- Knowledge of reliability strategies for containerized workloads
- Comfortable contributing to CI pipelines and deployment strategies
- Experience working in environments with limited QA/BA handoffs
Tools & Environment
- Languages: Python, Ruby
- Cloud: AWS (Lambda, ECS, IAM)
- IaC: Terraform
- Observability: Datadog
- Workflow: Agile (Scrum), Jira, Git, CI/CD pipelines