Full-Time Site Reliability Engineer
Keeper Security, Inc. is hiring a remote Full-Time Site Reliability Engineer. The career level for this job opening is Experienced and is accepting USA based applicants remotely. Read complete job description before applying.
Keeper Security, Inc.
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
Keeper is hiring a talented Site Reliability Engineer to join our DevOps team. This is a 100% remote position, with an opportunity for a hybrid schedule for candidates in the El Dorado Hills, CA or Chicago, IL metro area.
About the Role
As the Site Reliability Engineer (SRE), you will ensure the reliability, availability, and performance of our software systems. You'll collaborate with Information Security and DevOps teams to design, build, and maintain scalable and resilient infrastructure. Focus will be on CI/CD operations.
Responsibilities
- Design, implement, and manage infrastructure for CI/CD and reliable software deployment.
- Ensure high availability and performance of production systems, monitoring critical services to prevent outages.
- Manage infrastructure automation (Terraform, Ansible, Kubernetes).
- Support security audits and compliance (SOC2, ISO 27001, FedRAMP).
- Collaborate with development teams to optimize release pipelines (automated testing, code coverage, performance monitoring).
- Troubleshoot system performance, software reliability, and capacity management issues.
- Stay current with industry trends in cloud infrastructure, automation, and DevOps.
- Contribute to monitoring and alerting systems to proactively address reliability issues.
- Promote a culture of reliability through team collaboration on reliability goals.
Requirements
- 8+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.
- Proficiency with cloud platforms (AWS, Azure, Google Cloud) and infrastructure management tools.
- Experience with CI/CD tools (Jenkins, GitHub Actions, GitLab CI).
- Strong understanding of monitoring and observability tools (Prometheus, Grafana, New Relic).
- Solid Linux, Mac OS X, and Windows systems experience, plus scripting (Python, Bash, Go).
- In-depth knowledge of networking, security best practices, and incident management.
- Excellent communication skills and a collaborative mindset.
- Bachelor's degree in Computer Science or related field preferred.
Due to this role’s involvement in GovCloud, qualified candidates must be a “U.S. Person”.