Full-Time Senior Site Reliability Engineer
Exygy is hiring a remote Full-Time Senior Site Reliability Engineer. The career level for this job opening is Senior Manager and is accepting USA based applicants remotely. Read complete job description before applying.
Exygy
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
About ExygyExygy is a digital innovation studio committed to building resilient communities. We partner with impact-focused organizations to create digital products that solve problems and delight users.
SummaryExygy seeks a passionate and experienced Senior SRE to join our growing team. This full-time remote role will focus on supporting the CiviForm team, ensuring secure, dependable, and scalable production instances.
Responsibilities
- Participate in CiviForm product development, leveraging existing deployment systems and a new Kubernetes-based prototype.
- Manage staging and production environments, including on-call support for outages.
- Collaborate with governments on service-related issues.
- Own and develop deployment systems.
- Participate in the development of a new CiviForm SaaS, owning the Kubernetes deployment system from prototyping to launch.
- Improve the existing Python/Terraform-based infrastructure, deployed on AWS and Azure, to better support government cloud deployments.
- Define and implement metrics gathering and analysis to enhance deployments.
- Partner with the engineering team on testing, release procedures, scaling, and resilience.
- Create Service Level Objectives (SLOs) and Service Level Indicators (SLIs), and implement them.
- Develop playbooks for deployments, including monitoring and alerting strategies.
- Identify and mitigate security risks.
- Contribute to CI/CD implementation and best practices.