Full-Time Technical Senior Manager of Site Reliability Engineering
Coalfire is hiring a remote Full-Time Technical Senior Manager of Site Reliability Engineering. The career level for this job opening is Senior Manager and is accepting USA based applicants remotely. Read complete job description before applying.
Coalfire
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
About Coalfire: Coalfire is on a mission to make the world a safer place by solving our clients’ hardest cybersecurity challenges. We advise, assess, automate, and help companies navigate the ever-changing cybersecurity landscape. We are headquartered in Denver, Colorado with offices across the U.S. and U.K.
Position Summary: We’re looking for a Technical Senior Manager of SRE to play a central role in the implementation and maintenance of scalable, secure, and high-performing systems, ensuring our clients’ mission-critical infrastructures remain stable and resilient.
What You'll Do:
- Allocate 70% of time to hands-on engineering tasks, such as developing new deployments, tooling, and automation scripts.
- Dedicate 30% of time to leadership duties, including mentoring junior engineers, ensuring quality deliverables, and managing escalations.
- Act as primary escalation contact for complex technical issues.
- Monitor and uphold quality standards for engineering work, confirming alignment with internal protocols and project milestones.
- Identify and mitigate risks in partnership with consulting and solutions architecture teams.
- Coordinate day-to-day engineering activities, tracking progress and adjusting resources.
- Help create and implement solutions to improve the practice.
What You'll Bring:
- Experience: 9+ years in Systems Engineering and Architecture, Cloud Computing, and Infrastructure-as-Code.
- Proficiency: Hands-on proficiency in Terraform and Ansible.
- SLA and Issue Management: Proven track record of meeting SLAs, particularly regarding availability and response times.
- Operational Excellence: Demonstrated success driving continuous improvement via KPIs.
- Governance and Compliance: Experience guiding the creation of Infrastructure-as-Code solutions and alignment with standards like FedRAMP.
- Team Leadership: Proven track record of managing teams (6–8 contributors).
- Managed Services Expertise: Familiarity with ticket management systems.
- Cloud & Automation: Extensive experience with AWS, Azure, or GCP, and deep knowledge of Terraform, Ansible, GitLab, and CI/CD technologies.
- Technical Collaboration: Proven ability to collaborate with Site Reliability Engineers and cross-functional teams.
- Soft Skills: Strong interpersonal, organizational, and problem-solving skills.
- Documentation & Communication: Capable of creating technical diagrams and comprehensive documentation.
- Security Mindset: Critical thinker capable of balancing security and compliance requirements.
- Bonus Points: Consulting experience, high-availability environments, encryption and hardening expertise.