Full-Time Site Reliability Engineer
Coalfire is hiring a remote Full-Time Site Reliability Engineer. The career level for this job opening is Experienced and is accepting USA based applicants remotely. Read complete job description before applying.
Coalfire
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
About CoalfireCoalfire is on a mission to make the world a safer place by solving our clients’ hardest cybersecurity challenges. We work at the cutting edge of technology to advise, assess, automate, and ultimately help companies navigate the ever-changing cybersecurity landscape.
Position SummaryWe’re looking for a Site Reliability Engineer to join the Coalfire team. If you’re driven by a desire to innovate, excel at operational excellence, and thrive in a collaborative environment, come be part of a team committed to making the world a safer place.
What You'll Do
- Hands-on engineering work, including developing new deployments, automation scripts, and tooling to meet client needs.
- Manage and maintain patch management processes, ensuring timely updates, security compliance, and system stability across cloud and on-prem environments.
- Oversee Identity and Access Management (IAM), implementing and enforcing security best practices to protect sensitive data and ensure proper access controls.
- Perform cloud administration and system administration tasks, such as provisioning resources, optimizing performance, and troubleshooting infrastructure issues.
- Collaborate with senior engineers and solutions architecture teams to address complex technical issues, ensuring timely resolutions and maintaining client satisfaction.
- Adhere to established quality standards for engineering deliverables, aligning with internal protocols, compliance regulations, and project deadlines.
- Identify and communicate potential risks, working with relevant stakeholders to incorporate mitigation strategies that meet regulatory and client expectations.
- Contribute to day-to-day project tasks, including tracking progress, providing updates, and ensuring assigned activities are completed on schedule.
What You'll Bring
- 3–5 years in systems engineering and architecture.
- 3–5 years in cloud computing (AWS, Azure, or GCP).
- 3–5 years working with Infrastructure-as-Code (for example, Terraform, Ansible).
- Experience meeting SLAs through effective issue identification, escalation, and resolution.
- Proven track record of contributing to operational improvements.
- Experience participating in project definition and documentation.
- Managed Services Expertise: Familiarity with ticket management systems and meeting SLA requirements.
- Cloud and Automation: Hands-on experience with AWS, Azure, or GCP; working knowledge of Terraform, Ansible, GitLab, and CI/CD technologies.
- Technical Collaboration: Proven ability to work alongside Site Reliability Engineers and cross-functional teams.
- Soft Skills: Strong interpersonal, organizational, and problem-solving skills.
- Documentation and Communication: Skilled at creating technical diagrams and clear written documentation.
- Security Mindset: Critical thinker capable of meeting security and compliance requirements.
Bonus Points
- Serverless and Modern Architectures: Exposure to serverless, microservices, containerization, or other modern application frameworks
- Network and Firewall Technologies: Experience with cloud-based networking, next-gen firewalls
- Tools and Frameworks: Familiarity with Visio, LucidChart, Jira
- Regulatory Familiarity: Basic understanding of FedRAMP, FISMA, SOC, ISO, HIPAA, HITRUST, PCI
- Experience in technical consulting engagements or cross-functional collaboration