Full-Time Senior SRE - West
Nexthink is hiring a remote Full-Time Senior SRE - West. The career level for this job opening is Senior Manager and is accepting San Francisco, CA based applicants remotely. Read complete job description before applying.
Nexthink
Job Title
Posted
Career Level
Career Level
Locations Accepted
Salary
Share
Job Details
Nexthink is looking for a Site Reliability Engineer passionate about building and running a high-performance cloud platform, enabling best-in-class site reliability and operations practices.
This role will support US-based operations, focusing on enabling Nexthink to deliver to the US Public Sector market, particularly a FedRAMP Moderate offering.
Implement modern, cloud-native SRE processes and manage/operate Nexthink’s multi-tenant, microservices-based cloud platform (multiple global instances). Collaborate closely with cross-functional teams to integrate reliability and security into systems, ensuring federal security standards are met.
Key Responsibilities:
- Infrastructure Management: Design, deploy, and manage scalable and secure cloud infrastructure using IaC tools.
- Monitoring & Performance: Develop/maintain monitoring, logging, alerting systems to ensure high availability and performance. Lead performance tuning.
- Security & Compliance: Implement/maintain security controls to achieve FedRAMP compliance. Conduct security assessments, vulnerability scans, penetration testing, and collaborate with compliance team.
- Incident Management: Lead incident resolution, root cause analysis, and develop incident response strategies.
- Collaboration & Communication: Integrate reliability and security into the software development lifecycle; provide regular updates to stakeholders on system performance/reliability/compliance status.
Qualifications:
- Bachelor’s degree in Computer Science or related field (or equivalent experience)
- 5+ years of experience in SRE, DevOps, or related role
- Proficiency in cloud platforms (AWS, Azure, GCP)
- Strong scripting/programming skills (Python, Bash, Go, etc.)
- Experience with IaC tools (Terraform, CloudFormation, etc.), containerization/orchestration (Docker, Kubernetes)
- Familiarity with CI/CD pipelines/tools, and security tools/practices (SIEM, IDS/IPS, firewalls)
- In-depth knowledge of FedRAMP requirements and best practices
- Strong problem-solving, analytical, communication, and collaboration skills.
- Ability to work independently and as part of a team