Full-Time Staff Site Reliability Engineer - Remote
Cyberark is hiring a remote Full-Time Staff Site Reliability Engineer - Remote. The career level for this job opening is Expert and is accepting Santa Clara, CA based applicants remotely. Read complete job description before applying.
Cyberark
Job Title
Posted
Career Level
Career Level
Locations Accepted
Salary
Share
Job Details
CyberArk is seeking a Staff Site Reliability Engineer to join our team. If you excel at solving scale problems in the cloud, creating visible platforms, and implementing CI/CD pipelines, we want to hear from you!
Responsibilities include:
- Design and implementation of AWS infrastructure components (VPCs, EC2, EKS, S3, tagging schemes, CloudFormation, etc.)
- Lead architecture, design, and feature analysis of deployment and management automation for cloud-based infrastructure and software.
- Guide Site Reliability and DevOps Engineers on managing reliability and performance of SaaS environments and building automation for problem prevention.
- Architect and guide the team with configuration management tools (CloudFormation, Helm, Terraform, Salt, Ansible), both for Windows and Linux.
- Ensure cloud-based architectures meet availability and recoverability requirements.
- Develop and implement cloud-based monitoring, alerting, and reporting systems (Datadog, Logz.io, CloudWatch, Catchpoint, ELK).
- Provide support and guidance on tooling to improve team output and reliability.
- Deep understanding of latest technology solutions and trends.
- Collaborate with Team Leads to identify areas for improvement, create architecture roadmaps, and advocate with Product Management.
Qualifications:
- Minimum 4 years experience managing AWS infrastructure.
- Minimum 7 years experience in a senior, architect, or technical lead role (site reliability, systems engineering, or software development).
- Deep understanding of Site Reliability, infrastructure, and Cloud Platform.
- Expert understanding/experience of containerization services (Docker/Kubernetes).
- Expert in observability tooling (Datadog, NewRelic, Logstash, Elasticsearch).
- Solid understanding/experience of web services, databases, and related infrastructure/architectures.
- Solid understanding of backup/restore best practices.
- Strong expertise programming configuration management languages.
- Strong expertise programming in Python/Java or equivalent.
- Excellent troubleshooting skills.
- Experience supporting enterprise-level SaaS environments (preferred).
- Experience with AI/ML models to improve system performance (preferred).
Education:
- Bachelor's degree in Computer Science or equivalent experience.
Compensation:
$141,000 - $176,000/year, plus commissions/discretionary bonus. Base pay may vary based on skills/experience.
Benefits: Medical, dental, vision, financial benefits, and more.