Full-Time Site Reliability Engineer
Pavilionpayments is hiring a remote Full-Time Site Reliability Engineer. The career level for this job opening is Experienced and is accepting USA based applicants remotely. Read complete job description before applying.
Pavilionpayments
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
Pavilion Payments enables gaming entertainment leaders to create amazing consumer experiences and maximize spend.
Our suite of payment solutions enables safe, secure, and trusted cash access at the cage, on the casino floor, or online.
About the Role
As Pavilion Pay's inaugural Site Reliability Engineer (SRE), you will build a resilient infrastructure ensuring high availability across our systems.
Key Responsibilities:
- Reliability and Incident Management: Establish and track reliability metrics (Latency, Traffic, Errors, Capacity). Develop and refine monitoring systems using Grafana.
- Platform Management and Service Objectives: Collaborate with IT leadership to define and maintain service level objectives (SLOs). Structure and optimize platform management.
- Automation, IaC, and CI/CD Pipelines: Develop and maintain Terraform configurations for scalable infrastructure deployment. Optimize CI/CD workflows using Azure DevOps.
- Network and Security Collaboration: Partner with network engineers to optimize F5 load balancers and Palo Alto Networks/Panorama. Collaborate with security teams to ensure network traffic and access patterns align with security best practices.
Requirements:
- Technical Skills: Proficiency with SUSE, AKS, Linux, Azure Cloud, Grafana, Rancher, Terraform, Azure DevOps pipelines.
- Monitoring Tools: Strong experience with Grafana and OpsGenie.
- Automation and Scripting: Proficiency in scripting (e.g., Bash, Python) and experience with TailScale.
- Problem-Solving Mindset: Experience in identifying and remediating performance and security issues.
First 90 Days:
- Understand Pavilion Pay's products and their interdependencies.
- Develop monitoring structures.
- Learn network architecture and monitoring systems.
- Gain familiarity with platform elements.