Full-Time Principal Site Reliability Engineer
Dayforce is hiring a remote Full-Time Principal Site Reliability Engineer. The career level for this job opening is Experienced and is accepting USA, Canada based applicants remotely. Read complete job description before applying.
Dayforce
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
Site Reliability Engineers bridge the gap between development and operations organizations and facilitate effective collaboration which leads to faster feature delivery and elevated service quality overall. Join the pioneering Site Reliability Engineering team at Dayforce, where we lead the charge in ensuring our state-of-the-art products set new benchmarks in scalability, availability, and reliability.
What you’ll get to do
- Develop a deep understanding of Dayforce’s cloud infrastructure and applications to build a comprehensive mental model of the ecosystem.
- Contribute to and maintain a robust suite of tools that enhance reliability, support SRE practices, and automate operational tasks.
- Contribute to the design, implementation, and maintenance of cloud infrastructure and networking components to ensure scalability, security, and reliability.
- Collaborate with the team to uphold high coding standards and deliver reliable, maintainable code.
- Respond to and remediate production issues as they arise.
- Write scripts to collect data, monitor services, automate tasks, and support various operational needs.
- Participate in PagerDuty on-call rotations as required.
Skills and experience we value
- Self-starter and passionate individual willing to learn new concepts and technologies as well as contributing to the SRE powered ecosystem.
- 7+ years of hands-on experience managing Azure infrastructure using Terraform.
- Strong understanding of networking concepts, including VNets, Private Endpoints, and Private DNS Zones.
- Proficiency in CI/CD pipelines using GitHub Actions and managing deployment workflows.
- Experience managing alerting and incident response systems such as PagerDuty.
- Experience managing and maintaining SQL Server databases.
- Excellent communication, collaboration, and leadership skills.
What would make you really stand out
- Familiarity with AI/ML technologies or enthusiasm to explore them (e.g., LLMs, ML APIs).
- Openness to learning and adopting new tools and platforms (e.g., Rundeck, Pagerduty).
- Experience with security best practices in DevOps and software development.
- Knowledge of Docker and Kubernetes is considered an asset.Scripting and automation skills using PowerShell and Python.