Full-Time Site Reliability Engineer - Ireland
Arista Networks is hiring a remote Full-Time Site Reliability Engineer - Ireland. The career level for this job opening is Experienced and is accepting Ireland based applicants remotely. Read complete job description before applying.
Arista Networks
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
Who You'll Work With
Arista Networks is seeking Site Reliability Engineers to actively participate in the initial rollout of internal and customer-facing services. You will be involved in key architectural decisions, designing, and implementing best practices to advance the Software Defined Networking revolution in the cloud.
The Site Reliability Engineering (SRE) role combines software and systems engineering to build and operate high-performance, massively distributed, and robust systems.
The role is crucial for optimizing system capacity and performance.
SRE roles fall into two areas:
- Internal Tools: Design and operate internal systems, including CI/CD pipelines, source repositories, and other internal tools.
- External SaaS: Actively contribute to a cloud-based public SaaS across all Arista teams.
Both roles offer the opportunity to push the boundaries of quality and availability by designing, selecting, and building best practices and tools to achieve these goals.
What You'll Do
Engage in the entire service lifecycle:
- Inception and design
- Deployment
- Operation
- Refinement
Support services before launch:
- System design consulting
- Software platform and framework development
- Capacity planning
- Launch reviews
Maintain services after launch:
- Measure and monitor availability, latency, and overall system health
Scale and evolve systems:
- Sustainable scaling through automation
- System evolution to improve reliability and velocity
Incident Response and Postmortems:
- Practice sustainable incident response
- Conduct blameless postmortems
Required Qualifications:
- Bachelor's degree in Computer Science or a related technical field, or equivalent practical experience
- Programming experience in Go and Python
- SaaS operation experience
- Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
- Experience with Jenkins, Docker, and Kubernetes (K8s)
- Debugging, optimizing code, and automating routine tasks
- Understanding of Unix/Linux operating systems