Full-Time Senior Customer Reliability Engineer
Arista Networks is hiring a remote Full-Time Senior Customer Reliability Engineer. The career level for this job opening is Experienced and is accepting Austin, TX based applicants remotely. Read complete job description before applying.
This job was posted 2 months ago and is likely no
longer active. We encourage you to explore more recent opportunities on our site. However, you
may still try your luck using 'Apply Now' link below. We recommend focusing on newer listings
available here.
Arista Networks
Job Title
Senior Customer Reliability Engineer
Posted
Career Level
Full-Time
Career Level
Experienced
Locations Accepted
Austin, TX
Share
Job Details
This is not a traditional operations role. You will inherit operational responsibilities essential to our customers' success. We need you to help lead the effort to systematically dismantle this operational burden through automation, tooling, and systems. You will have a collaborative team of excellent engineers. The short-term needs are: manual deployments, reactive troubleshooting, and on-call escalations. But we need you to help us build a system where programmatic solutions have replaced human intervention.
What You’ll Do
Your work will follow a deliberate trajectory from reactive execution to proactive design.
Phase 1: Stabilize and Map (First 3-6 Months). You will embed with the team, taking ownership of the existing operational workload. This includes customer deployments, upgrades, and incident response. Your initial goal is to achieve stability while mapping the landscape of our operational toil.
Phase 2: Automate and Influence (Months 6-18). Armed with your map of toil, you will begin to automate. You will write code, build tooling, and deploy declarative infrastructure to eliminate the most critical operational burdens. For larger projects, you will act as a primary stakeholder, providing clear requirements to our internal tooling and platform teams and ensuring their solutions meet the operational need. Your success will be measured by a demonstrable reduction in the overall support effort, fewer pages, support escalations, and manual tasks.
Phase 3: Architect and Evangelize (Year 2+). With the most acute operational pains addressed, your focus will shift to architectural concerns. You will define and implement Service Level Objectives (SLOs), influence the design of new products for operability, and help instill SRE principles throughout the engineering organization.
DevOps and SRE Proficiency
You must have a strong background in Site Reliability Engineering or a closely related DevOps function. You also have a strong command of Linux systems administration and possess an understanding of networking fundamentals (TCP/IP, DNS, routing).Customer-Facing Experience
You must have experience working directly with external customers to solve difficult technical problems. Your communication must be clear, empathetic, and precise.Cloud Infrastructure Expertise
You need production experience with a major cloud provider, preferably AWS. You should be proficient in its core concepts and services (VPC, EC2, IAM, S3) and have experience building and managing infrastructure as code with tools like Terraform.Monitoring and Observability
You will be responsible for both building and using our observability stack. This requires hands-on experience instrumenting applications and managing the telemetry pipelines for metrics, logs, and traces. A core part of the role is then applying this data to debug complex production incidents, understand system behavior, and define SLOs.Automation and Software Development
You must be proficient in writing code to automate operational tasks. Expertise in a high-level language like Python or Go is required, as are strong shell scripting skills (e.g., Bash). We have a diverse tech stack including Python, Scala, C++, Haskell, Rust, PureScript, etc which requires experience with monitoring and debugging a complex system using system tools, command line utilities, networking debug tools, and filtering complex logs.FAQs
What is the last date for applying to the job?
The deadline to apply for Full-Time Senior Customer Reliability Engineer at Arista Networks is
30th of October 2025
. We consider jobs older than one month to have expired.
Which countries are accepted for this remote job?
This job accepts [
Austin, TX
] applicants. .
Related Jobs You May Like
Azure DevOps Engineer
Jersey City, NJ
2 days ago
.NET
Azure
DevOps
Derex Technologies Inc
Full-Time
Experienced
Lead Palantir Developer
Seattle, WA
2 days ago
CI/CD Pipelines
Data Engineering
Palantir Foundry
Logic20/20 Inc.
Full-Time
Experienced
YEAR $156750 - $173329
Cloud AppOps Engineer
Atlanta, GA
3 days ago
Application Support
AWS
Cloud Services (EC2, S3, IAM, ELB, VPC, VPN)
Sutherland
Full-Time
Experienced
Query Tuning Specialist - Database Performance - Postgre
Austin, Texas
3 days ago
Database Management
Performance Tuning
Problem-solving
ServiceNow
Full-Time
Experienced
DevOps Engineer, Playout
New York, New York
3 days ago
CICD
Cloud Services (AWS, GCP, Azure)
DevOps
NBCUniversal
Full-Time
Experienced
YEAR $90000 - $110000
Query Tuning Specialist - Database Performance - Postgres
Austin, Texas
3 days ago
Database Management
Performance Tuning
SaaS/PaaS/Cloud Development
ServiceNow
Full-Time
Experienced
Lead Palantir Developer
Seattle, WA
4 days ago
CI/CD Pipelines
Cloud ETL
Palantir Foundry
Logic20/20 Inc.
Full-Time
Experienced
YEAR $156750 - $173329
Cloud AppOps Engineer
Atlanta, GA
4 days ago
Application Support
AWS
Cloud Security
Sutherland
Full-Time
Experienced
Site Reliability Engineer
Stamford, Connecticut
4 days ago
Cloud Platforms (AWS, GCP, Azure)
Configuration Management
Monitoring And Alerting Tools
NBCUniversal
Full-Time
Experienced
YEAR $110000 - $145000
Senior Cloud Platform Engineer (Networking)
Berlin, Germany
5 days ago
AWS
Go
Networking
Scalable GmbH
Full-Time
Experienced