Full-Time Site Reliability Engineer
Egen is hiring a remote Full-Time Site Reliability Engineer. The career level for this job opening is Experienced and is accepting Naperville, IL based applicants remotely. Read complete job description before applying.
Egen
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
Egen is a fast-growing company with a data-first mindset. We bring together engineering talent using advanced technology platforms like Google Cloud and Salesforce to help clients leverage data and insights.
We are committed to a supportive work environment where top talent can apply their engineering skills to envision how data and platforms can change the world for the better.
We seek a Site Reliability Engineer to ensure system reliability and infrastructure support. Responsibilities include:
- Ensuring system reliability and uptime (based on SLAs).
- Monitoring system performance and optimizing it.
- Leading incident management, documenting Root Cause Analysis (RCA), lessons learned, and Standard Operating Procedures (SOPs).
- Collaborating with DevOps and Application teams to align priorities and improve processes.
- Prioritizing response efforts based on severity and impact.
- Evaluating and approving production system changes, maintaining stability.
- Optimizing resource usage and managing costs.
Requirements:
- 3+ years SRE experience with Azure and/or AWS
- Bachelor's Degree preferred or equivalent experience
- Programming skills (Java, SpringBoot, SQL, Bash)
- Monitoring experience (DataDog, Splunk, Grafana)
- Docker, Kubernetes, Linux experience
- Incident/Alerts Management (VictorOps, PagerDuty)
- Git, Bitbucket
- Troubleshooting complex distributed services
- Strong attention to detail
- Experience with testing, monitoring, logging, and alerting
- Excellent documentation skills
- Excellent Incident Management skills
Additional Note: Mention "GAILY" and tag RMjE2LjI0NS4yMjEuOTE= when applying (#RMjE2LjI0NS4yMjEuOTE=).