Full-Time Senior Site Reliability Engineer
Movable Ink is hiring a remote Full-Time Senior Site Reliability Engineer. The career level for this job opening is Senior Manager and is accepting USA based applicants remotely. Read complete job description before applying.
Movable Ink
Job Title
Posted
Career Level
Career Level
Locations Accepted
Salary
Share
Job Details
Movable Ink scales content personalization for marketers through data-activated content generation and AI decisioning. The world’s most innovative brands rely on Movable Ink to maximize revenue, simplify workflow and boost marketing agility.
As one of our Senior Site Reliability Engineers, you will be 100% hands-on with both infrastructure and software development.
We operate a multi-region, active-active content serving platform that serves upwards of 8 Billion requests daily.
Responsibilities:
- Improve the tooling and automation of our infrastructure to minimize manual work, increase performance, and decrease the frequency and severity of incidents
- Build, maintain, and support core applications
- Build and operate our core internal observability platform
- Monitor our systems for capacity, performance, and troubleshooting issues
- Partner with the rest of the SRE team and our service engineering teams to ensure smooth, continued delivery of our service to clients
Qualifications:
- Experience in Site Reliability or Software Engineering, building and maintaining scalable, resilient services
- Building the tooling and automation to manage those services, as well as investigating system and application metrics to diagnose and resolve performance issues.
- 4+ years experience as an SRE or Software Engineer, with a focus on Cloud platforms. We use AWS.
- Experience building and operating large-scale observability platforms. We use Prometheus, Thanos, Loki and Tempo.
- Experience and willingness to operate in an on-call environment, evaluating and improving monitoring and alerting systems, and developing run books to investigate and debug issues. Every member of the SRE team does a week-long on-call rotation every 5 to 6 weeks.
- Strong experience with infrastructure as code tools. We use Terraform and Chef.
- Strong experience with operating Kubernetes and running workloads on it. We use EKS.
- Familiarity with one or more high-level programming languages and a willingness to learn. We use NodeJS, Golang, Ruby, Python, Bash and Shell scripting.
- Linux experience (Ubuntu/Debian)
The base pay range for this position is $190,000 - $210,000/year.