Full-Time Sr. Site Reliability Engineer
Cookunity is hiring a remote Full-Time Sr. Site Reliability Engineer. The career level for this job opening is Experienced and is accepting Latam based applicants remotely. Read complete job description before applying.
Cookunity
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
About CookUnity: CookUnity connects chefs with customers for meal delivery.
About the Team: The CookUnity Infrastructure team maintains highly available infrastructure for millions of customers.
The role: The Sr. Site Reliability Engineer architects, implements, and maintains cloud-native infrastructure and deployment pipelines, focusing on reliability, scalability, and automation.
Responsibilities:
- Architect, deploy, and manage highly available and scalable infrastructure on AWS.
- Design, implement, and maintain Kubernetes clusters (EKS).
- Develop and manage GitOps workflows using ArgoCD for automated deployments.
- Write and maintain infrastructure as code (IaC) using tools such as Terraform.
- Build, optimize, and troubleshoot CI/CD pipelines.
- Develop automation scripts using Kotlin, Python, and/or Bash.
- Monitor system performance, reliability, and security.
- Collaborate with software engineers to improve deployment strategies.
- Implement security best practices.
- Maintain comprehensive documentation.
Qualifications:
- 7+ years experience in SRE or related roles.
- Proficiency in deploying, managing, and troubleshooting Kubernetes clusters.
- Advanced hands-on experience with ArgoCD.
- Strong development and scripting skills in Kotlin, Python, and Bash.
- Deep knowledge of CI/CD concepts and tools.
- Demonstrated ability to design and implement infrastructure as code.
- Strong problem-solving skills.
- Excellent communication and collaboration abilities.
Preferred requirements: Bachelor's or Master's degree in Computer Science or related field.