Full-Time Site Reliability Engineer
Platform.sh is hiring a remote Full-Time Site Reliability Engineer. The career level for this job opening is Experienced and is accepting USA, UK, Canada, Germany, France, Spain based applicants remotely. Read complete job description before applying.
Platform.sh
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
About Platform.shPlatform.sh is a Platform-as-a-Service (PaaS) that streamlines development-to-production workflows, making it faster to build and deploy applications.
Impact of a Site Reliability Engineer
As a Site Reliability Engineer (SRE), you’ll enhance system reliability, scalability, and efficiency, focusing on infrastructure improvement, automation, and streamlined processes.
- Refine Monitoring and Observability: Enhance system monitoring with tools like Prometheus, Grafana, and ELK Stack.
- Automate Deployments and Workflows: Use IaC tools (Terraform, Ansible) to automate deployments and improve efficiency.
- Optimize CI/CD Pipelines: Improve pipeline architecture for fast, reliable releases.
- Cloud Infrastructure Management: Scale cloud-based systems (AWS, GCP, Azure) while minimizing technical debt.
- Incident Response and Post-Mortem: Support incident management and lead post-mortem analysis.
- Collaborate with Cross-Functional Teams: Integrate reliability practices into the development lifecycle.
- Drive Technical Innovation: Introduce new tools, technologies, and practices to improve reliability, performance, and scalability.
What you bring
- DevOps, Cloud Operations, or SRE Expertise
- Advanced Linux Internals Expertise
- Programming Languages (Go/Python)
- Scripting Skills (Python, Bash, Go)
- Cloud Infrastructure Knowledge
- Containerization and Orchestration (Docker, Kubernetes)
- Problem-Solving and Collaboration