Full-Time Site Reliability Engineer
BforeAI is hiring a remote Full-Time Site Reliability Engineer. The career level for this job opening is Expert and is accepting Americas based applicants remotely. Read complete job description before applying.
BforeAI
Job Title
Posted
Career Level
Career Level
Locations Accepted
Salary
Share
Job Details
BforeAI is a rapidly expanding scale-up focused on preventing cybercrime with cutting-edge AI. We use prescriptive AI to tackle cyber threats, especially brand protection. We are like weather forecasts for cyber threats.
As an SRE at BforeAI, you'll be crucial to our technology team, ensuring the reliability, scalability, and performance of our cloud infrastructure and applications.
Your Responsibilities Include:
- Architecting, deploying, and managing Kubernetes clusters for high availability and scalability.
- Improving database performance through optimization, indexing, and caching.
- Developing and maintaining Infrastructure as Code (IaC) using tools like Terraform, Ansible.
- Implementing monitoring and alerting systems for proactive system health maintenance.
- Enforcing cloud environment best practices for security, access control, and compliance.
- Establishing and maintaining Incident management procedures.
- Collaborating with engineering teams to support their infrastructure needs.
- Ensuring infrastructure and product resilience and recovery through best practices.
- Creating and maintaining detailed documentation for processes and procedures.
Requirements
- 8+ years experience in SRE, system administration, or similar roles.
- Kubernetes expertise, including cluster setup, management, and maintenance (CKA/CKSS certifications preferred).
- Database performance optimization experience (PostgreSQL, MySQL).
- Experience with Infrastructure as Code (IaC) tools (Terraform certification a plus).
- Experience with monitoring and logging tools (Splunk, Prometheus, Grafana, etc).
- Experience with Incident response tools (PagerDuty, OpsGenie).
- Experience with cloud platforms (AWS, Azure, GCP; architect-level certification a plus).
- Experience with secrets management tools (Hashicorp Vault, CyberArk Conjur).
- Strong problem-solving and troubleshooting skills.
- Excellent communication and collaboration skills.
- RHCSA/RHCE certification preferred.
Compensation: Up to $110,000 USD per year (Cost to Company).