Full-Time Systems Reliability Engineer
FairMoney is hiring a remote Full-Time Systems Reliability Engineer. The career level for this job opening is Expert and is accepting Africa based applicants remotely. Read complete job description before applying.
FairMoney
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
FairMoney is a credit-led mobile bank for emerging markets. You will be working as a Systems Reliability Engineer in our Technology Division. We are building Engineering centers of excellence across multiple regions and are looking for smart, talented, driven engineers.
About the Role
The systems reliability engineer (SRE) will apply software development skills to infrastructure and operations work. The SRE will spend up to half of its time doing operations-related activities like supporting issues, writing documentation, and system management. The rest of the time will be spent delivering development tasks like new features, scaling, and automation.
Roles and Responsibilities
- Provide technical leadership on large/complex systems and platform opportunities.
- Build tooling for automation, management, and reliability of applicable systems.
- Build and support release pipelines for applicable systems.
- Work with development/delivery team to include SRE practices in solution design.
- Manage system lifecycle from design to decommissioning.
- Write documentation for peers and business partners.
- Define SLOs and SLIs, build robust monitoring solutions.
- Resolve P1 and P2 tickets using a systematic approach.
- Lead communications regarding system issues and post-mortems.
- Mentor junior team members.
- Perform other related duties.
- Multitask and prioritize work effectively.
- Strong troubleshooting and problem-solving skills.
- Work independently and with the team towards releasing features.
- Support SQL database technologies (SQL Server, Azure SQL PaaS, Azure SQL Managed Instances).
- Provide comprehensive administration of core platforms (backups, recovery, monitoring, maintenance, upgrades).
- Scripting and automation experience (Azure Resource Manager, AWS Cloud Formation, Ansible, Terraform).
- Experience writing YAML code for Azure DevOps or Github Actions pipelines.
- Proficiency with Azure resource management and operations.
- Familiarity with Linux Server administration.
- Familiarity with Azure Kubernetes services or other Kubernetes managed services.
Requirements
- Bachelor’s Degree in computer science or related field and 5+ years of experience or equivalent.
- Minimum 4 years in enterprise-level system engineering or reliability engineering.
- Strong knowledge of operating systems, networking, security best practices.
- Working knowledge of Agile delivery and DevOps principles.
- Proficiency in one or more programming languages (PowerShell, C#, Go, Python).
- Technical certifications are a plus.