Full-Time Senior DevOps Engineer
Experian is hiring a remote Full-Time Senior DevOps Engineer. The career level for this job opening is Expert and is accepting Costa Rica based applicants remotely. Read complete job description before applying.
Experian
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
Job Description
What you'll do
Experian IT Services (EITS) is the centralized technology organization that globally supports all of Experian. Our organization is comprised of about 2,000 employees across the globe supporting our business-critical IT environment/ecosystem. Experian IT Services includes Infrastructure Services, Cyber & Information Security, and Enterprise Architecture. We improve growth through reusable technology providing quicker time to market for solutions, increased productivity, and a more secure environment.
The Lead Devops Engineer is a critical role in our organization, dedicated to ensuring the performance and scalability of our infrastructure and applications through monitoring and observability practices. You will work with Infrastructure as Code (IAC) tooling like Terraform and will have a understanding of open telemetry standards. Familiarity with current monitoring and logging tools like Dynatrace and Splunk is important. You will report to the Director - Cloud Operations & Service Desk
Summary of Primary Responsibilities
- Lead the design and implementation of observability solutions that provide deep insights into application performance, system health, and user experience.
- Establish and advocate for observability best practices across engineering teams.
- Work with the infrastructure teams to automate and increase infrastructure provisioning and scaling using IAC tools like Terraform.
- Ensure infrastructure code is tested, reliable, and efficient.
- Promote the use of open telemetry standards to collect, process, and export telemetry data.
- Use and integrate monitoring tools like Dynatrace and Splunk to provide analytics.
- Guide the evaluation and adoption of new tools to keep us at the forefront of observability and monitoring practices.
- Collaborate with multiple engineering teams to ensure smooth adoption and transition to new technologies.
- Analyze existing monitoring and observability practices, identifying areas for improvement or optimization.
- Foster a culture of learning and improvement within the observability team and across the organization.
- Provide guidance, and mentoring to the observability team.
- Foster a collaborative and inclusive environment that encourages innovation and growth.
Qualifications
What your background looks like
- 5+ years of devops engineering experience
- Information Technology degree and/or technology certifications preferred or substantial equivalent experience.
- Design and implement an observability system for a new microservices-based application.
- Migrate an existing monitoring system to Prometheus and Grafana.
- Develop a new alerting system to detect and respond to performance issues.
- Work with the development team to instrument their code for better observability.
- Mentor other engineers on observability best practices.
- Work with team members, customers, vendors and leadership team.
- Advanced Shell scripting and IaC automation skills with Ansible and Terraform
- Experience with open telemetry standards.
- Experience overseeing and logging tools like Dynatrace and Splunk.
- Working Knowledge of Python and any databases (SQL/NoSQL).
- SMEs in enterprise monitoring like APM, Custom attribute Implementation, synthetic monitoring, browser monitoring, and Log monitoring.
- Knowledge of requirement gathering and rollout monitoring and observability solutions. Partner with the business and development teams to identify requirements, define monitoring solutions, and implement the same.
- Experience in Application Performance Monitoring (APM) and Infrastructure Monitoring for Different Hybrid Business Applications and Infrastructure.
- Provide health and performance reports, developing AIOps rules, creating alerts, creating custom dashboards.
- Experience in create workloads and user onboarding.