Full-Time DevOps Engineer
Nansen.ai is hiring a remote Full-Time DevOps Engineer. The career level for this job opening is Experienced and is accepting Remote Americas based applicants remotely. Read complete job description before applying.
Nansen.ai
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
We're hiring a DevOps Engineer to build and maintain our systems and cloud infrastructure. You will design, automate, deploy, and maintain secure, scalable systems that power our staking operations and broader internal infrastructure. You'll be part of the Staking Team and work closely with other teams across the organization to ensure reliability, performance, and security across blockchain validators, RPC endpoints, and shared infrastructure services.
Responsibilities
- Infrastructure Management: Deploy, secure, and manage validator, sentry, and RPC nodes for various PoS blockchains, ensuring high availability and scalability in both cloud and bare-metal environments. Build and operate production infrastructure and pipelines (Kubernetes clusters, etc.)
- Automation and IaC: Implement and maintain infrastructure as code using Terraform and configuration management tools. Contribute to CI/CD for apps and infrastructure.
- Networking: Configure and manage network infrastructure, including load balancing and firewalls.
- Monitoring and Alerting: Implement and maintain comprehensive monitoring and alerting to proactively identify and address performance, health, and security issues.
- Incident Response: Participate in on-call rotation (including weekends), troubleshoot incidents, and perform root cause analysis.
- Collaboration: Work closely with other engineering teams and maintain detailed documentation of system configurations and procedures.
Requirements
- 3+ years of experience in DevOps, SRE or a related role.
- Strong Linux system administration skills.
- Experience with containers and Kubernetes.
- Experience with Terraform or similar tools for infrastructure automation.
- Experience with monitoring and logging tools (e.g. Prometheus, Grafana, ELK stack).
- Experience managing fleets of machines with configuration management tools (e.g. Chef, Puppet, SaltStack, or Ansible).
- Strong networking knowledge (TCP/IP, HTTP/2, firewalls).
- Experience with gRPC and load balancing.
- Solid understanding of security best practices for Linux systems and networks.