Full-Time Senior Staff Engineer Infrastructure
Pryon is hiring a remote Full-Time Senior Staff Engineer Infrastructure. The career level for this job opening is Expert and is accepting Worldwide based applicants remotely. Read complete job description before applying.
Pryon
Job Title
Posted
Career Level
Career Level
Locations Accepted
Salary
Share
Job Details
About Pryon: We're a team of AI, technology, and language experts whose DNA lives in Alexa, Siri, Watson, and virtually every human language technology product on the market. Now we're building an industry-leading knowledge management and Retrieval-Augmented Generation (RAG) platform. Our proprietary, cutting-edge natural language processing capabilities transform unstructured data into meaningful experiences that increase productivity with unmatched accuracy and speed.We are growing our team and adding a DevOps/Infrastructure Engineer to our team focused on platform architecture, CI/CD, and observability infrastructure.
In this role, you will own the technical architecture and implementation of our cloud-native, and highly scalable RAG applications. You will design and manage the infrastructure, deployment pipelines, and operational procedures for delivering enterprise-grade AI/ML products. We're looking for someone who will drive DevOps best practices and work with engineering teams to implement them effectively. You will own the platform's reliability, scalability, and operational excellence across multiple cloud environments and on-premises deployments.
In This Role, You Will:
- Design and implement cloud-native architectures for AI/ML applications using Kubernetes (GKE, EKS, AKS)
- Architect and maintain CI/CD pipelines using modern GitOps practices with tools like FluxCD and BitBucket
- Design and implement observability solutions using Prometheus, Grafana, and other monitoring tools
- Implement operational best practices for scaling RDBMS, OpenSearch, and Minio
- Create and maintain Infrastructure as Code (IaC) using Terraform
- Implement container orchestration strategies using Docker, Kubernetes, and Helm
- Design and implement multi-cloud deployment strategies
- Establish SLOs/SLIs and implement SRE best practices
- Automate operational tasks and create self-healing systems
- Mentor team members on DevOps best practices
- Collaborate with ML engineers to optimize model deployment and serving infrastructure
- Stay current with emerging technologies and best practices in the DevOps/MLOps space
What You'll Need to Be Successful:
- 7+ years of experience in DevOps/Platform Engineering
- Deep expertise in Kubernetes, Helm and container orchestration
- Strong experience with a major cloud provider (GCP, AWS, Azure)
- Experience managing databases like Yugabyte, OpenSearch and Minio
- Experience with CI/CD tools and GitOps practices
- Proficiency in Go, Python, or similar programming languages
- Experience with observability tools (Prometheus, Grafana, etc.)
- Knowledge of security best practices and compliance requirements
- Experience with Infrastructure as Code and configuration management
- BS degree in Computer Science or related field
- Excellent communication and collaboration skills
- Strong problem-solving abilities and systematic thinking
- Experience working in an Agile environment
$180,000 - $215,000 a year