Full-Time Senior Cloud Operations Engineer

Linux Foundation is hiring a remote Full-Time Senior Cloud Operations Engineer. The career level for this job opening is Senior Manager and is accepting San Francisco, CA based applicants remotely. Read complete job description before applying.

This job was posted 9 months ago and is likely no longer active. We encourage you to explore more recent opportunities on our site. However, you may still try your luck using 'Apply Now' link below. We recommend focusing on newer listings available here.

Linux Foundation

Job Title

Senior Cloud Operations Engineer

Posted

Career Level

Full-Time

Career Level

Senior Manager

Locations Accepted

San Francisco, CA

Salary

YEAR $125000 - $165000

Job Details

The Senior Cloud Operations Engineer will play a crucial role in managing and optimizing our multi-cloud infrastructure and DevOps practices. This position is essential for maintaining and scaling our cloud operations across multiple cloud provider platforms and accelerator technologies.

Responsibilities:

  1. Cloud Infrastructure Management
    • Design and manage multi-cloud environments across AWS, GCP, and Azure
    • Optimize instance selection and utilization across various compute types including AMD and Intel CPU-based instances
    • Configure and manage GPU-accelerated instances (AMD and NVIDIA) and specialized accelerators (TPUs, NPUs)
    • Implement and maintain infrastructure-as-code using Terraform and other IaC tools
    • Optimize cloud resource utilization and implement FinOps practices for cost management
    • Design and implement high-availability solutions across multiple cloud providers
  2. CI/CD and DevOps
    • Design, implement, and maintain CI/CD pipelines using GitHub Actions
    • Configure and manage both github-hosted and self-hosted runners
    • Implement and maintain non-blocking and out-of-tree CI jobs
    • Design and implement matrix testing strategies across different hardware configurations
    • Develop and maintain automated testing frameworks for various testing types (unit, integration, performance)
    • Implement best practices for version control management and branching strategies
    • Experience with agile methodologies and scrum practices
  3. Performance Optimization and Testing
    • Develop and implement performance testing frameworks for various hardware accelerators
    • Optimize workload distribution across different types of compute instances
    • Implement automated performance regression testing
    • Design and maintain benchmarking systems for various hardware configurations
  4. Infrastructure Security and Monitoring
    • Implement security best practices across multi-cloud environments
    • Develop comprehensive monitoring solutions using cloud-native tools
    • Participate in on-call rotations supporting operations and incident response
    • Establish and maintain escalation procedures and resolution processes
    • Manage access control and security policies across cloud platforms

Required:

  • Bachelor's degree in Computer Science, Engineering, or related field
  • 7+ years of experience in cloud operations with extensive multi-cloud expertise (AWS, GCP, Azure)
  • Demonstrated experience with GPU computing (AMD and NVIDIA) and specialized accelerators (TPUs, NPUs)
  • Strong knowledge of CPU architectures and instance type optimization (AMD, Intel)
  • Advanced experience with GitHub Actions, including custom runner configuration and management
  • Expertise in implementing non-blocking and out-of-tree CI jobs
  • Strong background in version control systems and branching strategies
  • Experience with agile methodologies and scrum practices
  • Proficiency in infrastructure-as-code tools, particularly Terraform
  • Strong scripting abilities (Python, Bash, PowerShell, Typescript)
  • Experience with containerization and orchestration (Docker, Kubernetes)
  • Demonstrated experience in implementing automated testing frameworks

Preferred:

  • Experience optimizing workloads across different hardware accelerators
  • Background in performance testing and optimization
  • Contributions to open-source projects
  • Experience mentoring other engineers
  • Background in machine learning infrastructure
  • Experience with Datadog is a plus

FAQs

What is the last date for applying to the job?

The deadline to apply for Full-Time Senior Cloud Operations Engineer at Linux Foundation is 12th of March 2025 . We consider jobs older than one month to have expired.

Which countries are accepted for this remote job?

This job accepts [ San Francisco, CA ] applicants. .

Related Jobs You May Like

Azure DevOps Engineer

Jersey City, NJ
2 days ago
.NET
Azure
DevOps
Derex Technologies Inc
Full-Time
Experienced

Lead Palantir Developer

Seattle, WA
2 days ago
CI/CD Pipelines
Data Engineering
Palantir Foundry
Logic20/20 Inc.
Full-Time
Experienced
YEAR $156750 - $173329

Cloud AppOps Engineer

Atlanta, GA
3 days ago
Application Support
AWS
Cloud Services (EC2, S3, IAM, ELB, VPC, VPN)
Sutherland
Full-Time
Experienced

Staff DataOps Engineer

Remote, India
3 days ago
AWS
CI/CD
DataOps
Nagarro
Full-Time
Experienced

Query Tuning Specialist - Database Performance - Postgre

Austin, Texas
3 days ago
Database Management
Performance Tuning
Problem-solving
ServiceNow
Full-Time
Experienced

DevOps Engineer, Playout

New York, New York
3 days ago
CICD
Cloud Services (AWS, GCP, Azure)
DevOps
NBCUniversal
Full-Time
Experienced
YEAR $90000 - $110000

Query Tuning Specialist - Database Performance - Postgres

Austin, Texas
3 days ago
Database Management
Performance Tuning
SaaS/PaaS/Cloud Development
ServiceNow
Full-Time
Experienced

Lead Palantir Developer

Seattle, WA
4 days ago
CI/CD Pipelines
Cloud ETL
Palantir Foundry
Logic20/20 Inc.
Full-Time
Experienced
YEAR $156750 - $173329

Cloud AppOps Engineer

Atlanta, GA
4 days ago
Application Support
AWS
Cloud Security
Sutherland
Full-Time
Experienced

Site Reliability Engineer

Stamford, Connecticut
4 days ago
Cloud Platforms (AWS, GCP, Azure)
Configuration Management
Monitoring And Alerting Tools
NBCUniversal
Full-Time
Experienced
YEAR $110000 - $145000

Senior Cloud Platform Engineer (Networking)

Berlin, Germany
5 days ago
AWS
Go
Networking
Scalable GmbH
Full-Time
Experienced

DevOps Engineer

Texas
5 days ago
AWS
GitLab
Kubernetes
InfStones
Full-Time
Experienced

Looking for a specific job?