Full-Time Senior System Software Engineer Cloud Networking
2100 NVIDIA USA is hiring a remote Full-Time Senior System Software Engineer Cloud Networking. The career level for this job opening is Expert and is accepting CA, Santa Clara based applicants remotely. Read complete job description before applying.
2100 NVIDIA USA
Job Title
Posted
Career Level
Career Level
Locations Accepted
Salary
Share
Job Details
We are looking for a Senior System Software Engineer, Cloud Networking to design, prototype, implement and operate programmable software-defined networking solutions for NVIDIA Virtualized Clouds hosting GPU-accelerated applications.
What you'll be doing:
- Develop and maintain CICD environments for Cloud Software Defined Network (SDN) projects.
- Design, enhance and support existing GitLab CICD pipelines across the entire stack - Linux host networking, OVS, OVN and Kubernetes CNIs
- Write and maintain automated tests (unit, integration, system)
- Implement GitOps practices within the team
- Collaborate with multiple teams (SRE, DevOPs, network engineering)
What we need to see:
- BA/BS degree in Computer Science, or in related technical field or equivalent experience
- At least 12 years of experience in designing and supporting complex, multi-stage CICD pipelines across multiple codebases
- Advanced knowledge of networking (datacenter routing and switching, Linux host and VM)
- Advanced scripting, "Infrastructure as a Code" build and deployment tools (Bash, Python, Make, Ansible, Terraform, ArgoCD, Flux)
- Good command in programming languages (C and Go) to support test coverage for OVS, OVN and OVN-Kubernetes CNI code bases
- Deep knowledge of Kubernetes environment, practical experience with deploying and supporting CNIs
Ways to stand out from the crowd:
- MS or PhD in Computer Science or an equivalent technical field
- Expert level knowledge of OVN, OVS, OpenFlow
- Experience with Rust programming language, Nix, Z, Basel
- Practical experience with major cloud providers (AWS, Azure, GCP) and hybrid/multi-cloud deployments
- Contributions to open-source projects as well as experience supporting large scale AI/HPC compute with hardware acceleration (GPU, DPU) and strong communication and collaboration skills