We are seeking a highly skilled Senior Omniverse Infrastructure Software Engineer with Kubernetes-based infrastructure experience to join our Omniverse Infrastructure team! The ideal candidate will have a solid understanding of designing, developing, and maintaining cloud-based back-end distributed services that power NVIDIA Omniverse™ Cloud, a Kubernetes-based platform-as-a-service (PaaS) providing developers and enterprises a full-stack cloud environment to design, develop, deploy, and manage industrial Omniverse applications and workflows. What you will be doing:Extend, design and develop the distributed services that make up Omniverse cloud’s control plane (e.g., authentication, stream API and controller services, streaming reverse proxy) on top of KubernetesDesign and develop cluster-level system software solutions to map a wide range of Omniverse workloads to the high-performance interactive tasks (Kit-based applications), elastic microservices and simulation/AI tasks.Design and develop cluster-level system software solutions to increase the availability of NVIDIA Omniverse™ CloudCollaborate with multiple Omniverse product teams to understand customer storage, compute requirements, and build supporting infrastructure.Work across organizational boundaries with hardware and software engineers at different levels including kit-based applications, web streaming and distributed omniverse services.Proactively identify and address system software challenges in compute, networking, and storage resource utilization that affect OVC’s availability, fault tolerance, debuggability, operational efficiency, and sustainability.What we need to see:6+ years of hands-on software engineering experience to extend the cluster-level services for large-scale Kubernetes. 4+ years of experience building large-scale distributed, fault-tolerant distributed services.Masters or PhD in Computer Science or a related field (or equivalent experience)Experience with cloud infrastructure platforms like Azure, AWS, or Google CloudStrong systems programming skills, including optimizations using multi-threading, asynchronous programming, concurrency and parallelism, caching, and batchingProficiency in Python, C/C++ and Golang. Working knowledge of elasticity techniques within KubernetesDeep understanding of cloud technologies, distributed compute systems, and distributed systems and microservices architectureExcellent interpersonal skills and ability to work successfully with multi-functional teams, principles, and architects across organizational boundaries and geographies.Strong knowledge of virtualization and containerization technologies like Docker, VMware, KVM, etcNVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous and love a challenge, we want to hear from you.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting