Senior Infrastructure Software Engineer, Deep Learning Libraries

Company: NVIDIA
Company: NVIDIA
Location: China, Shanghai
Commitment: Full time
Posted on: 2025-06-04 05:32
We are now looking for a Senior Infrastructure Software Engineer for Deep Learning Libraries!NVIDIA's Deep Learning Libraries Group is seeking excellent software engineers to enable the next wave of NVIDIA’s highest performing deep learning libraries. The role spans multiple products, including cuDNN, TensorRT, and CUDA kernel libraries. The mission is to design and develop scalable, modular infrastructure that streamlines development, builds, and tests across NVIDIA’s diverse set of platforms, from Drive AGX for autonomous vehicles to DGX servers for datacenters and large language models. Join our technically diverse team of software engineers and infrastructure experts to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning platforms.What you'll be doing:Driving the infrastructure support for the open-source-first strategy of software development and deploymentBuilding scalable automation for build, test, integration, and release processes for publicly distributed deep learning librariesDeveloping throughout the software stack, from the user experience and user interfaces down to the cluster and database layersConfiguring, maintaining, and building upon deployments of industry-standard tools (e.g. Kubernetes, Jenkins, Docker, CMake, Gitlab, Jira, etc.)Advancing the state of the art in those industry-standard toolsWhat we need to see:A Masters Degree in Computer Science or Computer Engineering or equivalent experience.3+ years of relevant experienceStrong programming skills in Python (or similar) and familiarity with C/C++ developmentExperience setting up, maintaining, and automating continuous integration systems (e.g. Jenkins, GitHub Actions, GitLab pipelines, Azure DevOps)Fluency in SCM (e.g. Git, Perforce) and build systems (e.g. Make, CMake, Bazel)Ways to stand out from the crowd:Experience with infrastructure support for Github open-source projectsTrack record of active involvement in open-source communityExperience designing and developing automation in Jenkins with Groovy (or similar)Background with distributed systems and cluster/cloud computing, especially with KubernetesExperience with mobile/embedded platforms and multiple operating systems (Ubuntu, RedHat, Windows, QNX, or similar)This is an opportunity to have a wide impact at NVIDIA by improving development velocity across our many AI/DL/Compute Software projects. Are you creative, driven, and autonomous? Do you love a challenge? If so, we want to hear from you!
View Original Job Posting