NVIDIA’s Networking Performance team develops advanced software tools that drive the optimization of the world’s fastest networking technologies. Our mission spans benchmarking, telemetry capture, and performance analysis, enabling both NVIDIA's R&D teams and NVIDIA's customers across the industry to root-cause bottlenecks, maximize throughput, and achieve world-class performance for AI and HPC workloads.We are looking for a Senior Software Engineer to lead the design and development of next-generation performance engineering frameworks. In this role, you will focus on Python-based benchmarking and analysis systems, while also contributing to high-performance components in C/C++. You’ll work on challenges at the intersection of networking, distributed computing, and AI, building tools that run at scale on clusters, clouds, and data centers.What you'll be doing:Design and implement performance benchmarking frameworks for next-generation AI and HPC workloadsTake full technical ownership of our core telemetry engineWork across Python (primary) and C/C++ (for performance-critical modules) to deliver reliable and scalable toolsCollaborate with experts in networking, AI, and systems to translate performance engineering needs into powerful software solutionsEnhance our DevOps, owning the CI/CD pipelines and release processes for your projectsDrive technical innovation in the performance engineering ecosystem, including taking part in building our next-gen agentic AI assistant What we need to see:B.Sc. in Computer Science, or a related engineering field5+ years of professional software development experienceA proven track record of technical ownership, making key architectural decisions, driving a technical agenda, and problem solvingExpert-level Python development skills, building robust, well-structured, production-grade applicationsC/C++ experience, especially for performance-critical or low-level componentsExperience with modern CI/CD pipelines and DevOps practicesWays to stand out from the crowd:Linux systems knowledge, including software packaging (RPM, DEB), and an understanding of the complexities of software distribution and dependenciesExperience with the Python data analysis and visualization frameworks (e.g., h5py, pandas, NumPy, Matplotlib/Plotly)Experience with Slurm, Kubernetes, MPI, or other distributed job orchestration and cluster management systemsFamiliarity with agentic AI concepts or frameworks (e.g., RAG techniques, LangChain, LangGraph, LlamaIndex, etc.)Experience contributing to open-source projectsNVIDIA is dedicated to diversity, supporting an inclusive environment and ensuring equal opportunities. Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting