NVIDIA is looking for a Senior Performance Engineer in High-Performance Computing and Deep Learning. You will be part of global Performance Lab team and will work to improve our team’s ability to expertly and accurately test a growing number HPC&DL applications. With your help, our team will reduce request completion time, perform more challenging programming tasks, increase testing coverage of applications and hardware platforms, and better serve our customers through new and improved testing processes. The data that we collect drives marketing/sales collaterals as well as engineering studies for current and future products. We accomplish this by writing scripts that improve the team’s ability to gather data through automation and designing efficient processes for testing a wide variety of applications and hardware. You will have the opportunity to work with multi-functional teams and in a dynamic environment where multiple projects will be active at once and priorities may shift frequently.What you’ll be doing:Lead the expansion of inference testing for state-of-the-art models such as Stable Diffusion and GPTAdd multi-GPU inferencing capabilities to our baselineBuild automation to support LLM performance testing on single and multi-node configurationsBuild automation to support DL performance testing across more models for CSP such as AWS, GCP, Azure, OCI, DGX Cloud.Support testing on new GPUs such as H100 NVLWhat we need to see:Minimum of a BS in Computer Science, Electrical Engineering, or the equivalent and 3+ years of experience in software developmentSolid background in software design and programming techniquesStrong programming and debugging skills in a scripting language such as PythonSolid skill set on UNIX/LINUX system, including hands-on experience in system configuration and troubleshootingGood data analysis skills and the ability to summarize findings in a written reportStrong collaborative and interpersonal skills to effectively guide and influence within a multifaceted, technical environment.Ways to stand out from the crowd:Background with GPU/CPU benchmarkingProven experience with the design and implementation of sophisticated systemsHands on experience with Cloud Service Provider such as AWS, GCP, Azure, Alibaba cloudExperience with DL and Machine LearningNVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law
View Original Job Posting