We are seeking Research Engineers to join our strategic AI team to inform direction with NVIDIA’s AI engineering teams and collaborate with the world’s leading AI companies innovating on the next generation of AI. If you are a research/software engineer who enjoys working at the forefront of Generative AI, are passionate about fast-evolving Large Language models, and desire to work with teams, are driven to unite the latest AI research and future hardware designs in a cohesive, full-stack software strategy, we should talk!Our AI Strategy team, is responsible for informing the roadmap of our future stack. We contribute to all steps of the machine learning lifecycle: from conceptualization to applied research, engineering for optimized inference, training and deployment. As a research engineer on the team, you will interact with both internal teams, and key strategic partners to surface, define and help implement the strategy of our products.What you will be doing:Working closely with our most strategic partners and researchers to surface areas that stretch our hardware and software in unique ways.May include quick prototyping and collaborating with engineering and researcher architecting new insights.Working with engineering, research and product teams across all of NVIDIA to ensure flawless transition of concepts to the NVIDIA stack.Conceptualize solutions across multiple facets - end-to-end software stacks data center scaling, networking optimizations, different AI model architectures, and deployment scenarios.What we need to see:Doctoral degree in Computer Science, Computer Engineering, related field (or equivalent experience)10+ years proven experience in Deep Learning frameworks and NVIDIA GPUs.Excellent C/C++, Python programming and software design experience, including debugging, performance analysis and optimization.Understanding of the latest techniques in Deep Neural Networks, Large Language models, and Scaling techniques.Great foundation in CPU and/or GPU architecture. Knowledge of high-performance computing and distributed programming.Experience with the following technologies is a huge plus: Traditional DL Frameworks (PyTorch, JAX etc), XLA, TVM, MLIR, LLVM, OpenAI Triton, deep learning models and algorithms, and LLM designs.Strong communication and interpersonal skills along with the ability to work in a dynamic and high distributed team.Comfortable working across highly matrixed organizations and navigating cross functional relationships and conflicting priority and challenges.Ability to think beyond what's possible right now.Ways to stand out from a crowd:Experience architecting or developing large-scale deep learning distributed systemsBackground in training 100+B parameter GPT models from scratchExperience with training at 10,000 to 100,000 scale GPU training workloads.Recognized AI applied researcherAdvanced GPU optimizationDeveloped novel AI model architectureWith competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.The base salary range is $216,000 - $414,000. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting