Deep Learning Engineer - LLMs and Diffusion Models

Company: NVIDIA
Company: NVIDIA
Location: Switzerland, Remote
Commitment: Full time
Posted on: 2024-03-22 05:55
NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, and voice-powered intelligent assistants. Academic and commercial groups around the world are using NVIDIA GPUs to revolutionize deep learning and data science, and to power data centers.We are looking for a Deep Learning Engineer with strong research skills to join the DL Algorithms team that is developing new efficient DL architectures. In this role you will interact with the scientific community creating algorithms to accelerate the training and inference of Large Language Models and Diffusion Models.What you’ll be doing:Invent and develop model optimization algorithms based on Neural Architecture Search, Pruning, Knowledge Distillation, Quantization, Conditional Computation, Model fine-tuning, etc.Work in a dynamic, applied team of researchers and engineersWork on large-scale multi-node ML modelsPublish research papers and implement the results in Nvidia productsCollaborate with academiaWhat we need to see:M.Sc. plus 4 years of commercial experience in Computer Science, Artificial Intelligence, Applied Math, or related fieldMachine learning fundamentals (linear algebra, probability theory, optimization, supervised/unsupervised/self-supervised ML, etc.)Hands-on experience with designing Deep Learning models (Transformers, Diffusion Models, Convolutional Neural Networks etc.)Programming skills (Python, C/C++), algorithms & data structures, debugging, performance analysis, and design skills.Strong experience with deep learning frameworks such as PyTorch or TensorFlowAbility to work independently and handle your own work effortGood communication and documentation habitsWays to stand out from the crowd:Ph.D. degree or equivalent experience in Computer Science, Artificial Intelligence, Applied Math, or related fieldStrong track of publications on Deep Learning in leading international conferences/journalsExperience with ML model optimization techniques such as Neural Architecture Search, Pruning, Distillation, Quantization, Conditional Computation, etc.Knowledge of CPU and/or GPU architectures in the context of ML algorithmsNVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
View Original Job Posting