Deep Learning Performance Architect

Company: NVIDIA
Company: NVIDIA
Location: China, Shanghai
Commitment: Full time
Posted on: 2023-06-08 06:21
NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning system performance architect to join our AI ecosystem analysis and perf projection efforts. In this position, you will have a chance to analyze state-of-the-art AI compilers and SW stacks and their performance on various hardware architectures. You will make your contributions to our dynamic technology focused company. What you'll be doing:Analyze state-of-the-art AI compilers and SW stacks on various hardwareIdentify architecture and software performance bottlenecks and propose optimizationsExplore new features and hardware capabilities of current AI compiler and SW ecosystemsWhat we need to see:MS or PhD in relevant discipline (CS, EE, Math, etc.,)3 years work experienceBackground with popular AI compilers (e.g., OpenAI Triton, MLIR, TVM, XLA)Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow)Experience on deep learning models and operatorsKnowledge and experience on hardware architectures for deep learning applications#deeplearning
View Original Job Posting