System Software Engineer, Conversational AI

Company: NVIDIA
Company: NVIDIA
Location: India, Pune
Commitment: Full time
Posted on: 2024-01-26 05:06
NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, and intelligent assistants. Come join the team and see how you can make a lasting impact on the world! We're looking to grow our company, and build our teams with the smartest people in the world. Join us at the forefront of technological advancement. NVIDIA is looking for a System Software Engineer to develop tools for building powerful, flexible, multi-modal AI agents driven by Large Language Models(LLM) & improve the experience of millions of customers. If you're creative & passionate about solving real world conversational AI problems, come join us.What you’ll be doing:Build GPU accelerated scalable LLM driven Retrieval Augmented Generation(RAG) workflow and build a scalable microservice based architecture deployable on multi-node, multi-cloud environmentBuild domain specific agents and workflows and build a framework which can support multi-turn, multi-modal, multi-user conversations with a LLM driven agents.Develop knowledge discovery, and reasoning capabilities including but not limited to disambiguation, clarification, and anticipation for dialogue systemsEvaluate and benchmark end to end RAG and conversational AI agent pipelines for accuracy as well as system performanceAnalyze RAG and conversational AI agent end to end accuracy and limitations and recommend the next course of action & Improvements.Characterize performance and quality metrics across platforms for various AI and system componentsCollaborate with various teams on new product features and improvements of existing products. Customize and integrate the conversational AI framework with other NVIDIA productsParticipate in developing and reviewing code, design documents, use case reviews, and test plan reviews and help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.What we need to see:Bachelor's degree or Master’s degree (or equivalent experience) in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math2+ years of experienceExcellent programming skills in PythonKnowhow of Large Language model applicationsExperience of working on end to end Software lifecycle, release packaging & CI/CD pipelineHands-on experience on conversational AI Technologies like Large Language Models, Information Retrieval, Natural Language Processing, Dialogue systems (including system integration, state tracking and action prediction), Question and Answering, etc.General background around version control and code review tools like Git, Gerrit, Gitlab.Ways to stand out from the crowd:Strong fundamentals in Programming, optimizations and Software designStrong knowledge of ML/DL techniques, algorithms and tools with exposure to Transformers (BERT, GPT, Megatron), Language ModelsKnow how of vector databases and embedding modelsFamiliarity with GPU based technologies like CUDA, CuDNN and TensorRTBackground with deploying machine learning models on data center, cloud, and embedded systemsNVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression , sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting