NVIDIA is seeking a highly skilled and experienced Large Language Model (LLM) based Application Infrastructure engineer to join our growing team. The successful candidate will work at the intersection of GPU chip design and AI, you will be responsible for the design, development, and maintenance of the infrastructure around NVIDIA's internal large language model sought at facilitating chip design.What you'll be doing:Develop and maintain the infrastructure for managing large language models (LLMs) based application specifically adapted for the chip design and hardware domain.Build and maintain LLM based applications to serve hardware engineers, such as LLM based QA bot, code generator etc.Collaborate with HW chip designers and Large Language Models research teams to understand the specific needs and challenges of GPU design and ensure the LLM infrastructure is well-suited to these needs.Collaborate with LLM research teams to collect & coordinate training / fine-tuning data to train hardware specific language modelOptimize the infrastructure for performance, scalability, and reliability, and ensure the secure and efficient management of data.Stay updated with the latest industry trends in AI and machine learning, and continuously look for opportunities to apply these advancements to improve the LLM infrastructure.What we need to see:Pursuing or have recently graduated with a Bachelor's or higher degree in Computer Science, Computer Engineering or related major.Experience in developing and maintaining AI or machine learning infrastructure, preferably in the context of large language models.Strong proficiency in Python and web development, and familiarity with LLM related techniques e.g., langchain, vector database, timely engineering, etc.Understanding of chip design and related computational and data challenges.Experience with data management, including doc cleaning, transformation, and secure storage.Excellent problem-solving skills.In depth understanding of Machine Learning / Deep Learning / NLP concepts.Ways to stand out from the crowd:Experience crafting and developing production quality microservicesStrong technical background in cloud/distributed infrastructureAn excellent plus if you are familiar with front-end development using React or Vue.jsStrong understanding of SQL & NoSQL Data platforms.NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people in the world working for us. If you're creative and autonomous, we want to hear from you!The base salary range is 100,000 USD - 224,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting