Principal Cloud Architect - GPU Cloud

Company: NVIDIA
Company: NVIDIA
Location: US, CA, Santa Clara
Commitment: Full time
Posted on: 2023-12-06 05:07
NVIDIA is looking for a Principal Cloud Architect to design and develop Artificial Intelligence data center infrastructure architecture. We are looking for an architect who has a deep understanding of high performance computing and AI across software/hardware, outstanding design skills and a track record in building and delivering large-scale data center infrastructure.What you'll be doing:Design and architect data center and modular infrastructure targeted towards HPC and Deep Learning Applications from rack and stack to application bringupBuild the distributed computing infrastructure for creating large scale distributed model trainingPlan and coordinate across multi-functional teams, partners and vendors for execution of infrastructure build-outsWork with engineering teams across all of NVIDIA to ensure their requirements are correctly translated into infrastructure needsWhat we need to see:BS/MS degree in Computer science or related areas (or equivalent experience)Solid technical foundation in distributed computing and storage, including substantial experience with all of the following: server systems, storage, I/O, networking, and system software12+ years of system software engineering experience on large-scale production systems12+ years of architecting high performance computing infrastructure at scaleProven experience in high performance computing, Deep Learning, and/or GPU accelerated computing domainsExpert level knowledge in high speed interconnects such as RoCE and InfiniBandAbility to clearly and concisely communicate complex designs and requirements to peers, customers, and vendorsGeneral web networking knowledge (DNS, TCP/IP, HTTP, load balancing, firewalls)Understanding of performance, security and reliability in complex distributed infrastructure.Familiarity with system level architecture, such as interconnects, memory hierarchy, interrupts, and memory-mapped IOExcellent data analysis skills and demonstrated ability solving complex issues involving multiple software or hardware componentsWays to stand out from the crowd:Large-scale distributed system, HPC, DL Infrastructure experienceDeep knowledge of both software and hardware knowledge in data centerNVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and dedicated people in the world working for us. If you're creative and passionate about developing cloud services we want to hear from you!The base salary range is 268,000 USD - 414,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting