Datacenter Modeling Tools and Analytics Architect

Company: NVIDIA
Company: NVIDIA
Location: US, CA, Santa Clara
Commitment: Full time
Posted on: 2024-03-22 05:55
Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars. The GPU Architecture team is looking for world-class computer architects to design and develop tools and models for the next generation of GPU-accelerated AI servers and datacenters.What you'll be doing:Develop comprehensive models covering cost, power, and reliability attributes of the NVIDIA datacenter products.Calibration of predictive models against empiric data and measurements.Collaborating with a dedicated and skilled team of architects, silicon, system, software, and quality engineers to model power, cooling, performance, reliability, and TCO at datacenter scale.Creation of digital twins for datacenter designs using Omniverse.Drive continuous model improvement through calibration of models using targeted testing, thermal and electrical characterization, performance profiling, telemetric collection, and detailed analysis.Provide concise reporting of results to drive architecture, design, and operational actions across the company.What we need to see:6+ years of experience developing software and databases covering modeling and predictive behavior of complex systems.Degree in Computer Engineering, Computer Science, Electrical Engineering or related fields or equivalent experience.Hands on development and coding using Python, Splunk and SPL, and integration of data sources and dashboards.Strong expertise in data analysis and visualization.Familiarity with accelerated compute hardware platforms.Experience building automation into existing dashboards and other data sources.Excellent interpersonal skills with success leading projects across multi-discipline teams.Ways to stand out from the crowd:Datacenter TCO modeling experience including both Capex and Opex components.Understanding of large scale accelerated computing architectures for AI training and inference workloads including networking, compute, and datacenter layoutExperience with AI, NLP, and ML tools and frameworks.A love of technology and passion for your work.The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting