Senior Compute Infrastructure Architect

Company: NVIDIA
Company: NVIDIA
Location: US, CA, Santa Clara
Commitment: Full time
Posted on: 2023-11-01 05:29
Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars.What you will be doing:As the lead infrastructure architect for NVIDIA’s Advanced Technology Group (ATG), you will manage and drive all aspects of the ATG compute infrastructure.Work across several engineering teams to understand their infrastructure needs and make sure the storage, compute systems, and networking are deployed to optimally address these needs.Interface with counterparts from IT, Farm Team, and other infrastructure resources, to both align ATG’s infrastructure solutions with corporate standards, and to champion ATG’s unique requirements.Monitor and maintain the health of the large-scale dedicated hardware resources that ATG relies on to perform all of its required functions.Manage hardware and software procurements, service contract extensions, and other capital asset administration.Document current system setups, understand outage tolerances, and analyze where current infrastructure is insufficient and needs to be upgraded or expanded.Work with ATG flow experts to analyze tool requirements and their impact on the hardware and OS needs, and develop optimal configurations to maximize impact.Provide expert application support and help debug failures caused by inadequate hardware and/or missing OS software libraries.Track the latest technology trends and help ATG leaders understand the available upgrade paths to best take advantage of them, in order to develop a roadmap for infrastructure enhancement and replacement.What we need to see:8+ years of experience working in Information Technology and doing enterprise-level hardware support.A broad understanding of data centers and the interactions of compute systems, storage, networking, facilities, and the software layers involved, as well as a deep understanding of at least 2 of these components.Solid grasp of server management standards and tools such as IPMI, Puppet, oVirt, Netbox, Zabbix, etc.Hands on experience in debugging hard problems that span hardware, software, and networking.Excellent interpersonal skills with proven ability to coordinate cross-functional teamsDegree in Computer Engineering, Computer Science, Electrical Engineering or related fields or equivalent experience.Ways to stand out from the crowd:Participation in system management related industry standards bodies like DMTFExperience deploying and overseeing large-scale storage solutions such as GPFS, NAS, CIFS, or SMB.Familiarity with GPU compute and Mellanox networking.Experience with datacenter and cloud server deployment operationsShow that you have a love of technology and are passionate about your workOur technology has no boundaries! NVIDIA is building groundbreaking state of the art compute platforms for the world to use. It’s because of our work that scientists, researchers and engineers can advance their ideas. At its core, our visual computing technology not only enables an amazing computing experience, it is energy efficient! We pioneered a supercharged form of computing loved by the most fast paced computer users in the world - scientists, designers, artists, and gamers.The base salary range is 132,000 USD - 218,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting