Data Center Reliability Engineer

Company: NVIDIA
Company: NVIDIA
Location: US, CA, Santa Clara
Commitment: Full time
Posted on: 2023-10-28 18:35
We are now looking for a Reliability Engineer. Your role identifies risks to the uptime of our assets and partners with multiple teams to improve efficiency and up-time. You should be passionate about technology, a team-player and eager to help NVIDIA succeed in a variety of new and exciting markets.What you’ll be doing:Develop standards and programs in support of reliability programDefine and maintain a health score of environments for reliability and availabilityLead root cause analysis for outages and adjust documentation, workflows, and operating procedures to avoid future incidentsDesign testing methods to predict and isolate points of failureDefine and categorize spaces within an understandable reliability scaleStudy failure data and work with machine learning and AI teams to predict future failuresFacilitate reliability studies such as critical assessments, RAM models, and RCM studiesAssess and advise on maintenance strategiesWhat we need to see:Bachelor’s degree in related field or equivalent experience8+ years of operations experience or environmental health and safety within data centersProficient in developing and driving reliability activities (modeling predictions, life cycle testing, stress testing, etc..)Commercial and financial awareness, with a full comprehension on the impact of failure in translation to business costs, production targets and fulfillment of customer ordersHighly developed numeracy, statistical and reporting, ability to analyze, interpret and apply information, data and trends.Result oriented and organized, able to plan and deliver against expectations.Proficient in the use of asset database and DCIM solutions to extract data and develop meaningful insightsWays to stand out from the crowd:Proven experience in reliability engineering related to electrical or mechanical cooling systemsCertifications such as CMRP, CRL, CRE in Maintenance and ReliabilityKnowledge of relevant ISO standardsDemonstrated expertise in statistics, forecasting, and management information methods and techniques.Strong IT systems knowledge and skills including advanced Excel/G-Suite skills and the ability to learn new software packages.With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers; we have some of the most forward-thinking and talented people in the world working for us and, due to unparalleled growth, outstanding teams are rapidly growing. If you’re creative and autonomous with a real passion for your work, we want to hear from you!The base salary range is 140,000 USD - 224,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting