Senior Developer - Data Center Server Management

Company: NVIDIA
Company: NVIDIA
Location: Poland, Remote
Commitment: Full time
Posted on: 2025-05-26 00:12
NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the most thoughtful people in the world.NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload. Scale out is inherent to design of this massive superchip. We are looking for skilled software engineers to help implement firmware and software components for next generation AI supercomputing platforms.We are looking for a strong senior developer to implement manageability components for these products in data centers. You will collaborate with various teams, understand customer requirements, and develop robust solutions to drive our products to market.What you'll be doing:Develop and optimize server management software for GPU and Grace solutions in large clustersImplement firmware and software components based on performance requirements and architecture specificationsCollaborate with data center architects to understand requirements and ensure timely implementationWork with cross-functional teams to align implementation with design requirementsOptimize firmware components for reliability in data center environmentsSupport cluster validation and resolve technical issues efficientlyContribute to quality, reliability and telemetry performance of firmware delivered to data centersWhat we need to see:5+ years of relevant experience working on server firmware (BMC) and platform software development with BS, MS, or PhD in EE/CS or related fieldExperience with data center health management implementationTrack record of delivering server firmware componentsKnowledge of server architecture and manageability in data centersUnderstanding of hardware management interfaces (USB, SMBus/I2C, PCIe) and familiarity with modern management protocols including Redfish, MCTP, and PLDMStrong proficiency in C/C++ and PythonStrong programming and debugging skills for server platformsExperience with SCM (e.g. Git, Perforce) and project management tools like JiraExcellent written and oral communication skills, good work ethics, team-oriented mentality, and dedication to quality workSelf-starter who can solve sophisticated technical problems with effective coding solutionsWays to stand out from the crowd:Familiarity with x86 or ARM system architectureExperience collaborating effectively within large engineering teamsBackground with performance optimization in firmware componentsExperience with RTOS and bare metal programmingLinux kernel and user space development experienceNVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology. We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.
View Original Job Posting