We are looking for a Senior Software Engineer to join our Data and Application Services team to improve observability capabilities. Our team builds and operates sophisticated infrastructure to enable business critical services and AI applications. You will be working with a team of passionate and skilled engineers that are continuously working to provide better tools to build and manage this infrastructure. This role is responsible for all things related to observability from tooling definition to implementation and making sure the strategy aligns with the organization's Observability and Data needs. Ideal candidate is strong in software development, designing and creating reliable distributed systems, and has the ability to implement well thought out cloud strategy.What You'll be Doing:Design highly available and scalable systems to meet our observability requirementsEvaluate new and innovative technologies as the landscape evolvesContinuously improve infrastructure provisioning and management using automationSupport a globally distributed, multi-cloud hybrid environment - AWS, GCP and On-premBuild strong cross functional relationships and align with partners across various business unitsEnsure the highest level of up-time and Quality of Service (QoS) to our users through operational excellenceParticipate in team's on-call rotation and be a contact for service incidentsWhat We Need to See:8+ years of experience in design, implementation, and delivery of large engineering projectsComfortable with at least two of the following programming languages: Golang, Java, C/C++, Scala, Python, ElixirUnderstands scalability challenges and performance of server-side code. Able to craft and develop horizontally-scalable, resilient and performing-under-load systems.Versatile technologist with experience in full software development lifecycle – from inception and design to deployment, operation, and iterative developmentProficient in cloud technologies and are hands-on in at least one cloud platform: GCP, AWS, or AzureProficient in modern CI/CD techniques, GitOps and Infrastructure as Code(IaC)Strong work ethic and a passion for problem solvingB.S. degree in Computer Science or related technical field or equivalent experienceDetail oriented with great communication and collaboration skillsWays to stand out from the crowd:Prior experience building solutions for HPC clusters based on Slurm or kubernetes.Strong understanding of Linux operation system and TCP/IP fundamentalsNVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most brilliant and talented people in the world working for us and, due to unprecedented growth, our world-class engineering teams are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to hear from you.The base salary range is $176,000 - $333,500. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting