NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables unique creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence. You will design and build the next-generation architecture and software for managing storage services to support our engineers, architects with NVIDIA GPU design, VLSI, Corporate, and AI/ML teams. You will build the self-service architecture and capabilities to enable NVIDIA GPU design teams and handle services that are very critical for all NVIDIA users that require 24/7/365 availability and support. Your duties will involve leading and supporting the design of storage platform software services that not only impact revenue but also form the backbone of NVIDIA engineering infrastructure services and various business applications.What you’ll be doing: Define, Build, and manage an all-encompassing Enterprise software engineering platform to manage storage infrastructure and services consisting of a variety of enterprise appliances, networks, and open-source technologiesDesign and expand REST API’s that thousands of engineers will rely on for on-demand storage management to manage their workflows.Build and Integrate provisioning, metrics, monitoring, and software to enable management workflows for storage services.Develop tooling to automate deployment and management of large-scale design-storage environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources.Document the general procedures and practices, perform technology evaluations, and coordinate and track system orders, installations and deployment.What we need to see: BS in Computer Science (or equivalent experience) with 8+ years of relevant experience, MS with 5+ years of experience or Ph.D. with 3 years of experienceExtensive experience building and owning large-scale, multi-threaded, distributed backend systemsExperience designing and building REST APIs in Python or GoBackground with containerization and orchestration tools (e.g., Docker, Kubernetes)Experience with cloud infrastructure - AWS, Azure or Google Cloud.Background with telemetry stacks, like Grafana, Prometheus monitoring, AlertManager, and KibanaExperience with Strong collaborative and social skills, specifically a shown ability to effectively guide and influence within a dynamic matrix environmentWays to stand out from the Crowd: Demonstrated work with Open-Source software: building, debugging, patching, and contributing code.Experience with solving Linux storage-related problemsExperience with design, deployment, and management of Enterprise NAS solutions like NetApp, Pure Storage, Distributed filesystems like Lustre, and S3 storage.Background with HPC cluster management tools such as Slurm, PBS, LSF, etc.NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and enjoy having fun, then we want to hear from you!The base salary range is 160,000 USD - 304,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting