NVIDIA is looking for a Senior Cloud Infrastructure Development Engineer to design, develop, and improve a robust and efficient private cloud infrastructure used for batch job execution and bazel remote execution, remote caching, services for its Software groups. As a team we work with various groups within NVIDIA such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Autonomous Vehicles to cater to their infrastructure needs. These cloud services run on thousands of servers and execute half a million automated jobs per day helping with the productivity of thousands of NVIDIA's software engineers worldwide. As part of these services we host a heterogeneous mix of machines with various operating systems (Windows/Linux/Android), and a multitude of hardware platforms (x86/ARM) having both NVIDIA GPUs and Tegra Processors. Are you passionate about infrastructure and looking for complex and meaningful problems, ready to build the next generation of cloud services, design innovative solutions, mine through data to uncover real problems and fix them? We are excited to have a fun-loving person like you!What you'll be doing:Design and implement a scalable, low latency, high throughput and highly reliable remote execution & remote caching services for Bazel Remote execution.Working on challenging problems in the area of infrastructure such as multi-cluster, multi data center services supporting low latency and high throughput for data access.Working on supporting job execution on a heterogeneous mix of machines in Kubernetes cluster having both NVIDIA GPUs (vGPUs) and Tegra processors.Chasing system resiliency across databases, storage, network and web servers to achieve high availability goals.Implement security best practices for the remote execution cluster, ensuring the integrity and confidentiality of data.You want to dig more data, analyze much more, apply deep learning algorithms / machine learning to improve the performance/predictability of the system.What we need to see:.Strong object-oriented programming background, Java, Golang strongly preferredExperience of developing large scale cloud infrastructure applicationsBackground with Relational Databases such as MySQL and NoSQL DBs such as ElasticSearch, MongoDBBackground with Containers (Docker, Kubernetes), Kubevirt, Web Services (SOAP/REST) and Scalable Storage(HDFS/Ceph, Artifactory, Object storage)Experience working messaging technologies such as KafkaExcellent problem-solving and troubleshooting skills.Ability to collaborate across multiple teams and across people working in different time zones.BS/MS in Computer Science or Computer Engineering or equivalent experience10+ years of industry experience.Ways to stand out from the crowd:Worked on computer algorithms and demonstrated ability to choose the best possible algorithms to nail complex problemsBackground in design, implementation and deployment of major infrastructure features across multiple clusters in incremental rollout modeExperience with bazel, bazel remote execution services and familiarity with cloud computing platforms (e.g., AWS, GCP, Azure).Knowledge of build toolchains and dependency management.Previous contributions to open-source projects related to build systems or distributed computing.Experience with Machine Learning and Data Analytics and application of them in InfrastructureNVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens new universes to explore, enables outstanding creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence. Widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and passionate about new technologies we want you on our team!The base salary range is 176,000 USD - 333,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
View Original Job Posting