Please Note:1. If you are a first time user, please create your candidate login account before you apply for a job. (Click Sign In > Create Account)2. If you already have a Candidate Account, please Sign-In before you apply.Job Description:We are seeking an experienced DevOps Engineer to join our Machine Learning team, responsible for ensuring the smooth operation and scalability of our machine learning platforms and applications. The ideal candidate will have a strong background in DevOps, Site Reliability, and some familiarity with machine learning platforms, with experience in deploying and managing complex data pipelines and workflows. The successful candidate will work closely with data scientists, software engineers, and other stakeholders to design, implement, and maintain our machine learning infrastructure, ensuring high availability, scalability, and reliability.Responsibilities:Design, implement, and maintain the infrastructure for our machine learning models and applications, including data pipelines, workflows, and data storage solutions.Collaborate with data scientists and software engineers to develop and deploy machine learning models, ensuring seamless integration with our infrastructure and data pipelines.Ensure the scalability, reliability, and performance of our machine learning applications, using tools such as Kubernetes, Docker, and cloud providers like GCP.Develop and maintain automated testing and deployment scripts, using tools like Jenkins, GitLab CI/CD, or CircleCI.Monitor and troubleshoot issues with our machine learning infrastructure, using tools like Prometheus, Grafana, and ELK Stack.Participate in code reviews and contribute to the development of new features and tools for our machine learning infrastructure.Requirements:Bachelor's degree in Computer Science, Computer Engineering, or a related field.10+ years of experience in DevOps, cloud computing, and machine learning.Strong understanding of cloud providers like AWS, GCP, or Azure.Experience with containerization using Docker and orchestration using Kubernetes.Familiarity with machine learning frameworks like TensorFlow, PyTorch, or Scikit-Learn.Experience with data pipelines, data warehousing, and data lakes.Strong scripting skills in languages like Python, Bash, or PowerShell.Experience with CI/CD tools like Jenkins, GitLab CI/CD, or CircleCI.Strong problem-solving skills and attention to detail.Excellent communication and collaboration skills.Broadcom is proud to be an equal opportunity employer. We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, gender identity, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law. We will also consider qualified applicants with arrest and conviction records consistent with local law.If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.
View Original Job Posting