Senior Site Reliability Engineer (SRE), Multi-cloud Platform

Company: Workday
Company: Workday
Location: USA, CA, Pleasanton
Commitment: Full Time
Posted on: 2023-05-03 16:58
Your work days are brighter here.At Workday, it all began with a conversation over breakfast. When our founders met at a sunny California diner, they came up with an idea to revolutionize the enterprise software market. And when we began to rise, one thing that really set us apart was our culture. A culture which was driven by our value of putting our people first. And ever since, the happiness, development, and contribution of every Workmate is central to who we are. Our Workmates believe a healthy employee-centric, collaborative culture is the essential mix of ingredients for success in business. That’s why we look after our people, communities and the planet while still being profitable. Feel encouraged to shine, however that manifests: you don’t need to hide who you are. You can feel the energy and the passion, it's what makes us unique. Inspired to make a brighter work day for all and transform with us to the next stage of our growth journey? Bring your brightest version of you and have a brighter work day here.About the TeamAre you a Senior Site Reliability Engineer with who loves the challenge of automating, operating and improving pioneering cloud native service platforms? Do you love digging into a production problem and seeing it through to resolution and follow through?We’re the team that deploys, operates and supports our cloud native technology platform that was designed from scratch for the cloud. We lead the reliability for the complete stack and tools that delivers and supports Workday products across public clouds (e.g. AWS, GCP, Azure).The platform is built using Cloud Native technologies (CNCF), on a foundation of Kubernetes in Public Cloud environments. This provides a secure platform on which Workday service teams, and Platform development teams can build and test their pre-release code, through deployment to production on a continuous basis.Engineers from this team have shared their experiences at Cloud Native conferences, including KubeCon.About the RoleThe primary function of the SRE team is to ensure the reliability and availability of the platform to meet the desired SLAs, reduce operational load and to scale sustainably in alignment with business growth.Be a key member of team of dedicated SREs responsible for software engineering and operations, with an emphasis on reducing operational toil.Automation and improvement is planned by following scrum practices with two week sprints.The scrum team is autonomous - on-call function is follow-the-sunTech stack is Cloud Native (Kubernetes, Istio, OPA, GoLang, Prometheus, Grafana etc)Responsible for the safe change and reliability of customer environments, with SLO gated multi-stage deployment automation. Mission is to improve platform reliability, observability and overall customer satisfaction.Develop and launch effective SLIs to ensure that SLOs are achieved through building an extendable Observability architecture, runbook automation, and establishing new processes.Partner with platform service teams to craft and implement a range of SRE standards for their respective services to meet. Define benchmarks and automation to qualify services to move to production environments.About YouYour passion for identifying and solving problems on distributed environments scaling across configuration, Linux Operating System and network. You have hands-on experience handling distributed environments (Kubernetes experience is a big plus). You have a keen interest in improving operational efficiency, and believe that automation is the key to operating large-scale systems. You are driven to ensure customer success.Basic Qualifications: For Sr SRE:BS in Computer Science or related field or equivalent years of experience5+ years in handling and solving distributed systems in a public cloudFor SREBS in Computer Science or related field or equivalent years of experience3+ years in handling and solving distributed systems in a public cloudFor Sr Associate SRE:BS in Computer Science or related field or equivalent years of experience1+ years in handling and solving distributed systems in a public cloudOther Basic Qualification1-3 years of SRE experience in a distributed systems environment.Experience with AWS, GCP, or AzureExperience with KubernetesExperience with LinuxProficiency with a programming language such as GoLang, Python, or Ruby (preferably GoLang (Go))Experienced with software development standard methodologies such as code management, CI/CD, testingOther Qualifications:Passionate automator, with a track record of referenceable examples.Can work independently and with the demeanor that everything can be automated.Skills and enthusiasm to operate, maintain, support and sustain the platform.Excited by working in a fast-paced environment. Experience collaborating with multi-functional global and remote teams with a diverse set of backgrounds.Excellent documentation skills, experience with developing detailed runbooks, processes#LI-RSAs a federal contractor, Workday is requiring all new hires to verify that they are fully-vaccinated against COVID-19 within 72 hours of beginning employment with Workday, consistent with applicable law. Workday is an equal opportunity employer. Candidates who are not vaccinated due to a sincerely held religious belief, medical reasons, or other legally-protected reason should contact accommodations@workday.com to explore what, if any, reasonable accommodations or exemptions Workday is able to offer.Workday Pay Transparency StatementThe base pay range for the primary location of this job is listed below. Workday pay ranges vary based on work location. As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role-specific commission/bonus, as well as annual refresh stock grants. Recruiters can share more detail during the hiring process. Each candidate’s compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, future potential and internal pay parity. For more information regarding Workday’s comprehensive benefits, please click here.Primary Location: USA.CA.PleasantonBase Pay Min to Max Range: $160,000 - $240,000Pursuant to applicable Fair Chance law, Workday will consider for employment qualified applicants with arrest and conviction records.Workday is an Equal Opportunity Employer including individuals with disabilities and protected veterans.Are you being referred to one of our roles? If so, ask your connection at Workday about our Employee Referral process!
View Original Job Posting