Software Engineer, Site Reliability

Company: Redpanda Data
Company: Redpanda Data
Location: US / CANADA - REMOTE
Posted on: 2023-09-08 04:18
We are a team of doers, seasoned engineers, hackers and builders, working on the future of streaming data. Funded by premier investors including GV and Lightspeed, Redpanda is building the streaming data platform for developers. We’re evolving streaming beyond the Apache Kafka® protocol into a unified “engine of record” that delivers a categorical reduction in complexity, wicked-fast performance, onboard Wasm transforms, and transparent tiered storage that gives consumers access to both real-time and historical data from a single API.   About the Role:   We are continuing to invest and grow our Site Reliability Engineering team at Redpanda. We are looking for an experienced Site Reliability Engineer (SRE) who is up for the challenge to not only build the systems to lift Redpanda into the cloud, but also shape the technical culture that creates these systems.  Scaling our products requires a significant engineering investment into building the operational aspects of our infrastructure into our delivery systems and we’re looking for someone who will be an invaluable and impactful member of our team.   There are exciting challenges to solve and opportunities to learn from the very best. Come join us to partner closely with product, customer success, and cross-functional engineering teams to build and design together, the very best present and future of real-time data. You Will:  Be a part of our SRE team, working with all of engineering on building new services, automating infrastructure lifecycle on Kubernetes and monitoring our services with the goal of offering a reliable, scalable and high-performance SaaS Build systems and services to turn toil into automation Design and implement observability-as-code Build tools & services to allow automated infrastructure management and self-healing Participate in on-call rotations, working to keep customer workloads running and incident free You Have:  5+ years of experience in an SRE-like role Comfortable working with a 100% distributed engineering team, collaborating on GitHub, in the open Strong understanding of Go Experience with the ecosystem of both commercial and open source observability Strong experience with AWS and GCP Experience managing Kubernetes Experience running highly-scalable production workloads on Kubernetes Experience managing infrastructure predictably through GitOps and IaC Willingness to participate in an on-call rotation Excellent written communication skills B.S. in Computer Science or equivalent experience Nice to have: Understanding of SLIs and SLOs  Experience operating a SaaS platform Experience working with Azure or OpenShift Fluency in any of the other languages we use within our ecosystem (TypeScript, C++, Python) Operated and used streaming platforms either as a user or provider   U.S. base salary range for this role is $150,000 - $185,000 (CO, TX) and $180,000 - $210,000 (CA, NY). Our salary ranges are determined by role, level, and location. As a remote-first company, we strive to consider each candidate's job-related skills, location, experience, relevant education or training to determine individual base salary. Your talent partner will share more about the specific salary range for your preferred location during the hiring process. Redpanda is used by Fortune 1000 enterprises pushing hundreds of terabytes a day, as well as by the solo dev prototyping a React application on her laptop. Think of it as a streaming data API platform that scales with you from the smallest projects to petabytes of data distributed across the globe. Join Redpanda if you’d enjoy being part of a fast-moving, 100% remote organization with team members around the globe and a culture based on trust, transparency, communication, and kindness.  #LI-Remote
View Original Job Posting