Senior Principal Reliability Engineer

Company: Autodesk
Company: Autodesk
Location: AMER - United States - Texas - Offsite/Home
Commitment: Full time
Posted on: 2023-05-03 17:44
Job Requisition ID #22WD65276Position OverviewWant to help make a better world?  As a Senior Site Reliability Engineer (SRE) Autodesk you can do just that.  How is this possible? As a member of the team responsible for operating critical customer facing services. You will have the opportunity to contribute to and drive improvements in the operation of mission critical components  that make up and are dependencies of  hundreds  Autodesk desktop, mobile and web applications. These services are key business enablers serving millions of customers every day. The responsibilities of this role are part of the foundation for attaining and maintaining our customers trust to build their business around Autodesk’s commercial offerings.As a Senior Site Reliability Engineer you will serve as a primary point responsible for building the SRE practice that is focused on the overall health, availability,  performance, and capacity of one or more of our production services. In addition you will be responsible for building and operating our “Gameday” practices that will ensure our systems operate as designed and identify opportunities for continuous improvement. The role will also partner with development teams through the various stages of development to ensure systems are designed with both function and scaled operations in mind. The ideal candidate will be passionate about operations and with an “Automation first” mindset to drive scale.This role is eligible for remote work.ResponsibilitiesBuild, maintain the site reliability practice which includes the mentorship of SREs in the organizationScale and enhance our gameday practice by growing the scope and complexity of a critical program to validate and improve our reliability and customer trustServe as a primary point responsible for the overall health, performance, and capacity of one or more of our servicesWork closely with development teams to ensure that platforms are designed with "operability" in mindGain deep knowledge of both our complex internally developed applications and enterprise-class servicesOperate and maintain highly available production systemsDevelop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale Linux and Windows environmentParticipate in a 24x7 rotation for second-tier escalationsContinuously look for opportunities to automate changes and implement themKeep supported services compliant with the company and regulatory requirements including but not limited to security, privacy, SOC2, FedRampImplement and improve monitoring and alertingBuild, automate, and improve observability dashboards to provide better visibility in the operational aspects of the systemsFunction well in a fast-paced, rapidly-changing environment.Minimum QualificationsB.S. or higher in Computer Science or other technical discipline, or related practical experienceMonitoring/Logging tools techniques and configurationKnowledge of Information Security Best PracticesExcellent written and verbal communication skills7+ years of experience in the following areas:Commercial cloud experience building and maintaining AWS and/or Azure offerings for large scale enterprisesCompute: Cloud based Unix and/or windows administrationStorage: Cloud based storage provisioning, administrationDevelopment Languages: Working knowledge of development and scripting languagesDatabase Technologies: Cloud based database administrationPreferred Qualifications10+ Years of hands on experience with multiples of these example technologies:Compute: N and N-2 cloud based windows and Linux operating systemsEC2, ElastiCache, Cloud Front, Auto Scaling, Containers, API gatewaysStorage: AWS S3, EFS, EBSDevelopment Languages: Java, Python, Node JS, Perl, Java ScriptDatabase Technologies: MSSQL, MYSQL, AWS AuroraDB, AWS DynamoDB, AWS PostgresNetworking: Load balancers (ALB/ELB), SSL/TLS, DNS, FirewallMonitoring: Splunk, Grafana, Dynatrace, Data Dog, LogicMonitorScripting languages: Python, PowerShell, Bash; specifically for systems automationExperience in 24x7 support of the highly available production systems with experience in keeping stakeholders informedKeen eye to learn and improve from the incidentsStrong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other SREs, Engineers, Operators, Product Managers, etc.Passion to run and improve the customer facing systems with high degree of availability (four 9’s)#LI-POSTClick below to learn more about our benefits in the US.https://benefits.autodesk.com/ At Autodesk, we're building a diverse workplace and an inclusive culture to give more people the chance to imagine, design, and make a better world. Autodesk is proud to be an equal opportunity employer and considers all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender, gender identity, national origin, disability, veteran status or any other legally protected characteristic. We also consider for employment all qualified applicants regardless of criminal histories, consistent with applicable law.Are you an existing contractor or consultant with Autodesk? Please search for open jobs and apply internally (not on this external site). If you have any questions or require support, contact Autodesk Careers.Salary is one part of Autodesk’s competitive package. For U.S.-based roles, we expect a starting base salary between $150,900 and $244,090. Offers are based on the candidate’s experience and geographic location, and may exceed this range. In addition to base salaries, we also have a significant emphasis on annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.
View Original Job Posting