Search JobsJob DescriptionYour next adventure at VMware is only a click away!At VMware, we are committed to helping our people grow professionally. Our talented employees exemplify our shared values and continue to drive our company to new heights. If you see a position that might be right for you, we encourage you to apply and continue to be a part of our EPIC2 community.Job DescriptionThe Elevator Pitch: Why will you enjoy this new opportunity?We are looking for someone who likes solving problems and learning new things. You’ll work closely with other SREs and software engineers on building and operating a secure, reliable, and scalable SaaS application that provides real-time data collection and visualization for our customers.VMware Aria Operations for Applications provides unified observability across metrics, traces, and logs for greater insights and unmatched scalability. The Site Reliability Engineering team is tasked with delivering the platform for a SaaS product that is used at scale 24/7 by development and site-reliability teams at leading enterprises such as Snowflake, Intuit, Box and many more! You will:Have cloud platform, security, linux systems and automation experience, and knowledge of running workloads at scale. Support thousands of cloud instances in multiple regions at scale and share your learnings and best practices with others. Be experienced in, and enjoy working remotely within a fully remote and distributed team.What is the primary need, technical challenge, and/or problem you will be responsible for?We need someone who is passionate about security, automation, infrastructure as a code, and configuration as a code who can develop and deploy software that will help drive improvements towards the availability, management, and visibility of our platform. In this role, you will take part in the SRE on-call and drive improvements to continuously increase the signal-to-noise ratio. You will contribute to the security and development of tools for metrics gathering, introspection, monitoring, automated remediation, and orchestration. Success in the Role: What are the performance goals over the first 6-12 months you will work toward completing?Understand the architecture and the product as a user.Participate in an on-call rotation for the services owned by the team, effectively triaging and resolving incidents ensuring minimal downtime.Work cross-functionally with different teams to develop, deliver and improve automation projects towards effective delivery of team objectives.Contribute to team goals: projects, internal improvements, and ad-hoc work.“Infrastructure as code” is critical for our success. Terraform and Ansible are the two tools which we use extensively. Candidate is expected to understand and contribute towards furthering automation in this area. Python and Java are used by our team for some product scripting/functional scenarios. Knowledge of any of these will be added plus.Have a keen eye to learn, understand and contribute towards the reliability of Aria Operations for Applications service.Be part of discussions on automation and effectively suggest improvements.What type of work will you be doing? What assignments, requirements, or skills will you be performing on a regular basis?Passionate about learning new technologies and adopting the right tools to manage these services in production, keeping SLAs and MTTR in mind at all times.Understand the Aria Apps architecture, discover failure points, and work with other teams to design tools/alerts to prevent issues in the future.Develop and maintain documentation, runbooks, and playbooks to help streamline incident response and ensure consistency.Deploy and maintain production services using container technology such as Kubernetes or EKS or ECS. Along with patching/upgrading our fleet.Drive security, reliability and feature improvements within the product by providing feedback to the product management team, influenced by a commitment to act as customer zero.Be a self-starter with a high attention to detail. Have good communication and team presentation skills. Be able to collaborate remotely via Slack/Zoom etc. We know from experience that not ticking every box on the skills sections stops many from applying. Please apply regardless of your self-assessment -- we want to hear from you! We have seen engineers succeed with a diverse range of skills and experiences. What is leadership like for this role? What is the structure and culture of the team like?The hiring manager for this role is Julie Ann Davis, Manager of Site Reliability Engineering for Aria Apps SaaS SRE group, a critical component of the Reliability Engineering program for Aria products. Julie Ann joined VMware to add her SRE/DevSecOps industry expertise to the team. Prior to this role, Julie Ann has over 20 years of working with engineering teams across a broad range of industries and technologies.VMware Aria Operations for Applications continues to have the original startup DNA with the stable backing of VMware, providing a cozy feeling of a startup while having the values and benefits of a large company. The SRE team consists of engineers with diverse cultural backgrounds and different technical expertise. We play on our strengths while finding the opportunities to develop knowledge in new technical areas. VMware Aria Operations for Applications keeps us in the learning mode: we like to learn from each other and give each other a hand when help is needed. We are passionate about delivering products with high quality and strive for work life balance. What are the benefits and perks of working at VMware?We are proud to offer you and your family a comprehensive program of benefits that are among the best in the industry. This site provides detailed information, contacts and resources to ensure you make the most out of your VMware benefits. Below are some highlights, or you can view the complete benefits package by visiting benefits.vmware.com.Medical Coverage, Retirement, and Parental Leave Plans for All Family TypesGenerous Time Off Programs40 hours of paid time to volunteer in your communityFinancial contributions to your ongoing development (conference participation, training, course work, etc.)#TeamAriaAppsVMware is an Equal Opportunity Employer and Prohibits Discrimination and Harassment of Any Kind: VMware is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. All employment decisions at VMware are based on business needs, job requirements and individual qualifications, without regard to race, color, religion or belief, national, social or ethnic origin, sex (including pregnancy), age, physical, mental or sensory disability, HIV Status, sexual orientation, gender identity and/or expression, marital, civil union or domestic partnership status, past or present military service, family medical history or genetic information, family or parental status, or any other status protected by the laws or regulations in the locations where we operate. VMware will not tolerate discrimination or harassment based on any of these characteristics. VMware encourages applicants of all ages. VMware will provide reasonable accommodation to employees who have protected disabilities consistent with local law. Search Jobs
View Original Job Posting