Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote)

Company: CrowdStrike
Company: CrowdStrike
Location: India - Remote, MH
Commitment: Full time
Posted on: 2025-07-02 05:40
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.About the Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur.What You’ll Need5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX))Strong proficiency in scripting languages (Python, Bash, PowerShell) for automationExperience with log management platforms (ELK stack, Splunk, LogScale)Working knowledge of cloud services monitoring (AWS CloudWatch, GCP)Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoringKnowledge of SRE principles, SLOs, error budgets, and incident managementExperience with automated alerting, remediation workflows, and CI/CD pipeline monitoringFamiliarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes)Strong incident triage, root cause analysis, and documentation skillsExperience participating in on-call rotations and emergency responseWhat You'll DoMonitoring and ReliabilityDesign and maintain comprehensive monitoring solutions across infrastructure and applicationsConfigure appropriate alerting thresholds to ensure timely response to potential issuesDefine and track SLOs and error budgets for critical servicesCreate and maintain dashboards providing real-time visibility into system healthConduct regular reviews of system reliability and recommend improvementsIncident Management and OperationsParticipate in on-call rotation to respond to alerts and incidentsLead incident response efforts and conduct thorough post-incident reviewsDocument incidents, resolutions, and lessons learnedDevelop and refine incident response procedures to improve MTTRImplement proactive monitoring to detect potential issues before they impact usersAutomation and CollaborationDevelop scripts and automation to streamline monitoring tasks and reduce manual effortCreate self-healing systems that can automatically remediate common issuesIntegrate monitoring tools with other operational systemsWork closely with development, infrastructure, and security teamsProvide guidance on monitoring best practices and observabilityMaintain comprehensive documentation for monitoring systems and proceduresContinuous ImprovementStay current with industry trends in monitoring and site reliability engineeringAnalyze monitoring data to identify patterns and improvement opportunitiesImplement metrics to track the effectiveness of monitoring processesContribute to the evolution of the organization's monitoring strategyPreferred QualificationsSRE, cloud platform, or monitoring tool certificationsITIL Foundation certificationBachelor's degree in Computer Science, Information Technology, or related fieldShift timings - 12PM -9PM IST#LI-DP1#LI-VJ1#LI-RemoteBenefits of Working at CrowdStrike: Remote-friendly and flexible work cultureMarket leader in compensation and equity awardsComprehensive physical and mental wellness programsCompetitive vacation and holidays for rechargePaid parental and adoption leavesProfessional development opportunities for all employees regardless of level or roleEmployee Networks, geographic neighborhood groups, and volunteer opportunities to build connectionsVibrant office culture with world class amenitiesGreat Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
View Original Job Posting