Who are we?Equinix is the world’s digital infrastructure company®, operating over 260 data centers across the globe. Digital leaders harness Equinix's trusted platform to bring together and interconnect foundational infrastructure at software speed. Equinix enables organizations to access all the right places, partners and possibilities to scale with agility, speed the launch of digital services, deliver world-class experiences and multiply their value, while supporting their sustainability goals. Our culture is based on collaboration and the growth and development of our teams. We hire hardworking people who thrive on solving challenging problems and give them opportunities to hone new skills and try new approaches, as we grow our product portfolio with new software and network architecture solutions. We embrace diversity in thought and contribution and are committed to providing an equitable work environment that is foundational to our core values as a company and is vital to our success.Job SummaryWe are seeking a seasoned RabbitMQ and Kafka Platform Engineer with 7+ years of experience, including a strong track record of leading end-to-end design and implementation of enterprise-grade messaging solutions. This role requires deep expertise in RabbitMQ (self-hosted), Kafka, and related technologies, along with a solid understanding of distributed systems and messaging frameworks. The ideal candidate will have demonstrated leadership in guiding engineering teams, setting technical direction, and driving the development of high-performance, scalable, and reliable communication platforms.ResponsibilitiesDesign and ArchitectureArchitect and implement RabbitMQ solutions tailored to the organization's needsDevelop and maintain efficient message-routing strategies, including exchanges, queues, and bindingsOptimize RabbitMQ clusters for scalability, fault tolerance, and performanceDeployment and ManagementDeploy RabbitMQ clusters in on-premise, cloud, or hybrid environmentsManaging users and permissions, ensuring that only authorized users have access to specific queues and exchanges, and implementing security best practicesMonitor and troubleshoot RabbitMQ nodes to ensure system health and uptimeImplementing backup strategies for RabbitMQ data and ensuring that there are recovery plans in place in case of failuresKeeping RabbitMQ and its dependencies up to date with the latest versions and security patchesPerformance OptimizationAnalyze and resolve performance bottlenecks in message processingOptimize resource utilization by fine-tuning RabbitMQ configurationsImplement caching, clustering, or sharding strategies as neededSecurity and ComplianceImplement robust authentication and authorization mechanisms (e.g., TLS, OAuth)Ensure RabbitMQ clusters comply with organizational security policies and best practicesPerform regular security audits and vulnerability assessmentsCollaboration and SupportWork with developers to define messaging patterns and best practicesProvide support and training to teams on RabbitMQ usageCollaborate with DevOps teams to integrate RabbitMQ into CI/CD pipelinesMonitoring and Alerting:Set up monitoring tools (e.g., Prometheus, Grafana) to track RabbitMQ metricsImplement alerting mechanisms to proactively address potential issuesResponsibilitiesKafka Infrastructure Design and ManagementArchitect, deploy, and maintain Kafka clusters in on-premise and cloud environmentsEnsure high availability, scalability, and fault tolerance of the Kafka platformImplement and manage Kafka components, including brokers, Zookeeper, Kafka Connect, Kafka Streams, and Schema RegistryManaging user access and permissions, ensuring that only authorized users and applications can produce or consume messages from specific topicsConfiguring data retention policies for topics, managing log segments, and ensuring that old data is cleaned up according to organizational policiesConfigure and manage Kafka topics, partitions, and replicationDevelop and implement backup and disaster recovery plan for Kafka clusterMonitoring and Performance OptimizationContinuously analyze system performance, troubleshoot issues, and optimize resource utilizationAddress bottlenecks in message throughput and processingSecurity and ComplianceConfigure and manage Kafka security features, including encryption, authentication (SASL), and ACLsEnsure the Kafka environment complies with organizational and regulatory standardsCollaboration and SupportCollaborate with development, data engineering, and DevOps teams to implement event-driven solutionsProvide technical guidance and troubleshooting expertise for Kafka-related issuesMaintain clear documentation for Kafka infrastructure and processesLeadershipLead cross-functional teams in the architecture, deployment, and optimization of messaging solutions, ensuring alignment with enterprise-scale system requirements and business objectivesDrive technical strategy and decision-making for messaging platforms, mentoring junior engineers and fostering best practices across development, deployment, and operationsQualificationsPlatform Management experience in any Messaging Product like Solace etc.Good scripting skills ( Python, etc.)Working knowledge of Ansible & AutomationKnowledge of any Cloud is plusstrong verbal and written communication skillsEquinix is committed to ensuring that our employment process is open to all individuals, including those with a disability. If you are a qualified candidate and need assistance or an accommodation, please let us know by completing this form. Equinix is an Equal Employment Opportunity and, in the U.S., an Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to unlawful consideration of race, color, religion, creed, national or ethnic origin, ancestry, place of birth, citizenship, sex, pregnancy / childbirth or related medical conditions, sexual orientation, gender identity or expression, marital or domestic partnership status, age, veteran or military status, physical or mental disability, medical condition, genetic information, political / organizational affiliation, status as a victim or family member of a victim of crime or abuse, or any other status protected by applicable law.
View Original Job Posting