Director of EMS (Element Mgmt Systems)

Company: Calix
Company: Calix
Location: Remote - USA
Commitment: Full time
Posted on: 2023-12-05 05:00
Calix provides the cloud, software platforms, systems and services required for communications service providers to simplify their businesses, excite their subscribers and grow their value.Our Director of Network Element Management Systems will provide leadership and play a key role in building our EMS systems. They will pioneer new strategies for enabling our customers to monitor their systems and help scale their business. They will also perform a highly visible consulting role in helping our support and customer success teams roll out our EMS systems to our customers successfully.Roles and Responsibilities:Hire and retain top talent for software engineering.Design the core of the EMS to handle multiple OLT devices, considering the scalability, future-proofing, and manageability of large-scale deployments to manage growing networks.Integrate third-party applications or systems when necessary.Work with QA teams to ensure the robustness of the EMS software by reviewing test plan design and test cases for the EMS software.Ensure the EMS is compatible with different versions of network elements help build monitoring in EMS logs to handle faults or issues.Implement tools and strategies for monitoring the performance of managed network elements.Implement software enhancements and modifications as per user feedback and requirements.Document software design, user manuals, and troubleshooting guides.Provide technical support and guidance to other teams when needed and train network operations teams on the effective usage of the EMS software.Collaborate with network teams to understand their requirements and tailor the EMS software accordingly.Suggest and implement improvements to the EMS software based on new technologies, industry trends and best practices or methodologies.Implement multiple layers of security for on-premises solutions for enhanced data protection, reduced risk, and controlled access.Qualifications:5+ years experience in Microservices architecture using service orchestration / containerization tools like Docker, Kubernetes, Apache Mesos.5+ years experience implementing modules that specifically handle optical network protocols, like GPON, EPON, or XGS-PON for seamless management of diverse OLT systems, protocol-specific optimizations.5+ years experience defining real-time monitoring and analytics capabilities for rapid fault detection, performance tuning, and predictive maintenance.8+ years experience decomposing network functions like alarm management, configuration, and performance monitoring into independent microservices to provide modular updates, scalability, and independent scaling of specific functions.5+ years using Git based repositories and modern software development using Gitflow.3+ years experience building asynchronous modules using Apache Kafka, RabbitMQ, ActiveMQ for system decoupling and enhanced performance.3+ years experience with centralized systems to handle logs, alarms, and events like ELK Stack (Elasticsearch, Logstash, Kibana), Grafana that will provide a holistic view of the network, efficient troubleshooting, trend analysis.5+ years experience with secure channels like VPN, SSH tunneling for configuration, troubleshooting, and updates.5+ years experience with SQL databases (e.g., PostgreSQL) for structured data; Time-series databases (e.g., Mimir) for performance metrics that will enable fast data retrieval, efficient storage, historical analysis.5+ years experience designing with open APIs to allow integration with other network management tools, customer management systems, or OSS/BSS systems.3+ years experience using modular web interface frameworks like  Angular, React, Vue.js.3+ years experience with CI / CD for seamless delivery and Test Left mindset.Knowledge of Apache, Nginx, or similar servers.Strong analytical skills and ability to problem solve on the go.Very clear and concise communication skills and ability to interact with various layers of executive leadership.Preferred Experience:3+ years experience building Intrusion Detection Systems (IDS), firewalls and LDAP/Active Directory for authentication and secure communication with elements.2+ years experience creating data replication modules using Rsync, Bacula or ZFS snapshots.3+ years experience integrating with cloud services for specific functionalities like analytics, backups, or AI-driven insights.3+ years experience with technologies such as HAProxy, Keepalived, Pacemaker to  ensure minimal downtime, even in case of node failures, and to provide continuous monitoring, rapid failover, resilience.Familiarity with Jenkins, Travis CI, or similar tools.Location:This is a remote-based position that can be located anywhere in the United States or Canada.#LI-RemoteCompensation will vary based on geographical location (see below) within the United States. Individual pay is determined by the candidate's location of residence and multiple factors, including job-related skills, experience, and education.For more information on our benefits click here.There are different ranges applied to specific locations. The average base pay range (or OTE range for sales) in the U.S. for the position is listed below.San Francisco Bay Area Only:188,500.00 - 349,900.00 USD AnnualAll Other Locations:163,900.00 - 304,300.00 USD Annual
View Original Job Posting