Data Analyst (Remote, ROU)

Company: CrowdStrike
Company: CrowdStrike
Location: Romania - Remote
Commitment: Full time
Posted on: 2024-06-09 05:02
​​#WeAreCrowdStrike and our mission is to stop breaches. As a global leader in cybersecurity, our team changed the game. Since our inception, our market leading cloud-native platform has offered unparalleled protection against the most sophisticated cyberattacks. We’re looking for people with limitless passion, a relentless focus on innovation and a fanatical commitment to the customer to join us in shaping the future of cybersecurity. Consistently recognized as a top workplace, CrowdStrike is committed to cultivating an inclusive, remote-first culture that offers people the autonomy and flexibility to balance the needs of work and life while taking their career to the next level. Interested in working for a company that sets the standard and leads with integrity? Join us on a mission that matters - one team, one fight.About the Role:CrowdStrike is looking for a Data Analyst to join our growing Generative AI Research Center. This is a junior/entry-level position with quick advancement opportunities. As Data Analyst you will focus on data and corpus labeling, as well as other data-related tasks critical to supporting our large language models (LLMs) and cybersecurity initiatives. This role is crucial in enhancing our products capabilities by ensuring the accuracy and quality of the data used to train models and detect threats, thereby supporting the overall mission of the Generative AI Research Center. CrowdStrike is a cybersecurity company, but we do not require candidates for this role to have prior security industry experience. We will mentor and train in security topics as needed. We do expect a strong interest in CrowdStrike's mission and a willingness to engage with the needs of our product teams and customers. If you are a hands-on engineer who loves technical challenges and wants to operate at scale, apply & let's talk!  Interviewing process: online and onsite where applicable What You'll Do:            Label and annotate cybersecurity-related datasets to prepare them for analysis and machine learning tasksEnsure labeling accuracy and consistency across different datasets, including threat intelligence data, incident reports, network logs, etc.Gather data from various cybersecurity sources, including threat intelligence feeds, logs, and internal reportsClean and preprocess data to make it suitable for analysis and modelingPerform exploratory data analysis to uncover patterns, trends, and insights related to cybersecurity threats and vulnerabilitiesUtilize statistical methods and tools to interpret data and identify potential security issuesCreate and maintain dashboards and reports to communicate findings to cybersecurity stakeholdersDevelop visualizations to present data in a clear and concise manner, highlighting key security metrics and trendsWork closely with analysts, data scientists, engineers, and other team members to support their data needsSupport the implementation and optimization of MLOps pipelines, leveraging data insights to deploy, monitor and scale machine learning models for different solutionsParticipate in team meetings and contribute to project planning and discussions, providing data-driven insightsDocument processes, methodologies, and insights gained from data analysis and labeling activitiesMaintain clear records of data sources, cleaning steps, and labeling criteria to ensure reproducibility and auditability              What You'll Need:Bachelor's degree in Computer Science or related STEM field      Proficiency in data manipulation and analysis tools (e.g., Python, SQL)Familiarity with relevant libraries and frameworks (e.g., TensorFlow, PyTorch)Experience with data labeling and annotation toolsStrong analytical and problem-solving skills, with an understanding of cybersecurity conceptsExcellent communication and collaboration abilitiesAttention to detail and a commitment to data accuracy        Tech Stack (not mandatory to know everything; a robust learning capacity is essential):Python SQLData Labeling and Annotation Tools like Labelbox, Prodigy, etc.Data Analysis and Visualization like Pandas, NumPy, Matplotlib, Seaborn, etc.DockerKubernetesAWS KafkaGIT    Bonus Points:Existing exposure to Go, AWS, Cassandra, Kafka, ElasticsearchExperience with Language Models, Data Science, Data Engineering Experience with data labeling and annotation tools, particularly in a cybersecurity context   #LI-JP2#LI-EV1#LI-GT1#LI-RemoteBenefits of Working at CrowdStrike:Remote-first cultureMarket leader in compensation and equity awards with option to participate in ESPP in eligible countriesCompetitive vacation and flexible working arrangementsPhysical and mental wellness programs Paid parental leave, including adoption A variety of professional development and mentorship opportunitiesAccess to CrowdStrike University, LinkedIn Learning and JhannaOffices with stocked kitchens when you need to fuel innovation and collaborationBirthday time-off in your local countryWork with people who are passionate in our mission and Great Place to Work certified across the globeWe are committed to fostering a culture of belonging where everyone feels seen, heard, valued for who they are and empowered to succeed. Our approach to cultivating a diverse, equitable, and inclusive culture is rooted in listening, learning and collective action. By embracing the diversity of our people, we achieve our best work and fuel innovation - generating the best possible outcomes for our customers and the communities they serve.CrowdStrike is committed to maintaining an environment of Equal Opportunity and Affirmative Action. If you need reasonable accommodation to access the information provided on this website, please contact Recruiting@crowdstrike.com​, for further assistance.
View Original Job Posting