We are looking for data acquisition expert with strong skills and knowledge of web scraping, web services, file transfers, relational and non-relational databases to join our Provider Relations for Data Acquisition team on the position of Web Scraping / Database Administrator. The team has a special focus on novel approaches to data acquisition and analytical functions that emphasize cross-team collaboration and communication, as well as domain expertise. If this sounds like an opportunity you are interested in, then we would love to talk to you!About You – experience, education, skills, and accomplishmentsBachelor’s degree in Computer Science or equivalent experience2 + years of experience with Puppeteer library and/or JavaScript2+ years of experience with relational and non-relational databases.2+ years of hands-on industry experience working with large data sets.It would be great if you also had...Experience with document processing (OCR, XML parsing etc.).Experience with Python, Java, or any other object-oriented programming languages; as well as with Node.js, Selenium, Crawlee, Axios and/or PlaywrightExcellent knowledge of HTTP protocols, Captcha solving techniques, proxy, and Tor networks; as well as outstanding knowledge of HTML, CSS, XML, JSON, CSV, and other textual formats; knowledge of AWS or GCP.Knowledge of intellectual property data sources.Proven record of communicating and working effectively with multi-disciplinary teams to reach resolution, share knowledge, and maintain strong relationships. Clear understanding of information retrieval.What will you be doing in this role?Design and develop a variety of tools and infrastructure to automate the extraction of publicly available and proprietary information (writing web scrapers, calling third party APIs, creating SQL queries, etc.)Create tools and processes to download data, parse it for relevant content, and store it in existing data management systems.Gather and process raw data at scale (including writing scripts, web scraping, calling APIs, write SQL queries, etc.)Create documentation and draft requirements for production implementation teams.Contribute to problem solving, collaboration, process improvement, and information sharing.About the TeamThe team is multi-disciplinary consisting of administrative stuff as well as analysts and data acquisition experts. The team is responsible for maintaining the entire vendor network administration and data acquisition to support the Provider Relations team for data acquisition in delivering a timely, complete, and accurate data for different Clarivate IPG products as well as the other teams within Clarivate.Hours of Work40 hours per week, full-time employment, #LI-Hybrid (3 days in the office every other week).We Offer:Private health insurancePaid lunchYearly bonusYearly merit planMy Learning platformFamily benefits: Bushido kids sports school, tutorship lessonsFitPassMental health care - Psychotherapy sessionsCompany bicycles for rent free of charge25 days of annual leaveOnly shortlisted candidates will be contacted.At Clarivate, we are committed to providing equal employment opportunities for all persons with respect to hiring, compensation, promotion, training, and other terms, conditions, and privileges of employment. We comply with applicable laws and regulations governing non-discrimination in all locations.
View Original Job Posting