Experienced Data Extraction Engineer (Reverse/Crawler Engineer)

Job description

It has been a big year here at OTA Insight as we recently had the pleasure of announcing a new series B funding of 80 million which will fuel our investment in product innovation! To accomplish our ambitions plans we are growing our innovation teams significantly where our Crawling Engineering team plays a crucial part in. Ready to have a big impact in an ambitious scale-up? Read along!


Are you passionate about how the web works and especially about reverse engineering websites to enable large scale data extractions? Great, then read on!


About the position
We are looking for an expert in web scraping techniques and web technology, with an outside the box mindset who has experience or interest in employing this on a large scale.


In this role you will:

  • Reverse engineer data sources like websites, APIs, mobile apps to help us scale our products
  • Investigate the newest browser fingerprinting techniques
  • Perform tracking behavior analysis
  • Think out of the box and come up with creative strategies to extract data
  • Write Python applications that extract data from these external data sources
  • Find ways to monitor and auto-heal the data extraction process
  • Support and rebuild integrations that fail e.g. due to a changing website


About the team

The OTA Insight Crawler team is built around data ingestion and integration with external parties. It's responsibilities lie in ensuring our products receive a continuous stream of high quality data. We do this by building the integrations with external systems, but also by implementing monitoring and QA tooling that allows us to monitor and optimize our systems at scale. The team works in close collaboration with our Product team, as well as a highly talented group of software engineers, devops engineers, and project managers to drive initiatives forward.


Today we process billions of data points and +100TB of data on a daily basis, containing hotels' pricing information, search data, hotel bookings, etc. All of that using modern technologies. Being a growth company enables us to regularly attract new and interesting datasets, which can unlock new product directions.



About OTA Insight

OTA Insight is a scale-up within the hotel industry. Founded in 2012, with a vision to provide user friendly tools to hoteliers. Today we are considered the global leader in hotel BI and are working with 50,000+ hotels worldwide in 168 countries. As there are more than 1 million hotels worldwide, we are still filled with ambition to grow further.
We generate value to our customers by visualising actionable insights out of our vast datasets. Our tools help hotels to analyse their competition’s room pricing, analyse their hotel revenue, and find out where and when guests are looking and booking. Our products have a profound impact on the day to day activities of our customers, taking away guesswork and simplifying their routines. 


About the compensation

As we are a growth company, we offer:

  • A flexible environment that enables you to grow over time and define your role in the way you enjoy it most
  • A compensation that values your work and which we will proactively keep competitive
  • A choice to go green as we take pride in respecting our environment, either with a flexible mobility budget or the best electrical car on the market
  • Ease of mind as we truly care about our team, e.g. by offering the best health & ambulant insurance on the Belgian market
  • An opportunity to make an impact on the entire hospitality industry with 100.000’s of hotels worldwide
  • A motivation to deliver your best work as we have built a high-bar and very talented team of individuals that are friendly, creative, open-minded and passionate about what they do

Requirements

  • Master degree in a STEM field or equivalent experience
  • High-energy self-starter with a passion for data, attention to detail, and a positive attitude
  • Python developer that has well-rounded experience on web technologies (in various domains: browser, server, web infrastructure, security)
  • Experience with cloud platforms (we use GCP)
  • Fluent in English
  • Open for remote work