TL;DR
Senior Data Science Engineer (GCP, Python): Building and optimizing high-scale, secure, and automated data processing pipelines on GCP with an accent on cloud compute efficiency and data governance. Focus on solving complex data science challenges, system re-architecture, and fast feature delivery with robust engineering.
Location: Hybrid in Edinburgh or London, UK
Company
Blis is an award-winning global leader in big data analytics and advertising, uniting telco data, real-world movement patterns, and transactions to deliver a complete view of the consumer.
What you will do
- Design, build, monitor, and support large-scale data processing pipelines.
- Support, mentor, and pair with other team members to advance capabilities.
- Help Blis explore and exploit new data streams to innovate and support commercial and technical growth.
- Work closely with Product and deliver against fast-paced decisions to delight customers.
Requirements
- 5+ years of direct experience delivering robust, performant data pipelines.
- Proven experience in architecting, developing, and maintaining Apache Druid and Imply platforms, with a focus on DevOps practices and large-scale system re-architecture.
- Mastery of building pipelines in GCP, maximizing the use of native and native supporting technologies like Apache Airflow.
- Mastery of Python for data and computational tasks with fluency in data cleansing, validation, and composition techniques.
- Hands-on implementation and architectural familiarity with all forms of data sourcing (streaming, relational/non-relational databases, distributed processing like Spark).
- Excellent working understanding of server-side Linux.
Nice to have
- Experience optimizing both code and config in Spark, Hive, or similar tools.
- Practical experience working with relational databases, including advanced operations such as partitioning and indexing.
- Knowledge and experience with tools like AWS Athena or Google BigQuery to solve data-centric problems.
- Understanding and ability to innovate, apply, and optimize complex algorithms and statistical techniques to large data structures.
Culture & Benefits
- Work on fantastically high-scale systems processing over 350GB of data an hour and responding to 400,000 decision requests each second.
- Tackle challenges across major data science disciplines including classification, clustering, optimization, and data mining.
- Join a growing team with big responsibilities and exciting challenges, aiming for the next 10x level of scale and intelligence.
- Adherents of Lean Development, working in environments with significant freedom and ambitious goals.
- Company values are Brave, Love our clients, Inclusive, and Solutions driven.
- Global company founded in the UK in 2004, with over 300 employees across 14 offices in 11 countries.
