Data Engineer
SmarketsWho are we?
Smarkets: Predicting the Future of Betting
Smarkets runs one of the world’s most sophisticated prediction markets, handling over £29 billion in volume since 2010 and engaging 200,000+ traders globally. We’re a technology company that happens to revolutionise betting - from sports to political markets, to delivering the sharpest prices and the fairest odds.
Our stack is designed for scale, reliability, and speed: Linux, Kafka, Postgres, Kubernetes, with Python 3, C++, Rust and React powering our platform. We build infrastructure that institutions trust while keeping trading accessible to everyone. Our edge? We’ve thrived through every market cycle, competitor, and industry revolution.
At the heart of our success are our people. We create a high-performance environment where exceptional talent can thrive, blending deep business experience with a commercial mindset to drive strategic growth.
If you’re ready to help shape the future of prediction markets with cutting-edge technology and a customer-first philosophy, Smarkets is where you belong.
The Team
The Data Team is responsible for taking the wealth of data that Smarkets generates and using it to drive insights which improve the business. Since Smarkets produces a huge amount of data - including sports event data, payments information, order flow and user analytics - there are many opportunities for the team to add real business value.
The team’s responsibilities currently span across three different areas:
Data Engineering: development and maintenance of ETL pipelines, services and APIs, and data-related infrastructure like Redshift or BigQuery;
Data Science and Machine Learning: data exploration, ML models training and ML Ops to extract new insights from data;
Analytics and Reporting: creation of data models and dashboards as well as automation of reporting pipelines for different teams, stakeholders and third-parties.
In a typical week, a data engineer in the Data team would:
Add a new python ETL pipeline that segments users interested in specific sports through analysing behaviour which streamlines and tailors marketing communications to those users;
Develop a new endpoint to a Flask API, add unit tests, and deploy the new version of the API into our production Kubernetes cluster;
Train and evaluate an ML model to identify certain user patterns and provide it as service to other engineering teams in a Flask API;
Team Tech Stack
Our current technology stack primarily includes Linux, Docker, Kubernetes, Jenkins, Kafka, Python, Flask/FastAPI, Postgres, AWS Redshift, dbt, Google Bigquery, Prometheus, Grafana, Elastic Search, Kibana, Lightdash.
About the Role:
You will work very closely with the Data team lead and the other team members who will be assisting you whenever needed, making your integration in the company as smooth as possible.
The Data team works in an organised way using Agile methodologies and tools such as Jira and regular standups. You will find an environment where you have a clear engineering direction, can focus on your work and hone your skills as a data engineer through exciting projects. You will always be able to count on the support of many engineers across the company.
What you will do:
As a member of the data team, your responsibilities will include contributions to:
Developing and maintaining our Data ETL pipelines, some of which are real-time. The pipelines are fundamental to helping teams and stakeholders understand and drive business direction. Data components can also be user facing e.g. sending notifications to users;
Ensuring our data lake is kept in a healthy state, particularly our data warehouses: Redshift and Bigquery;
Developing and maintaining Flask services and Postgres databases within the Data team to provide access to data or manage certain business entities relevant to Data.
Assisting the different team
Similar roles
Design & Tech
Related reads from TCHNX

The Quiet Revolution in Local-First Software
As major platforms face outages and data breaches, a new generation of developers is building applications that prioritise local data storage and peer-to-peer sync, challenging the cloud-first orthodoxy that's dominated tech for two decades.

The Quiet Revolution in Edge AI: Why Your Next Computer Might Not Need the Cloud
As neural processing units become standard in consumer devices, we're witnessing a fundamental shift in how AI applications work. Local processing is no longer a fallback; it's becoming the preferred architecture.

The Rise of AI-Assisted Code Generation 2: Are Developers Becoming Prompt Engineers?
As AI coding assistants reshape software development, the industry grapples with a fundamental question: is writing code giving way to writing prompts? We examine how London's tech scene is adapting to this seismic shift.


