Applied Scientist / Machine Learning Engineer
WayveWayve is building embodied AI for the physical world, starting with autonomous driving. Instead of the hand-engineered, modular stacks that defined the first era of self-driving, we pioneered AV2.0: a single, end-to-end neural network that learns to drive from raw sensor data and generalises to new cities, vehicles, and conditions. Our foundation models, the GAIA family of generative world models and the LINGO family of vision-language-action models, let vehicles perceive, reason, and act in the open world. We have driven zero-shot across hundreds of cities on three continents, and we are now scaling from proving the science to deploying it with leading automakers and mobility partners, including Nissan, Stellantis, and Uber.
The role
This role sits in the AI Platform organisation, on the data flywheel that powers every model we ship. The thesis is simple and compounding: the more intelligently we curate, enrich, and evaluate the real-world driving experience our fleet generates, the faster our foundation models improve, and the further they generalise across geographies, embodiments, and OEM platforms. As deployment scales, the bottleneck is shifting from raw model capacity to the quality and intelligence of the data engine and the rigour of how we measure progress. That is the problem you will own.
This is a dual-track role: we are hiring at either Applied Scientist or Machine Learning Engineer, at TC3 (Senior) or TC4 (Staff / Tech Lead), calibrated to your background. We are open on specialisation. There are three areas we are hiring into, and you can go deep in any one of them:
Data curation: mine world-scale fleet data for the rare, long-tail, and safety-critical moments that move the model.
Data enrichment: turn raw driving experience into high-signal training data through (semi-)automated enrichment, labeling, and data quality at scale.
Foundation model evaluation: define how we know a driving foundation model is genuinely getting better, offline and in closed loop.
Day to day, the role also spans the broader foundation-model stack, including vision-language-action and vision-language models for embodied AI, world modeling, policy learning, reinforcement learning, and reward modeling.
Key responsibilities
Mine world-scale fleet data for rare, long-tail, and safety-critical events using active learning, smart sampling, and embedding-based retrieval and dedup.
Figure out what makes a good training dataset: which data, mix, and balance actually move the model, and turn that into repeatable curation across cities, sensor rigs, and embodiments.
Build high-quality enrichments that teams across the company depend on, through (semi-)automated enrichment and labeling pipelines and data quality at scale.
Build and fine-tune large-scale pretrained models, and run smaller-scale experiments to test and derisk ideas before committing serious compute.
Help build the best embodied VLM / VLA in the world for driving (the LINGO line): push multimodal perception, reasoning, language, and action.
Design rigorous offline and closed-loop evaluation: metrics and benchmarks that correlate with real on-road behaviour and safety, with deliberate coverage of rare and safety-critical scenarios.
Use world-model-based evaluation (GAIA) to probe counterfactual “what if” scenarios safely, repeatably, and at scale.
Contribute across the wider foundation-model stack as the work demands: generative world models (GAIA), policy learning, reinforcement learning, and reward modeling.
About you
Essential
A Masters with around 6 or more years of relevant experience, or a PhD with 2 or more years, in computer science, machine learning, robotics, mathematics, or a related field (required).
Strong ML and software fundamentals, and a track record of taking ML from research into production systems that run at scale.
Hands-on strength in one or more of: data curation, foundation model training, large-scale data wrangling, and foundat
Similar roles
Design & Tech
Related reads from TCHNX

The Quiet Revolution in Local-First Software
As major platforms face outages and data breaches, a new generation of developers is building applications that prioritise local data storage and peer-to-peer sync, challenging the cloud-first orthodoxy that's dominated tech for two decades.

The Quiet Revolution in Edge AI: Why Your Next Computer Might Not Need the Cloud
As neural processing units become standard in consumer devices, we're witnessing a fundamental shift in how AI applications work. Local processing is no longer a fallback; it's becoming the preferred architecture.

The Rise of AI-Assisted Code Generation 2: Are Developers Becoming Prompt Engineers?
As AI coding assistants reshape software development, the industry grapples with a fundamental question: is writing code giving way to writing prompts? We examine how London's tech scene is adapting to this seismic shift.


