Principal Engineer, Data & Compute
WayveThe Role
At Wayve, we are teaching machines to drive—not by coding rules, but by training end-to-end neural networks that learn from vast streams of real-world data. Achieving this requires unprecedented scale in both data infrastructure and compute orchestration. Our workloads span thousands of GPUs, petabytes of driving data, and geographically distributed training and inference clusters.
As Architect for AI Infrastructure, you will design and guide the evolution of the foundational compute and storage systems that fuel our model development lifecycle. Your leadership will directly accelerate AI research, enable rapid model deployment, and ensure our platform meets the demands of a company pushing the boundaries of autonomy.
You’ll sit at the strategic core of AI, systems, and cloud infrastructure—owning challenges that few companies have the ambition or scale to tackle.
Key Responsibilities
Global Compute Strategy – Define and evolve the architecture for how Wayve allocates and orchestrates training and inference workloads across thousands of GPUs and multiple data centers, ensuring optimal throughput, resiliency, and cost efficiency.
Petabyte-Scale Data Federation – Design systems that enable fast, reliable access to high-volume sensor and simulation data across geographies, ensuring the right data is always available for training, evaluation, and inference. Furthermore, preparing Wayve for being an exabyte-scale company.
Cross-Region GPU Job Execution – Build the foundations that enable large-scale AI workloads to run seamlessly across hybrid and multi-cloud environments.
Cloud Infrastructure Advisory – Act as a trusted partner to leadership in aligning compute investments and architecture with company strategy, growth plans, and performance goals.
Technical Leadership & Mentorship – Uplift the broader engineering org through architectural coaching, technical deep dives, and by cultivating a culture of operational and engineering excellence.
About You
In order to set you up for success at Wayve, we’re looking for the following skills and experience.
Essential
10+ years designing and building large-scale distributed systems, with at least 4 years focused on GPU-based cloud infrastructure.
Proven experience enabling large-scale AI training, inference, or computer vision workloads in GPU clusters.
Deep understanding of petabyte-scale data architecture, including storage federation, high-throughput access, and data locality for AI workloads.
Strong technical leadership with a track record of defining and communicating architectural strategy, balancing long-term vision with delivery needs.
A natural mentor with a history of developing engineers and influencing technical direction across teams.
Advanced degree in Computer Science, Electrical Engineering, or a related field—or equivalent industry experience.
Desirable
Experience with multi-cloud orchestration, particularly in latency- or cost-sensitive training and inference pipelines.
Familiarity with systems like Ray, Kubernetes, Airflow, or Flyte, and deep fluency in AI/ML job scheduling, model lifecycle management, and infrastructure-as-code practices.
Background in supporting safety-critical or real-time inference use cases (e.g., robotics, autonomous vehicles, aerospace).
Passion for building infrastructure-as-a-product that delivers performance and simplicity to research and product teams alike.
This is a full-time role based in-office. At Wayve we want the best of all worlds so we operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships and learning, and time spent working from home. This role is a full-time role based in Sunnyvale, CA (hybrid) and the reasonably estimated salary for this role ranges from $370,300 to $418,200, plus a competitive equity package. Actual compensation is based on the candidate's skills,
Similar roles
Design & Tech
Related reads from TCHNX

The Quiet Revolution in Local-First Software
As major platforms face outages and data breaches, a new generation of developers is building applications that prioritise local data storage and peer-to-peer sync, challenging the cloud-first orthodoxy that's dominated tech for two decades.

The Quiet Revolution in Edge AI: Why Your Next Computer Might Not Need the Cloud
As neural processing units become standard in consumer devices, we're witnessing a fundamental shift in how AI applications work. Local processing is no longer a fallback; it's becoming the preferred architecture.

The Rise of AI-Assisted Code Generation 2: Are Developers Becoming Prompt Engineers?
As AI coding assistants reshape software development, the industry grapples with a fundamental question: is writing code giving way to writing prompts? We examine how London's tech scene is adapting to this seismic shift.


