Software Engineer, RL Data
AnthropicAbout Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the role
This is a senior, foundational role on a new team: you'll make architecture decisions the rest of the team builds on, and help shape what we build first. The work is hands-on and varied. Some weeks you'll be deep in pipeline or infrastructure engineering; others you'll be tuning prompts until the output is good, or sitting with a research team that depends on your systems and shipping the fixes they need. We're looking for experienced engineers who own outcomes end-to-end — down to reading transcripts, supporting users, and wrangling vendors.
Anthropic's RL Data team builds the systems that produce high-quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data trustworthy at scale. Our goal is to make Claude great at real work — especially the work that matters most, like AI safety research and beneficial deployments of AI. (To be upfront: this is dual-use work — it advances general capabilities too.)
Key responsibilities
- Own significant parts of our stack end-to-end, from technical architecture through the unglamorous operational work that makes it succeed.
- Build data collection pipelines, read the transcripts they produce, and iterate on prompts, evals, and graders until the output is good.
- Develop and improve QA frameworks to catch reward hacking and ensure environment quality.
- Build interfaces that make collecting human data fast and painless for the people providing it.
- Harden execution environments — sandboxing, snapshotting, tool coverage — so tasks hold up at training scale.
- Embed with the teams and domain experts who use our systems day-to-day, and work with operations, security, and compliance partners to roll our systems out to new users and vendors.
Minimum qualifications
- A track record of owning major projects end-to-end in fast-paced, ambiguous environments — for example as a founder or CTO, forward deployed engineer, tech lead, founding engineer at a startup, or creator of a substanti
Listed via
Greenhouse
Similar roles
Sr. Customer Support Engineer, Raipur
Danaher
Collibra Platform Developer (Mid to Senior)
Arch Capital Group Ltd.
Scheduling Director (Renewables Construction)
MasTec Industrial
Mom and Baby Care Manager - RN - Must reside in Nevada
CareSource
Design & Tech
Related reads from TCHNX

The Quiet Revolution in Local-First Software
As major platforms face outages and data breaches, a new generation of developers is building applications that prioritise local data storage and peer-to-peer sync, challenging the cloud-first orthodoxy that's dominated tech for two decades.

The Return of Physical Controls: Why Haptic Feedback Is Reshaping Digital Interfaces
After years of pursuing flat, buttonless designs, tech companies are rediscovering the value of tactile interaction. A new wave of products proves that touching isn't just feeling it's understanding.

The Quiet Revolution of Parametric Design Tools in Everyday Products
Parametric design is migrating from architecture studios to consumer products. As tools democratize and manufacturers adopt flexible production, we're entering an era of mass customization that challenges fundamental assumptions about design.