Manager I, Engineering - AI Platform - Training & Serving
DatadogThe AI platform is responsible for all AI infrastructure across Datadog. Our mission is to
provide tools and platforms that enable data scientists and engineers to conduct large-scale training and inference with ease. We support products such as Bits AI, LLMObs and all our AI research.
As an engineering manager for the Training & Serving team, you’ll join a new and fast growing team and organization. You will support building and scaling the team, define our technical vision and help shape the roadmap. Your team will lead the charge on multiple critical technical challenges: distributed training of foundation models, serving at scale, designing the user experience.
You’ll work closely with sister teams in the AI platform organization ensuring a seamless AI development cycle. You’ll also partner with the Applied AI org and with Datadog infrastructure & tooling teams to build out systems from the ground up.
At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You’ll Do:
- Manage and grow the Training & Serving team, directly managing 10+ engineers
- Define our technical roadmap in alignment with AI platform goals and the Applied AI team roadmap.
- Work with our core platform teams to tailor Datadog's storage, infrastructure and data pipelines to our needs
- Create a strong team culture aligned with our engineering standards and our customer focus
- Participate in hands-on work: Code reviews, design reviews and some coding
Who You Are:
- Previous experience (1+ years) leading software engineering teams, as a tech lead or people manager
- Strong technician with a mix of backend, data engineer and infrastructure experience who is interested in remaining a hands-on leader
- Excellent leader with strong interpersonal skills, and the ability to build and lead high-performing teams
- Interested in working on an early stage project with many challenges to solve and a fast iteration cycle
- You build stron
Opens the company's application page
About the company
Datadog
Monitoring and security platform for cloud applications.
Listed via
Greenhouse
Similar roles
Sr. Customer Support Engineer, Raipur
Danaher
Collibra Platform Developer (Mid to Senior)
Arch Capital Group Ltd.
Scheduling Director (Renewables Construction)
MasTec Industrial
Mom and Baby Care Manager - RN - Must reside in Nevada
CareSource
Design & Tech
Related reads from TCHNX

The Emergence of Small Language Models: Why Efficiency Is Overtaking Scale
As the AI industry confronts computational costs and environmental concerns, a new generation of compact models is proving that bigger isn't always better. Small language models are reshaping enterprise AI deployment.

The Quiet Revolution in Local-First Software
As major platforms face outages and data breaches, a new generation of developers is building applications that prioritise local data storage and peer-to-peer sync, challenging the cloud-first orthodoxy that's dominated tech for two decades.

The Return of Physical Controls: Why Haptic Feedback Is Reshaping Digital Interfaces
After years of pursuing flat, buttonless designs, tech companies are rediscovering the value of tactile interaction. A new wave of products proves that touching isn't just feeling it's understanding.