Senior Software Engineer - Linux/eBPF
DatadogThe eBPF Platform team owns the shared eBPF infrastructure inside the Datadog Agent, and is responsible for its reliability, performance, and evolution across a wide variety of Linux distributions and kernel versions. We build tooling and agent functionality for product teams utilizing eBPF (Network Performance Monitoring, Universal Service Monitoring, Cloud Workload Security, GPU Monitoring), enable new teams exploring eBPF, and centralize deep kernel expertise across the organization. The team contributes to open source projects such as btfhub and cilium/ebpf.
Datadog is investing heavily in GPU Monitoring to give customers deep visibility into GPU utilization, health, and performance across their infrastructure. The eBPF Platform team builds the agent-side foundation that makes this possible, from eBPF programs that capture GPU activity at the kernel level, to the metrics pipelines and validation infrastructure that ensure data quality at scale.
In this role, you will work at the intersection of eBPF, the Linux kernel, and GPU infrastructure. You'll contribute to GPU Monitoring capabilities within the Datadog Agent while also working across the broader eBPF platform, investigating production incidents, improving reliability, and helping shape the architecture of one of the most widely deployed eBPF solutions in the industry.
At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.
What You’ll Do:
- Contribute to GPU Monitoring feature development end-to-end, from ideation to implementation within the Datadog Agent
- Build and maintain shared eBPF functionality for product teams to use in their eBPF-based products
- Investigate and debug complex production issues that span the kernel, eBPF programs, and agent runtime
- Research, prototype, develop, and document solutions to hard problems in the eBPF and GPU monitoring space
- Provide technical input to product and architecture decisions
Who You Are:
- You have experience working with and developing in the Linux kernel, either using eBPF, as a kernel developer, or building low level programs in C
- You have significant experience building high throughput/low latency systems
- You have been building applications and tooling for 5+ years and know the systems you’ve worked on in depth
- You are motivated by solving problems for our customers first and foremost and buil
About the company
Datadog
Monitoring and security platform for cloud applications.