Tokens-as-a-Service (Taas) Software Engineer
OpenAI
Technical Community Lead, Campus Leaders
OpenAI
Engineering Manager, Core Services
OpenAI
Workload Porting & Performance Engineer
OpenAI
Security Engineer, Application Security
OpenAI
AI Success Engineer - Tokyo
OpenAI
Recruiter, AI/ML Research
OpenAI
SMS Prototype Handling Specialist
OpenAI
Developer Marketing Lead, India
OpenAI
Customer Marketing Manager, Enterprise and Digital Natives
OpenAI
Technical Threat Investigator, Threat Intel Engineering
OpenAI
Performance Modeling Lead
OpenAI
Manager, AI Success Engineers - San Francisco
OpenAI
Market Research Lead
OpenAI
Software Engineer, Youth Well-Being
OpenAI
Program Manager, Outreach & Audiences
OpenAI
Software Engineer, GPU Infrastructure - HPC
OpenAI
Software Engineer, Cloud Agents
OpenAI
Product Manager, Cyber Safety
OpenAI
Workday Engineer
OpenAI
Threat Modeler, Preparedness
OpenAI
Software Engineer, Data Infrastructure
OpenAI
Security Engineer, Insider Threat Detection & Response
OpenAI
AI Deployment Engineer, Startups
OpenAI
Account Director, Federal Partnerships
OpenAI
Software Engineer, Privacy
OpenAI
AI Deployment Engineer
OpenAI
Product Manager, API Agents
OpenAI
Data Scientist, Business
OpenAI
Technical Sourcer, Research
OpenAI
Showing 30 of 5,931 roles
Load more rolesOpenAI
Software Engineer, Data Infrastructure
About OpenAI
AI research and deployment company.
About the role
About the Team
Data Platform at OpenAI owns the foundational data stack powering critical product, research, and analytics workflows. We operate some of the largest Spark compute fleets in production; design, and build data lakes and metadata systems on Iceberg and Delta with a vision toward exabyte-scale architecture; run high throughput streaming platforms on Kafka and Flink; provide orchestration with Airflow; and support ML feature engineering tooling such as Chronon. Our mission is to deliver reliable, secure, and efficient data access at scale and accelerate intelligent, AI assisted data workflows.
Join us to build and operate these core platforms that underpin OpenAI products, research, and analytics.
We’re not just scaling infrastructure – we’re redefining how people interact with data. Our vision includes intelligent interfaces and AI-assisted workflows that make working with data faster, more reliable, and more intuitive.
About the Role
This role focuses on building and operating data infrastructure that supports massive compute fleets and storage systems, designed for high performance and scalability. You’ll help design, build, and operate the next generation of data infrastructure at OpenAI. You will scale and harden big data compute and storage platforms, build and support high-throughput streaming systems, build and operate low latency data ingestions, enable secure and governed data access for ML and analytics, and design for reliability and performance at extreme scale.
You will take full lifecycle ownership: architecture, implementation, production operations, and on-call participation.
You’ve supported Spark, Kafka, Flink, Airflow, Trino, or Iceberg as platforms. You’re well-versed in infrastructure tooling like Terraform, experienced in debugging large-scale distributed systems, and excited about solving data infrastructure problems in the AI space.
This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
In this role, you will:
Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security
Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient
Accelerate company productivity by empowering your fellow engineers & teammates wit
Feel the pull?
Apply directly on the company's site.