Senior Staff Applied AI Engineer - Context Retrieval

Mountain View, California; San Francisco, CaliforniaOn-site 2d ago

P-1549

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business.

The Mission

Databricks agents are only as good as the context they can retrieve. Whether an agent is answering a question about last quarter's revenue, debugging a failing job, generating SQL against a 10,000-table lakehouse, or summarizing a Wiki page, its quality is bounded by what it can find — and how well it understands what it finds.

We are hiring a Senior Staff Applied AI Engineer to own context retrieval for Databricks agents across SaaS providers. This is a zero-to-one role with two deeply connected charters:

Build the retrieval stack — query understanding, content understanding, ranking, retrieval, and evaluation — across the Enterprise SaaS data stored across multiple systems.
Build the search subagents that sit on top of that stack and reason about what context is needed, how to retrieve it, and whether the right thing actually came back — closing the loop between an agent's intent and the substrate that serves it.

If you have deep Information Retrieval wisdom, have shipped retrieval systems for RAG and agentic workloads, and want to build the substrate — and the agents on top of it — that make every Databricks agent measurably smarter, this role is for you.

What You Will Do

Build the full retrieval stack from scratch. Own the end-to-end system: query understanding, content understanding and indexing, hybrid retrieval, ranking, and evaluation. Make the architectural calls that will define how Databricks agents access context for years to come.
Retrieve across heterogeneous data — structured and unstructured. Index and rank across structured assets (tables, columns, SQL queries, dashboards, code, notebooks, jobs) and unstructured content (docs, wikis, tickets, chat, images, video, audio). Each modality has its own signals — design retrieval that exploits them rather than flattens them.
Connect to the SaaS surface area customers actually use. Build connectors and retrieval adapters for the systems where enterprise knowledge lives. Treat each retrieval source with its

Apply now

Opens the company's application page