GatherJob
Back to jobs
Supabase

Site Reliability Engineer

Supabase
On-site Today

About Supabase

Supabase is the Postgres development platform, built by developers for developers. We provide a complete backend solution including Database, Auth, Storage, Edge Functions, Realtime, and Vector Search. All services are deeply integrated and designed for growth.

About the Role

Supabase manages millions of Postgres instances and is growing. We have strong teams across observability, release engineering, and incident management — and we're concentrating our reliability efforts into a dedicated SRE practice that ties the discipline together across the platform.

You'll be embedded within Service Operations, and your primary job is to make every engineering team more reliable — not by owning their infrastructure, but by establishing the practices, frameworks, and feedback loops that let them own reliability themselves. You'll work across the org: sometimes setting the standard, sometimes pair-programming a fix, sometimes helping a team define their error budget, sometimes telling them it's exhausted.

This role is ideal for someone who has a strong vision for how SRE should work and thrives in async, fast-paced environments where influence matters more than authority.

What You'll Own

  • Partner with service teams to define meaningful SLIs and SLOs grounded in customer experience, and build the error budget policies that turn them into engineering decisions

  • Own and evolve the Operational Readiness Review (ORR) process — conducting reviews for new services and major changes across observability, alerting, runbooks, capacity, and graceful degradation

  • Strengthen the incident-to-improvement pipeline: connecting postmortem findings to operational readiness gaps, identifying repeat failure patterns, and driving systemic fixes

  • Act as the reliability expert teams pull in for architecture reviews, failure mode analysis, dependency mapping, and resilience design

  • Identify and quantify operational toil across the org, and build or advocate for automation that eliminates it

  • Help teams design sustainable on-call practices: alert quality, escalation paths, runbook coverage, and noise reduction

  • Track and report on org-wide operational maturity, surfacing systemic gaps and driving remediation

You Might Be a Good Fit If You

  • Have 7+ years of experience in SRE, production engineering, or reliability-focused role

Apply now

Opens the company's application page

About the company

Supabase

Supabase

The open source Firebase alternative.