GatherJob
Back to jobs
O
Okta

Senior Database Reliability Engineer (DBRE)

Okta
Bellevue, Washington; Chicago, Illinois; New York, New York; San Francisco, California; Washington, DCOn-site 5d ago

Secure Every Identity, from AI to Human

Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

Senior Database Reliability Engineer (DBRE)

Experience Level: Mid–Senior (4+ years PostgreSQL experience)

About the Role

We are looking for a highly skilled Database Reliability Engineer (DBRE) with deep expertise in PostgreSQL at scale and solid experience with MySQL. In this role, you will design, operationalize, and optimize the data persistence layer that powers our large-scale, mission-critical systems. You will work closely with SRE, Platform, and Engineering teams to ensure performance, reliability, automation, and operational excellence across our database environment.

This is a hands-on engineering role focused on building resilient data infrastructure, not just administering it.

Responsibilities:

Architecture, Reliability & Performance

  • Design, implement, and operate highly available PostgreSQL clusters (physical replication, logical replication, sharding/partitioning, failover automation).
  • Optimize query performance, indexing strategies, schema design, and storage engines.
  • Perform capacity planning, growth forecasting, and workload modeling.
  • Own high-availability strategies including automatic failover, multi-AZ/multi-region setups, and disaster recovery.

Automation & Tooling

  • Develop automation for any and all tasks including but not limited to: provisioning, configuration, backups, failovers, vacuum tuning, and schema management using tools such as Terraform, Ansible, Kubernetes Operators, or custom tooling.
  • Build monitoring, alerting, and self-healing systems for PostgreSQL and MySQL.

Operations & Incident Response

  • Lead response during database incidents—performance regressions, replication lag, deadlocks, bloat issues, storage failures, etc.
  • Conduct root-cause analysis and implement permanent fixes.

Cross-Functional Collaboration

  • P
Apply now

Opens the company's application page

About the company

Okta

Okta

Identity and access management.

Listed via

G

Greenhouse