Senior Site Reliability Engineer, Kubernetes w/ active TS/SCI
OktaSecure Every Identity, from AI to Human
Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.
This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.
About the Team
At Okta, our motto is "Always On." Within the Technical Operations (TechOps) team, we live this mission by building the most reliable and performant systems on the planet. We empower organizations to do their most significant work by securely connecting any person, on any device, to the technologies they need.
The Role
We are looking for an experienced Senior Site Reliability Engineer (SRE) who thrives on the challenge of managing large-scale cloud production systems. The ideal candidate is a self-starter who lives by the ethic: "If you have to do it twice, automate it." Based in the Washington, D.C. area, with on-site customer travel, you will ensure our infrastructure maintains uncompromising reliability and performance while supporting the most sensitive national security missions.
Security Requirement: Must be able to obtain and maintain a U.S. security clearance (Secret or Top Secret) to the extent required by U.S. Government contracts.
The selected candidate may be subject to drug testing to the extent required by U.S. Government contracts.
What You’ll Do
- Infrastructure Excellence: Design, deploy, and monitor Okta’s production infrastructure to ensure peak performance and reliability.
- Incident Management: Serve as a frontline responder to production incidents, performing deep-dive troubleshooting and implementing permanent preventive solutions.
- Aggressive Automation: Eliminate manual toil by developing automation scripts, evolving monitoring tools, and documenting technical workflows.
- Scalability: Support a highly available, large-scale environment as part of an on-call rotation, ensuring "Always On" service delivery. <
Listed via
Greenhouse
Similar roles
Sr. Customer Support Engineer, Raipur
Danaher
Collibra Platform Developer (Mid to Senior)
Arch Capital Group Ltd.
Scheduling Director (Renewables Construction)
MasTec Industrial
Mom and Baby Care Manager - RN - Must reside in Nevada
CareSource
Design & Tech
Related reads from TCHNX

The Quiet Revolution in Local-First Software
As major platforms face outages and data breaches, a new generation of developers is building applications that prioritise local data storage and peer-to-peer sync, challenging the cloud-first orthodoxy that's dominated tech for two decades.

The Return of Physical Controls: Why Haptic Feedback Is Reshaping Digital Interfaces
After years of pursuing flat, buttonless designs, tech companies are rediscovering the value of tactile interaction. A new wave of products proves that touching isn't just feeling it's understanding.

The Quiet Revolution of Parametric Design Tools in Everyday Products
Parametric design is migrating from architecture studios to consumer products. As tools democratize and manufacturers adopt flexible production, we're entering an era of mass customization that challenges fundamental assumptions about design.