Staff Site Reliability Engineer, Networking (FedRamp)
OktaSecure Every Identity, from AI to Human
Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.
This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.
The Team
The TCore team is a specialized engineering group that owns and operates all of Okta's networks. They are focused on ensuring the reliability, performance, and security of Okta's core infrastructure, particularly its global traffic entry points and the complete internal networking.
The Staff Site Reliability Engineer Role
We are looking for a Staff Site Reliability Engineer to join the TCore team. The ideal candidate is a self-starter who takes pride in designing and implementing durable solutions to network problems. They are passionate about network responsiveness and performance.
What you’ll be doing
- Work with various teams to design and implement scalable, and reliable network solutions
- Maintain a highly available cloud infrastructure edge for the Okta identity platform
- Collect and analyze data to identify root causes for network-specific events
- Automate AWS infrastructure with Terraform and/or Chef
- Evolve the system by introducing changes to improve efficiency, scalability, and velocity
What you’ll bring to the role
- 8+ years experience in a Cloud Network Engineer role or related
- Demonstrated in-depth understanding of TCP/IP networking stack; (layer 2 through 7). Ability to implement a highly available VPC network, including inter-vpc connectivity. Working knowledge of stateless and stateful firewalls. Familiar with DNS, web-application firewalls, and various load balancing methods available in the cloud.
- Deep knowledge of AWS/GCP network concepts such as Transit Gateway / Network Connectivity Center (NCC), Site-to-Site VPN / HA VPN, and Direct Connect / Cloud Interconnect
- Ability to troubleshoot network issues using AWS VPC flow logs and Cloudwatch metrics, as well as
Listed via
Greenhouse
Similar roles
Sr. Customer Support Engineer, Raipur
Danaher
Collibra Platform Developer (Mid to Senior)
Arch Capital Group Ltd.
Scheduling Director (Renewables Construction)
MasTec Industrial
Mom and Baby Care Manager - RN - Must reside in Nevada
CareSource
Design & Tech
Related reads from TCHNX

The Quiet Revolution in Local-First Software
As major platforms face outages and data breaches, a new generation of developers is building applications that prioritise local data storage and peer-to-peer sync, challenging the cloud-first orthodoxy that's dominated tech for two decades.

The Return of Physical Controls: Why Haptic Feedback Is Reshaping Digital Interfaces
After years of pursuing flat, buttonless designs, tech companies are rediscovering the value of tactile interaction. A new wave of products proves that touching isn't just feeling it's understanding.

The Quiet Revolution of Parametric Design Tools in Everyday Products
Parametric design is migrating from architecture studios to consumer products. As tools democratize and manufacturers adopt flexible production, we're entering an era of mass customization that challenges fundamental assumptions about design.