GatherJob
Back to jobs
B
Braze

Senior Site Reliability Engineer

Braze
São PauloOn-site 3w ago

At Braze, we have found our people. We’re a genuinely approachable, exceptionally kind, and intensely passionate crew.

We seek to ignite that passion by setting high standards, championing teamwork, and creating work-life harmony as we collectively navigate rapid growth on a global scale while striving for greater equity and opportunity – inside and outside our organization.

To flourish here, you must be prepared to set a high bar for yourself and those around you. There is always a way to contribute: Acting with autonomy, having accountability and being open to new perspectives are essential to our continued success.

Our deep curiosity to learn and our eagerness to share diverse passions with others gives us balance and injects a one-of-a-kind vibrancy into our culture.

If you are driven to solve exhilarating challenges and have a bias toward action in the face of change, you will be empowered to make a real impact here, with a sharp and passionate team at your back. If Braze sounds like a place where you can thrive, we can’t wait to meet you.

WHAT YOU'LL DO

Braze runs one of the largest MongoDB deployments in the world – powering real-time customer engagement for thousands of the world’s leading brands. We process hundreds of billions of data points each month across more than 3.3 billion monthly active users, with MongoDB at the core of how we store, query, and serve that data at scale.

As a Senior SRE on the MongoDB Platform team, your primary mission is to make MongoDB better for Braze – and to do so with the rigor, automation-first mindset, and engineering discipline of a world-class SRE. You won’t just keep the lights on; you’ll architect a more reliable, scalable, and observable MongoDB platform that the entire engineering organization depends on.

Main responsibilities:

Own MongoDB Reliability at Scale

  • Design and operate Braze’s MongoDB infrastructure to meet strict enterprise-grade SLAs, with deep ownership of availability, durability, and query performance
  • Build proactive monitoring and alerting that fires on symptoms – before customers feel impact – with rich MongoDB-specific observability (oplog lag, replication health, lock contention, index hit rates, etc.)
  • Lead capacity planning and sharding strategy as data volumes and query patterns evolve
  • Drive root-cause analysis on MongoDB incidents and translate findings into permanent system improvements

Improve the MongoDB Developer Experience

  • Partner with product engineering teams to review schema designs, index strategies, and aggregation pipelines – catching scalability a
Apply now

Opens the company's application page

About the company

Braze

Braze

Customer engagement platform.