Site Reliability Engineer at Supabase | Torre
warning

Heads-up

The job you’re trying to post already exists in Torre:

Site Reliability Engineer

You'll shape SRE practices across engineering teams, driving reliability and systemic improvements at scale.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Compensation is to be agreed upon.
location_on
Remote (anywhere)
Shared by
Emma of Torre.ai
about 7 hours ago

Requirements and responsibilities


About SupabaseSupabase is the Postgres development platform, built by developers for developers. We provide a complete backend solution including Database, Auth, Storage, Edge Functions, Realtime, and Vector Search. All services are deeply integrated and designed for growth.About the RoleSupabase manages millions of Postgres instances and is growing. We have strong teams across observability, release engineering, and incident management — and we're concentrating our reliability efforts into a dedicated SRE practice that ties the discipline together across the platform.You'll be embedded within Service Operations, and your primary job is to make every engineering team more reliable — not by owning their infrastructure, but by establishing the practices, frameworks, and feedback loops that let them own reliability themselves. You'll work across the org: sometimes setting the standard, sometimes pair-programming a fix, sometimes helping a team define their error budget, sometimes telling them it's exhausted.This role is ideal for someone who has a strong vision for how SRE should work and thrives in async, fast-paced environments where influence matters more than authority.What You'll OwnPartner with service teams to define meaningful SLIs and SLOs grounded in customer experience, and build the error budget policies that turn them into engineering decisionsOwn and evolve the Operational Readiness Review (ORR) process — conducting reviews for new services and major changes across observability, alerting, runbooks, capacity, and graceful degradationStrengthen the incident-to-improvement pipeline: connecting postmortem findings to operational readiness gaps, identifying repeat failure patterns, and driving systemic fixesAct as the reliability expert teams pull in for architecture reviews, failure mode analysis, dependency mapping, and resilience designIdentify and quantify operational toil across the org, and build or advocate for automation that eliminates itHelp teams design sustainable on-call practices: alert quality, escalation paths, runbook coverage, and noise reductionTrack and report on org-wide operational maturity, surfacing systemic gaps and driving remediationYou Might Be a Good Fit If YouHave 7+ years of experience in SRE, production engineering, or reliability-focused roles, including experience shaping SRE practices and driving adoption across engineering teamsHave a software engineering mindset — you write code and build tools, not just configure themHave hands-on experience defining and operationalizing SLOs/SLIs at scale, including error budget policies that actually influenced engineering decisionsHave deep experience with incident response, postmortem facilitation, and turning incident learnings into systemic improvementsHave worked with large-scale multi-tenant systems (bonus: managed database platforms or Postgres)Are proficient with cloud infrastructure (AWS preferred) and infrastructure-as-code (Pulumi preferred, Terraform/CDK also acceptable)Communicate clearly and persuasively — this role requires influencing without authority across a distributed orgHave experience in async or globally distributed teamsAre energized by making other teams more effective rather than being the one who fixes everythingNice to HaveExperience with Kubernetes-based platform operationsFamiliarity with OpenTelemetry, VictoriaMetrics, Grafana, or similar observability toolingExperience building developer-facing reliability tooling (SLO dashboards, ORR frameworks, toil tracking, DORA metrics)What We OfferFully RemoteWe hire globally. We believe you can do your best work from anywhere. There are no Supabase offices, but we provide a WeWork membership or co-working allowance you can use anywhere in the world.ESOPEvery team member receives ESOP (equity ownership) in the company. We want everyone to share in the upside of what we’re building together.Tech AllowanceUse this budget to set up your ideal work environment—laptop, monitor, headphones, or whatever helps you do your best work.Health BenefitsSupabase covers 100% of health insurance for employees and 80% for dependents, wherever you are. Your wellbeing and your family’s health are important to us.Annual Off-SitesOnce a year, the entire company gathers in a new city for a week of connection, collaboration, and fun. It’s a highlight of our year.Flexible WorkWe operate asynchronously and trust you to manage your own time. You know what needs to be done and when.Professional DevelopmentEvery team member receives an annual education allowance to spend on learning—courses, books, conferences, or anything that supports your growth.About the TeamSupabase was born-remote and open-source-first. We believe our globally distributed team is our secret weapon in building tools developers love.280+ team members55+ countries20+ languages spoken$500M raised500,000+ community membersWe move fast, build in public, and use what we ship. If it’s in your project, we probably use it in ours too. We believe deeply in the open-source ecosystem and strive to support—not replace—existing tools and communities.Hiring ProcessWe keep things simple, async-friendly, and respectful of your time:Apply – Our team will review your application.Intro Call – A short video chat to get to know each other.Interviews – Up to four calls with:Team LeadsFuture teammatesSomeone cross-functional from product, growth, or engineering (depending on the role)Someone from our leadership/founding teamDecision – We may follow up with a final question or go straight to offer.All communication is remote and we aim to move fast.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.