Site Reliability Engineer at Cleo | Torre

Site Reliability Engineer

Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: To be defined

USD75.4K - 100K/year

~COP150M - 200M/year

+ Equity

+ Bonuses

location_on
Remote (for United Kingdom residents)
flightsmode
Visa sponsorship: No
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted over 2 years ago

Requirements and responsibilities


Most people come to Cleo to do work that matters. Every day, we empower people to build a life beyond their next paycheck, building a beloved AI that enables you to forge your own path toward financial well-being. Backed by some of the most well-known investors in tech, we’ve reached over 7 million users and plan to double that number each year... which is where you come in. We are looking for a Site Reliability Engineer to join our Platform team. You will be responsible for troubleshooting performance and reliability issues in our ruby and rails monolith, instrumenting our codebases to emit the right telemetry for our observability stack, coaching our engineering squads on effective implementation of monitoring and alerting, maintaining our services estate across Heroku and AWS, supporting with ad-hoc security and compliance tasks, and working closely with the rest of the Platform team to ensure stability and reliability of our estate. This is a full-time position with a competitive compensation package, clear progression plan, flexibility, and other benefits. If you are passionate about making a positive difference in society by improving the financial health of our users and have experience with Ruby on Rails and infrastructure-as-code tools like Terraform on AWS, we would love to hear from you! Responsibilities: - Troubleshooting performance and reliability issues in our ruby and rails monolith - Instrumenting our codebases to emit the right telemetry for our observability stack (using opentelemetry libraries) - Coaching our engineering squads on effective implementation of monitoring and alerting - Maintaining our services estate across Heroku and AWS - Support with ad-hoc security and compliance tasks - Working closely with the rest of the Platform team to ensure stability and reliability of our estate - Proactively diving into support during user-facing outages and incidents
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.