Staff Site Reliability Engineer at Zscaler | Torre

Staff Site Reliability Engineer

Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: To be defined

USD75.4K - 100K/year

~COP150M - 200M/year

+ Equity

+ Bonuses

location_on
Bangalore, India
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted over 2 years ago

Requirements and responsibilities


Zscaler accelerates digital transformation and protects customers from cyberattacks and data loss. With over 10 years of experience, Zscaler serves thousands of enterprise customers around the world, including 450 of the Forbes Global 2000 organizations. Zscaler's purpose-built security platform puts a company’s defenses and controls where the connections occur—the internet—so that every connection is fast and secure, no matter how or where users connect or where their applications and workloads reside. We are seeking a Staff Site Reliability Engineer to join our Site Reliability Engineering Rapid Response Team. You will drive Toil reduction through automation and tooling, drive organizational excellence, and ensure seamless implementation with built-in scalability over time. The role involves supporting and troubleshooting large-scale distributed software applications and networks, championing best practices for reliability, and participating in projects from intake to closure. The ideal candidate has 8+ years of professional SRE experience, proficiency in Python, Go, and Bash, and expert-level knowledge across infrastructure components such as Linux, Networking, Observability, and databases. Responsibilities: - Drive Toil reduction through automation and tooling - Drive organizational excellence and build scalable process frameworks - Drive holistic observability, high fidelity and low frequency alerting, zero touch operations, and automation first strategies - Drive standardized tooling around deployment, capacity management, service onboarding, and operationalization - Drive infrastructure, tooling, and process improvements to improve overall system reliability - Work closely with Product SRE teams to ensure adoption of SRE best practice, tooling, and capability - Build technical training capabilities and support and troubleshoot multiple large-scale distributed software applications and networks - Champion best practices for reliability within Engineering Department - Participate and/or lead projects from intake to closure
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.