Staff Site Reliability Engineer at Zscaler

Zscaler accelerates digital transformation and protects customers from cyberattacks and data loss. With over 10 years of experience, Zscaler serves thousands of enterprise customers around the world, including 450 of the Forbes Global 2000 organizations. Zscaler's purpose-built security platform puts a company’s defenses and controls where the connections occur—the internet—so that every connection is fast and secure, no matter how or where users connect or where their applications and workloads reside. We are seeking a Staff Site Reliability Engineer to join our Site Reliability Engineering Rapid Response Team. You will drive Toil reduction through automation and tooling, drive organizational excellence, and ensure seamless implementation with built-in scalability over time. The role involves supporting and troubleshooting large-scale distributed software applications and networks, championing best practices for reliability, and participating in projects from intake to closure. The ideal candidate has 8+ years of professional SRE experience, proficiency in Python, Go, and Bash, and expert-level knowledge across infrastructure components such as Linux, Networking, Observability, and databases. Responsibilities: - Drive Toil reduction through automation and tooling - Drive organizational excellence and build scalable process frameworks - Drive holistic observability, high fidelity and low frequency alerting, zero touch operations, and automation first strategies - Drive standardized tooling around deployment, capacity management, service onboarding, and operationalization - Drive infrastructure, tooling, and process improvements to improve overall system reliability - Work closely with Product SRE teams to ensure adoption of SRE best practice, tooling, and capability - Build technical training capabilities and support and troubleshoot multiple large-scale distributed software applications and networks - Champion best practices for reliability within Engineering Department - Participate and/or lead projects from intake to closure

Staff Site Reliability Engineer

Requirements and responsibilities

Skills wanted:

Language(s) required:

About Zscaler:

mission:

www.zscaler.com/

Admin access needed

Payment confirmed

A member of the Torre team will contact you shortly