Sunny Arora

Sunny Arora

About

Detail

Delhi, India

Timeline


work
Job
school
Education

Résumé


Jobs verified_user 0% verified
  • A
    Site Reliability Engineer
    Abacus.ai,
    Jan 2025 - Current (10 months)
    • Managing ML infrastructure, GPU workloads and inferences over k8s • Managing Multi cloud Infra fleet using kops over AWS, GCP and Azure. • CI/CD using Github-action custom python toolings
  • Ubie
    Site Reliability Engineer
    Ubie
    Jan 2024 - Jan 2025 (1 year 1 month)
    • Manage Infrastructure for product over Google cloud, managing multiple K8s clusters • Setting up Infra for US launch of the product. • Improving observability of system and setting up SLO/SLI frameworks for product using sloth • Internal tooling for ease of infra-dev workflows
  • R
    Lead Platform Engineer
    Razorpay
    Mar 2023 - Dec 2023 (10 months)
    • Led the Dev-Productivity team, managing ephemeral on-demand cloud fleets for developers over Kubernetes clusters using devspace. • Managed multiple Kubernetes clusters. • Led the formation of SLO and error budgeting framework in the organization, from implementation as a Git workflow to educating developers on SLOs and driving adherence across teams. Additionally, implemented cybersecurity measures by securing policies in infrastructure and applications for the financial system at Razorpay, ensuring robust protection and compliance within the development environment.
  • S
    Senior Platform Engineer
    Oct 2021 - Mar 2023 (1 year 6 months)
    • Led the observability team and collaborated with the Hypertrace OSS project to build an in-house distributed tracing platform over Kafka Streams and Apache Pinot, successfully scaling it to 1M messages/second (20TB daily volume). • Optimized infrastructure, achieving cost savings of $100k+ annually. • Implemented log shipping pipelines over Fluentbit, efficiently handling 50 TB/day. • Led the SaaS vendor migration project for logging, resulting in an organizational savings of $300k. • Utilized knowledge of cybersecurity principles to enhance the security posture of the platform, while managing Kubernetes (k8s) and VPC networking in AWS and GCP clouds to ensure robust and secure infrastructure.
  • P
    Platform Engineer
    Jun 2019 - Oct 2021 (2 years 5 months)
    • Implemented safe deployment practices across multiple microservices, including blue/green deployments, canary deployments, and auto-rollbacks on metrics deviation . • Managed & migrated testing infrastructure components over Kubernetes • Worked with the Payments team, involved in performance and load testing and contributed to the in-house payment routing system design. • Implement mocking server over Java Spring Boot with local & redis cache for performance runs & functional testing
  • I
    Intern
    Jan 2019 - May 2019 (5 months)
    • Worked with the Payments team improving post rollouts sanity, integration tests, frontend integration tests over Selenium • deployment sanity suites • CI/CD pipelines for multiple microservices over github action and spinnkaer • working infra provisioning with terraform • setting up env fleet, on k8s cluster using helm charts
Education verified_user 0% verified
  • Vellore Institute of Technology
    Bachelor of Technology
    Vellore Institute of Technology
    Jan 2015 - Jan 2019 (4 years 1 month)