Lead DevOps Engineer (M/F/D) at Kayzen | Torre

Lead DevOps Engineer (M/F/D)

You'll lead engineering to build and scale a global, automated, self-healing infrastructure for petabytes of data.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: To be defined

Compensation is to be agreed upon.
location_on
Remote (for India residents)
Shared by
Emma of Torre.ai
2 days ago

Requirements and responsibilities


Lead DevOps Engineer (m/f/d)Bangalore or Fully Remote from IndiaHello 👋 I am Servesh, Co- founder and CTO at Kayzen, and I am now looking for a Lead DevOps Engineer who will be part and lead our Engineering team. 🙌 But wait, you have not heard of Kayzen before? 😃Kayzen is a mobile demand-side platform (DSP) dedicated to democratizing programmatic advertising. We enable leading apps, agencies, media buyers, and brands to run programmatic customer acquisition, retargeting, and brand performance campaigns through its self-serve and managed service options. Built on the three core pillars of performance, transparency, and control, Kayzen powers the world’s best mobile marketing teams with bespoke solutions that fuel business growth and deliver a competitive advantage. With an unprecedented scale of 160B+ daily ad requests from 1.6B+ unique users worldwide, we serve up to 1B+ ads per day in 180 countries. Kayzen is accessible through our APIs and user interface.The TeamThe Platform Engineering team owns Real-time bidding (RTB) platform, a Real-time budget system, Real-time event processing, Stream data processing engine and multiple other complex large-scale distributed components and data pipelines. We work closely with data scientists, data analysts, product and business teams. We are responsible for some of the most technically challenging work, for example:Handling ~2 Million Req/sec in sub-millisecond latencyHanding & managing ~Petabytes of dataManaging distributed systems deployed across multiple data centersOptimizing JVM and Linux kernel for optimal performanceManaging our own data center spread across the globe consisting of thousand of powerful serversWorking on some of the most challenging problems of Ad-TechSounds interesting. Isn't it?About the RoleWe are looking for a Lead DevOps Engineer to build and scale the backbone of our global RTB platform. You will move beyond manual server management to engineer automated, self-healing infrastructure that handles petabytes of data with sub-millisecond latency. This role is about treating our private data centers like a programmable cloud.Responsibilities- Engineering and Provisioning -Develop and maintain automated provisioning pipelines (PXE, ZTP) to deploy bare-metal servers ( configure hardware, peripherals, services, settings, directories, storage, etc. in accordance with standards and project/operational requirements) at scale across global data centers.You research and recommend innovative and automated approaches for system administration tasks.- Operations and Support -You perform regular security monitoring to identify any possible intrusions.You repair and recover from hardware or software failures, coordinating with impacted teams.- Maintenance -You apply OS patches and upgrades on a regular basis, and upgrade administrative tools and utilities.You maintain data center environmental and monitoring equipment.You perform ongoing performance tuning, hardware upgrades, and resource optimization as required.-Team Leadership & Collaboration -Act as a technical lead and trusted point of contact for the infrastructure team, helping drive operational excellence and engineering best practices.Support mentoring and onboarding of engineers, improve team collaboration and communication, and contribute to scaling the team as the infrastructure organization grows.Report direcly to CTO while partner closely with other tech teams to improve reliability, automation, monitoring, and incident response processes across the infrastructure stack.RequirementsYou have min. 8 years of DevOps, system administration/debugging experience, scripting and related tools experience.2+ years of Team Lead/ people management experience.You are flexible to work in rotational shifts as part of the incidence response team.You have good knowledge of coding with Shell and Python/Java and SQL commands.Hands-on experience with Terraform or Ansible or Puppet/Chef for managing bare-metal configurations and automations.Strong understanding of L4/L7 load balancing (HAProxy/Nginx) and network performance tuning (TCP/IP stack optimization).You have good knowledge on pipeline/orchestration tools like Jenkins/Ariflow and similar platforms  as mandatory. You have good knowledge on Observability platform experience (Prometheus, Grafana, InfluxDB) as mandatoryYou have good knowledge of Kubernetes will be an added advantage.Hands-on experience with Unix/Linux. You have a Bachelor degree, with a technical major, such as engineering or computer science.You have systems Administration/System Engineer certification in Unix will be an added advantage.What do we offer?Reporting directly to Co-founder & CTODirect access to top management and an extremely “visible” role Exceptional career growth and learning opportunityFully remote work setupA unique opportunity to be part of an experienced team of industry experts and entrepreneurs who bring massive change to the Adtech marketA fun, driven, and multinational team located across Germany, India, UK, Argentina, Ukraine, Turkey, Spain  and many more countriesA flexible work-from-home arrangementA 500-dollar home-office setup budgetA 1000-dollar annual learning and development budget- We will store your data for 18 months- You can also withdraw your consent at any given point.- To read more about our privacy policy, click here
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.