Senior AI Infrastructure Engineer with 11+ years of experience designing, deploying, and scaling production-grade ML and LLM systems. Proven expertise in RAG pipelines, GPU-backed model serving, distributed systems, and Kubernetes-based infrastructure, delivering highly available AI platforms handling millions of inference requests per month. Strong technical leader with experience mentoring backend teams (5-10 engineers), defining engineering standards, driving architecture decisions, and leading end-to-end ML infrastructure delivery from research validation to production deployment. Recognized for building scalable AI systems with measurable improvements in latency, throughput, reliability, and cost efficiency.