Data Engineer with 5+ years of experience designing and optimizing large-scale, cloud-native data infrastructure across OpenAI, Apple, and JPMorgan Chase. Specialized in real-time and batch data processing, scalable ETL/ELT pipelines, and ML feature engineering. Proficient in Python, PySpark, Kafka, dbt, and Airflow with deep hands-on experience across AWS, Azure, Snowflake, and the Hadoop ecosystem (HDFS, YARN, Hive, Spark). Adept in containerization (Docker, Kubernetes), CI/CD automation, and data observability using tools like MLflow and Great Expectations. Strong foundation in data warehousing, SQL optimization, and Agile development practices.