Data Engineer
Itransition Group
Mar 2025 - Dec 2025 (10 months)
As a Data Engineering Intern, I built and maintained scalable ETL/ELT pipelines using Microsoft Fabric, PySpark, Airflow, and AWS Lambda. Implemented Medallion Architecture (Bronze → Silver → Gold) with Delta Lake to ensure data quality, deduplication, schema enforcement, and performance optimization.
Key achievements:
• Developed an interactive bookstore analytics dashboard (Python Pandas + Streamlit) analyzing sales, orders, and user data — identified top revenue days and customer spending patterns.
• Performed advanced data cleaning and aggregation on large sales datasets, creating visualizations for daily trends and top customers.
• Completed 6-month intensive training focused on commercial data engineering workflows, ETL best pract