AI-Powered E-Commerce Analytics Engine with RAG and Agentic Workflows
Independent project
Apr 2026 - May 2026 (2 months)
• Engineered a Medallion Architecture pipeline (Bronze → Silver → Gold) orchestrated by Apache Airflow, transforming 100K+ raw Olist e-commerce records into a star schema with fact and dimension tables in PostgreSQL
• Built a RAG pipeline using Voyage AI embeddings and ChromaDB, enabling semantic search over 40K+ customer reviews for qualitative business insights with sub-second query response times
• Designed an agentic LangGraph workflow with intelligent query routing — automatically classifying user questions and directing them to SQL, RAG, or hybrid retrieval paths for comprehensive answers
• Containerized the full stack (FastAPI, Streamlit, Airflow, PostgreSQL, ChromaDB) using Docker Compose, enabling one-command deployment and full re