Lead Data Engineer & Data Scientist
Smart Alto
May 2022 - Jan 2024 (1 year 9 months)
• Led end-to-end GCP data engineering for a US residential real estate platform, ingesting and serving billions of property data points across analytical and transactional workloads.
• Designed and maintained scalable streaming and batch ETL/ELT pipelines using Apache Beam on Cloud Dataflow; implemented Pub/Sub topics and subscriptions for event-driven, exactly-once message processing at scale.
• Architected BigQuery data warehouse with multi-layer data modeling (raw → curated → serving), leveraging partitioned and clustered tables, authorized views, column-level security, and row-level access policies for strict governance.
• Orchestrated all workflows via Cloud Composer (Airflow 2.x), managing DAG lifecycle, SLA monitoring, alerting, and