Yash Hanumant Chaudhari

Yash Hanumant Chaudhari

About

Detail

India

Contact Yash regarding: 
connect_without_contact
Finding mentors
Finding co-founders
groups
Networking

Timeline


work
Job
school
Education
folder
Project

Résumé


Jobs verified_user 0% verified
  • T
    Data Engineer Analyst
    TREDENCE(Walmart Project)
    Aug 2025 - Current (11 months)
    • Migration of data pipelines from Azure to GCP within Databricks, implementing PySpark and SQL-based ETLjobs to ensure accurate and efficient data transfer. • Automated schema mapping, validation, and retrofitting scripts in PySpark, reducing manual validation effort by 60% and improving data consistency across environments. • Created detailed technical documentation, data validation reports, and job runbooks, improving maintainability and cross-team knowledge transfer.
  • accenture
    Associate Software Engineer
    accenture
    Oct 2023 - Aug 2025 (1 year 11 months)
    •Developed and optimized SQL queries, joining tables with more than 200 columns and calculating metrics that provided actionable insights, directly influencing leadership decisions. • Created 10+ views and production-ready datasets for the Data Science team, improving their model development efficiency by 30%. • Ensured high-quality data transformations by adhering to best practices in ETL and big data processing, improving system efficiency and data reliability. • Automated daily data transformations using Airflow, reducing manual intervention by 80% and ensuring real-time availability of the latest data in the production environment.
Education verified_user 0% verified
  • D
    Databricks Certified Data Engineer Associate
    Nov 2024 - Current (1 year 8 months)
  • M
    Microsoft Azure Data Fundamentals
    Nov 2024 - Current (1 year 8 months)
  • G
    Google Cloud Associate Engineer
    Nov 2024 - Current (1 year 8 months)
  • G
    B.tech
    G.H Raisoni College of Engineering
    Sep 2019 - Jul 2023 (3 years 11 months)
Projects (professional or personal) verified_user 0% verified
  • C
    Credit Card Fraud Detection
    Feb 2025 - Jun 2025 (5 months)
    • Built an end-to-end fraud detection system using AWS S3, Databricks, and PySpark, ensuring real-time fraud alerts with Kafka streaming. • Implemented real-time stream processing using Kafka and PySpark to ingest and process JSON card_transactions, enabling real-time fraud detection. • Validated key fraud parameters with a Delta format lookup table, reducing processing time by 40%.
  • E
    ETL Pipeline for Spotify API using AWS
    Oct 2024 - Jan 2025 (4 months)
    • Automated ETL Workflow: Extracted Top 50 - India playlist data from Spotify API, triggered by CloudWatch and processed via AWS Lambda, storing raw and transformed data in S3. • Metadata & Querying: Used AWS Glue Crawler to infer schemas and update the Data Catalog, enabling efficient querying via Amazon Athena. • & Serverless: Leveraged AWS Lambda, S3, and Athena for a cost-effective, serverless pipeline, ensuring automatic scaling, monitoring (CloudWatch), and efficient storage.