Mehroz Alam

Mehroz Alam

About

Detail

Sindh, Pakistan

Contact Mehroz regarding: 
Flexible work
Starting at USD25/hour

Timeline


work
Job
school
Education
folder
Project

Résumé


Jobs verified_user 0% verified
  • B
    SeniorDataEngineer public Remote experience
    Berlin Brands Group,
    Nov 2023 - Current (2 years 7 months)
    – Implemented medallion architecture (bronze/silver/gold) for structured data processing, reducing query time by 40% – Built efficient ETL pipelines with Python, Spark, and Azure Data Factory, achieving 3x faster processing and 30% cost reduction. – Established robust data quality frameworks with automated validation checks, ensuring 99% data accuracy and increasing stakeholder satisfaction through reliable reporting – Built and maintained data models in Azure Synapse Analytics to support business intelligence and analytics requirements – Delivered 15+ ETL pipelines, cutting report generation time by 75% for data-driven decisions – Mentored 5 junior engineers, improving team delivery speed by 30% and streamlining code review process
  • O
    SeniorDataEngineer
    Odyssey Solutions,
    Sep 2022 - Jul 2023 (11 months)
    – Revitalized end-to-end web data crawlers for various websites using Python and Selenium, efficiently scheduled through Airflow, processing 100K+ daily records with 99% accuracy rate – Architected cost-efficient data pipelines with AWS Lambda and Glue Jobs, reducing processing expenses by 45% – Utilized Pyspark to create and deploy Glue Jobs, enabling seamless file drops to S3 and data loading into databases. – Led database evaluation initiative that tested 5 database solutions, resulting in 40% lower operational costs and 50% faster query performance after final implementation
  • O
    Data Scientist &Engineer
    OnStak,
    Apr 2019 - Aug 2022 (3 years 5 months)
    – Engineered and deployed production ML models achieving 25% improved prediction accuracy through advanced algorithm optimization – Developed comprehensive feature engineering processes that identified critical patterns in complex datasets, enhancing model performance by 10% – Built robust data preprocessing pipelines using SQL operations in pandas, ensuring 99% data quality standards for model training – Constructed scalable ETL workflows with PySpark for daily batch processing, reducing data preparation time by 15% – Applied statistical analysis and data visualization techniques to uncover actionable insights and trends in complex datasets
Education verified_user 0% verified
  • FAST NUCES
    Bachelor of Science in
    FAST NUCES
    Sep 2014 - Jun 2018 (3 years 10 months)
Projects (professional or personal) verified_user 0% verified
  • G
    Google Ads Marketing Pipelines
    Jan 2025 - Current (1 year 5 months)
    Goal: Analyze Google Ads data in the Data Warehouse (DWH) for informed decision-making – Designed & implemented a robust medallion architecture for efficient data processing and integration into the Data Warehouse (DWH) – Extracted data from the Google Ads API and stored it in Azure's landing (Raw) storage – Executed data transformation & validation procedures – Ingested data into Data Warehouse for seamless flow
  • Odyssey
    Vehicle sales Prediction System for Ford Corporation
    Odyssey
    Jan 2022 - Mar 2023 (1 year 3 months)
    Goal: Optimize vehicle sales strategy by predicting regional demand based on vehicle features – System that predicts the region based sales of a specific vehicles based on it's features. – Running ETL process on (7 GBs) of data for mining – Data preprocessing by SQL operation in pandas. – Random Forest model is used for sales predictions. Libraries: Numpy, Pandas, Scikit-learn, Matplotlib, Regex.