S

Saddam Khan

About

Detail

AI | ML | LLM | Cloud | BigData | GCP | AWS | Azure | DevOps | SRE
Berlin, Germany

Contact Saddam regarding: 
work
Full-time jobs

Timeline


work
Job

Résumé


Jobs verified_user 0% verified
  • FIFA
    Freelance Data Engineer
    FIFA
    Feb 2024 - Current (2 years 4 months)
    • Archi tected end to end delivery of Enterpri se Data and Analyti cs products whi ch i ncludes various data platforms li ke Enterpri se Football Data Hub ( FDH) , Enterpri se Data Warehouse (EDW) , Data Sci ence and Analyti cs platforms i n AWS Cloud . • Understand the data needs across the organi zation and for developi ng and executi ng a roadmap to maximi ze data capabi li ti es . Bui ld and mai ntai n a well-governed , trusted enterpri se data and analyti cs platform for AI use cases . • Develop HVR and FiveTran framework to support modern real-time data repli cation across diverse databases and systems to mi grate large datasets seamlessly across cloud and on-premi se envi ronments wi th FiveTran . • Developed framework for proces
  • TIER Mobility
    Senior Data Engineer
    TIER Mobility
    Feb 2022 - Feb 2024 (2 years 1 month)
    • Archi tecti ng and developi ng Framework for Data Mi gration i n AWS for processi ng data as Batch , Streami ng and Events for AI and ML use cases . • Developed ETL data processi ng pi peli nes for scalable data mi gration by processi ng Batch (S3) and Streami ng(Ki nesi s) Data on AWS usi ng Spark EMR, Lambda , Athena , Glue , S3 , Athena , RDS , DynamoDB, Redshi ft , etc . • Developed Glue Spark jobs to perform i n memory data processi ng and improvi ng performance by applyi ng operations such as i ndexi ng, wi ndowi ng or rollup queri es on large dataset . • Developed EMR spark jobs for cleani ng, processi ng and analyzi ng raw data residi ng i n DataLake thereby creati ng subsequent DataMarts and Data Warehouses . • Developed Lam
  • Wayfair
    Senior Data Engineer
    Wayfair
    Mar 2020 - Jan 2022 (1 year 11 months)
    Technology : GCP , Beam, Spark, Kafka , DBT, Snowflake , Ai rflow, Gi t , Terraform, Docker , Kubernetes , Grafana , Prometheus , Python , Java , SQL , Unix • Archi tected and desi gned ETL data processi ng pi peli nes for scalable data mi gration by processi ng Batch (GCS Buckets) and Streami ng(PubSub) Data on GCP usi ng Dataflow, Dataproc , CloudFunction/Cloud Run , Bi gQuery, Bi gTable , Cloud SQL , Cloud Scheduler , Workflows , Cloud Composer , etc . • Developed framework for processi ng batch and streami ng data under Lambda archi tecture , leveragi ng Apache Spark Dataproc and Dataflow. • Developed framework for handli ng records level batch and streami ng data processi ng under Lambda archi tecture , leveragi ng Apache Beam Dataf
  • Emirates Airlines
    Data Engineer
    Emirates Airlines
    Mar 2018 - Jan 2020 (1 year 11 months)
    • Led Desi gni ng, Archi tecti ng and Creati ng Data Mesh around the exi sti ng Data warehouse for enabli ng fast data movement across multi ple busi ness uni ts wi th long term vi sion of removi ng data dependenci es between teams . • Developed data mi gration framework for enabli ng bulk data mi gration from on premi se systems (RDBMS , Log Servers , Fi les Folders , etc) to HDFS (CDH) wi th Apache Camel an orchestration usi ng Oozi e Scheduler . • Developed Query Translator (Teradata queri es to Hive Queri es) leveragi ng ANTLR for translati ng structured Teradata SQL to Hive SQL , whi ch boosted i n-house mi gration from Teradata to Hive . • Bui lt Near to Real time ODS under Lambda Archi tecture (i . e . Batch & Streams) to Collect
  • Target
    Data Engineer
    Target
    May 2016 - Apr 2018 (2 years)
    Kafka, Data Science, Machine Learning, Sci-Kit Git, Qlikview, Python, Java, Scala, SQL, Unix • Developed MapReduce jobs for batch for cleaning, processing and loading raw data residing in DataLake thereby creating subsequent DataMarts and Data Warehouses for Data Scientists and Business Stakeholders. • Developed Hive jobs and Pig Latin scripts for applying data transformation and aggregation on raw data and improving performance by applying operations such as indexing, windowing or rollup queries on large dataset. • Developed Spark jobs to perform in memory data processing and improving performance by applying operations such as indexing, windowing or rollup queries on large dataset. • Developed and maintained Oozie workflows to orchest
  • P
    Data Engineer
    PUBLICIS.SAPIENT
    May 2014 - Apr 2016 (2 years)
    • Microservice Architecture Implementation in AWS:- Technology used : Python, SQL, Unix ◦ Designed and implemented scalable microservices architecture using AWS Lambda, API Gateway, and DynamoDB. ◦ Built RESTful APIs in Python and integrated them with AWS services to support real-time data processing. ◦ Developed and maintained backend services in Node.js and Python for high-traffic applications hosted on AWS EC2 and RDS. ◦ Integrated AWS SQS and SNS for asynchronous task processing, enabling reliable and scalable background operations. ◦ Designed CloudWatch Alarms and dashboards to proactively monitor system health and reduce downtime by 20%. ◦ Collaborated with the security team to implement AWS WAF (Web Application Firewall)
Education verified_user 0% verified
  • V
    Bachelor of Technology(Computer
    Visvesvaraya Technical University
  • G
    Google CProfessional Cloud Architect
  • C
    Cloudera Certified Professional Data Engineer
  • G
    Google Certified Cloud Associate
  • A
    AWS Certified Solution Architect
  • O
    Oracle certified Java Programmer
This is a community-created genome.