T

Tariq Farooq

About

Detail

United States

Timeline


work
Job
school
Education

Résumé


Jobs verified_user 0% verified
  • InfluxData
    Lead Data Engineer
    InfluxData
    Jan 2022 - Current (4 years 5 months)
    Architectural Leadership: Led the modernization of enterprise data platforms, migrating on-prem systems to a cloud-native AWS/Databricks architecture, cutting operational costs by 35% and boosting platform performance by 60%. Analytics Engineering: Architected and built scalable ELT pipelines using Airflow and dbt to create complex dimensional models in Snowflake and Redshift, ensuring high-quality, analytics-ready data assets for BI and reporting. Real-Time Streaming: Delivered low-latency, real-time streaming solutions using Kafka, Spark Streaming, and AWS Lambda, reducing data latency by 50% and enabling continuous operational decision-making across supply chain and production. MLOps & GenAI Readiness: Partnered with ML teams to build ML
  • Immuta
    Senior Data Engineer
    Immuta
    Mar 2018 - Dec 2021 (3 years 10 months)
    Data Platform Ownership: Served as a key leader in the data team, taking ownership of major data infrastructure components and migrating systems to AWS S3, Glue, Athena, Redshift, EMR, and Databricks. Pipeline Development: Designed and developed robust batch and real-time data workflows using Airflow, dbt, and PySpark to support large-scale data processing and transformation needs. Mentorship & Best Practices: Defined and enforced data engineering best practices, including CI/CD automation, testing strategies, and version control (Git), leading code reviews, and providing mentorship to junior team members. Cloud Deployment: Proven experience deploying data solutions on AWS and Azure, leveraging native services and Infrastructure-as-Code (Te
  • Magnite
    Data Engineer
    Magnite
    Mar 2015 - Feb 2018 (3 years)
    Developed core ETL/ELT workflows using Python (Flask), SQL, and AWS Glue, transforming Financial and healthcare/CMS data into Redshift. Optimized Airflow DAGs, reducing runtimes by 30% and containerizing pipelines with Docker for robust deployment. Integrated SageMaker ML models via Flask APIs for fraud detection, improving anomaly accuracy and supporting real-time decision pipelines. Ensured SOX, HIPAA, GDPR, and CMS compliance with encryption, audit trails, and secure data governance. Enhanced Tableau and Power BI performance via SQL tuning, dimensional modeling, and optimized queries using Athena and Redshift.
Education verified_user 0% verified
  • B
    Bachelor of Computer Science
    Feb 2010 - Feb 2014 (4 years 1 month)