A

Anish Simkhada

About

Detail

Data Engineering Leader: Transforming Raw Data into Strategic Insights with Python, SQL & AWS | Excellence in Scalable and Robust Data Solutions
Troy, Michigan, United States

Contact Anish regarding: 

work
Full-time jobs

Timeline


work
Job

Résumé


Jobs verified_user 0% verified
  • Capital One
    Data Engineer public Remote experience
    Capital One
    Apr 2020 - Current (5 years 2 months)
    Engineered and maintained scalable data pipelines using AWS Glue, Apache Spark, Azure Data Factory, focusing on efficient data extraction, transformation, and loading, and processing over 5TB of data daily. Led real-time data processing with Apache Kafka, Spark Streaming, PySpark, and Apache Flink, deriving insights from streaming data sources that enhanced decision making process. Collaborated with data science teams on Databricks and Azure Databricks platforms for data transformation, analysis, and machine learning model deployment. Managed cloud-based data warehouses including Amazon Redshift, Azure SQL DB, and Snowflake, implementing indexing, partitioning, and optimization techniques that improved data access speed by 30%. Designed
  • General Motors
    Data Engineer/Analyst
    General Motors
    Sep 2018 - Mar 2020 (1 year 7 months)
    Drove data analysis initiatives using SQL and Python (Pandas, NumPy), enhancing operational efficiency by 25% through insightful data interpretation and statistical analysis. Implemented data pipelines for data ingestion, cleaning, and transformation, ensuring 99% data accuracy and integrity. Contributed to the creation and optimization of data models, upholding data integrity and best practices. Analyzed data to identify trends and patterns, aiding data-driven decision-making. Developed and presented actionable insights through reports and visualizations in Power BI, effectively communicating findings to stakeholders.
  • UnitedHealth Group
    Data Engineer
    UnitedHealth Group
    Jan 2017 - Aug 2018 (1 year 8 months)
    Designed and implemented scalable data architectures using Hadoop, Hive, and Spark for processing vast amounts of structured and unstructured data. Developed robust ETL pipelines with PySpark and Databricks, ensuring high data quality and timely delivery for analytics. Optimized data processing tasks, including in-memory processing, for optimal throughput and speed. Collaborated with Database Administrators for seamless data flow between traditional databases and big data platforms. Leveraged Snowflake for cloud-based data warehousing, ensuring easy access for analytics. Administered user roles and privileges in Azure Data Lake Storage and other cloud platforms for data security and compliance. Drove continuous data migration and integratio
Education verified_user 0% verified
  • U
    Bachelor's of Science in Computer & Information Systems
    University of MichiganDearborn
  • U
    Bachelor's of Science in Cyber Security & Information Assurance
    University of MichiganDearborn