Data Engineer
Cargill
Jun 2018 - Oct 2019 (1 year 5 months)
● Performed Data Analysis, Data Migration, Data Cleansing, Transformation, Integration, Data Import, and Data Export through Python.
● Implemented Hadoop jobs on a EMR cluster performing several Spark, Hive & Map Reduce Jobs for processing data for building recommendation Engines, Transactional fraud analytics and Behavioral insights.
● Worked on Analyzing and Developing Complex SQL queries, Stored Procedures, ETL Mapping for application development.
● Developed the code for Importing and exporting data into HDFS and Hive using Sqoop.
● Worked on AWS and BIG Data Technologies like HDFS, HIVE, SQOOP, EMR, SPARK AWS, REDSHIFT, EMR, EC2, DATA PIPELINE.
● Developed a Python script to integrate DDL changes between on-Prem Talend warehouse and sn