● 4.7+ years of experience in Analysis, Design, Development, and Implementation as a Data Engineer.
● Good knowledge of project life cycle and SDLC methodologies including Agile and Waterfall.
● Highly skilled in the Big Data ecosystem, encompassing Hadoop, MapReduce, Hive, Apache Spark, Pig, Sqoop, and Pyspark.
● Proficient in ETL processes using SSIS, Apache NiFi, Apache Kafka, and Talend, with a focus on designing and implementing scalable data solutions.
● Well-versed in cloud technologies, with expertise in AWS and Azure.
● Familiar with packages such as NumPy, Pandas, Matplotlib, Scikit-learn, Seaborn, and TensorFlow for advanced data analytics and machine learning.
● Experienced in reporting and visualization tools such as Tableau, Power BI, and SSRS to create interactive dashboards and reports.
● Good database management capabilities across MongoDB, MySQL, Teradata, and Snowflake, with a proven track record in query optimization.