Shravika Pinna

E

Data Engineer

EnerSys

Jan 2023 - May 2023 (5 months)

Built optimized data pipeline architecture in Azure using Azure Data Factory and Azure Data bricks, handling hashed and un hashed data from XML files to data lake. Designed and implemented database solutions in Azure Synapse and utilized HQL to create Hive target tables for storing data after PIG ETL operations. Proficient in dealing with views, indexes, stored procedures, SQL, T-SQL, and PL/SQL scripts, among other database application components. Partnered with VP-level leaders to define and prioritize new initiatives supporting the migration of existing SSIS on-premises systems to Azure cloud.

Data Analyst Intern

Rowan University

Jan 2022 - Dec 2022 (1 year)

Designed use cases to capture functional requirements, UML diagrams to model system architecture, and ER diagrams to represent data relationships. Prepared comprehensive BRDs that capture business needs, objectives, and constraints, focusing on data relationships and requirements.

L

Data Engineer

Larsen Toubro Infotech Ltd LTI

Apr 2021 - Dec 2021 (9 months)

Engineered and maintained components for HDFS, Hive, Spark, and Kafka, handling an average of 1TB of data daily and improving data throughput by 25%. Implemented parallelization strategies in Hive, optimizing over 500 searches and reducing query times by up to 40%. Spearheaded a proof-of-concept cluster implementation for HBase, improving its performance and reducing its drawbacks by 30%. Built and managed 100+ Hive target tables using HQL, facilitating the analysis of over 500GB of semi-structured data through PIG Latin Scripts.

Data Engineer

CGI Group Inc.

Jan 2020 - Apr 2021 (1 year 4 months)

Created functions in AWS Lambda for event-driven processing and optimized storage classes on AWS S3, resulting in a 30% cost reduction. Contributed to the integration of EMR with Spark 2, S3 storage, and Snowflake using AWS. Built a data lake and optimized computational resources using PySpark on AWS EMR. Utilized AWS Glue, Spark, and Airflow to build data pipelines, cutting data sync time with the source system in half to just 4 hours. Initiated new ETL pipelines to and from the data warehouse, developing key reports using advanced SQL queries in Snowflake. Automated Terraform scripts using Jenkins for data quality management.

D

Data Analyst

Jan 2018 - Jan 2020 (2 years 1 month)

Responsible for gathering requirements, system analysis, and design focusing on data-centric projects. Coordinated with the testing team to ensure adequate test coverage and facilitated user acceptance testing (UAT) for data related changes and implementations. Defined and communicated the impact of system changes on business processes and users, ensuring smooth data transitions and integrations.

Shravika Pinna

About

Detail

Timeline

Résumé