Data Engineering Consultant
accenture
Jun 2021 - Feb 2024 (2 years 9 months)
Developed and maintained data pipelines for processing and analyzing large datasets using PySpark and Hive.
Worked on cloud platforms such as AWS (EMR, S3, Managed Apache Airflow, Lambda) and GCP (Dataproc,
Composer, PubSub, Cloud Task, CloudRun) to build scalable and reliable solutions.
Utilized databases such as Oracle, PostgreSQL, and Elasticsearch to store and manage data.
• Managed source code using Git and SVN for version control and collaborated with teams for code review.
• Analyzed production incidents and logs, performed data quality checks, and wrote unit tests to ensure high-quality
deliverables.
• Integrated code quality checks and security scans in the CI/CD pipelines using SonarQube, Snyk, and Blackduck
to ensure high-qual