H

Harish Nagallapati

About

Detail

Bavaria, Germany

Timeline


work
Job
school
Education

Résumé


Jobs verified_user 0% verified
  • C
    Data Engineer/Backend Software Engineer
    Carfax Europe
    May 2024 - Current (2 years 1 month)
    Building a central data platform to enable independent data retrieval from all external raw data sources with microservices built around Kafka using TDD and pair programming. Implemented a framework on top of Apache Airflow enabling data analysts to easily setup data retrieval from a new raw data source. Tech: Python, Java, Kotlin, Spring Boot, Airflow, AWS RDS/Aurora, S3, Athena, EMR, Spark, Kafka, DuckDB, Postgres, MongoDB, Kubernetes (EKS), Helm, Gradle.
  • CARFAX
    Associate Backend Software Engineer/Data Engineer
    CARFAX
    Feb 2023 - Apr 2024 (1 year 3 months)
    Worked on a team of 10+ people to determine the quantified "reliability" of over 200 million US vehicles which will be consumed by major customer-facing products within the company. Developed, optimized and automated Apache Spark data pipeline ingesting billions of vehicle service records to generate a final dataset in the 1000s of records using pair-programming and TDD. Led efforts to backup HDFS datasets to AWS S3 for disaster recovery and decommissioned all legacy AWS constructs with security vulnerabilities. Spearheaded the jobs-as-code automation refactor initiative of team's Spark data pipeline. Mentored an intern who will likely return in the following year. Led the team's initiative to migrate deployment of Spark pipeline from Jenki
  • CARFAX
    Associate Backend Software Engineer
    CARFAX
    Mar 2022 - Jan 2023 (11 months)
    Worked in the Data Technologies department reducing the run-time of a legacy system by 2 orders of magnitude while in a culture of pair-programming and TDD. Created applications to export more granular data to AWS S3 on a faster basis empowering business-minded teams to make higher quality decisions faster. Planned the migration of the company's file archiving solution by replacing an on-premises Mongo database with AWS S3 which will eliminate $20k+ per year costs. Created reusable Gitlab CI templates reducing pipeline creation time for many teams within the department. Tech: Java, Spring Boot, AWS, SQL, Docker, Okta, Gradle, Mockito, Vault, Gitlab, JUnit.
Education verified_user 0% verified
  • Western University
    Bachelor of Science in Computer Science
    Western University
    Sep 2016 - May 2021 (4 years 9 months)