Kalyan S.

Kalyan S.

About

Detail

Data Engineer
Texas, United States

Contact Kalyan regarding: 
work
Full-time jobs

Timeline


work
Job
school
Education

Résumé


Jobs verified_user 0% verified
  • PNC
    Data Engineer
    PNC
    Feb 2023 - Current (3 years 3 months)
  • University of North Texas
    Graduate Assistant
    University of North Texas
    Sep 2021 - Jan 2022 (5 months)
  • Tech Mahindra
    Data Engineer
    Tech Mahindra
    Jul 2019 - Aug 2021 (2 years 2 months)
    Developed multiple POCs using Pyspark and deployed on the Yarncluster,compared the performance of Spark,with Hive and SQL/Teradata and developed code in reading multiple data formats on HDFS using Pyspark.  Loaded the data into Spark data frames and perform in-memory data computation to generate the output as per the requirements. Worked on AWS Cloud to convert all on premise,existing processes and databases to AWSCloud. Design and Develop ETL Processes in AWS Glue to migrate Campaign data from external sources like S3, ORC/Parquet/Text Files into AWS Redshift. Experience in ETL/Pipeline Development using tools such as Azure Databricks,Matillion,Apache Spark,andPython  Extract Transform and Load data from Sources Systems to Azure Data Stor
  • Tech Mahindra
    Data Engineer
    Tech Mahindra
    Jul 2019 - Aug 2021 (2 years 2 months)
    Developed multiple POCs using Pyspark and deployed on the Yarncluster,compared the performance of Spark,with Hive and SQL/Teradata and developed code in reading multiple data formats on HDFS using Pyspark.  Loaded the data into Spark data frames and perform in-memory data computation to generate the output as per the requirements. Worked on AWS Cloud to convert all on premise,existing processes and databases to AWSCloud. Design and Develop ETL Processes in AWS Glue to migrate Campaign data from external sources like S3, ORC/Parquet/Text Files into AWS Redshift. Experience in ETL/Pipeline Development using tools such as Azure Databricks,Matillion,Apache Spark,andPython  Extract Transform and Load data from Sources Systems to Azure Data Stor
  • S
    Data Engineer
    Souxe Technologies
    Jun 2018 - Jun 2019 (1 year 1 month)
    Designed,developed,and maintained data pipe lines using tools such as ApacheKafka,AWS Kinesistoingest data from various sources and load it into data warehouses such as Snowflake, Redshift, or BigQuery. Built and optimized ETL workflows using technologies like Apache Spark, Apache Airflow to transform data into the desired format for downstream analysis and visualization. Implemented data quality checks and validationprocesses to ensure the accuracy and consistency of data, including outlier detection, missing value imputation, and data profiling. Managed and maintained data bases such as MySQL,PostgreSQL,orOracle,including schema design,query optimization, and performance tuning.
  • S
    Data Engineer
    Souxe Technologies
    Jun 2018 - Jun 2019 (1 year 1 month)
    Designed,developed,and maintained data pipe lines using tools such as ApacheKafka,AWS Kinesistoingest data from various sources and load it into data warehouses such as Snowflake, Redshift, or BigQuery. Built and optimized ETL workflows using technologies like Apache Spark, Apache Airflow to transform data into the desired format for downstream analysis and visualization. Implemented data quality checks and validationprocesses to ensure the accuracy and consistency of data, including outlier detection, missing value imputation, and data profiling. Managed and maintained data bases such as MySQL,PostgreSQL,orOracle,including schema design,query optimization, and performance tuning.
Education verified_user 0% verified
  • University of North Texas
    Master's degree, Data science
    University of North Texas
    Aug 2021 - Dec 2022 (1 year 5 months)
  • GITAM Deemed University
    Bachelor of Technology, Computer Science Engineering
    GITAM Deemed University
    Jan 2014 - Jan 2018 (4 years 1 month)