M

Manasvini V Kajjam

About

Detail

Actively looking for Full-time roles as Data Scientist/ Data Engineer / Data Analyst | Masters in CIS & IT| University of Central Missouri
Frisco, Texas, United States

Contact Manasvini regarding: 
work
Full-time jobs

Timeline


work
Job
school
Education
folder
Project

Résumé


Jobs verified_user 0% verified
  • Q
    Data Engineer
    Quebec Sol
    Jul 2023 - Current (3 years 1 month)
    As a Data Engineer I've Extensively involved in the Installation and configuration of Cloudera Hadoop Distribution. • Performed data exploration, data visualizations, and feature selections using Python and Apache Spark • Utilized Python libraries Python, NumPy, and matplotlib. Scaled Scikit-learn machine learning algorithms using Apache Spark. • Working with Data Scientists, and ML Ops Engineers in building Data Pipelines using Azure Data Factory, Azure Databricks, and Azure SQL.
  • Myntra
    Data Engineer
    Myntra
    Dec 2020 - Dec 2022 (2 years 1 month)
    • Responsible for the execution of big data analytics, predictive analytics, and machine learning initiatives. • Developed Scala scripts, and UDFs using both data frames/SQL and RDD in Spark for data aggregation, queries, and writing back into the S3 bucket. • Wrote, compiled, and executed programs as necessary using Apache Spark in Scala to perform ETL jobs with ingested data. • Dockerized applications by creating Docker images from Docker files. • Used Spark Streaming to divide streaming data into batches as input to the Spark engine for batch processing. •Wrote Spark applications for data validation, cleansing, transformation, and custom aggregation and used Spark engine, and Spark SQL for data analysis and provided them to the data sci
  • S
    Data Analyst
    Sparsh Technologies
    Jan 2020 - Nov 2020 (11 months)
    As a seasoned Data Analyst, I developed spark applications for performing large-scale transformations and denormalization of relational datasets. • Developed Spark code using Scala and Spark-SQL/Streaming for faster processing of data. • Designed custom Spark REPL application to handle similar datasets. • Used Hadoop scripts for HDFS (Hadoop File System) data loading and manipulation. • Performed Hive test queries on local sample files and HDFS files. • Create and maintain optimal data pipeline architecture in cloud Microsoft Azure using Data Factory and Azure Databricks. • Developed the application on Eclipse IDE. • Have real-time experience with Kafka-Storm on the HDP 2.2 platform for real-time analysis. • Involved in importing real-time
Education verified_user 0% verified
  • University of Central Missouri
    MASTERS IN CIS & IT
    University of Central Missouri
    Jan 2022 - May 2023 (1 year 5 months)
  • S
    Bachelor of Technology - BTech
    Sri Indu College Of Engineering Technology
    Aug 2017 - Jul 2021 (4 years)
Projects (professional or personal) verified_user 0% verified
  • S
    Streaming Data Analytics with Kafka and Spark Streaming
    Jun 2020 - Nov 2020 (6 months)