A

Avinash Sidhwani

About

Detail

Sr. Data Engineer (3+ yrs)
Riverside, California, United States

Contact Avinash regarding: 

groups
Networking

Timeline


work
Job
school
Education
folder
Project
flag
Award
auto_stories
Publication

Résumé


Jobs verified_user 0% verified
  • University of California
    Big Data Research Assistant
    University of California
    Apr 2022 - Mar 2023 (1 year)
    • Utilized Spark, SparkSQL & Scala to add data integration features & compute metadata for geospatial datasets in UCR STAR • Improved performance and scalability of data processing APIs by 15% in UCR STAR (star.cs.ucr.edu) by developing pipelines for parallel processing using Scala, Akka, and Slick (ORM) • Mastered the code base of and adapted to a complex, multi-year project developing a Spark-based Java library (UCR BEAST), by a diverse team of developers, with limited documentation available, to integrate it as well as to fix bugs • Implemented location-aware features and visualizations using current or selected location, in Java, JavaScript and MongoDB • Regularly discussed high-level system architecture and design, and any blockers
  • Q
    Senior Data Engineer
    Quantiphi Inc
    Jul 2019 - Oct 2021 (2 years 4 months)
    Designed, developed, and tested batch and real time data pipelines optimizing data processing efficiency and costs, utilizing Spark, Hive, Hadoop distributed frameworks, AWS (Glue, Kinesis, EMR, Redshift), and GCP (Dataflow, Dataproc, Pub/Sub, BigQuery). Experienced with programming languages—Python, Scala, Java, ETL tools (Informatica) and developing REST APIs • Enterprise Data Platform modernization for a US-based insurance company ○ Collaborated with solution architects to define procedures for and implement data assets migration and cutover from Teradata to AWS & Snowflake, while managing security aspects, internal dependencies and ensuring seamless transition ○ Integrated 10+ diverse data sources into Enterprise Operational Data
Education verified_user 0% verified
  • U
    Master of Science - MS, Computer Science
    University of California Riverside
    Jan 2022 - May 2023 (1 year 5 months)
    GPA: 3.9/4.0 Coursework: CS218 Design & Analysis of Algorithms CS226 Big-Data Management CS236 Database Management Systems CS247 Principles of Distributed Computing CS235 Data Mining Techniques CS202 Advanced Operating Systems CS225 Spatial Computing CS208 Cloud Computing & Cloud Networking CS206 Advanced Software Testing & Analysis CS246 Software Verification
  • University of Mumbai
    Bachelor of Engineering - BE, Computer Engineering
    University of Mumbai
    Sep 2016 - May 2019 (2 years 9 months)
    CGPI: 8.13/10.0 Coursework: CPC801 Data Warehouse & Data Mining CPC603 Distributed Databases CSC402 Analysis of Algorithms CSC302 Object Oriented Programming Methodology CSC303 Data Structures CPC504 Computer Networks CSC405 Theoretical Computer Science CSC305 Discrete Structures CPC601 Compiler Construction CPC702 Cryptography & Computer Security CPC703 Artificial Intelligence CPC803 Parallel & Distributed Systems CSC406 Computer Graphics CSC403 Computer Architecture CPL601 Network Programming CPL701 Network Threats & Attacks
Projects verified_user 0% verified
  • U
    Real-time Twitter Data Analysis in Spark
    University of California Riverside
    Sep 2022 - Dec 2022 (4 months)
    Developed Spark streaming pipeline with Kafka source to analyze scraped data. Developed backend REST APIs of web-app in Python, for user input and to visualize output
  • U
    Credit Card Fraud Detection System
    University of California Riverside
    Sep 2022 - Dec 2022 (4 months)
    Developed streaming pipeline in Apache Flink to read a stream of transactions and output alert for fraud transactions
Awards verified_user 0% verified
  • A
    AWS Certified Solutions Architect - Associate
    Jan 2020 - Jan 2023 (3 years 1 month)
  • G
    Google Cloud Certified Associate Cloud Engineer
    Oct 2019 - Oct 2022 (3 years 1 month)
Publications verified_user 0% verified
  • Springer
    Two-Level Text Summarization with Natural Language Processing
    Springer
    Jan 2020
    Presented Technical Paper on “Two-Level Text Summarization with Natural Language Processing” in the International Conference on Computer Networks and Inventive Communication Technology(ICCNCT 2019) (Springer) in May 2019.