S

Sushma Bandugula

About

Detail

Texas, United States

Timeline


work
Job

Résumé


Jobs verified_user 0% verified
  • Baylor Scott & White Health
    Azure Data Engineer
    Baylor Scott & White Health
    Jan 2022 - Current (4 years 6 months)
    Designed and implemented data models for various business units using Power BI, resulting in improved data accuracy and accessibility. Write complex Hive, Hbase, Data stage queries to load and process data in Hadoop File System and performance tuning. Loaded data into Spark Data Frames and used Spark-SQL to explore data insights. Read the files from source (S3) and do the code changes in Job modules and its dependent tables as per the business user need. Led the development of a centralized data warehouse using SAS, consolidating data from multiple sources for unified reporting. Optimized ETL processes in SAS to improve data processing efficiency and reduce load times by 25%. Designed and implemented data pipelines using Azure Synapse Pipel
  • Goldman Sachs
    Azure Data Engineer
    Goldman Sachs
    Dec 2019 - Dec 2022 (3 years 1 month)
    Leveraged Databricks extensively in conjunction with Azure Data Factory (ADF) to process large volumes of data efficiently. Executed ETL operations within Azure Databricks, establishing connections to diverse relational database source systems through JDBC connectors. Developed Python scripts within Databricks for file validations and orchestrated automation of these processes using ADF. Engineered an automated Azure cloud process for daily data ingestion from web services, seamlessly loading it into Azure Data Lake Gen2. Conducted data analysis directly within its residing environment by Mounting Azure Data Lake and Blob to Databricks. Employed Logic App to facilitate decision-making actions within the workflow. Engineered custom alerts ut
  • Capgemini
    Azure Data Engineer
    Capgemini
    Aug 2018 - Dec 2019 (1 year 5 months)
    Design and implement data storage solutions using Azure services such as Azure SQL Database, Azure Cosmos DB, and Azure Data Lake Storage. Developed PySpark scripts from source system like Azure Event Hub to ingest data in reload, append, and merge modeling into Delta tables in Databricks. Familiarity with Azure Data Explorer (ADX) for real-time analytics and monitoring of streaming data sources in Azure environments. Demonstrated expertise in optimizing performance on both Teradata and Snowflake SME platforms, utilizing advanced techniques such as query tuning, indexing strategies, and resource allocation to enhance query performance and reduce latency. Writing PySpark and spark SQL transformation in Azure Databricks to perform complex tra
Education verified_user 0% verified
  • L
    Master's in computer information systems
    Lindsey Wilson College