Sai Achyuth Dasari

Sai Achyuth Dasari

About

Detail

data engineer
Albany, New York, United States

Timeline


work
Job
school
Education

Résumé


Jobs verified_user 0% verified
  • c
    cigna
    May 2023 - Current (3 years 2 months)
  • Cigna
    Data Engineer
    Cigna
    May 2022 - Nov 2023 (1 year 7 months)
    • Worked on creating a new project and new repositories in Bitbucket. Implemented bitbucket repositories in Bamboo for code deployment. • Implemented a 'serverless' architecture using API Gateway, Lambda, and Dynamo DB and deployed AWS Lambda code to Amazon S3 buckets. Created a Lambda Deployment function, and configured it to receive events from your S3 bucket. • Designed the data models to be used in data-intensive AWS Lambda applications which are aimed at doing complex analysis and creating analytical reports for end-to-end traceability, lineage, and definition of Key Business elements from Aurora. • On-demand, secure EMR launcher with custom spark-submit steps using S3 Event, SNS, KMS, and Lambda functions. Worked on developing Pyth
  • Cigna Healthcare
    Data Engineer
    Cigna Healthcare
    Apr 2022 - Current (4 years 3 months)
    complex analysis and creating analytical reports for end-to-end traceability, lineage, and definition of Key Business elements from Aurora. ● On-demand, secure EMR launcher with custom spark-submit steps using S3 Event, SNS, KMS, and Lambda functions. Worked on developing Python scripts to invoke a lambda using Secrets Managers in AWS for developing a pipeline. ● Developed a pipeline using lambda to get the data from the Postgres database to the AWS S3 bucket. Worked on creating an S3 bucket with subfolders using cloud formation templates. ● Used AWS glue catalog with crawler to get the data from S3 and perform SQL query operations. ● Used Pandas, NumPy, Seaborn, SciPy,
  • Cigna Healthcare
    Data Engineer
    Cigna Healthcare
    Apr 2022 - Current (4 years 3 months)
    complex analysis and creating analytical reports for end-to-end traceability, lineage, and definition of Key Business elements from Aurora. ● On-demand, secure EMR launcher with custom spark-submit steps using S3 Event, SNS, KMS, and Lambda functions. Worked on developing Python scripts to invoke a lambda using Secrets Managers in AWS for developing a pipeline. ● Developed a pipeline using lambda to get the data from the Postgres database to the AWS S3 bucket. Worked on creating an S3 bucket with subfolders using cloud formation templates. ● Used AWS glue catalog with crawler to get the data from S3 and perform SQL query operations. ● Used Pandas, NumPy, Seaborn, SciPy,
  • A
    Data Engineer
    AI9 Solutions INC
    Aug 2021 - Jan 2022 (6 months)
    ● Performed data cleansing and applied transformations using Databricks and Spark data analysis. ● Designed and automated Custom-built input adapters using Spark, Sqoop and Oozie to ingest and analyze data from RDBMS to Azure Data lake. ● Involved in the development of automated workflows for daily incremental loads, moving data from traditional RDBMSs to data lakes. ● Worked on Azure Synapse analytics service that brings together enterprise data warehousing and Big Data analytics. ● Experience in the creation of database objects such as tables, views, stored procedures, triggers, packages, and functions using T-SQL to provide efficient data management and structure. ● Extract Transform and Load data from Sources Systems to Azure Data Stora
  • A
    Data Engineer
    AI9 Solutions INC
    Aug 2021 - Jan 2022 (6 months)
    ● Performed data cleansing and applied transformations using Databricks and Spark data analysis. ● Designed and automated Custom-built input adapters using Spark, Sqoop and Oozie to ingest and analyze data from RDBMS to Azure Data lake. ● Involved in the development of automated workflows for daily incremental loads, moving data from traditional RDBMSs to data lakes. ● Worked on Azure Synapse analytics service that brings together enterprise data warehousing and Big Data analytics. ● Experience in the creation of database objects such as tables, views, stored procedures, triggers, packages, and functions using T-SQL to provide efficient data management and structure. ● Extract Transform and Load data from Sources Systems to Azure Data Stora
  • A
    Data Engineer
    AI9 Solutions INC, USA
    Jun 2021 - May 2022 (1 year)
    • Worked on complete data conversion (Extract, Transform, Load) using MS-SQL for a project. Which accounts for 50% of the project's work. • Performed data cleansing and applied transformations using Databricks and Spark data analysis. • Designed and automated Custom-built input adapters using Spark, Sqoop and Oozie to ingest and analyze data from RDBMS to Azure Data lake. • Involved in the development of automated workflows for daily incremental loads, moving data from traditional RDBMSs to data lakes. • Worked on Azure Synapse analytics service that brings together enterprise data warehousing and Big Data analytics. • Experience in the creation of database objects such as tables, views, stored procedures, triggers, packages, and func
  • W
    Data Engineer
    Websparx IT Solutions
    Dec 2017 - Jun 2019 (1 year 7 months)
    ● Involved in building a data pipeline and performed analytics using AWS stack (EMR, EC2, S3, RDS, Lambda, Glue, SQS, and Redshift). ● Strong knowledge and experience on Confidential Web Services (AWS) Cloud services like EC2, S3, IAM. ● Utilized Spark’s in memory capabilities to handle large datasets on S3 Datalake. Loaded data into S3 buckets, then filtered and loaded into Hive external tables. ● Involved heavily in setting up the CI/CD pipeline using Jenkins, Terraform and AWS ● Performed end- to-end Architecture & implementation assessment of various AWS services like Amazon EMR, Redshift, S3 ● Used AWS EMR to transform and move large amounts of data into and out of other AWS data stores and databases, such as Amazon Simple Storage Serv
  • W
    Data Engineer
    Websparx IT Solutions
    Dec 2017 - Jun 2019 (1 year 7 months)
    ● Involved in building a data pipeline and performed analytics using AWS stack (EMR, EC2, S3, RDS, Lambda, Glue, SQS, and Redshift). ● Strong knowledge and experience on Confidential Web Services (AWS) Cloud services like EC2, S3, IAM. ● Utilized Spark’s in memory capabilities to handle large datasets on S3 Datalake. Loaded data into S3 buckets, then filtered and loaded into Hive external tables. ● Involved heavily in setting up the CI/CD pipeline using Jenkins, Terraform and AWS ● Performed end- to-end Architecture & implementation assessment of various AWS services like Amazon EMR, Redshift, S3 ● Used AWS EMR to transform and move large amounts of data into and out of other AWS data stores and databases, such as Amazon Simple Storage Serv
  • W
    Data Engineer
    Websparx IT Solutions
    Dec 2017 - Jun 2019 (1 year 7 months)
    • Involved in building a data pipeline and performed analytics using AWS stack (EMR, EC2, S3, RDS, Lambda, Glue, SQS, and Redshift). • Strong knowledge and experience on Confidential Web Services (AWS) Cloud services like EC2, S3, IAM. • Utilized Spark's in memory capabilities to handle large datasets on S3 Datalake. Loaded data into S3 buckets, then filtered and loaded into Hive external tables. • Involved heavily in setting up the CI/CD pipeline using Jenkins, Terraform and AWS • Performed end- to-end Architecture & implementation assessment of various AWS services like Amazon EMR, Redshift, S3 • Used AWS EMR to transform and move large amounts of data into and out of other AWS data stores and databases, such as Amazon Simple Storag
Education verified_user 0% verified
  • University at Albany, SUNY
    Master of Science
    University at Albany, SUNY
    Aug 2019 - May 2021 (1 year 10 months)
  • University at Albany SUNY
    masters, Data science
    University at Albany SUNY
    Aug 2019 - May 2021 (1 year 10 months)
    Courses: 1. Modern Computing for Mathematicians 2. Introduction to Theory of Statistic I 3. Function Theory and Functional Analysis 4. Topological Data Analysis I 5. Topological Data Analysis II 6. NonParametric Statistics 7. Optimization Methods and Nonlinear Programming 8. Data Mining 9. Machine Learning 10. Practical Methods in Machine Learning 11. Databases and Business Intelligence
  • J
    Bachelor of Technology in Computer Science and Engineering
    JNTU
    Aug 2013 - May 2017 (3 years 10 months)