P

Pragna Katasani

About

Detail

Michigan, United States

Timeline


work
Job
school
Education

Résumé


Jobs verified_user 0% verified
  • Q
    Software Full Stack Engineer
    QBE North America - Envision Infosolutions
    Mar 2025 - Current (1 year 3 months)
    Support platform development within, focusing on migrating legacy data workflows to Palantir Foundry and Databricks.Develop full-stack internal tools using TypeScript and React to visualize complex data lineage and migration status.Build and maintain Node.js microservices to bridge enterprise data platforms with front-end analytics dashboards.Write and optimize PySpark jobs for data transformation, ensuring high performance within Snowflake and Azure environments.Utilize GitHub Copilot to accelerate unit testing and documentation, maintaining high code quality in an Agile environment.Experiment with LangChain to prototype RAG-based chatbots that query internal metadata, reducing manual data discovery time.
  • R
    Genomic Data Engineer III - Full Stack Development
    RxMapper
    Dec 2024 - Feb 2025 (3 months)
    Developed and maintained a React-based internal portal used by clinical researchers to visualize and query large-scale genomic datasets migrated from AWS to GCSBuilt and optimized Node.js and TypeScript REST APIs to serve as a middleware layer between clinical frontends and backend data warehouses, ensuring secure and low-latency data retrieval.Supported the implementation of a Medallion Architecture (Bronze, Silver, Gold) in a multi-cloud environment, utilizing Python and PySpark to automate ETL pipelines for structured clinical data.Facilitated cross-platform data migration by developing automated scripts to transfer metadata and large datasets from AWS S3 to Google Cloud Storage (GCS) and Snowflake.Integrated GitHub Copilot into the deve
  • E
    Data Engineer & Business Intelligence Analyst
    Envision Infosolutions
    Aug 2023 - Nov 2024 (1 year 4 months)
    Wrote SQL queries and optimized query performance for high-volume transactional workloads, reducing dashboard latency by 30%.Contributed to the development of internal data-management tools and maintained Node.js backend services and REST APIs to facilitate communication between legacy ERP systems and a modern Snowflake data warehouse.Partnered with data engineers to build robust Python and PySpark scripts that automated data validation and error handling within Azure and AWS environments.Actively participated in daily scrums and code reviews, ensuring all code met enterprise documentation standardsManaged automated deployment workflows using GitHub Actions and Azure DevOps, streamlining the path from development to production for internal
  • Southern Illinois University
    Graduate Research Assistant
    Southern Illinois University
    Aug 2021 - May 2023 (1 year 10 months)
    Developed Python scripts for data collection, cleaning, and analysis using pandas and NumPy, enhancing workflows with open-source NIH and Illinois public health data. Built and tested predictive machine learning models, achieving 97% accuracy on government health datasets. Applied SQL profiling techniques to extract and validate insights from unstructured public health data.Applied SQL and data profiling techniques to extract insights from both structured and unstructured data.Collaborated on complex data engineering projects, presenting findings at conferences and contributing to research papers.
  • Babylon Health
    Software Engineer
    Babylon Health
    Jul 2019 - Jun 2021 (2 years)
    Optimized healthcare data processing and analysis on AWS EMR and Lambda, reducing costs by 50%, while developing real-time data pipelines and visualizations to enhance patient data analysis.Improved performance of MapReduce and Spark jobs through optimization and orchestration with AWS Step Functions.Built and deployed model prediction pipelines using EMR and Flink, enabling real-time insights for end-users.Designed ETL pipelines for batch and real-time healthcare data extraction with AWS Batch and Lambda, ensuring smooth integration with legacy Epic EMR data.Created data visualizations and reports with Pandas, Python, and PySpark, facilitating better decision-making for healthcare providers through enhanced patient data analysis.
  • HDFC Bank
    Data Analyst Intern
    HDFC Bank
    Jan 2019 - Jun 2019 (6 months)
    Assisted in developing a web-based reporting dashboard using Python and Plotly to help team members visualize daily financial metrics. Wrote SQL scripts to extract and clean transaction data, improving the speed of weekly report generation. Created data visualizations using Matplotlib and Power BI to identify trends in customer engagement data. Collaborated with senior analysts to document data sources and ensure the accuracy of internal reporting tools.
Education verified_user 0% verified
  • Southern Illinois University
    Master's in Computer Science
    Southern Illinois University
    Jan 2022 - May 2023 (1 year 5 months)
    3.6 GPADean's List 2022 - Top 10 for 100% Scholarship
  • J
    B.Tech in Electronics and Communications Engineering
    JNTU, India
    Aug 2016 - Current (9 years 10 months)
    3.7 GPA.