A

Aastha Khatgarh

About

Detail

Illinois, United States

Timeline


work
Job
school
Education
folder
Project (professional or personal)

Résumé


Jobs verified_user 0% verified
  • E
    Data Analytics & Machine Learning Fellow Trainee
    ElevateMe Bootcamp,
    Jun 2025 - Current (4 months)
    • Engaging in 150+ hours of hands-on learning and project work, including active participation in a live project and two optional capstone projects. • Apply data analytics and machine learning techniques to address real-world problems. • Utilize advanced tools such as Python, Jupyter Notebooks, Pandas, Numpy, Matplotlib, Seaborn, and Scikit-learn to build and evaluate machine learning models. • Deploy machine learning models on Microsoft Azure, integrating Azure ML with Azure SQL databases to enable real-time analytics and predictions. • Perform data preprocessing, feature engineering, and model selection, and evaluate regression, classification, and clustering algorithms to optimize performance. • Collaborate in a professional team e
  • N
    AI Researcher
    Northwestern University
    Jan 2025 - Current (9 months)
    • Developed a PPO-based reinforcement learning framework to unlearn restricted content from LLMs while preserving fluency and coherence. • Engineered a hybrid reward system to suppress specific tokens without causing catastrophic forgetting. • Initiated a project on automated LLM-based grading using rubrics, prior grading patterns, and course material. • Utilised PyTorch, Hugging Face Transformers, prompt engineering, PEFT, LoRA, and Reinforcement Learning while documenting experiments for collaborative research.
  • Allegion
    Data Science Intern
    Allegion
    Sep 2024 - Dec 2024 (4 months)
    • Collaborated with cross-functional teams to analyze warranty claims using PySpark workflows and LLM-based extraction techniques. • Delivered insights across geography, product category, and time using OpenAI embeddings, clustering methods, and exploratory data analysis. • Proposed mitigation strategies that led to over 80% reduction in recurring claims by automating root cause analysis and categorization. • Worked with Python, NumPy, Pandas, Jupyter Notebooks, Power BI, and OpenAI API for data processing, visualization, and modeling.
  • N
    AI Research Intern
    Northwestern University
    Apr 2024 - Sep 2024 (6 months)
    • Designed and deployed a dual-mode LLM pipeline (live and batch processing) to automate key data extraction from global poverty research PDFs, reducing manual effort by over 60% . • Built and containerized front-end and back-end services using Docker, with model integration (Llama 3. 1 via AWS Bedrock and GPT-4) for answer extraction based on user-selected accuracy levels. • Engineered scalable AWS-based infrastructure using Lambda, S3, MongoDB, and CI/CD pipelines with GitHub Actions and CodeDeploy for real-time deployment and document processing.
Education verified_user 0% verified
  • E
    Data Analytics and Machine Learning Certificate of Completion
    ElevateMe Bootcamp
    Jun 2025 - Current (4 months)
  • N
    Master's of Science
    Northwestern University
    Sep 2023 - Dec 2024 (1 year 4 months)
  • Thapar Institute of Engineering and Technology
    Bachelor's of Engineering
    Thapar Institute of Engineering and Technology
    Jun 2015 - Jul 2019 (4 years 2 months)
Projects (professional or personal) verified_user 0% verified
  • A
    Automated Reddit Moderator
    Sep 2024 - Dec 2024 (4 months)
  • I
    Image to Text Captioning - mPLUG Implementation
    Sep 2024 - Dec 2024 (4 months)
    • Executed mPLUG model integrating Cross-Modal Attention to enhance text embeddings using image embeddings. • Compared performance across Transformer (ResNet + BERT) and Vision Transformer (ViT) base models, achieving a 27% performance improvement with mPLUG. Tools / Python, PyTorch, BERT, ResNet, ViT, mPLUG, Transformer Architecture
  • R
    Real-Time Loan Application Processing|
    Jun 2024 - Aug 2024 (3 months)
    https://github.com/khatgarhaastha/Real-Time-Loan-Application-Processing • Built a real-time ML pipeline to evaluate credit risk from streaming data and model retraining. • Designed and deployed Logistic Regression and Decision Tree models with Kafka-based ingestion and auto retraining. • Tools / Languages: Python, Kafka, Docker, Regression, Decision https://github.com/khatgarhaastha/Automated_reddit_moderator • LLM-powered dashboard for real-time Reddit content moderation using multi-rule enforcement. • Built moderation pipeline, integrated Ollama with AWS Lambda, DynamoDB, Eventbridge and S3. • Tools / Languages: Python, Streamlit, PRAW, Ollama, AWS(lambda, S3, DynamoDB, EventBridge)