Graduate Research Assistant
Southern Illinois University
Aug 2021 - May 2023 (1 year 10 months)
Developed Python scripts for data collection, cleaning, and analysis using pandas and NumPy, enhancing workflows with open-source NIH and Illinois public health data. Built and tested predictive machine learning models, achieving 97% accuracy on government health datasets. Applied SQL profiling techniques to extract and validate insights from unstructured public health data.Applied SQL and data profiling techniques to extract insights from both structured and unstructured data.Collaborated on complex data engineering projects, presenting findings at conferences and contributing to research papers.