February 28, 2026
For my UC Berkeley MIDS DATASCI 207 final project, my team and I studied next-day rainfall prediction in Australia using the WeatherAUS dataset. We framed the...
grad
ml
February 28, 2026
For my UC Berkeley MIDS DATASCI 201 final project, our team studied how large language models can be integrated into 8th-grade science classrooms in New Jersey...
grad
nlp
education
February 28, 2026
For my UC Berkeley MIDS DATASCI 205 final project, our team analyzed global forced displacement to understand where refugee movement is concentrated, how large flows are,...
grad
graph
policy
April 16, 2025
Under guided research from Professor Cantay Caliskan and Ezgi Siir Kibris at the University of Rochester, we investigated the usability of large language models to conduct...
nlp
research
February 01, 2025
Using MLB’s API, I developed a website utilizing React and Javascript to display MLB scores and relevant information for completed games, and preview information for upcoming...
baseball
software
December 20, 2024
I created a Streamlit app that hosts data on how pitchers change their pitch usage depending on the time through the opposing lineup. My hypothesis was...
baseball
software
December 13, 2024
As part of my undergraduate coursework, I completed a capstone project with a group analyzing statements of purpose submitted to our school’s Master’s of Data Science...
undergrad
ml
April 22, 2024
As part of my undergraduate coursework, Jared Coffey and I analyzed how the COVID-19 pandemic affected the state of Ohio. We first conducted descriptive analysis, calculating...
undergrad
ml
February 11, 2024
As part of my leadership role with the University of Rochester’s sports analytics club, I developed and led two coding tutorials for our club. For the...
undergrad
baseball
football
January 06, 2024
This was my group’s project to the 2024 NFL Big Data Bowl. We created a novel tracking data based model for expected yards after catch using...
football
ml
December 23, 2023
Following framework from 538’s MLB ELO model, I developed my own code base and model to analyze team strengths and generate pre-game predictions. Trained on data...
baseball
ml
December 15, 2023
As part of my undergraduate coursework, I, with a group, completed a final project for Data Mining where we built predictive models aimed at classifying metabolic...
undergrad
ml
December 08, 2023
As part of my undergraduate coursework, Tochi Onwuasoanya and I created a Flask app hosting hypothetical basketball data through MySQL, using HTML and CSS to design...
undergrad
software
June 01, 2023
Sean Sullivan and I published extensive work on analyzing pitch sequencing in MLB. Using the difflib package in Python, we generated sequence similarity ratios that would...
baseball
ml
April 18, 2023
Using pitch-level data from MLB, I developed visualization reports in R utilizing the ggplot2 and gt packages to summarize a starting pitcher’s outing. To my knowledge,...
baseball
March 09, 2023
Along with good friend Ben Wieland, we created contract predictions for MLB free agents using publicly available statistics and a zero-inflated Poisson modeling technique. Our full...
baseball
ml
January 06, 2023
This was my group’s project for the 2023 NFL Big Data Bowl, where we were selected as a finalist and presented our work at the NFL...
football
ml
August 22, 2022
With the rise in publicity of pitch-level modeling, I made my own model using Random Forests to isolate and quantify how “good” pitches are. I detailed...
baseball
ml
August 01, 2022
This was Ben Wieland and I’s submission to the 2022 SMT Data Challenge. We tackled minor league defense, particularly outfield defense, by measuring route efficiency and...
baseball
ml
July 18, 2022
I developed an adjusted exit velocity metric for hitters and pitchers that accounts for the quality and location of the pitch seen to further isolate power...
baseball
ml