Most Recent:

New York Times Movie Reviews (R rvest)

Scraping New York Times Movie Reviews

December 20, 2018

All code can be found here

New York Times Movie Reviews

I love reading movie reviews. I avidly look forward to the Friday New York Times because that is when most of the movie reviews get printed (yes - printed, I still get the physical copy sent to my house on weekends). Manohla Dargis, A.O. Scott, …keep reading

The Wu-Tang Clan Network (python graphlab networkx)

The Wu-Tang Clan Network

December 10, 2018

All code can be found here

In honor of Wu-Tang Clan day, I dug out this old post I created for one of my grad school classes. We were learning about graph analysis and this is what I put together. All code is in python using the networkx package and graphlab package. Graphlab is pretty great.

For this project, we were tasked with exploring and analyzing a bi-modal network. It was important to me to choose a data set that I was familiar with. Familiarity with the data helped me make sure the calculations produce coherent results. …keep reading

538 NBA Predictions (R dplyr)

Posting Up 538 NBA Predictions Using R

November 25, 2018

All code can be found here


FiveThirtyEight is the best. From politics to pop culture, Nate Silver and his team do a great job creating interesting articles and visuals using various data science techniques.

A big part of what FiveThirtyEight does revolves around sports. A major focus of mine is also sports, specifically the NBA. FiveThirtyEight assigns win probabilities to every NBA game during the regular season and playoffs. I have been using the NBA regular season to test my modelling skills.

Predicting wins for every NBA game is difficult…keep reading