We had the chance to use the NFL play by play dataset all the way from 2002 through 2013 and the best part is the analysis was carried within Hadoop using Cloudera Impala. For the analysis we wanted to be at the individual game level but the data contained mixed grain including the play by play data. So what we ended up doing was apply some SQL filters to restrict it to the first row of each pla…
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Most popular articles