This post is a look at how affordable it is for people living in the direct vicinity of a stadium to go to a baseball game. I classify direct vicinity as a county for several reasons discussed below. This isn’t about policy or fairness, just a look at some data I got my hands on. Enjoy!
Read MoreWhen coming to Spark from a background in R or Python Pandas, you’ll likely get tripped up on a few things. The most notable of these is the difference between R and Python dataframe apis and the Spark dataframe API. Furthermore, not all models in Spark are fit with a dataframe and the inter loop between dataframes and RDD (Resilient distributed datasets) are not so obvious.
Read More