After posting my most recent blog using census data to illustrate handling “large” dataframes in R exploiting fst and feather file formats, I realized I c...
Many statistics, such as correlations or R-squared, depend on the sample size, making it difficult to compare values computed on two data sets of different sizes. Here, w...
Artificial intelligence (AI) seemingly has been discussed everywhere over the last few years, and now it’s made its way into the commercial insurance industry. Organiza...
Originally posted by John Bowden. There is a meeting of the world’s most renowned scientists once in two years. These meetings are held to tackle biological puzzles t...
Blog key points: Google open-sourced TensorFlow to gain tens of thousands of more users across hundreds (thousands) of new use cases to improve the predictive effectivene...
A schema is a conceptual framework. It can function as a lens through which to study data. When I was conducting research on workplace stress to do my graduate degree...
In the nascent field of Data Science, myths are abound. Here’s my top 10, scoured from the internet (where better than to find a myth or two?). Myth #1: It’s ...
Until very recently, most organizations have seen two distinct, non-overlapping work streams when building an AI enabled application: a development path and a data scienc...
Logistic regression (LR) models estimate the probability of a binary response, based on one or more predictor variables. Unlike linear regression models, the dependent va...
Y’all it may have taken me a little time, but I did listen. Thank you for your emails. Because of you, I have now updated my ggmap tutorial to address the Google Static...