Guest blog by Michael Grogan. Here is how we can use the maps, mapdata and ggplot2 libraries to create maps in R. In this particular example, we’re going to cre...
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, c...
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, c...
The positive reactions on my last post: “Different kinds of loops in R” lead me to compare some different versions of loops in R, RCPP (C++ integration of R). To see ...
“The thinking in AI has changed from ‘What’s possible?’ to ‘How do I do this?’” explains Rafiq Ajani at McKinsey educational AI forum. Natural Language Proc...
Pooled, also referred to as “converged”, clusters in a unified data environment support disparate workload better than separate, siloed clusters. Vendors now provide ...
It has been suggested the role conflicts can lead to poorer performance in the workplace. Below I present the general dynamics: more role conflicts equate to less perfo...
K-means is a centroid based algorithm that means points are grouped in a cluster according to the distance(mostly Euclidean) from centroid. Centroid-based Clustering Cent...
I recently wrote a blog “Interweaving Design Thinking and Data Science to Unleash Economic V…” that discussed the power of interweaving Design Thinkin...
The gradient descent algorithm is one of the most popular optimization techniques in machine learning. It comes in three flavors: batch or “vanilla” gradient desce...