Guest blog by Michael Grogan. Here is how we can use the maps, mapdata and ggplot2 libraries to create maps in R. In this particular example, we’re going to cre...
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, c...
The positive reactions on my last post: “Different kinds of loops in R” lead me to compare some different versions of loops in R, RCPP (C++ integration of R). To see ...
“The thinking in AI has changed from ‘What’s possible?’ to ‘How do I do this?’” explains Rafiq Ajani at McKinsey educational AI forum. Natural Language Proc...
Python is the most loved, dreaded, and wanted programming languages by most developers, according to StackOverflow survey. Popular among most professional software deve...
Machine learning algorithms are extremely computationally intensive and time consuming when they must be trained on large amounts of data. Typical processors are not opti...
Data science is a growing and promising discipline that has impacted various domains, including higher education. Owing to its ability to use precise methods and platform...
Pooled, also referred to as “converged”, clusters in a unified data environment support disparate workload better than separate, siloed clusters. Vendors now provide ...
Summary: The annual Burtch Works salary survey tells us a lot about which industries are using the most data scientists and the difference between higher and lower skil...
It has been suggested the role conflicts can lead to poorer performance in the workplace. Below I present the general dynamics: more role conflicts equate to less perfo...