Apache Spark is currently one of the most active projects in the Hadoop ecosystem, and as such, there’s been plenty of hype about it in recent months, but how much of t...
Organizations of all sizes and across all industries are using data science to solve problems, find new opportunities and achieve actionable results. Empowered by sharing...
Data visualization has a rich and detailed history. The challenges faced by the early pioneers of data visualization are still relevant today. The ways they were solved b...
In large businesses, many entities purchase a variety of products from many vendors. In these very complex systems, tracking activity that is typical versus fraudulent or...
You can download the new machine learning cheat sheet here (PDF format, 14 pages.) Originally published in 2014 and viewed more than 200,000 times, this is the oldest...
Facts that are reported in the media are increasingly based on numbers. These lend themselves not only to visualization, but also to exploration by readers. Creating comp...
The explosion of new data sources enables companies to gain insights that were not previously available, but with these new opportunities also come new challenges like re...
Big Data integration is a key operational challenge for today’s enterprise IT departments. IT groups may find their skill sets, workload, and budgets over-stretched...
Do your dashboards tell the story you want to get across or does your data get lost in a sea of pixels? Tableau strives to keep our users in the flow with software cent...
According to research, in 2012, our digital universe created 2.5 Quintilian bytes of data every day. (Quintilian = 1 followed by 18 zeroes). Even now, many businesses...