The constant search for something bigger might be part of the American culture. However, big data is often critical: without real time credit card fraud detection - a big data application - no store would accept credit cards.
There has been a few people questioning the value of big data recently, and predicting that big data is going to get smaller in the future. While most of these would-be oracles are traditional statisticians working on small data and worried about their…
ContinueAdded by Vincent Granville on October 31, 2014 at 9:00pm — No Comments
Summary: We’ve scoured the literature to bring you a complete listing of possible definitions of Big Data with the goal of being able to determine what’s a Big Data opportunity and what’s not. Our conclusion is that Volume, Variety, and Velocity still make the best definitions but none of these stand on their own in identifying Big Data from not-so-big-data. Understanding these characteristics will help you analyze whether an opportunity calls for a Big Data solution but…
ContinueAdded by William Vorhies on October 31, 2014 at 2:00pm — No Comments
Last weekend, I was waiting in New York’s Penn Station, when the public announcer gave the familiar “See Something Say Something” message. It took a minute to sink in, but I had to laugh. Midtown Manhattan IS suspicious and unusual activity.
Speaking of outliers
In practice, data is dirty and big data is filthy. Analysts munge, wrangle and clean their…
ContinueAdded by Michael Bryan on October 31, 2014 at 11:33am — No Comments
Nikitinsky Nikita is the first to complete our DSA, using NLP, web crawling, statistical techniques and Python to cluster our content in top categories: click here to check his project.
To be fair, our intern …
ContinueAdded by Vincent Granville on October 31, 2014 at 10:30am — No Comments
When we perform machine learning of type classification, the target variable is a categorical (nominal) variable that has a set of unique values or classes . It could be a simple two class target variable like "approve application? " with classes (values) of "yes" or "no". Sometimes they might indicate ranges like "Excellent", "Good" etc. for a target variable like satisfaction score. We might also convert continuous variables like test scores (1 - 100) into classes like grades (A, B, C…
ContinueAdded by Kumaran Ponnambalam on October 30, 2014 at 7:00am — 2 Comments
The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday.
Featured
ContinueAdded by Vincent Granville on October 29, 2014 at 4:00pm — No Comments
Data visualization is everywhere. Whether you check your online bank account, monitor your workouts, discover the energy consumption of your house, check your pipeline in your CRM system or view remaining vacation days on your HR application, visualizations are part of the large majority of web applications.
When data visualizations…
ContinueAdded by Michael Singer on October 28, 2014 at 8:34am — No Comments
As a long-term member of the Linked Data community, which has evolved from W3C's Semantic Web, the latest developments around Data Science have become more and more attractive to me due to its complementary perspectives on similar challenges. Both disciplines work on questions like these:
Added by Andreas Blumauer on October 28, 2014 at 12:27am — No Comments
This is an announcement regarding my upcoming book: Data Science 2.0. The subtitle is Automation, survival kit, career resources.
Just like our first book, it will first be available as a free PDF document to members of our community. It will…
ContinueAdded by Vincent Granville on October 27, 2014 at 1:30pm — 16 Comments
In this article, I compare two approaches (with their advantages and drawbacks) to compute a simple metric: the number of unique visitors ("uniques") per year for a website. I use the word user or visitor interchangeably.
Source for picture: …
ContinueAdded by Vincent Granville on October 27, 2014 at 9:30am — 7 Comments
I’ve been thinking a lot about data, where it comes from, and what it looks like. I can’t help it. I’ve been a data geek for almost 15 years. And I find data beautiful. Not necessarily in its raw form, mind you. Then it’s just messy and more often than not a pain to deal with, especially when it gets really, really big. But when smart, creative people start to clean it up and use it in different ways to find the hidden stories that make sense, it can help us learn things in ways that we…
ContinueAdded by Anne Russell on October 27, 2014 at 6:30am — No Comments
Introduction
Any author would like to know if his/her article will be successful or not. Here is an attempt to deal with this task.
Data and tools
Added by Nikitinsky Nikita on October 26, 2014 at 10:30am — 1 Comment
Given the nature of the community, presumably many visitors already have a strong understanding of the nature of quantitative data. Perhaps more mysterious is the idea of qualitative data especially since it can sometimes be expressed in quantitative terms. For instance, "stress" as an internal response to an externality differs from person to person; yet it would be possible to canvas a large number of people and express stress levels as an aggregate based on a perceptual gradient: minimal,…
ContinueAdded by Don Philip Faithful on October 25, 2014 at 6:37am — No Comments
This happened tonight, shortly after Facebook took the same decision. Even Bit.ly itself is banned, see picture below. This happens only with Chrome, but not with other browsers such as IE or Firefox. The ban will probably be lifted in several hours.
This brings interesting questions:
Added by Vincent Granville on October 24, 2014 at 11:00pm — No Comments
Zipfian Academy has graduated more than 50 alumni, placing graduates into data science roles at Facebook, Twitter, Airbnb, Tesla, Uber, Square,…
Added by Molly Larkin on October 24, 2014 at 10:22am — 3 Comments
Summary: Is the addition of “Prescriptive” analytics to our nomenclature really worthwhile or are we just confusing our customers?
I admit to being annoyed when this or that industry wag tries to coin a new term to describe some portion of the discipline we are already practicing. Some of these folks I think are…
ContinueAdded by William Vorhies on October 23, 2014 at 10:00am — 9 Comments
For a very long time, businesses had their documents filed in folders and stored in huge metal cabinets. But thanks to advances in technology, they were eventually coded and stored digitally. As we advance through the Age of Information, the traditional digital storage devices like the floppy, compact, and flash discs evolved into cloud storage. Eric Griffith, a…
ContinueAdded by Kyle Albert on October 23, 2014 at 5:51am — No Comments
It should be detailed, featuring appropriate and properly scaled business charts, organized in a meaningful way and commented:
Added by Andrej Lapajne on October 23, 2014 at 4:00am — 1 Comment
Or to put it differently, when your metrics lie to you: how to find out, and what should you do?
The purpose of this article is to let Google aware of the problem, and fix their Google Analytics reports (filtering out the fake traffic). This scheme also impacts many companies computing website rankings. Tons of websites now have their traffic…
ContinueAdded by Vincent Granville on October 22, 2014 at 6:30pm — 4 Comments
The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday.
Featured
ContinueAdded by Vincent Granville on October 22, 2014 at 3:00pm — No Comments
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
© 2021 TechTarget, Inc.
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles