Subscribe to DSC Newsletter

All Blog Posts (7,248)

The Deadly Data Science Sin of Confirmation Bias

Confirmation bias occurs when people actively search for and favor information or evidence that confirms their preconceptions or hypotheses while ignoring or slighting adverse or mitigating evidence. It is a type of cognitive bias (pattern of deviation in judgment that occurs in…


Added by Michael Walker on April 24, 2014 at 7:30pm — 5 Comments

Interesting new contests on Kaggle

This morning, I received the following in my mailbox, from Kaggle:…


Added by Mirko Krivanek on April 24, 2014 at 8:06am — No Comments

Data science displacing traditional science

Here's an interesting piece of email my wife received today - looks like it was sent to all students at university of Washington.

Subject: Update on the Closure of the Physics and Astronomy Reading Room

Hello Huskies,

As of June 13, 2014, the Physics and Astronomy Reading Room will be replaced with a new…


Added by Vincent Granville on April 23, 2014 at 6:30pm — 1 Comment

This page has been moved

The new content can be found here.

Added by Vincent Granville on April 23, 2014 at 2:30pm — No Comments

Data sets and other machine learning resources from UC Irvine

They maintain 284 data sets as a service to the machine learning community.…


Added by Mirko Krivanek on April 21, 2014 at 6:00pm — No Comments

Nine new, great articles and resources posted externally

Here we go. Enjoy the reading!

MapReduce NextGen Architecture

Illustration of YARN (from first article below)

Articles from external publishers and bloggers:


Added by Vincent Granville on April 21, 2014 at 12:08pm — No Comments

Manage performance of enterprise applications

Managing performance of enterprise applications and achieving high levels of Performance with minimum resources is topic of discussion in today’s large enterprises. Resolving performance issues is essential for database administrators (DBAs) when it happens however it is best to react to the problems proactively. Proactive management requires very high level of attention and to help make sense of the overwhelming data provided by the database engine.

In database management being…


Added by Muhammad Saeed on April 21, 2014 at 3:43am — No Comments

Data Embodiment – an Ecosystemic Approach

Embodiment is comparable to the idea of an “ecosystemic” or “holistic” approach. In an ecosystem, each thing affects everything else. In light of the interrelationship, a person would not attempt to correct a problem by considering only a single piece of the puzzle. Instead, there is a need to bring together many aspects of the body. To understand embodiment, it is necessary to recognize how “the body” separates an organism from its environment; in a manner of speaking, the body represents…


Added by Don Philip Faithful on April 19, 2014 at 7:30am — No Comments

Business Intelligence and Data Warehousing

In the modern era, business environment is changing rapidly. They are seeking for valuable business information as being essential assets which will not only lead organisation towards the path of success but also help to sustain business in a competitive environment. Business Intelligence (BI) is a model which relates managerial values, and a tool which is used in an organisation to handle and filter information in order to make healthy business decisions. It refers to the appropriate…


Added by Avesh Dhakal on April 18, 2014 at 9:30pm — No Comments

The data science alphabet

Feel free to add your keywords. Here's a start:

The alphabet:

  • Algorithm (also: API, accountability)
  • Big data…

Added by Vincent Granville on April 17, 2014 at 6:30pm — 2 Comments

Employee Churn 201: Calculating Employee Value

Guest blog pst by Pasha Roberts, Chief Scientist, Talent Analytics @pasharoberts

Much has been written about customer churn - predicting who, when, and why customers will stop buying, and how (or…


Added by Vincent Granville on April 17, 2014 at 4:00pm — No Comments

Weekly digest - April 21

Sponsored Announcements

  • Predictive Analytics World, June 16-19 in Chicago is the business event for predictive analytics professionals, managers and commercial practitioners, covering today's commercial deployment of predictive analytics, across industries and across software vendors.…

Added by Vincent Granville on April 17, 2014 at 3:30pm — No Comments

How to identify the right data scientist for your company

Should you hire someone who knows all the most recent flavors of logistic regression? Or an Hadoop developer?

In my opinion, this is the wrong strategy. These employees are very expensive (at least $120k per year), and they might not bring the ROI that you expect. At least, if going in that direction, hire someone favoring simple, scalable, robust, automated solutions over anything else. To automate, you need someone great at developing…


Added by Vincent Granville on April 16, 2014 at 4:30pm — 1 Comment

Top 10 List – The V's of Big Data

Recently I wrote about the "Top 10 Big Data Challenges – A Serious Look at 10 Big Data V’s", which summarizes some of the big issues associated with the deployment of big data projects. The use of the letter V may seem forced and contrived, but it is used primarily as a mnemonic device to label and recall these critical challenges, in much the same way the…


Added by Kirk Borne on April 12, 2014 at 8:30am — 9 Comments

4000 copies of data science book pre-ordered

I've heard from Wiley that our data science book had already 4,133 pre-ordered copies, which is (according to Wiley) a great start. It was published last Monday.

I invite you to check the final table of content or check out the book on…


Added by Vincent Granville on April 12, 2014 at 8:00am — 6 Comments

Another 10 external articles and resources worth reading

Here we go. This a a follow up to our previous 9 must-read articles. The map below illustrates the data science (route optimization) problem described in the UPS Truck article.…


Added by Vincent Granville on April 10, 2014 at 12:30pm — No Comments

Big data: are we making a big mistake? My reaction

This is a recent, very popular article published in the FT Magazine. A similar one - Eight (No, Nine!) Problems With Big Data - was published in the New York Times. In both cases, this is an attack against big data. It features Google's failure…


Added by Vincent Granville on April 10, 2014 at 9:30am — No Comments

Data Science for business hacking

You can call it business or data hacking, but the idea is to use analytic intelligence to reverse-engineer algorithms, transform, manipulate and modify data in external databases, without even accessing the databases in questions, for your business advantage.

A few examples:

  • Query tag hijacking. You find an…

Added by Mirko Krivanek on April 5, 2014 at 1:00pm — 1 Comment

Big Data -- Seven Meta-Resources for Best Practices, Lessons Learned, Data Stories, Opportunities, and Insights

The flood of articles, webinars, and conferences related to Big Data is generating its own "infoglut". Consequently, it is really helpful when you find resources that summarize many of the latest developments in one place -- a sort of "Cliff Notes" for Big Data.  Here are six meta-resources that I have found useful, plus one additional collection that I…


Added by Kirk Borne on April 5, 2014 at 8:30am — No Comments

Blog Topics by Tags

Monthly Archives













  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service