Subscribe to DSC Newsletter

All Blog Posts (2,455)

Associations and Correlations - The Essential Elements

In writing this book my primary aim is to guide the beginner to the essential elements of associations and correlations and avoid the most common pit-falls.

Here, I discuss a holistic method of discovering the story of all…


Added by Vincent Granville on February 12, 2016 at 7:00pm — No Comments

Setting up Your Analytics Stack with Jupyter Notebook & AWS Redshift


From querying your data and visualizing it all in one place, to documenting your work and building interactive charts and dashboards, to running machine learning algorithms on top of your data and sharing the results with your team, there are very few limits to what one can do with the Jupyter + Redshift stack. However, setting everything up and resolving all the package dependencies can be a painful experience.

In this blog post I will walk…


Added by Yevgeniy Slutskiy Meyer on February 12, 2016 at 1:30am — No Comments

Ember Data: A Comprehensive Tutorial for the ember-data Library

Ember Data (a.k.a ember-data or is a library for robustly managing model data in Ember.js applications. The developers of Ember Data state that it is designed to be agnostic to the underlying persistence mechanism, so it works just as well with JSON APIs over HTTP as it does with streaming WebSockets or local IndexedDB storage. It provides many of the facilities you’d…


Added by Irina Papuc on February 11, 2016 at 5:30am — No Comments

Data Team for Data Driven Organization

A data-driven organization will use the data as critical evidence to help inform and influence strategy. To be data-driven means cultivating a mindset throughout the business to continually use data and analytics to make fact-based business decisions. Becoming a data-driven organization is no longer a choice, but a necessity. Making decisions based on data-driven approaches not only increases the accuracy of results but also provides consistency in how the results are interpreted and fed…


Added by Raghavan Madabusi on February 11, 2016 at 5:00am — No Comments

Robot Company: Can a True AI Run a Business Autonomously?

Imagine a company that lives in the cloud and requires no human interaction at all. What would it look like? Is this possible? What purpose would it have? 

Originally posted by sublimecattree

Let’s start with email: What percentage of emails could be eliminated if all your employees shared the same brain? 90%? 99%? You would only need external facing…


Added by Eduardo Siman on February 11, 2016 at 3:53am — No Comments

Will Big Data solve the Riemann Hypothesis?

The Riemann Hypothesis is arguably the most important unsolved problem in mathematics. It falls into an area called Analytic Number Theory which is essentially number theory with complex numbers thrown into the mix. The hypothesis states that all non-trivial zeros of the Reimann Zeta function fall on the critical line. What!?? Ok, sorry. That is not very helpful. Lets just say that there is a critical relationship between this function and our…


Added by Eduardo Siman on February 11, 2016 at 3:30am — 1 Comment

18 Analytics Tools Every Business Manager Should Know

The goal of any business analytic tool is to…


Added by Bernard Marr on February 10, 2016 at 2:30pm — No Comments

Weekly Digest, February 15

Starred articles are new additions or updated content, posted between Thursday and Sunday. The weekly digest has six sections: (1) Featured Resources and Technical Contributions, (2) Featured Articles and Case Studies, (3) From our Sponsors, (4) News, Events, Books, Training, Forum Questions, (5) Picture of the Week, and (6) Syndicated Content.

The full version is always published Monday.…


Added by Vincent Granville on February 10, 2016 at 10:30am — No Comments

Data is Not the Enemy of Intuition

Summary: There is a common misconception that data is the enemy of intuition.  It’s true that sometimes our clients are misled by their preconceived notions.  But all the same, there are two practical lessons here, one about being effective in explaining data science to our clients, and another about methodology. 



Added by William Vorhies on February 10, 2016 at 10:30am — 2 Comments

50 Years of Data Science

Very interesting document, relatively recent (September 2015), authored by David Donoho (Statistics professor at Stanford) and posted on one of the MIT websites, here (41 pages, PDF). 

Below you will find the abstract and the table of content. Interestingly, Andrew Gelman and Vincent Granville (our data scientist)…


Added by Laetitia Van Cauwenberge on February 10, 2016 at 8:30am — No Comments

Big Data Analytics Forecast : The Streaming Future

"Information is the oil of the 21st century, and Analytics is the combustion engine."

The Volume, Variety and Velocity of data coming into your organization continue to reach unprecedented levels. This phenomenal growth means that not only must you understand big data in order to decipher the information that truly counts, but you…


Added by Anuj Tripathi on February 9, 2016 at 2:20pm — No Comments

The lipstick robot - A great way to explain Deep learning

I love motivational examples in teaching complex ideas! I use this simple little video to teach Deep Learning to my students.

When we consider Deep Learning, we think of ideas like teaching a computer to …


Added by ajit jaokar on February 9, 2016 at 12:00pm — No Comments

The company that perfects Data Visualization in Virtual Reality will be the next Unicorn

Fortune 500 companies are investing staggering amounts into data visualization. Many have opted for Tableau, Qlik, MicroStrategy, etc. but some have created their own in HTML5, full stack JavaScript, Python, and R. Leading CIOs and CTOs are obsessed with being the first adopters in whatever is next in data visualization. 

The next frontier in data visualization is clearly immersive experiences. The 2014 paper "Immersive and Collaborative Data Visualization Using Virtual Reality…


Added by Eduardo Siman on February 9, 2016 at 10:30am — No Comments

Avoid the "Fishing Expedition" Approach to Analytics Projects

By  Pasha Roberts, Chief Scientist & Co-founder, Talent Analytics, Corp.

Over the years, our firm has had many discussions with employers on the eve of a new talent analytics project. Often, it is the firm’s first deep-dive look at employee data. Sometimes we act as a strategic sounding board, and sometimes we can help them move directly forward into predictive analytics. It is always interesting.

This article will discuss two analytics approaches that we have…


Added by Mike Kennedy on February 9, 2016 at 4:30am — No Comments

How Hotels use Big Data to Generate New Revenues

Hotel revenue management and use of analytics for room sales has remained largely unchanged for decades since the early 1980s when hotels started looking at yield and how they could optimize the revenue each room could generate. By the mid-1990’s, Marriott’s successful execution of revenue…


Added by Mark Ross-Smith on February 9, 2016 at 2:00am — No Comments

Relax: Automation isn't coming for your job

Relax: Automation isn't coming for your job

By Justin Tenuto

For the past few years, the drumbeat of think pieces about automation taking your job–yes,your job–has gotten both louder and more incessant. Smart people like the folks at …


Added by Leena Kamath on February 8, 2016 at 9:41am — 1 Comment

Ten Favorite Open Data Libraries by Justin Tenuto

There are precious few things that everybody adores. Once you get past breakfast in bed and two dollar bills, the list starts to look a little barren. But if there's one thing we can agree on as a society it's this: free stuff is good and cool and you want some of it right now.

In the spirit of this immutable law, we've compiled a list of our ten favorite places to find open data. Here they are, in no particular order.…


Added by Leena Kamath on February 8, 2016 at 9:37am — No Comments

What skills should data scientists have in 2016?

What skills should data scientists have in 2016?

By Justin Tenuto

It's a brand new year and that means you'll be seeing scads of trend pieces this week. As you can probably assume from the title, we're not necessarily immune to that sort of thing…


Added by Leena Kamath on February 8, 2016 at 9:35am — 5 Comments

34 External Machine Learning Resources and Related Articles

Starred articles are candidates for the picture of the week. A comprehensive list of all past resources is found here. We are in the process of automatically categorizing them using indexation and automated tagging…


Added by Vincent Granville on February 7, 2016 at 8:30pm — 2 Comments

k-nearest neighbor algorithm using Python

This article was written by Natasha Latysheva. Here we publish a short version, with references to full source code in the original article

Our internal data scientist had a few questions and comments about the article:

  • The example used to illustrate the method in the source code is the famous iris…

Added by Laetitia Van Cauwenberge on February 6, 2016 at 6:00pm — 1 Comment

Blog Topics by Tags

Monthly Archives








© 2016   Data Science Central   Powered by

Badges  |  Report an Issue  |  Terms of Service