Subscribe to DSC Newsletter

December 2015 Blog Posts (116)

The Data-Driven Weekly #1.5

This week, we continue the parallel themes of deep learning and natural language processing. Last week I mentioned some papers that use deep learning for NLP. In deep learning, these tasks are modeled as a prediction problem, which is why such an extensive training set is required. I think it's important to remember this amongst the flurry of sensationalist headlines around deep learning. While I…


Added by Brian Rowe on December 9, 2015 at 7:01am — No Comments

10 Open Source ETL Tools

ETL tools…


Added by Techroba on December 9, 2015 at 4:00am — 1 Comment

Top 6 Hadoop Vendors Providing Big Data Solutions in Open Data Platforms

This article is no longer available. We apologize for the inconvenience. To read more about Hadoop vendors, click here.  Below are great Hadoop vendors providing Big Data solutions in open data platforms:

  • 1. Amazon Web Services Elastic MapReduce Hadoop Distribution…

Added by William Vorhies on December 8, 2015 at 10:30am — No Comments

20 Big Data Repositories You Should Check Out

This is an interesting listing created by Bernard Marr. I would add the following great sources:


Added by Vincent Granville on December 7, 2015 at 8:12pm — 2 Comments

96% of Companies Are Failing Miserably When it Comes to Marketing Data Insights

target consumers

96% of Companies Are Failing Miserably When it Comes to Marketing Data Insights

At a time in our history when there is more data than ever before, the overwhelming majority of companies have…


Added by Larisa Bedgood on December 7, 2015 at 12:00pm — 2 Comments

Interview with Gideon Mann, Head of Data Science at Bloomberg

Interview with Gideon Mann, Head of Data Science at Bloomberg, where he guides the strategic direction for machine learning, natural language processing, and search on the core terminal. He joined Bloomberg from Google Research. At Google, in addition to academic research, his team built the core middle-ware libraries…


Added by William Vorhies on December 7, 2015 at 10:30am — No Comments

Great people, great grocery — and the link to operational excellence in support of the shopper

In a recent interview on WGN, Bob Mariano, the CEO of Roundy’s was asked the question “What makes a great grocery store?” His response focused on customer care: “A great grocery store is made up of great people that care about their customers and go out of their way to make them feel appreciated.” Today, Mariano’s competes against global companies like Trader Joes and others and plans to merge with The Kroger Company in late 2015.   But they still regard themselves as…


Added by Tony Agresta on December 7, 2015 at 7:30am — No Comments

4 Data Processing Architectures of Web Companies

Have you struggled in your data science function because of underlying data processing issues? Here is the list of 4 data processing…


Added by Banjog on December 7, 2015 at 3:10am — No Comments

49 Machine Learning Resources and Related Articles from Top Bloggers

Starred articles are candidates for the picture of the week. A comprehensive list of all past resources is found here. We are in the process of automatically categorizing them using indexation and automated tagging…


Added by Vincent Granville on December 6, 2015 at 8:46pm — No Comments

R Programming: 35 Job Interview Questions and Answers

Read the questions. At the bottom, you will find a link to the answers.

The Questions

First Set

  1. Explain what is R?
  2. List out some of the function that R provides?
  3. Explain how you can start the R commander…

Added by L.V. on December 6, 2015 at 9:00am — 2 Comments

10 Harvard Business Review Articles that you should Read

There are various outlets publishing high quality articles about data science, analytics, big data, machine learning and related fields. These outlets can be broken down in the following categories:

  • Professional associations: ASA (Amstat News), IEEE/Spectrum, Informs
  • Corporate blogs and magazines: IBM big data hub, Pivotal, Teradata, Tableau
  • Niche publishers: Data Science Central (check our…

Added by L.V. on December 6, 2015 at 9:00am — No Comments

Top 10 Machine Learning Algorithms

This was the subject of a question asked on Quora: What are the top 10 data mining or machine learning algorithms?

Some modern algorithms such as collaborative filtering, recommendation engine, segmentation, or attribution modeling, are missing from the lists below. Algorithms from graph theory (to find the shortest path in a graph, or to detect connected components),…


Added by L.V. on December 6, 2015 at 9:00am — 4 Comments

Big Data at NASA

“Billions and billions…”

It’s a phrase made famous by physicist Carl Sagan on his popular Cosmos TV show, and he was referring to the number of stars in our universe.

But it could also easily be applied to the bits of data NASA is collecting about that same universe.

It’s a daunting task — and with its dozens of missions and hundreds of scientists taken together, it may constitute the biggest big data project ever undertaken.…


Added by Bernard Marr on December 5, 2015 at 6:30pm — No Comments

It's Sunny Skies For Big Data

Big Data has been around, in one form or another, for quite some time. In fact, it even pre-dates computers, if you can believe such a thing! The reason all the Big Data hype is because advances in computers, the Internet, wireless networking, and the cloud have combined to really bring out the best in Big Data. It's through all of the innovations of the past two decades that the extent of Big Data's true potential is finally being realized.

Naturally, there are still questions and…


Added by Stephen Jeske on December 4, 2015 at 2:44pm — 3 Comments

MetaDapper: Data Mapping and Conversion Made Easy With the Right Tools

Data conversion, translation, and mapping is by no means rocket science, but it is by all means tedious. Even a simple data conversion task (e.g., reading a CSV file into a list of class instances) can require a non-trivial amount of code. While all of these tasks share much in common, they are all “just different enough” to require their own data conversion methods.

In virtually every system that we build, we will at some point find ourselves…


Added by Irina Papuc on December 4, 2015 at 11:30am — No Comments

4 Open Source & Cloud Machine Learning, Data Analytics & Visualization projects by Google

Google is a…


Added by Jogmon on December 4, 2015 at 2:02am — No Comments

Global Green Data Center Market to reach US$221.50 billion by 2022

The rising level of awareness regarding the benefits that the green data centers offer is driving the global market for green data center significantly. In 2014, the market stood at US$25.87 billion. Aided by the increase in requirement for energy efficient data centers from various enterprises coupled with rules and regulation set by several governments in order to promote the usage of these eco-friendly data centers, the global market is likely to expand at a CAGR of 30.8%…


Added by Madhuri Pawar on December 4, 2015 at 12:23am — No Comments

The Role of Big Data Analytics in the Petabyte Age

The data flood that you are witnessing every minute is trying to tell you hidden secrets about your business growth, are you listening? In the previous years, we may have taken a glass half empty perspective for the analytics sector but this definitely is about to change now.  Big data analytics  is indeed one of the fastest growing markets and is expected to mature in 2016 and…


Added by Aureus Analytics on December 3, 2015 at 9:30pm — No Comments

Control: The "Uncle Fester" of the Data Science Family (part 2--Optimization)

"Darling, this must be really bothering you. The last time you shook Uncle Fester like that you were trying to get the termites out of his sinuses." 

Morticia Addams…


Added by Jamie Lawson on December 3, 2015 at 6:00pm — No Comments

Control: The "Uncle Fester" of the Data Science Family (part 1--The Knowledge Pyramid)

Three parts dynamite, with a nitroglycerin cap. It's perfect for small homes, carports and tool sheds.

Fester Addams

I am a co-founder…


Added by Jamie Lawson on December 3, 2015 at 2:00pm — 1 Comment

Blog Topics by Tags

Monthly Archives













  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service