Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week.
Featured Resources and Technical ContributionsContinue
Added by Vincent Granville on August 26, 2016 at 9:30am — No Comments
Relation, Relationship and Association
While most players in the IT sector adopted Graph or Document databases and Hadoop based solutions, Hadoop is an enabler of HBase column store, it went almost unnoticed that several new DBMS, AtomicDB,…Continue
Added by Athanassios Hatzis on August 26, 2016 at 4:00am — No Comments
Ever wondered why people have an affinity towards using certain apps vs others even though other apps provide the same functionality? Or, they are more attracted to certain features in the tab because it’s just easier to use? If not, then as a UX/UI designer it’s time for you to find that out,…Continue
Added by Siddhi Shroff on August 25, 2016 at 1:00pm — No Comments
Here are our most recent featured articles and resources, including three interesting books, as well as tutorials about R Data Frames, and Stats with Google Sheets.Continue
Added by Vincent Granville on August 25, 2016 at 8:02am — No Comments
Open source software solutions have become so powerful and large corporates started to prepare their traditional business analysts to move to open source softwares, particularly R. I have prepared a basic document to train some of my clients and local communities in Dallas. This article is not intended for people who are exposed to R before; but, people who are new and want to learn ABCs of R.
Downloading R is rather simple on …Continue
Added by Meltem Ballan on August 25, 2016 at 7:00am — No Comments
Data Frames are the tables to store data. If you recall the vectors from the first R notes data frames can be imagined as the collection of vectors with same dimension. We have already created vectors, named the vectors and plotted on histograms.
In this note we will create data frames, aggregate and plot.
Let’s start with baby steps and create a small data frame as a new script. You can open a new script by clicking on file and new script. You can copy and paste following…Continue
Added by Meltem Ballan on August 25, 2016 at 6:30am — No Comments
In this note I will quickly talk about csv files on a basic scenario.
I have loaded two csv files with customer complaints on my github account. The complaints are unique and a customer might complain more than once. Customer Id is encrypted as XX000 and assume that missing values don't have the same pattern and number of strings.
After the basic preprocessing we want to know the number of complaints by customers and…Continue
Added by Meltem Ballan on August 25, 2016 at 6:30am — No Comments
Five years ago, we had the technology available to enable us to undertake advanced analytics, yet there was no real interest in analysing data in a fast and efficient manner. Whilst the benefits were undeniable, businesses did not think that it was necessary to analyse information in real-time. However, this way of thinking has become dated, and now all those who once deemed it unnecessary are rushing to adopt advanced analytics and harness the insights it can provide them…Continue
Added by Aaron Auld on August 25, 2016 at 2:30am — No Comments
This is part of a new series of articles: once or twice a month, we post previous articles that were very popular when first published. These articles are at least 6 month old but no more than 12 month old. The first digest in this series was posted here two weeks ago. Below is our second edition.…Continue
Added by Vincent Granville on August 24, 2016 at 11:30am — No Comments
Added by Emmanuelle Rieuf on August 24, 2016 at 11:00am — No Comments
This article was originally posted here. It was written by Steven Scott, a Bayesian statistician interested in data augmentation methods and Markov chain Monte Carlo. Steven has applied these methods to problems in educational testing, network security, biometrics, web browsing, e-commerce, and medical applications.…Continue
Added by Emmanuelle Rieuf on August 24, 2016 at 10:30am — No Comments
The term Big Data is no longer a buzzword, it’s become an institution, and businesses all over the world are hiring Data Scientists, Chief Data Officers and the like to help them make sense of it all. But Big Data shouldn’t be thought of some scary, untouchable thing. We’ve been collecting data for decades and Big Data is well, just more of it.
Considering we now have a lot more data coming in, day on day, how best can we make it work for us? The first step is to ensure that the data…Continue
Added by Gareth Forbes on August 24, 2016 at 12:30am — No Comments
In my experience, some of the most talented analytics professionals I’ve managed were ones that had intimate knowledge of the system limitations required to meet customer needs. These individuals came from a variety of roles, some from engineering, and others from customer service roles. Their strength was in forming specific hypotheses to pinpoint customer experience issues and then leveraging their curiosity to do whatever it took, including learning new statistical techniques and…Continue
Added by Valiance Solutions on August 23, 2016 at 9:00pm — No Comments
There are many who believe all the true transformational opportunities for quantum improvements in business are to found only by exploring the unknown-unknowns through big-data analytics.
So what exactly are the unknown-unknowns and…Continue
Added by Krishna Pera on August 23, 2016 at 12:30pm — No Comments
Go from messy, unstructured artifacts stored in SQL and NoSQL databases to a neat, well-organized dataset with this quick reference for the busy data scientist. Understand text mining, machine learning, and network analysis; process numeric data with the…Continue
Added by Emmanuelle Rieuf on August 23, 2016 at 9:00am — No Comments
Summary: Sensors that know how you feel? Sensors that want to change the way you feel? When did that happen and better yet how?
Added by William Vorhies on August 23, 2016 at 3:00am — No Comments
Added by Emmanuelle Rieuf on August 22, 2016 at 4:00pm — No Comments
[Introduction of Association Rules]
Sometimes, the anecdotal story helps you understand the new concept. But, this story is real. About 15 years ago, in Walmart, a sales guy made efforts to boost sales in his store. His idea was simple. He bundled the products together and applied some discounts to the bundled products. (Now, it became common practices in marketing) For example, this guy bundled bread with jam, so that customers easily found them together. Moreover,…
Added by Gregory Choi on August 22, 2016 at 7:30am — No Comments
Big data is seeping into every facet of our lives. Smart home gadgets are becoming part of the nerve systems of new and remodeled homes, and many renters are demanding these interconnected gadgets from landlords.
But nowhere has Big Data created a bigger buzz than in business.…Continue
Added by Larry Alton on August 22, 2016 at 6:00am — No Comments
Currently, many of us are overwhelmed with mighty power of Deep Learning. We start to forget about humble graphical models. CRF is not so trendy as LSTM, but it is robust, reliable and worth noting.
In this post, you will find a short summary about CRF (aka Conditional Random Fields) – what is this thing, what is it for and some interesting facts. Enjoy!…Continue
Added by Nikitinsky Nikita on August 22, 2016 at 5:00am — No Comments