Subscribe to DSC Newsletter

February 2018 Blog Posts (83)

Weekly Digest, February 26

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions

Continue

Added by Vincent Granville on February 24, 2018 at 8:36am — No Comments

A Common Data Analysis Pattern with a Simple Solution in R

It seems that much of the data analysis work I've done over the last few months has followed a "script". First, identify data, often government-sponsored and freely-available, that's of keen interest. Next, find the websites that house the data and download the relevant files to my notebook. The downloads might ultimately be one-off or included in the data analysis programs. Finally, load the data into either R or python and have at it with queries, visualizations, and…

Continue

Added by steve miller on February 23, 2018 at 6:00am — No Comments

Difficult Probability Problem: Distribution of Digits in Rogue Systems

I recently posted a table summarizing probabilistic properties of digits in various number representation systems, see here.  The topic is already rather difficult for well-behaved systems (those listed in my table) but some systems are rogue, and do not have these nice statistical properties. Here we focus on one of these less known systems,…

Continue

Added by Vincent Granville on February 22, 2018 at 7:00pm — No Comments

Topology Data Analysis (TDA)

Topology is the branch of pure mathematics that studies the notion of shape.  In the context of large, complex, and high dimensional data sets, topology takes on two main tasks, the measurement of shape and the representation of shape.  One can measure shape related properties within the data, and create compressed representations of data sets retaining features which reflect the relationships among the points in the data set. The…

Continue

Added by Valentina Kibuyaga on February 22, 2018 at 5:30pm — No Comments

Thursday News: Correlation, Regression, R, AI, Books, Deep Learning, NLP

Here is our selection of featured articles and resources posted since Monday:

Forum Questions and Answers

Continue

Added by Vincent Granville on February 22, 2018 at 11:30am — No Comments

15 Great Articles About Decision Trees

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, outliers, regression, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, time series, cross-validation, model fitting, dataviz, AI and many more. To keep receiving these articles, …

Continue

Added by Vincent Granville on February 21, 2018 at 6:30pm — No Comments

Foolproof R package Install

The number of R packages associated cool new tricks available continues to grow every month.  To understand the current state of R packages on…
Continue

Added by Laura Ellis on February 21, 2018 at 12:30pm — No Comments

Text Classification: Applications and Use Cases

 text classification







Text analysis, as a whole, is an emerging field of study. Fields  such as Marketing, Product Management, Academia, and Governance are already leveraging the process of analyzing and extracting information from textual data. We discussed the technology behind Text Classification, one of the…

Continue

Added by Shashank Gupta on February 21, 2018 at 5:30am — No Comments

Building a Data Quality Strategy

In this article I explore some of the key concepts of data quality management and how to build a strategy for continuous improvement. I won’t be covering every possible scenario, process, method or problem; only those that are common across most industries and those that have proved useful on my own personal journey.

Hopefully, we already agree that good data quality is an essential part of business intelligence and a foundation on which you build your systems, processes and…

Continue

Added by Richard Cook on February 21, 2018 at 5:00am — No Comments

An Executive Primer to Deep Learning

Continue

Added by Pradeep Menon on February 20, 2018 at 4:30pm — No Comments

Double-Yolk "Bayesian Egg": Bayes, Frequentist and a 250 years-old puzzle

The Backdrop. Bayesians and Frequentists have long been ambivalent toward each other. The concept of “Prior” remains the center of this 250 years old tug-of-war: frequentists view prior as a weakness that can cloud the final inference, whereas Bayesians view it as a strength to incorporate expert knowledge into the data analysis. So, the question naturally arises, how can we develop a Bayes-frequentist consolidated data analysis workflow that enjoys the best…

Continue

Added by Subhadeep (DEEP) Mukhopadhyay on February 20, 2018 at 3:30pm — No Comments

Selected Recent Articles from Top DSC Contributors - Part 6

This is a new series, featuring great content from our top contributors. Some of these articles are rather technical in nature, but many are business-oriented and written in simple English. The entire series consists of about 120 articles. We intend to publish a new set every two weeks or so. Click here to check out the…

Continue

Added by Vincent Granville on February 20, 2018 at 3:30pm — No Comments

New Marketing Insight from Unsupervised Bayesian Belief Networks

Introduction

“Limited-Service Restaurants” (LSRs) is how the restaurant industry refers collectively to fast food and fast-casual dining establishments.  Marketers who specialize in LSRs often employ marketing research to evaluate hypotheses about their brands or to detect segments within their markets.  An important additional purpose of market research is to understand the total structure of a market, to find out what guests consider important…

Continue

Added by Charles Hammerslough on February 20, 2018 at 10:30am — No Comments

Off the Beaten Path - HTM-based Strong AI Beats RNNs and CNNs at Prediction and Anomaly Detection

Summary: This is the second in our “Off the Beaten Path” series looking at innovators in machine learning who have elected strategies and methods outside of the mainstream.  In this article we look at Numenta’s unique approach to scalar prediction and anomaly detection based on their own brain research.

 …

Continue

Added by William Vorhies on February 20, 2018 at 8:30am — No Comments

Adding Program Evaluation to the Data Science Curriculum

We tried to do XYZ. Did it make a difference?”

Whether you are in the for-profit world or the not-for profit world, this is a very basic question that many people try to answer.  

You could be working at a bank trying to figure out which offer is most appealing to customers, at an online retailer figuring out which ad display gets the most clicks, at the Department of Education trying to test the effect of smaller class sizes, at the city government office trying to see if the…

Continue

Added by Howard Friedman on February 20, 2018 at 5:30am — No Comments

4 Ways To Hire Web Testing Services: Upscale Your App Productivity

A great web application testing plan makes sure that a web application is functional and user-friendly.

By allowing the testing phase to estimate important areas of user experience, organizations can develop applications that are instantly…

Continue

Added by Alisha Henderson on February 20, 2018 at 4:30am — No Comments

Application of Image Processing and Convolution Networks in Intelligent Character Recognition for Digitized Forms Processing

ABSTRACT

Image processing is a rapidly evolving field with immense significance in science and engineering. One of the latest

applications of Image processing is in Intelligent Character Recognition (ICR), that is the computer translation of

handwritten text into machine-readable and machine-editable…

Continue

Added by Valiance Solutions on February 20, 2018 at 12:30am — No Comments

List of Free Must-Read Machine Learning Books

machine learning books

Machine learning is an application of artificial intelligence that gives a system an ability to automatically learn and improve from experiences without being explicitly programmed. In this article, we have listed some of the best free machine learning books that you should consider going through (no order in particular).

Mining of Massive Datasets

Author: Jure Leskovec, Anand Rajaraman, Jeff…

Continue

Added by Shashank Gupta on February 19, 2018 at 11:00pm — No Comments

Top Trends in AI in 2018

Continue

Added by Pradeep Menon on February 19, 2018 at 10:00pm — No Comments

Data Science Simplified Part 11: Logistic Regression

In the last blog post of this series, we discussed classifiers. The categories of classifiers and how they are evaluated were discussed. We have also discussed regression models in depth. In this post, we dwell a little deeper in how regression models can be used for classification tasks.

Logistic Regression is a widely used regression model used for classification tasks. As usual, we will discuss by example. No Money bank approaches us with a problem. The bank wants…

Continue

Added by Pradeep Menon on February 19, 2018 at 10:00pm — No Comments

Monthly Archives

2018

2017

2016

2015

2014

2013

2012

2011

1999

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2018   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service