Subscribe to DSC Newsletter

January 2019 Blog Posts (95)

Thursday News: ML Coding, Classification, Regression Trees, Python, AI, Case Studies

Here is our selection of featured articles, technical contributions, and forum questions posted since Monday:

Technical Contributions

Continue

Added by Vincent Granville on January 31, 2019 at 10:00am — No Comments

Classification and Regression Trees

Learn about CART in this guest post by Jillur Quddus, a lead technical architect, polyglot software engineer and data scientist with over 10 years of hands-on experience in architecting and engineering distributed, scalable, high-performance, and secure solutions used to combat serious organized crime, cybercrime, and fraud.

Although both linear regression models allow and logistic regression models allow us to predict a categorical outcome, both of these models assume…

Continue

Added by Packt Publishing on January 31, 2019 at 4:09am — No Comments

Learn #MachineLearning Coding Basics in a weekend – a new approach to coding for #AI

image source - wikipedia

Update

Hello all,

The first book is posted on data science…

Continue

Added by ajit jaokar on January 30, 2019 at 12:00pm — 375 Comments

Degrees of Freedom and Sudoku

This article is about Intuitive explanation of Degrees of Freedom and How Degrees of Freedom affects Sudoku.

A lot of aspiring Data Scientists take courses on statistics and get befuddled with the concept of Degrees of Freedom. Some memorize it by rote as ‘n-1'.

But there is a intuitive reason why it is ‘n-1’.…

Continue

Added by Venkat Raman on January 30, 2019 at 1:51am — 3 Comments

The Challenges to Tackle Before You Start With AI

Artificial Intelligence and the technology behind it are growing at a furious pace. Marketers have realized its vast potential and are striving to extract the technology’s opportunities in full. There are numerous advancements being made in this regard, and many organizations have taken center stage of the AI world with in depth data analysis and data…

Continue

Added by Ronald van Loon on January 29, 2019 at 10:22pm — No Comments

Taming 1.5 Billion Rows of "Big Apple” Data

Diving into the many underlying trends throughout the entire 1.5 Billion rows of NYC Taxi data with Pivot Billion…


Continue

Added by Benjamin Waxer on January 29, 2019 at 7:26am — No Comments

Data Acquisition

When building applications that ingest a large amount of customer data sets, what is your preferred method of data transfer? Which APIs do you leverage to acquire and transmit? 

Added by Fahad Zaidi on January 29, 2019 at 4:19am — No Comments

Cross-Validation: Concept and Example in R

This article was written by Sondos Atwi.

What is Cross-Validation?

In Machine Learning, Cross-validation is a resampling method used for model evaluation to avoid testing a model on the same dataset on which it was trained. This is a common mistake, especially that a separate…

Continue

Added by Andrea Manero-Bastin on January 28, 2019 at 11:30pm — No Comments

One Shot Learning and Other Strategies for Reducing Training Data

Summary: Not enough labeled training data is a huge barrier to getting at the equally large benefits that could be had from deep learning applications.  Here are five strategies for getting around the data problem including the latest in One Shot Learning.

 

For at least the last two years we’ve been in an…

Continue

Added by William Vorhies on January 28, 2019 at 9:56am — 1 Comment

A Blast from Python Past

I had an interesting discussion with one of my son's friends at a neighborhood gathering over the holidays. He's just reached the halfway point of a Chicago-area Masters in Analytics program and wanted to pick my brain on the state of the discipline.

Of the four major program foci of business, data, computation, and algorithms, he acknowledged…

Continue

Added by steve miller on January 28, 2019 at 8:24am — No Comments

Machine Learning for Transactional Analytics: Customer Life time Value v/s Acquisition Cost

Understanding customer transactional behaviour pays well for any business. With the tsunami of start ups in recent times and the immense money flow in businesses, customers find lucrative offers from companies for acquisition, retention & referrals strategies. Understanding transactional behaviour of a customer has become even more complex with the invent of new business houses everyday. Although, with the rise of powerful machines, one can…

Continue

Added by PS Dhillon on January 28, 2019 at 4:00am — No Comments

Top 10 Technology Trends of 2019

First days after the celebration of the New Year is the time when looking back we can analyze our actions, promises and draw conclusions whether our predictions and expectations came true. As 2018 came to its end, it is perfect time to analyze it and to set trends for the next year. The amount of data generated every minute…

Continue

Added by Igor Bobriakov on January 28, 2019 at 2:00am — No Comments

Top 10 Technology Trends of 2019

Guest blog by Igor Bobriakov.

First days after celebration of the New Year is the time when looking back we can analyze our actions, promises and draw conclusions whether our predictions and expectations came true. As 2018 came to its end, it is perfect time to analyze it and to set trends for the next year.…

Continue

Added by Capri Granville on January 27, 2019 at 9:30am — No Comments

Weekly Digest, January 28

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this…

Continue

Added by Vincent Granville on January 27, 2019 at 9:00am — No Comments

Best dynamically-typed programming languages for data analysis

One can seriously argue about what programming language is the best for data analysis, but there is one universal metric that can define your choice: speed of calculations. Therefore, the word "best" in the title means the languages that lead to most performant applications. If most performant program can also be written in an easy-to-use, easy-to-learn, dynamically-typed…

Continue

Added by jwork.ORG on January 26, 2019 at 2:54pm — No Comments

Data-driven Marketing Strategy: Spatial Analytics for Micro-marketing

Data-driven Marketing Strategy: Spatial Analytics for Micro-marketing

Organizations, often in their me-too hurry to adopt a new technology, just pour their old-wine (data) into a new bottle. What was…

Continue

Added by Krishna Pera on January 26, 2019 at 5:45am — No Comments

New Home Sales Projection: A Time Series Forecasting

1. Background

New home construction plays a significant role in housing economy, while simultaneously impacting other sectors such as timber, furniture and home appliances. New house sales is also an important indicator of country’s overall economic health and direction. In the last 50 years there has been few significant bumps and turning points in this sector that shaped the trajectory of the overall economy.  Here I review the…

Continue

Added by Mab Alam on January 25, 2019 at 8:15pm — No Comments

How to Flourish in Industry 4.0, the Fourth Industrial Revolution

Call it a “Forrest Gump moment;” an instance of being in the right place at the right time for no other reason than just plain luck.  A “Forrest Gump moment” is based upon Tom Hanks’ character in the movie “Forrest Gump,” a guy who always seemed to be in the right place at the right time meeting Presidents Kennedy, Johnson and Nixon at critical points in American history.

I too have had a Forrest Gump moment in meeting President Reagan, however, my deeper Forrest Gump…

Continue

Added by Bill Schmarzo on January 25, 2019 at 1:50pm — No Comments

SIP text log analysis using Pandas

SIP application server (AS) text logs analysis may help in detection and, in some specific situations, prediction of different types of issues within a VoIP network. SIP server text logs contain the information which is difficult to obtain or even cannot be obtained from other sources, such as CDRs or signaling traffic captures.

The following parameters, among others, can help in estimating…

Continue

Added by Ilya Selitser on January 25, 2019 at 12:00am — No Comments

23 Statistical Concepts Explained in Simple English - Part 7

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, decision trees, ensembles, correlation, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, cross-validation, model fitting, and many more. To keep receiving these articles, sign up on…

Continue

Added by Vincent Granville on January 24, 2019 at 12:30pm — No Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service