### Comparing Regression Lines with Hypothesis Tests

How do you compare regression lines statistically? Imagine you are studying the relationship between height and weight and want to determine whether this relationship differs between basketball players and non-basketball players. You can graph the two regression lines to see if they look different. However, you…

### Multicollinearity in Regression Analysis: Problems, Detection, and Solutions

Multicollinearity occurs when independent variables in a regression model are correlated. This correlation is a problem because independent variables should be independent. If the degree of correlation between variables is high enough, it can cause problems when you fit the model and interpret the results.…

### The 17 equations that changed the course of history

From Ian Stewart's book, these 17 math equations changed the course of human history

• A 2013 book by mathematician and science author Ian Stewart looked at 17 mathematical equations that shaped our understanding of the…
### #TextAnalytics concepts can be used to deal with credibility issues in the main stream media

Main stream media's credibility at an all time low

Credibility of the media has taken a…

### Limits of linear models for forecasting

In this post, I will demonstrate the use of nonlinear models for time series analysis, and contrast to linear models. I will use a (simulated) noisy and nonlinear time…

### Finding organic clusters in complex data-networks

A common task for a data scientist is to identify clusters in a given data set. The idea is to simply find groups of objects that have more connections or similarities to one another than they do to outsiders. In the study of networks, we use clustering to recognize…

### Five Regression Analysis Tips to Avoid Common Problems

Regression is a very powerful statistical analysis. It allows you to isolate and understand the effects of individual variables, model curvature and interactions, and make predictions. Regression analysis offers high flexibility but presents a variety of potential pitfalls. Great power requires great…

### Compressing information through the information bottleneck during deep learning

Read an article in Quanta Magazine (New theory cracks open the black box of deep learning) about a talk (see 18: Information Theory of Deep Learning, YouTube video) done a month or so ago given by Professor Naftali (Tali) Tishby on his theory that all deep learning convolutional neural networks (CNN) exhibit an “information bottleneck”…

### Introduction to Numpy - A Math Library for Python

Lets get started quickly. Numpy is a math library for python. It enables us to do computation efficiently and effectively. It is better than regular python because of it’s amazing capabilities.

In this article I’m just going to introduce you to the basics of what is mostly required for machine learning and…

### Regularization in Machine Learning

One of the major aspects of training your machine learning model is avoiding overfitting. The model will have a low accuracy if it is overfitting. This happens because your model is trying too hard to capture the noise in your training dataset. By noise…

### Text Classification using Neural Networks

Understanding how chatbots work is important. A fundamental piece of machinery inside a chat-bot is the text classifier. Let’s look at the inner workings of an artificial neural network (ANN) for text classification.

We’ll use 2 layers of neurons (1 hidden…

### The 10 Deep Learning Methods AI Practitioners Need to Apply

Neural networks are one type of model for machine learning; they have been around for at least 50 years. The fundamental unit of a neural network is a node, which is loosely based on the biological neuron in the mammalian brain. The connections between neurons are also modeled on…

### Scikit-learn Classification Algorithms

Scikit-learn is the de facto official machine learning library in use in the Python ecosystem. As described on its official website, Scikit-learn is:

• Simple and efficient tools for data mining and data analysis
• Accessible to everybody, and reusable in various contexts
• Built on NumPy, SciPy, and matplotlib
• Open…
### Where have you seen Machine Learning in your everyday life?

This article features the following applications, one of them is pictured above (recommendation engine).

• Ridesharing Apps Like Uber and Lyft
• Commercial Flights Use an AI…
Anybody who has tried Google Photos would agree that this free photo storage and management service from Google is smart. It packs in various smart features like advanced search, ability to categorize your pictures by locations and dates, automatically create albums and videos based on similarities, and walk you down the memory…

### Essentials of Deep Learning : Introduction to Long Short Term Memory

Sequence prediction problems have been around for a long time. They are considered as one of the hardest problems to solve in the data science industry. These include a wide range of problems; from predicting sales to finding patterns in stock markets’ data, from understanding movie plots to recognizing your…

### A Majority of Data Scientists Lack Competency in Advanced Machine Learning Areas and Techniques

Data science requires the effective application of skills in a variety of machine learning areas and techniques. A recent survey by Kaggle, however, revealed that a limited number of data professionals possess competency in advanced machine learning skills. About half of data professionals said they were competent in…

### Regression analysis using Python

This tutorial covers regression analysis using the Python StatsModels package with Quandl integration. For motivational purposes, here is what we are working towards: a regression analysis program which receives multiple data-set names from Quandl.com, automatically downloads the data, analyses it, and plots the results in a new window.…

### R Package Install Troubleshooting

One of the reasons why I love R is that I feel like I’m constantly finding out about cool new packages through an ever-growing community of users and teachers.

To understand the current state of R packages on CRAN, I ran some code provided by Gergely Daróczi on Github .  As of today there have been almost 14,000 R packages published on CRAN and the rate of…

### Deep Learning from first principles in Python, R and Octave – Part 1

This is the first in the series of posts, I intend to write on Deep Learning. This post is inspired by the Deep Learning Specialization by Prof Andrew Ng on Coursera and Neural Networks for Machine Learning by Prof Geoffrey Hinton also on Coursera.…

