*This article was written by Joyce Xu**.*

With the rise of autonomous vehicles, smart video surveillance, facial detection and various people counting applications, fast and accurate object detection systems are rising in demand. These systems involve not only recognizing and classifying every object in an…

ContinueAdded by Andrea Manero-Bastin on June 15, 2020 at 12:00am — No Comments

*This article was written by Madhu Sanjeevi (Mady)**.*

In the previous story we talked about Linear Regression for solving regression problems in machine learning, This story we will talk about Logistic Regression for classification problems.

You may be wondering why the name says regression…

ContinueAdded by Andrea Manero-Bastin on June 14, 2020 at 11:30pm — No Comments

*This article was written by Andrey Nikishaev**.*

Today I will show you very simple but powerful example of how to count traffic capacity with the algorithm that you can run on devices.

So this algorithm works in 4 steps:

1. Get frame edges.

2. Blur…

ContinueAdded by Andrea Manero-Bastin on June 9, 2020 at 11:30am — No Comments

*This article was written by Suresh Kondamudi**.*

Dealing with large datasets is often daunting. With limited computing resources, particularly memory, it can be challenging to perform even basic tasks like counting distinct elements, membership check, filtering duplicate elements, finding minimum, maximum, top-n elements, or set operations like union,…

ContinueAdded by Andrea Manero-Bastin on May 5, 2020 at 11:00am — No Comments

*This article was written by Lixun Zhang**.*

As a data scientist, I have experience with R. Naturally, when I was first exposed to Microsoft R Open (MRO, formerly Revolution R Open) and Microsoft R Server (MRS, formerly Revolution R Enterprise), I wanted to know the answers for 3 questions:

- What do R, MRO, and MRS…

Added by Andrea Manero-Bastin on May 5, 2020 at 10:30am — No Comments

*This article was written by Tristan Handy.*

Over the past nine months I’ve worked with over a dozen venture-funded startups to build out their internal analytics. In doing so, there’s a single SQL function that I have come to use surprisingly often. At first it wasn’t at all clear to me why I would want to use this function, but as time goes on I have found ever more uses for…

ContinueAdded by Andrea Manero-Bastin on April 25, 2020 at 12:30am — 1 Comment

*This article was written by Jim Frost**.*

How do you compare regression lines statistically? Imagine you are studying the relationship between height and weight and want to determine whether this relationship differs between basketball players and non-basketball players. You can graph the two regression lines to see if they look different. However, you…

ContinueAdded by Andrea Manero-Bastin on April 18, 2020 at 7:30am — No Comments

*This article was written by Jim Frost**.*

Multicollinearity occurs when independent variables in a regression model are correlated. This correlation is a problem because independent variables should be independent. If the degree of correlation between variables is high enough, it can cause problems when you fit the model and interpret the results.…

ContinueAdded by Andrea Manero-Bastin on April 12, 2020 at 2:00am — No Comments

*This article was written by Andy Kiersz**.*

**From Ian Stewart's book, these 17 math equations changed the course of human history**

- A 2013 book by mathematician and science author Ian Stewart looked at 17 mathematical equations that shaped our understanding of the…

Added by Andrea Manero-Bastin on April 3, 2020 at 7:00am — 1 Comment

*This article was written by Ramsundar Lakshminarayanan**.*

**Main stream media's credibility at an all time low**

Credibility of the media has taken a…

ContinueAdded by Andrea Manero-Bastin on March 29, 2020 at 7:00am — No Comments

*This article was written by Blaine Bateman**.*

In this post, I will demonstrate the use of nonlinear models for time series analysis, and contrast to linear models. I will use a (simulated) noisy and nonlinear time…

ContinueAdded by Andrea Manero-Bastin on March 29, 2020 at 7:00am — No Comments

*This article was written by Graph Commons**.*

A common task for a data scientist is to identify clusters in a given data set. The idea is to simply find groups of objects that have more connections or similarities to one another than they do to outsiders. In the study of networks, we use clustering to recognize…

ContinueAdded by Andrea Manero-Bastin on March 9, 2020 at 3:30am — No Comments

*This article was written by Jim Frost**.*

Regression is a very powerful statistical analysis. It allows you to isolate and understand the effects of individual variables, model curvature and interactions, and make predictions. Regression analysis offers high flexibility but presents a variety of potential pitfalls. Great power requires great…

ContinueAdded by Andrea Manero-Bastin on March 9, 2020 at 3:00am — No Comments

*This article was written by Ray.*

Read an article in Quanta Magazine (New theory cracks open the black box of deep learning) about a talk (see 18: Information Theory of Deep Learning, YouTube video) done a month or so ago given by Professor Naftali (Tali) Tishby on his theory that all deep learning convolutional neural networks (CNN) exhibit an “information bottleneck”…

ContinueAdded by Andrea Manero-Bastin on March 6, 2020 at 3:00am — No Comments

*This article was written by Vasudev**.*

Lets get started quickly. Numpy is a math library for python. It enables us to do computation efficiently and effectively. It is better than regular python because of it’s amazing capabilities.

In this article I’m just going to introduce you to the basics of what is mostly required for machine learning and…

ContinueAdded by Andrea Manero-Bastin on March 1, 2020 at 12:00pm — 1 Comment

*This article was written by Prashant Gupta**.*

One of the major aspects of training your machine learning model is avoiding overfitting. The model will have a low accuracy if it is overfitting. This happens because your model is trying too hard to capture the noise in your training dataset. By noise…

ContinueAdded by Andrea Manero-Bastin on February 20, 2020 at 6:00am — No Comments

*This article was written by gk_**.*

Understanding how chatbots work is important. A fundamental piece of machinery inside a chat-bot is the text classifier. Let’s look at the inner workings of an artificial neural network (ANN) for text classification.

We’ll use 2 layers of neurons (1 hidden…

ContinueAdded by Andrea Manero-Bastin on February 9, 2020 at 12:30pm — No Comments

*This article was written by James Le**.*

Neural networks are one type of model for machine learning; they have been around for at least 50 years. The fundamental unit of a neural network is a node, which is loosely based on the biological neuron in the mammalian brain. The connections between neurons are also modeled on…

ContinueAdded by Andrea Manero-Bastin on February 9, 2020 at 12:00pm — No Comments

*This article was written by Matthew Mayo**.*

Scikit-learn is the de facto official machine learning library in use in the Python ecosystem. As described on its official website, Scikit-learn is:

- Simple and efficient tools for data mining and data analysis
- Accessible to everybody, and reusable in various contexts
- Built on NumPy, SciPy, and matplotlib
- Open…

Added by Andrea Manero-Bastin on February 1, 2020 at 9:00am — No Comments

*This article is on the blog* *artificialintelligenceml.*

This article features the following applications, one of them is pictured above (recommendation engine).

- Google’s AI-Powered Predictions
- Ridesharing Apps Like Uber and Lyft
- Commercial Flights Use an AI…

Added by Andrea Manero-Bastin on January 3, 2020 at 7:30am — No Comments

- Deep Learning for Object Detection: A Comprehensive Review
- Logistic Regression with Math
- Tutorial: Counting Road Traffic Capacity with OpenCV
- How to remove duplicates in large datasets
- A Data Scientist’s Perspective on Microsoft R
- The most underutilized function in SQL
- Comparing Regression Lines with Hypothesis Tests

- Machine Learning’s Limits (Part 1): Why machine learning works in some cases and not in others.
- Object Detection with 10 lines of code
- What is a Generative Adversarial Network?
- How to Visualize a Decision Tree from a Random Forest in Python using Scikit-Learn
- Regression analysis using Python
- Advanced cross-validation tips for time series
- The 10 Statistical Techniques Data Scientists Need to Master

© 2020 Data Science Central ® Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**DSC Podcast**

- Data Science Fails – If It Looks Too Good To Be True…

You’ve probably seen amazing AI news headlines such as: AI can predict earthquakes. Using just a single heartbeat, an AI achieved 100% accuracy predicting congestive heart failure. AI can diagnose covid19 in seconds from a chest scan. A new marketing model is promising to increase the response rate tenfold. It all seems too good to be true. But as the modern proverb says, “If it seems too good to be true, it probably is”. Download now.

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Statistics -- New Foundations, Toolbox, and Machine Learning Recipes
- Book: Classification and Regression In a Weekend - With Python
- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**DSC Podcast**

- Data Science Fails – If It Looks Too Good To Be True…

You’ve probably seen amazing AI news headlines such as: AI can predict earthquakes. Using just a single heartbeat, an AI achieved 100% accuracy predicting congestive heart failure. AI can diagnose covid19 in seconds from a chest scan. A new marketing model is promising to increase the response rate tenfold. It all seems too good to be true. But as the modern proverb says, “If it seems too good to be true, it probably is”. Download now.

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions