Subscribe to DSC Newsletter

Andrea Manero-Bastin's Blog (86)

Deep Learning for Object Detection: A Comprehensive Review

This article was written by Joyce Xu.

With the rise of autonomous vehicles, smart video surveillance, facial detection and various people counting applications, fast and accurate object detection systems are rising in demand. These systems involve not only recognizing and classifying every object in an…

Continue

Added by Andrea Manero-Bastin on June 15, 2020 at 12:00am — No Comments

Logistic Regression with Math

This article was written by Madhu Sanjeevi (Mady).

In the previous story we talked about Linear Regression for solving regression problems in machine learning, This story we will talk about Logistic Regression for classification problems.

You may be wondering why the name says regression…

Continue

Added by Andrea Manero-Bastin on June 14, 2020 at 11:30pm — No Comments

Tutorial: Counting Road Traffic Capacity with OpenCV

This article was written by Andrey Nikishaev.

Today I will show you very simple but powerful example of how to count traffic capacity with the algorithm that you can run on devices.

So this algorithm works in 4 steps:

1. Get frame edges.

2. Blur…

Continue

Added by Andrea Manero-Bastin on June 9, 2020 at 11:30am — No Comments

How to remove duplicates in large datasets

This article was written by Suresh Kondamudi.

Dealing with large datasets is often daunting. With limited computing resources, particularly memory, it can be challenging to perform even basic tasks like counting distinct elements, membership check, filtering duplicate elements, finding minimum, maximum, top-n elements, or set operations like union,…

Continue

Added by Andrea Manero-Bastin on May 5, 2020 at 11:00am — No Comments

A Data Scientist’s Perspective on Microsoft R

This article was written by Lixun Zhang.

As a data scientist, I have experience with R. Naturally, when I was first exposed to Microsoft R Open (MRO, formerly Revolution R Open) and Microsoft R Server (MRS, formerly Revolution R Enterprise), I wanted to know the answers for 3 questions:

  • What do R, MRO, and MRS…
Continue

Added by Andrea Manero-Bastin on May 5, 2020 at 10:30am — No Comments

The most underutilized function in SQL

This article was written by Tristan Handy.

Over the past nine months I’ve worked with over a dozen venture-funded startups to build out their internal analytics. In doing so, there’s a single SQL function that I have come to use surprisingly often. At first it wasn’t at all clear to me why I would want to use this function, but as time goes on I have found ever more uses for…

Continue

Added by Andrea Manero-Bastin on April 25, 2020 at 12:30am — 1 Comment

Comparing Regression Lines with Hypothesis Tests

This article was written by Jim Frost.

How do you compare regression lines statistically? Imagine you are studying the relationship between height and weight and want to determine whether this relationship differs between basketball players and non-basketball players. You can graph the two regression lines to see if they look different. However, you…

Continue

Added by Andrea Manero-Bastin on April 18, 2020 at 7:30am — No Comments

Multicollinearity in Regression Analysis: Problems, Detection, and Solutions

This article was written by Jim Frost.

Multicollinearity occurs when independent variables in a regression model are correlated. This correlation is a problem because independent variables should be independent. If the degree of correlation between variables is high enough, it can cause problems when you fit the model and interpret the results.…

Continue

Added by Andrea Manero-Bastin on April 12, 2020 at 2:00am — No Comments

The 17 equations that changed the course of history

This article was written by Andy Kiersz.

From Ian Stewart's book, these 17 math equations changed the course of human history

  • A 2013 book by mathematician and science author Ian Stewart looked at 17 mathematical equations that shaped our understanding of the…
Continue

Added by Andrea Manero-Bastin on April 3, 2020 at 7:00am — 1 Comment

#TextAnalytics concepts can be used to deal with credibility issues in the main stream media

This article was written by Ramsundar Lakshminarayanan.

Main stream media's credibility at an all time low

Credibility of the media has taken a…

Continue

Added by Andrea Manero-Bastin on March 29, 2020 at 7:00am — No Comments

Limits of linear models for forecasting

This article was written by Blaine Bateman.

In this post, I will demonstrate the use of nonlinear models for time series analysis, and contrast to linear models. I will use a (simulated) noisy and nonlinear time…

Continue

Added by Andrea Manero-Bastin on March 29, 2020 at 7:00am — No Comments

Finding organic clusters in complex data-networks

This article was written by Graph Commons.

A common task for a data scientist is to identify clusters in a given data set. The idea is to simply find groups of objects that have more connections or similarities to one another than they do to outsiders. In the study of networks, we use clustering to recognize…

Continue

Added by Andrea Manero-Bastin on March 9, 2020 at 3:30am — No Comments

Five Regression Analysis Tips to Avoid Common Problems

This article was written by Jim Frost.

Regression is a very powerful statistical analysis. It allows you to isolate and understand the effects of individual variables, model curvature and interactions, and make predictions. Regression analysis offers high flexibility but presents a variety of potential pitfalls. Great power requires great…

Continue

Added by Andrea Manero-Bastin on March 9, 2020 at 3:00am — No Comments

Compressing information through the information bottleneck during deep learning

This article was written by Ray.

Read an article in Quanta Magazine (New theory cracks open the black box of deep learning) about a talk (see 18: Information Theory of Deep Learning, YouTube video) done a month or so ago given by Professor Naftali (Tali) Tishby on his theory that all deep learning convolutional neural networks (CNN) exhibit an “information bottleneck”…

Continue

Added by Andrea Manero-Bastin on March 6, 2020 at 3:00am — No Comments

Introduction to Numpy - A Math Library for Python

This article was written by Vasudev.

Lets get started quickly. Numpy is a math library for python. It enables us to do computation efficiently and effectively. It is better than regular python because of it’s amazing capabilities.

In this article I’m just going to introduce you to the basics of what is mostly required for machine learning and…

Continue

Added by Andrea Manero-Bastin on March 1, 2020 at 12:00pm — 1 Comment

Regularization in Machine Learning

This article was written by Prashant Gupta.

One of the major aspects of training your machine learning model is avoiding overfitting. The model will have a low accuracy if it is overfitting. This happens because your model is trying too hard to capture the noise in your training dataset. By noise…

Continue

Added by Andrea Manero-Bastin on February 20, 2020 at 6:00am — No Comments

Text Classification using Neural Networks

This article was written by gk_.

Understanding how chatbots work is important. A fundamental piece of machinery inside a chat-bot is the text classifier. Let’s look at the inner workings of an artificial neural network (ANN) for text classification.

We’ll use 2 layers of neurons (1 hidden…

Continue

Added by Andrea Manero-Bastin on February 9, 2020 at 12:30pm — No Comments

The 10 Deep Learning Methods AI Practitioners Need to Apply

This article was written by James Le.

Neural networks are one type of model for machine learning; they have been around for at least 50 years. The fundamental unit of a neural network is a node, which is loosely based on the biological neuron in the mammalian brain. The connections between neurons are also modeled on…

Continue

Added by Andrea Manero-Bastin on February 9, 2020 at 12:00pm — No Comments

Scikit-learn Classification Algorithms

This article was written by Matthew Mayo.

Scikit-learn is the de facto official machine learning library in use in the Python ecosystem. As described on its official website, Scikit-learn is:

  • Simple and efficient tools for data mining and data analysis
  • Accessible to everybody, and reusable in various contexts
  • Built on NumPy, SciPy, and matplotlib
  • Open…
Continue

Added by Andrea Manero-Bastin on February 1, 2020 at 9:00am — No Comments

Where have you seen Machine Learning in your everyday life?

This article is on the blog artificialintelligenceml.

This article features the following applications, one of them is pictured above (recommendation engine).

  • Google’s AI-Powered Predictions
  • Ridesharing Apps Like Uber and Lyft
  • Commercial Flights Use an AI…
Continue

Added by Andrea Manero-Bastin on January 3, 2020 at 7:30am — No Comments

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service