Subscribe to DSC Newsletter

All Blog Posts (6,178)

Machine Learning: Association Rule Mining

Users who bought this… Also bought this…‘, I consider this as the statement of this generation. There is not a single shopping application not showcasing this feature to gain more from the buyers. This rule is another by-product of Machine Learning. We humans always look for more similar things which we like for example if a reader starts to like a book of the specific genre say ‘biography’ he/she is leaned towards having more such similar books. And this is what we call…

Continue

Added by Neeraj on June 13, 2019 at 8:38am — No Comments

Machine Learning : Unsupervised – k-means Clustering and Bootstrapping

This article is in continuation to our previous topic ‘Unsupervised Machine Learning’. Today I’m giving you another powerful tool on this topic named ‘k means Clustering‘. The work in this article is on the continuation of the previous WHO data set featured in ‘Machine Learning: Unsupervised – Hierarchical Clustering and Bootstrapping’. This artifact demonstrates implementing k means clustering and bootstrapping to make sure that the algorithm and clusters formed stand true.…

Continue

Added by Neeraj on June 13, 2019 at 8:37am — No Comments

Machine Learning : Unsupervised – Hierarchical Clustering and Bootstrapping

Preface

This article is based on Unsupervised Learning algorithm: Hierarchical Clustering. This is the brief illustration with a practical working example of forming unsupervised hierarchical clusters and testing them to assure that you have formed the right clusters. This is a real-life data world example which can be studied and evaluated as data is provided for personal use and practice. There are variations to each topic in data science but there is a brief basic pattern…

Continue

Added by Neeraj on June 13, 2019 at 8:36am — No Comments

How Cloud Computing Technology Helps in Big Data Analysis?

Do you know that the global business community will be spending $310 billion on the Internet of Things (IoT) by 2020?

 

Isn’t it a huge investment! Why is this trend booming?  

 

Actually, the IoT has encapsulated devices. The software, sensors, actuators and connectivity through vehicles and home appliances are penetrating deeply. This networking model is skyrocketing in almost every walk of life. That’s why small entrepreneurs to big industrialists are hungry…

Continue

Added by Moses Vandenberg on June 13, 2019 at 5:30am — No Comments

The Recommender Canvas: Everything You Wanted To Know About Recommender System Design & Its Necessity Today

The internet is evolving day by day, and when users shop online, they are flooded with thousands of results, leaving them in a dilemma to choose the best possible product that suits their requirements. Have you ever thought of how Google Ads precisely knew what you need and display…

Continue

Added by Guido van Capelleveen on June 13, 2019 at 2:44am — No Comments

Simplified Logistic Regression

Logistic regression is typically used when the response Y is a probability or a binary value (0 or 1). For instance, the chance for an email message to be spam, based on a number of features such as suspicious keywords or IP address.  In matrix notation, the model can be written as

where X is the observations matrix,…

Continue

Added by Vincent Granville on June 12, 2019 at 9:00am — No Comments

Optimal Allocation of Ambulances

Our client was an Emergency Response Management organization who handles medical, police and fire emergencies through the " 1-0-8 Emergency service". Currently the organization runs around 690 ambulances. An analysis is run on historic data extracted from client's management database. The live data input is fed to a simulation model to propose an optimal ambulance allocation providing an opportunity of cost reduction for the organization.

In probability theory and statistics, the…

Continue

Added by Dr. Moloy De on June 12, 2019 at 6:51am — No Comments

Impact of IoT on Mobile App Development Industry



Innovation and advancement mean subtracting the obvious things and adding meaningful things. Technology is successfully on this path and is doing wonders. It is very obvious in this era that technology is winning the race in innovation and advancement and has exceeded the humanity. The sectors and trends that are pushing…

Continue

Added by Smith Johnson on June 12, 2019 at 2:49am — No Comments

Enhancing Trading Models with AI

Dramatically improving currency trading models with AI using Keras Deep Learning and PivotBillions.…

Continue

Added by Benjamin Waxer on June 12, 2019 at 1:25am — No Comments

29 Statistical Concepts Explained in Simple English - Part 16

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, decision trees, ensembles, correlation, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, cross-validation, model fitting, and many more. To keep receiving these articles, sign up on…

Continue

Added by Vincent Granville on June 11, 2019 at 3:33pm — No Comments

30 Statistical Concepts Explained in Simple English - Part 15

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, decision trees, ensembles, correlation, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, cross-validation, model fitting, and many more. To keep receiving these articles, sign up on…

Continue

Added by Vincent Granville on June 11, 2019 at 3:16pm — No Comments

33 Statistical Concepts Explained in Simple English - Part 14

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, decision trees, ensembles, correlation, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, cross-validation, model fitting, and many more. To keep receiving these articles, sign up on…

Continue

Added by Vincent Granville on June 11, 2019 at 3:00pm — No Comments

How to Lie with P-values

P-values are used in statistics and scientific publications, much less so in machine learning applications where re-sampling techniques are favored and easy to implement today thanks to modern computing power. In some sense, p-values are a relic from old times, when computing power was limited and mathematical / theoretical formulas were favored and easier to deal with than lengthy computations.

Recently, p-values have been criticized and even banned by some…

Continue

Added by Vincent Granville on June 11, 2019 at 7:30am — No Comments

Interesting Charts and Maps Obtained Using Census Data

I found an interesting websites featuring hundreds of charts derived from US census data. It shows contrasts between states, cities, regarding education, jobs, languages spoken, salaries, even discrepencies between men and women or Asians and Caucasians, regarding various metrics broken down by location, education, or other criteria. I selected four of these charts.…

Continue

Added by Capri Granville on June 10, 2019 at 5:00pm — No Comments

Seduced by the Big Data meme: Hadoop vs the Public Cloud

 

Currently, Cloudera is in the news for all the wrong reasons(Cloudera stock down 42%)

Since Cloudera now also incorporates Hortonworks – the current issues are just the latest in the Big Data woes. Apparently, the third vendor…

Continue

Added by ajit jaokar on June 10, 2019 at 10:30am — No Comments

Big Ideas in AI for the Next 10 Years

Summary:  Despite our concerns about China taking the lead in AI, our own government efforts mostly through DARPA continue powerful leadership and funding to maintain our lead.  Here’s their plan to maintain that lead over the next decade. 

Think all those great ideas that have powered AI/ML for the…

Continue

Added by William Vorhies on June 10, 2019 at 8:28am — No Comments

Alternatives to R-squared (with pluses and minuses)

R-squared can help you answer the question "How does my model perform, compared to a naive model?". However, r2 is far from a perfect tool. Probably the main issue is that every data set contains a certain amount of unexplainable data. R-squared can't tell the difference between the explainable and the…

Continue

Added by Stephanie Glen on June 10, 2019 at 5:30am — No Comments

Time Series Forecasting with RNNs

This article was written by Marek Galovič.

 In this article I want to give you an overview of a RNN model I built to forecast time series data. Main objectives of this work were to design a model that can not only predict the very next time step but rather generate a sequence of predictions and utilize multiple driving time series together with a…

Continue

Added by Andrea Manero-Bastin on June 9, 2019 at 11:30am — No Comments

Interesting Type of Chart: Hexagonal Binning

This chart communicates the same insights as a contour plot. What is interesting is the choice of hexagonal buckets (rather than squares) to aggregate data. In fact, any tessellation would work, in particular Voronoi tessellations.…

Continue

Added by Capri Granville on June 9, 2019 at 8:00am — 1 Comment

Hiring the right data scientist for the organisation

Any organisation needs talented, hardworking and skilled employees irrespective of department, business unit or a team. But finding and nurturing such talent can be challenging sometimes. When it comes to data science field, with rapid change and demand in the technology, many organisations have set up the data science teams. A successful data science team has 3 major strengths, A-availability of data, B- infrastructure and most importantly C - the “right” data scientists. 

The…

Continue

Added by Rohit Walimbe on June 9, 2019 at 6:03am — No Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service