Exploring San Francisco police incident data and visualizing the density distribution of incidents involving mentally ill individuals using Pivot Billions and Tableau.…
Added by Benjamin Waxer on February 6, 2019 at 8:06am —
Chatbots are the hot thing right now. Many surveys claim that XX% of companies plan to deploy a chatbot in Y years. Consider the infographic below that describes the chatbot market value in the US in a million dollars
However, where did it all start? Where are we now? In this article, we try to answer these questions.
Joseph Weizenbaum, a German professor of Computer Science at the Massachuset…
Added by Divya Singh on February 6, 2019 at 4:00am —
I am researching the general use of process models in Data Mining projects. In case you have already participate in a Data Mining project in any role I would like to ask you to support my analysis.
It just takes up to 5… Continue
Added by Silke Michelle Wildenberg on February 5, 2019 at 11:35pm —
Last week I posted the first of a three-part series on basic data programming with Python. For that article, I resurrected scripts written 10 years ago that deployed core Python data structures and functions to assemble a Python list for… Continue
Added by steve miller on February 5, 2019 at 7:55am —
This article has 4 sections:
- Introduction: Introduction to the gambler's ruin problem
- Methodology: How the simulation will be carried out
- Pseudocode: Summary of the Python Code
- Theory: A summary of the theory
The full code and theory can be found here: …
Added by Tansel Arif on February 5, 2019 at 4:55am —
Majority of modern companies deal with processes which they want to be automated. This need can be caused by various reasons, in particular, due to the routine, repetitive and boring nature of manual processes. Another shortcoming is that such processes often require a lot of time and human resources; additionally, office… Continue
Added by Igor Bobriakov on February 5, 2019 at 2:32am —
In the course of time, data science has proved its high value and efficiency. Data scientists find more and more new ways to implement big data solutions in daily life. Nowadays data is a fuel needed for a successful company.
Telecommunication companies are not an exception. Due to these… Continue
Added by Igor Bobriakov on February 5, 2019 at 1:55am —
This article is currently under revision.
Added by William Vorhies on February 4, 2019 at 9:30am —
The following articles were hand-picked, and curated by one of our interns. They cover dozens of topics of interest to data scientists.
Added by Vincent Granville on February 3, 2019 at 2:30pm —
Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.
Added by Vincent Granville on February 3, 2019 at 10:00am —
Below is an extract of a 10-page cheat sheet about probability, compiled by William Chen (http://wzchen.com) and Joe Blitzstein, with contributions from Sebastian Chiu, Yuan Jiang, Yuqi Hou, and Jessy Hwang. Material based on Joe Blitzstein’s Harvard's introductory probability course (@stat110 - (http://stat110.net) and Blitzstein / Hwang’s Introduction to Probability textbook (… Continue
Added by Capri Granville on February 3, 2019 at 8:00am —
Interesting picture comparing linear, logistic and Poisson regression. For more about regression, read our other articles on this subject, here. For other ML concepts summarized in one picture, follow this link. …
Added by Capri Granville on February 3, 2019 at 8:00am —
Until very recently, most organizations have seen two distinct, non-overlapping work streams when building an AI enabled application: a development path and a data science path.
Often, both groups are actually building similarly scripted functional solutions using something like python or C/F#. Further, once a data scientist finishes the evaluation and… Continue
Added by Laura Edell on February 2, 2019 at 12:35pm —
Logistic regression (LR) models estimate the probability of a binary response, based on one or more predictor variables. Unlike linear regression models, the dependent variables are categorical. LR has become very popular, perhaps because of the wide availability of the procedure in software. Although LR is a good choice for many situations, it doesn't work well for all situations. For example:
- In propensity score analysis where there are many…
Added by Stephanie Glen on February 2, 2019 at 6:55am —
I loved playing StarCraft, though I seldom played against other humans (my sons in particular, because they absolutely kick my butt). But ah, there is finally revenge for “Dad the Data Nerd”, and it’s known as AlphaStar. AlphaStar was developed by Google’s DeepMind AI group to leverage artificial intelligence (AI)… Continue
Added by Bill Schmarzo on February 1, 2019 at 2:40pm —
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, decision trees, ensembles, correlation, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, cross-validation, model fitting, and many more. To keep receiving these articles, sign up on… Continue
Added by Vincent Granville on February 1, 2019 at 6:00am —
Agriculture is a major segment powering the Asian economy. A small transformation in agricultural outcomes can have a huge impact on 2 dimensions – economic and human. Keeping this in mind CANOPY ONE brings to you our pick of the top 3 use cases which would have a massive impact. So here they come.
Agriculture AI use case-1: Early detection of pests, disease and… Continue
Added by Mahesh Kumar CV on February 1, 2019 at 5:12am —
Here is our selection of featured articles, technical contributions, and forum questions posted since Monday:
Added by Vincent Granville on January 31, 2019 at 10:00am —
Learn about CART in this guest post by Jillur Quddus, a lead technical architect, polyglot software engineer and data scientist with over 10 years of hands-on experience in architecting and engineering distributed, scalable, high-performance, and secure solutions used to combat serious organized crime, cybercrime, and fraud.
Although both linear regression models allow and logistic regression models allow us to predict a categorical outcome, both of these models assume… Continue
Added by Packt Publishing on January 31, 2019 at 4:09am —
image source - wikipedia
we are now closing this
we have been… Continue
Added by ajit jaokar on January 30, 2019 at 12:00pm —