Determining the number of clusters when performing unsupervised clustering is a tricky problem. Many data sets don't exhibit well separated clusters, and two human beings asked to visually tell the number of clusters by looking at a chart, are likely to provide two different answers. Sometimes clusters overlap with each other, and large clusters contain sub-clusters, making a decision not easy.
For instance, how many clusters do you see in the picture below? What is the optimum number… Continue
Added by Vincent Granville on March 13, 2019 at 2:30pm —
By Ajit Jaokar and Cheuk Ting Ho.
Exclusively for Data Science Central members, with free access. You can download this book (PDF) here.
Enterprise AI: An applications perspective takes a use case driven approach to understanding the deployment of AI in the Enterprise. Designed for strategists and… Continue
Added by Vincent Granville on October 10, 2018 at 6:00am —
image source - wikipedia
we are now closing this
we have been… Continue
Added by ajit jaokar on January 30, 2019 at 12:00pm —
Note: This article was originally posted on OpenDataScience.com.
Witnessing the data science field’s meteoric rise in demand across pretty much all industries and areas of scientific research, it’s easy to anticipate efforts to create shortcuts to satisfy the need for more data science practitioners. The current trend… Continue
Added by Alexander Landa on March 14, 2019 at 6:00am —
Summary: Recurrent Neural Nets (RNNs) are at the core of the most common AI applications in use today but we are rapidly recognizing broad time series problem types where they don’t fit well. Several alternatives are already in use and one that’s just been introduced, ODE net is a radical departure from our way of thinking about the solution.
Added by William Vorhies on March 11, 2019 at 7:43am —
The last decade has seen unprecedented advancements in artificial intelligence. We have moved towards a data-centric approach, and data is the center of everything digital. The… Continue
Added by Ronald van Loon on March 11, 2019 at 7:18am —
The EM algorithm finds maximum-likelihood estimates for model parameters when you have incomplete data. The "E-Step" finds probabilities for the assignment of data points, based on a set of hypothesized probability… Continue
Added by Stephanie Glen on March 12, 2019 at 7:00pm —
I was comparing home prices in San Francisco between 1994 and 2018, and I noticed that it has increased by a factor 4 over 25 years. In the meanwhile, the inflation index increased by a factor 1.7 (see here.) I am not saying here that my sources are correct or wrong -- entire books have been written on the subject -- but instead, my purpose here is to show how some visualizations can be… Continue
Added by Capri Granville on March 10, 2019 at 5:00pm —
A system is an entity that behaves based on the intrinsic characteristics of its components and the external forces that drive these elements to react as a result of their interaction with the environment.
When optimizing a system, three scenarios should be defined. The first is identifying all of its components, boundaries and the conditions acting upon it. The system intrinsically has a homeostatic point where it wants to exist and that may be not optimal for the… Continue
Added by Jose Bautista on March 12, 2019 at 9:00am —
What is Automated Machine Learning? Quite simply, it is the means by which your business can optimize resources, encourage collaboration and rapidly and dependably distribute data across the enterprise and use that data to predict, plan and achieve revenue goals.
With the right tools, today’s average business user can become a Citizen Data Scientist, using data integrated from various sources to learn, test theories and make decisions. AutoML comes into play as… Continue
Added by Kartik Patel on March 13, 2019 at 3:00am —
Here is our list of featured articles and technical resources posted since Monday. The picture is from the article flagged with a +.
Resources and Forum Questions
Added by Vincent Granville on March 14, 2019 at 9:30am —
Thanks to evolving technology, retail marketers nowadays have access to wide range of data and analytics tools, which is making everyone’s life easier. It gives businesses a chance to solve problems and provide excellent customer experience.
The ability of machines to process data and perform tasks in a way that mimics human intelligence such as voice and visual recognition or decision making is opening many doors for online retailers, and brand owners look for ways to… Continue
Added by Karolina Sposob on March 11, 2019 at 12:30am —
Nothing perhaps rattles a regulatory professional as much as an FDA inspection! It can send the regulatory professional who is in charge of compliance into panic mode for a variety of reasons. As the one who faces the heat from the FDA directly, the regulatory professional is answerable to the FDA, most of whose questions are challenging and awkward. If anything goes wrong at any stage, it is the company that suffers.
Added by Adam Fleming on March 11, 2019 at 3:57am —
The blockchain is one of the hottest and fastest growing skills in the IT sector today. It is said that there are around 44% of organizations that have adopted blockchain globally. We all know that this technology has taken quite a turn in the industry given its popularity in providing safe and secured online transactions.
This technology is… Continue
Added by Yoey Thamas on March 11, 2019 at 8:22pm —
As of now, chatbots are among the most trending technology for which the industry is excited to get in integrated. They get touted as the next rendition of applications, similar to an immense change in the correspondence business. Since Facebook has extended access to its messenger administration, it is enabling firms to achieve clients better… Continue
Added by Harikrishna on March 15, 2019 at 12:56am —
Last month I had an honor to participate in data science project reviews for the new graduates of General Assembly's Data Science Immersive program. In the span of just three months of full-time studies and endless nights of homework Chicago campus students mastered Python programming skills, machine learning, and…
Added by Alex Blyakhman on March 14, 2019 at 5:15am —
Added by Benjamin Waxer on March 18, 2019 at 5:00am —
Summary: The fourth and final AI strategy we’ll review is Systems of Intelligence (SOI). This is getting nearly as much attention as the Vertical strategy we previously reviewed. It’s appealing because it seems to offer the financial advantages of a Horizontal strategy but its ability to create a defensible moat requires some fine tuning.
Added by William Vorhies on July 24, 2018 at 9:00am —
With a ROC curve, you're trying to find a good model that optimizes the trade off between the False Positive Rate (FPR) and… Continue
Added by Stephanie Glen on March 9, 2019 at 9:00am —
Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.
Featured Resources and Technical… Continue
Added by Vincent Granville on March 10, 2019 at 7:00am —