If you still can’t figure out what exactly Hadoop is, don’t worry, you’re not alone. But hopefully this multi-part blog series will help! Hadoop is very difficult to understand if, like most people, you’re gathering bits and pieces of info here and there. And, sorry Wikipedia, but your page isn’t very helpful if you aren’t a… Continue
Added by Michael Burke on September 30, 2019 at 1:44pm —
You may have figured out already that statistics isn't exactly a science. Lots of terms are open to interpretation, and sometimes there are many words that mean the same thing—like "mean" and "average"—or sound like they should mean the same thing, like significance level and confidence level.
Although they sound very similar, significance level and confidence level are in fact two completely different concepts. Confidence levels and confidence… Continue
Added by Stephanie Glen on September 30, 2019 at 12:00pm —
Co-relation does not equal causation – is a mantra drilled into a Data Scientist from an early age
That’s fine ..
But very few talk of the follow-on question ..
How exactly do you determine causation?
This problem is… Continue
Added by ajit jaokar on September 30, 2019 at 6:33am —
There are a lot of on-going controversies on the impact of Artificial Intelligence on designing and development. While, designing is a bit of a complex process; as the humans only have the capability to set the context and create what the user’s really are looking for. But researchers and designers are constantly working on the technology in an attempt to find out how AI will affect the future of designing industry. …
Added by Ashok Sharma on September 29, 2019 at 7:30pm —
The evolution of the technologies in Big Data in the last 20 years has presented a history of battles with growing data volume. The challenge of big data has not been solved yet, and the effort will certainly continue, with the data volume continuing to grow in the coming years. The original relational database system (RDBMS) and the associated OLTP (Online Transaction Processing) make it so easy to work with data using SQL in all aspects, as long as the data size is small enough to… Continue
Added by Stephanie Shen on September 29, 2019 at 4:00pm —
Power BI provides slicers for a single column, but there are scenarios where it could be useful to consolidate alternative filters for multiple columns in a single slicer. Technically, this is not possible in Power BI through the standard visualizations, but you can use a particular data modeling technique to obtain the desired result.
Consider the case of a Customer table with a geographical hierarchy with Continent, Country,…
Added by Adetayo Kolade on September 29, 2019 at 10:30am —
Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.
Added by Vincent Granville on September 29, 2019 at 7:00am —
They say that the best ideas sometimes come to you while you are in the shower, and this idea of how to explain two important Neural Network concepts – Backpropagation and Stochastic Gradient Descent – actually did come to me as I was trying to set the perfect water temperature for my morning shower.
As I was struggling to adjust the two shower handles – one handle that controlled scolding hot and the other handle that controlled flash freezing – it occurred to me that I was a simple… Continue
Added by Bill Schmarzo on September 29, 2019 at 4:29am —
By now you may already know that to add scholarly articles to the English version of Wikipedia is difficult due to the "notability" concept and tight control from anonymous editors (see this article). In recent years, entire Wikipedia topics and articles dedicated to software and data…
Added by jwork.ORG on September 28, 2019 at 4:30am —
With intense urbanization, the transportation industry works round the clock to suffice the demands of… Continue
Added by Sanjeev Verma on September 27, 2019 at 1:30am —
Every industry in the world is moving towards data-driven decision making, then one of the most popular and … Continue
Added by Sandra Moraes on September 26, 2019 at 6:56pm —
Here is our selection of featured articles and technical resources posted since Monday:
Added by Vincent Granville on September 26, 2019 at 11:00am —
Overview of changes TensorFlow 1.0 vs TensorFlow 2.0
Earlier this year, Google announced TensorFlow 2.0, it is a major leap from the existing TensorFlow 1.0. The key differences are as follows:
Ease of use: Many old libraries (example tf.contrib) were removed, and some consolidated. For example, in TensorFlow1.x the model could be made using Contrib, layers, Keras or estimators, so many options for the same task confused many new users.… Continue
Added by ajit jaokar on September 25, 2019 at 11:30pm —
The new era of BigData and advances in technology have made significant transitions towards the high functionality of IoT devices. The popularity of IoT devices has led to more easier methods for BigData collection, analysis, and distribution at a rapid rate. According to a report by… Continue
Added by Smith Johnson on September 25, 2019 at 7:30pm —
Bayesian Machine Learning (part -6)
Probabilistic Clustering – Gaussian Mixture Model
Continuing our discussion on probabilistically clustering of our data, where we left out discussion on part 4 of our Bayesian inference series. As we have seen the modelling theory of… Continue
Added by Ashutosh vyas on September 25, 2019 at 9:30am —
Imagine you are sitting at your home and your entire house is connected to Google Home. You ask Google to dim the lights, turn on the tv, shut the windows, pull out your favourite throw cover, put up your favourite movie and finally order your favourite snacks. Or imagine you are managing your day-to-day schedule over your phone by just giving voice commands to your “intelligent assistant.” Sounds… Continue
Added by Emily Williamson on September 25, 2019 at 12:20am —
Even though Amazon is the leader in e-commerce marketplaces – eBay still has its fair share in the online retail industry. Brands selling online should be monitoring prices on eBay as well to gain a competitive advantage.
Extracting data from eBay at a huge scale regularly is a challenging problem for data scientists. Here is an example of scraping eBay using python to identify prices of mobile phones.
Lets us imagine a use case where you need… Continue
Added by Sandra Moraes on September 24, 2019 at 8:00pm —
Big data analytics has been applied to different spheres of human life. One of the best features of data analytics is its adaptability and wide application specter. We have come through the whole series of articles concerning data science application in various spheres that are… Continue
Added by Igor Bobriakov on September 24, 2019 at 6:31am —
This article was written by Soft Media Lab.
It has the following sections.
- What is Machine Learning?
- How to apply machine learning to…
Added by Andrea Manero-Bastin on September 24, 2019 at 6:30am —
Summary: Autonomous vehicles (AUVs) and many other systems that need to accurately perceive the world around them will be much better off when image classification moves from 2D to 3D. Here we examine the two leading approaches to 3D classification, Point Clouds and Voxel Grids.
One of the well-known… Continue
Added by William Vorhies on September 23, 2019 at 2:24pm —