“Big Data is dead.” “Big Data is passé.”
“We no longer need Big Data; we need Machine Learning now.”
As we end 2017 and look forward to big (data) things in 2018, the most important lessons of 2017 – in fact, maybe the most important lesson going forward – is that Big Data is NOT a thing. Big Data isn’t about the volume, variety or velocity of data any more than car…
Added by Bill Schmarzo on January 20, 2018 at 5:30am —
Since I am sometimes asked to explain phenomena in the absence of data, it becomes necessary to determine what data is required to explain phenomena. Some would say the best approach is to develop and test a hypothesis - to start filling a void of space with pinholes of light - until there are enough lit pinholes to provide a… Continue
Added by Don Philip Faithful on January 20, 2018 at 6:00am —
In a previous post I talked about how to preprocess and explore image dataset. In this post, I will talk about how to model image data with neural networks having a single neuron, using sigmoid function. Original version of this blog can be found here. This is equivalent to logistic…
Added by Jobil Louis on January 20, 2018 at 5:00pm —
How many times you have compared your homeland to another country when you have watched a foreign language movie/show or when you have traveled to a different country for a business or leisure trip ? I have found myself doing so many times and also had conversations and debates with friends.What I have observed in my discussion is that our brain very quickly comes to a decision on the basis of what it sees at the first glance, very few people control their mind and think about various… Continue
Added by vishal sethia on January 22, 2018 at 4:00pm —
Summary: This is a lesson in how it may be possible to snatch victory from the jaws of defeat. 1.) A good ROC score does not necessarily mean a good model. 2.) Even a weak model may be good at the top and bottom – consider how you can use that.
This is a lesson in how it may be possible to snatch victory from the jaws of defeat. In our world, defeat is ending up with a poor model that doesn’t do what you’d hoped. This story about a particular project… Continue
Added by William Vorhies on May 23, 2017 at 5:55am —
Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week.
Featured Resources and Technical Contributions
Added by Vincent Granville on January 20, 2018 at 1:00pm —
Artificial intelligence is getting smarter by leaps and bounds -- within this century, research suggests, a computer AI could be as "smart" as a human being. And then, says Nick Bostrom, it will overtake us: "Machine intelligence is the last invention that humanity will ever need to make." A philosopher and technologist, Bostrom asks us to think hard about the world we're building right now, driven by thinking machines. Will our smart machines help to preserve humanity and our values -- or… Continue
Added by Capri Granville on January 20, 2018 at 12:30pm —
This article was posted by Dikesh Jariwala on R Bloggers.
With ever increasing volume of data, it is impossible to tell stories without visualizations. Data visualization is an art of how to turn numbers into useful knowledge.
R Programming lets you learn this art by offering a set of inbuilt functions and libraries to build visualizations and present data. Before the… Continue
Added by Emmanuelle Rieuf on January 13, 2018 at 9:00am —
We may be years away from the “AI-enabled Coworker,” but the first implementations of machine-learning capabilities are finding their way into the everyday data-analysis tools used by businesses of all types. Cognitive assistance promises to reshape business processes, but only if app development and deployment tools are adapted to support machine learning.
While it has become fashionable to hypeAIas the next game-changing technology promising to have an impact greater than either… Continue
Added by Emmanuelle Rieuf on November 15, 2016 at 11:00am —
This article is written by Gunnar Carlsson.
Topological data analysis has been very successful in discovering information in many large and complex data sets. In this post, I would like to discuss the reasons why it is an effective methodology.
One of the key messages around topological data analysis is that data has shape and the shape matters. Although it may… Continue
Added by Amelia Matteson on January 18, 2017 at 12:30pm —
I was recently a guest lecturer at the University of California Berkeley Extension in San Francisco. On a lovely Saturday afternoon, the classroom was crowded with students of all ages learning the tools of the modern economy. The craftspeople of the “Analytics Revolution” were busy learning new skills and tools that will prepare them for this Brave New World of analytics. I was blown away by their dedication!
As we teach the next generation, it’s important…
Added by Bill Schmarzo on January 19, 2018 at 5:00am —
As the world is progressing towards an age of development in the Internet of Things (IoT) and other aspects related to it, the concept of the digital consumer is on the rise. Consumers of today want to experience the feasibility that is promised through this method. From experiencing customer support resources… Continue
Added by Ronald van Loon on January 19, 2018 at 5:30am —
In the past few years, machine learning (ML) has revolutionized the way we do business. A disruptive breakthrough that differentiates machine learning from other approaches to automation is a step away from the rules-based programming. ML algorithms allowed engineers to leverage data without explicitly programming machines to follow specific paths of problem-solving. Instead, machines themselves arrive at the right answers based on the data they have. This capability made business… Continue
Added by Olexander Kolisnykov on January 18, 2018 at 3:00am —
We’ve compiled a list of the hottest events and conferences from the world of Data Science, Machine Learning and Artificial Intelligence happening in 2018.
Below are all the links you need to get yourself to these great events!
17th – 19th Continue
Added by Matt Reaney on January 9, 2018 at 3:30am —
In this article a few simple applications of Markov chain are going to be discussed as a solution to a few text processing problems. These problems appeared as assignments in a few courses, the descriptions are taken straightaway from the courses themselves.
1. Markov Model of Natural…
Added by Sandipan Dey on January 16, 2018 at 8:30pm —
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, outliers, regression, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, time series, cross-validation, model fitting,… Continue
Added by Vincent Granville on January 16, 2018 at 7:00pm —
In this series, I will talk about training a simple neural network on image data. To give a brief overview, neural networks is a kind of supervised learning. By this I mean, the model needs to train on historical data to understand the relationship between input variables and target variables. Once trained, the model can be used to predict target variable on new input data. In the previous posts, we have written about linear, lasso and ridge regression. All those methods come under… Continue
Added by Jobil Louis on January 16, 2018 at 8:00pm —
Most organizations’ IOT Strategy look like a game of ‘Twister’ with progress across important IOT capabilities such as architecture, technology, data, analytics and governance; variables comprising a series of random investments and decisions.…
Added by Bill Schmarzo on January 13, 2018 at 5:00am —
There is no doubt that SQL is the most widely-used working language for processing structured data. Not only is the language adopted by all relational database products, but its implementation is the goal of many newly-invented big data platforms. But in many aspects SQL isn’t so convenient to use in handling various computational and query demands. The procedurality issue stated in the last article is just a superficial one. SQL’s problems are rooted in its theory foundation, the relational… Continue
Added by JIANG Buxing on January 14, 2018 at 10:30pm —
In this script, we will explore the open roles at Google, and try to see what common attributes Google is looking for, in future employees.
This dataset contains text information about job location, title, department, minimum and preferred qualifications and the responsibilities of the position. Using this dataset we will try to answer the following questions:
- Where are the open roles?
- What are the minimum and preferred educational qualifications needed…
Added by Anupama Rajaram on January 15, 2018 at 5:30pm —