A little less than a year ago, I posted a blog on generating multivariate frequencies with the Python Pandas data management library, at the same time showcasing Python/R graphics interoperability. For my… Continue
Added by steve miller on April 25, 2019 at 5:33am —
Hadoop and Spark are software frameworks from Apache Software Foundation that are used to manage ‘Big Data’. There is no particular threshold size which classifies data as “big data”, but in simple terms, it is a data set that is too high in volume, velocity or variety such that it cannot be stored and processed by a single computing…
Added by Divya Singh on April 24, 2019 at 8:30pm —
So many fascinating and deep results have been written about the number (1 + SQRT(5)) / 2 and its related sequence - the Fibonacci numbers - that it would take years to read all of them. This number has been studied both for its applications (population growth, architecture) and its mathematical properties, for over 2,000 years. It is still a topic of active research.…
Added by Vincent Granville on April 24, 2019 at 4:00pm —
The Merriam-Webster dictionary defines the word 'augment' this way: 'to make greater, more numerous, larger or more intense'. If you are wondering how this applies to the term 'augmented analytics', you are not alone. Let's take a closer look at Augmented Analytics and talk about why it has gotten so much attention in the business intelligence world.
What is Augmented… Continue
Added by Kartik Patel on April 24, 2019 at 3:00am —
At the Data Science for IoT course at the University of Oxford – I have been working on a strategy implementing Artificial Intelligence holistically on the Cloud and Edge. This is a complex approach with many new concepts to learn.… Continue
Added by ajit jaokar on April 23, 2019 at 11:00am —
Summary: Finally there are tools that let us transcend ‘correlation is not causation’ and identify true causal factors and their relative strengths in our models. This is what prescriptive analytics was meant to be.
Just when I thought we’d figured it all out, something comes along to make… Continue
Added by William Vorhies on April 22, 2019 at 8:47am —
The world has taken several leaps in the technical aspects in the past few years. This has resulted in tremendous growth for the entire globe. With proper modes of communication and travel, the world has discovered ways of becoming stronger by every passing day.
With the flow of time, technology evolved and discovered those… Continue
Added by Sanjeev Verma on April 22, 2019 at 12:24am —
When dealing with building machine learning models, Data scientists spend most of the time on 2 main tasks when building machine learning models
Pre-processing and Cleaning
The major portion of time goes in to collecting, understanding, and analysing, cleaning the data and then building features. All the above steps mentioned are very important and critical to build successful machine learning… Continue
Added by Rohit Walimbe on April 21, 2019 at 9:00pm —
Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this… Continue
Added by Vincent Granville on April 21, 2019 at 10:00am —
Quora contribution written by Chomba Bupe.
I am actually not even aware of any machine learning (ML) problem that is considered to have been solved recently or in the past. This tells you a lot about how hard things really are in ML. Of course, if you read media outlets, it may seem like researchers are sweeping the floor clean with deep learning (DL), solving ML problems one… Continue
Added by Andrea Manero-Bastin on April 21, 2019 at 6:00am —
The Big Data craze caught fire with a provocative declaration that “Data is the New Oil”; that data will fuel the economic growth in the 21stcentury in much the same way that oil fueled the economic growth of the 20thcentury. The “New Oil” analogy was a great way to contextualize the economic value of data; to give the Big Data conversation an easily recognizable face. The Economist recently declared data “… Continue
Added by Bill Schmarzo on April 19, 2019 at 11:15am —
Kaggle is an excellent place for learning. And I learned a lot of things from the recently concluded competition on Quora Insincere questions classification in which I got a rank of
182/4037. In this post, I will try to provide a summary of the things I tried. I will also try to summarize the ideas which I missed but were a part of other winning solutions.
As a side note: if you want to know more about NLP, I would… Continue
Added by Rahul Agarwal on April 19, 2019 at 10:15am —
Here is our selection of featured articles and resources posted since Monday:
Added by Vincent Granville on April 18, 2019 at 12:00pm —
Imagine you’re completing a mission in a computer game. Maybe you’re going through a military depot to find a secret weapon. You get points for the right actions (killing an enemy) and lose them for the wrong ones (falling into a pit or getting hit). If you’re playing on high difficulty, you might not conclude this task in just one attempt. Try after try, you learn which consecutive actions are needed to get out of a location safe, armed, and equipped with bonuses like extra health points or… Continue
Added by Kateryna Lytvynova on April 18, 2019 at 2:00am —
Artificial Intelligence and Machine Learning have empowered our lives to a large extent. The number of advancements made in this space has revolutionized our society and continue making society a better place to live in.
In terms of perception, both Artificial Intelligence and Machine Learning are often used in the same context which leads to confusion. AI is the concept in which machine makes smart decisions whereas Machine Learning is a sub-field of AI…
Added by Divya Singh on April 17, 2019 at 9:00pm —
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, decision trees, ensembles, correlation, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, cross-validation, model fitting, and many more. To keep receiving these articles, sign up on… Continue
Added by Vincent Granville on April 17, 2019 at 8:30am —
In this post, I explain
- How you can participate further in the free book series which we are launching based on the early experiences and
- Useful resources we recommend based on our experience for learning coding for Data Science (using Python – tensorflow and keras)
To provide some context, I posted… Continue
Added by ajit jaokar on April 17, 2019 at 8:06am —
In February 2019 Polish government added an amendment to a banking law that gives a customer a right to receive an explanation in case of a negative credit decision. It’s one of the direct consequences of implementing GDPR in EU. This means that a bank needs to be able to explain why the loan wasn’t granted if the decision process was automatic.
In October 2018 world headlines reported about … Continue
Added by Michał Frącek on April 17, 2019 at 7:30am —
Sales data analyses can provide a wealth of insights for any business but rarely is it made available to the public. In 2018, however, a retail chain provided Black Friday sales data on Kaggle as part of a Kaggle competition. Although the store and product lines are…
Added by Ayumi Owada on April 17, 2019 at 6:30am —
Data science has become a widely used term and a buzzword as well. It is a broad field representing a combination of multiple disciplines. However, there are adjacent areas that deserve proper attention and should not be confused with data science.… Continue
Added by Igor Bobriakov on April 17, 2019 at 1:30am —