Currently, Cloudera is in the news for all the wrong reasons(Cloudera stock down 42%)
Since Cloudera now also incorporates Hortonworks – the current issues are just the latest in the Big Data woes. Apparently, the third vendor… Continue
Added by ajit jaokar on June 10, 2019 at 10:30am —
Summary: Despite our concerns about China taking the lead in AI, our own government efforts mostly through DARPA continue powerful leadership and funding to maintain our lead. Here’s their plan to maintain that lead over the next decade.
Think all those great ideas that have powered AI/ML for the… Continue
Added by William Vorhies on June 10, 2019 at 8:28am —
R-squared can help you answer the question "How does my model perform, compared to a naive model?". However, r2 is far from a perfect tool. Probably the main issue is that every data set contains a certain amount of unexplainable data. R-squared can't tell the difference between the explainable and the… Continue
Added by Stephanie Glen on June 10, 2019 at 5:30am —
This chart communicates the same insights as a contour plot. What is interesting is the choice of hexagonal buckets (rather than squares) to aggregate data. In fact, any tessellation would work, in particular Voronoi tessellations.…
Added by Capri Granville on June 9, 2019 at 8:00am —
Any organisation needs talented, hardworking and skilled employees irrespective of department, business unit or a team. But finding and nurturing such talent can be challenging sometimes. When it comes to data science field, with rapid change and demand in the technology, many organisations have set up the data science teams. A successful data science team has 3 major strengths, A-availability of data, B- infrastructure and most importantly C - the “right” data scientists.
Added by Rohit Walimbe on June 9, 2019 at 6:03am —
Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.
Added by Vincent Granville on June 9, 2019 at 5:30am —
You don't need a sophisticated model nor advanced machine learning techniques to quickly get a high level picture and trends for bottom-line business metrics. Not only the concepts explained here are easy to grasp, but while being high level, it nevertheless includes granular effects. The methodology presented here was used in business contexts in the past, when I was working with enterprise executives, particularly finance people, to assess the overall health of their business, and the… Continue
Added by Vincent Granville on June 8, 2019 at 4:49pm —
By Ajit Jaokar and Ayse Mutlu.
Exclusively for Data Science Central members, with free access. You can download this book (PDF) here.
This tutorial is the second book in the ‘in a weekend’ series – after Classification and Regression… Continue
Added by Vincent Granville on June 8, 2019 at 5:22am —
In my previous blog “How DevOps Drives Analytics Operationalization and Monetization”, I discussed the critical and complementary role of DevOps to operationalize and monetize the analytics that came out of the Data Science development process. While the combination of Design Thinking and Data Science accelerate the creation of more effective, more… Continue
Added by Bill Schmarzo on June 6, 2019 at 11:14pm —
Here is our selection of featured articles and technical resources posted since Monday:
Added by Vincent Granville on June 6, 2019 at 9:30am —
No matter how intelligent and sophisticated your technology is, what you ultimately need for Big Data Analysis is data. Lots of data. Versatile and coming from many sources in different formats. In many cases, your data will come in a machine-readable format…
Added by Max Ved on June 6, 2019 at 1:29am —
Data science is one of the hottest topics in the 21st century because we are generating data at a rate which is much higher than what we can actually process. A lot of business and tech firms are now leveraging key benefits by harnessing the benefits of data science. Due to this, data science right now is really booming.
In this blog, we will deep dive into the world of machine learning. We will walk you… Continue
Added by Divya Singh on June 4, 2019 at 8:30pm —
Summary: The ability to train large scale CNNs directly on your cell phone without sending the data round trip to the cloud is the key to next gen AI applications like real time computer vision and safe self-driving cars. Problem is our current GPU AI chips won’t get us there. But neuromorphic chips look like they will.
Added by William Vorhies on June 4, 2019 at 9:00am —
A significant sector with direct influence on our lives is the role Government plays in terms of the services it offers to citizens and operations of the government.
In view of the penetration of a data-driven approach to all businesses, there is a clear need to adopt a data-driven approach to government services and operations. Hence the penetration of AI is a case of positive influence on Govt products and services.
Some key areas where government operations and services can… Continue
Added by Mahesh Kumar CV on June 4, 2019 at 6:23am —
Value of adopting Data Science Skills
Data Science is responsible to provide meaning to the large amounts of complex data called big data. It involves different fields of work in statistics and computation to interpret data for decision-making.
Advances in the internet and social media is increasing access to big data. Extraction of meaningful information requires the use of AI and ML by data science. Big data is used in every… Continue
Added by Yoey Thamas on June 4, 2019 at 2:33am —
As artificial intelligence progresses and technology becomes more sophisticated, we expect existing concepts to embrace this change — or change…
Added by Max Ved on June 4, 2019 at 12:30am —
Four years ago, the software engineer Jack Alciné caused a storm by pointing out to Google that their algorithm had the unsavoury tendency to classify his black friends as Gorillas. Following a public outcry for blatant racism, the giant apologised and diligently ‘fixed’ the problem. Last year Amazon got into hot water by finding its advanced AI hiring software heavily favoured men for technical positions. Again, retraction followed the outcry. In a more newsworthy style, an unfortunate… Continue
Added by Dany Majard on June 3, 2019 at 3:10am —
Many statistics, such as correlations or R-squared, depend on the sample size, making it difficult to compare values computed on two data sets of different sizes. Here, we address this issue.
Below is an example with 20 observations. The 10 last observations (the second half of the data set) is a mirror of the first 10, and the two correlations, computed on each subset, are identical and equal to 0.30. The full correlation computed on the 20 observations is 0.85.…
Added by Vincent Granville on June 2, 2019 at 7:30am —
My first computer was a Commodore Vic-20 in 1981. I bought the device because of this incredible urge to program in BASIC as a result of Mr. Ted Becker’s course on computer programming. I vaguely remember the leap from the pain-staking process of programming using punch cards to writing code and watching your program run immediately, once you resolved all of the syntax errors of course. Nonetheless, it was thrilling and addictive! In hindsight, a… Continue
Added by Richard Charles, PhD on June 2, 2019 at 12:00am —
The analysis and classification of ordinal categorical data are central in most scientific domains and ubiquitous in governments and businesses.
Examples of ordinal data are either found in questionnaires for measuring opinions or self-reported health status. A well-known example of ordinal data is the Likert Scale 
(DISLIKE = 1, DISLIKE SOMEWHAT… Continue
Added by Ludovico Pinzari on June 1, 2019 at 3:35pm —