Machine Learning? Data Mining?
Well, there is a little bit difference between machine learning and data mining although I don't see any difference between them.
See the Stackexchange debate on the difference between machine learning and data mining. Continue
At the end, it is about training the machine to…
Added by Gregory Choi on April 7, 2016 at 4:30pm —
Find out how Data to Value’s Graph Data software partners Neo4j and Linkurious have been used in the Panama Papers investigation.
Recently there has been a lot of interest around the newly published Panama papers. This giant trove of data that is said to contain a whopping 11.5 million documents or… Continue
Added by Zygimantas Jacikevicius on April 7, 2016 at 4:59am —
For a modern business, data is everything. It can tell you who your customers are, what they like best, what ideas you should pursue, and how those ideas will eventually perform. If you can ask the right questions and use the right tools to uncover and crunch the data, you’ll be in a much better position to sustain your profitability and outcompete your rival businesses.
The Ignorance of Data
The problem is, many businesses are ignoring data, or neglecting it in crucial… Continue
Added by Larry Alton on April 7, 2016 at 2:24am —
Starred articles are new additions or updated content, posted between Thursday and Sunday. The weekly digest has six sections: (1) Featured Resources and Technical Contributions, (2) Featured Articles and Case Studies, (3) From our Sponsors, (4) News, Events, Books, Training, Forum Questions, (5) Picture of the Week, and (6) Syndicated Content.
The full version is always published Monday.… Continue
Added by Vincent Granville on April 6, 2016 at 2:30pm —
A business intelligence platform is used by healthcare organizations to build helpful healthcare applications that assist them in ensuring the provision of quality healthcare to patients. BI platforms help control healthcare costs and provide several benefits to healthcare organizations such as analysis capability, providing information on delivery, and integration.
Healthcare BI platforms offer a very useful function called financial analytics. The availability of the financial… Continue
Added by Ankit Jain on April 6, 2016 at 10:30am —
The E-Commerce boon didn’t just change the way we buy goods, it revolutionized the way we look at our customers completely. E-Commerce uses analytics as a tool to study consumer behavior further boosting sales, acquiring new customers and building a brand. There are 4 major heads in any e-commerce business that need to be analyzed to ensure proper functioning of the business:
- As most of the commerce businesses just act as a platform to connect buyers…
Added by Tanmay Bhandari on April 6, 2016 at 10:00am —
Data storytelling is undergoing a big change, one that will influence not only data journalism but also how data is shown in the business world. And 2016 will only accelerate this trend.
Five years ago, data journalism focused on rich interactives designed primarily for a monitor or laptop screen. Those interactives were complete applications, destinations to be browsed. You could explore, drill down, filter, and play with them. This was a rich experience in the days when we used a… Continue
Added by Andy Cotgreave on April 6, 2016 at 9:26am —
[The goal of this page] Continue
When I have read all R introductions, the books were filled with just instructions. The goal of R is to solve our real life problem. That's why I want to minimize this page. In the real though, we need to understand some key concepts that might be useful for you to tackle the real life problem. Here's basic data structures and data manipulation method.
Still, I believe the best way to learn R programming language is to tackle the real life…
Added by Gregory Choi on April 6, 2016 at 8:53am —
I recently wrote an article entitled ‘First Big Data initiative – why you need Big Data governance now!’ and one of the comments received was from metadata expert and noted industry metadata presenter and speaker Bob Schork. I had the privilege of working with Bob in the past and have benefited from his extensive metadata insights over the years. What made me write this article was his comment stating that “metadata which is and will be ignored by many working on a BD (Big Data) project, to… Continue
Added by John P. Stevens on April 6, 2016 at 7:00am —
Whatever your business sector, data is your most valuable asset. Along with the machinery and stock you hold, data and insights hold the key to profit and growth. But it has the unique ability to unite every department, and every function. It can reveal problems in processes, drive productivity among your staff and ensure everyone is ‘singing from the same hymn… Continue
Added by Martin Doyle on April 6, 2016 at 3:30am —
Guest blog post by Xiu-Shen Wei, originally posted here.
Deep Neural Networks, especially Convolutional Neural Networks (CNN), allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods… Continue
Added by Vincent Granville on April 5, 2016 at 4:00pm —
Summary: Every year the Strata+Hadoop conference offers a unique opportunity to look into the future and see what trends will dominate our profession in the near term. Here’s my take.
I’m just back from the Strata+Hadoop Conference in San Jose last week and I’m charged up. This is a traditionally sold-out crowd of 5,000 Big Data and data science practitioners meeting in the heart of the Silicon Valley once a year to… Continue
Added by William Vorhies on April 5, 2016 at 3:27pm —
Guest blog post by Kenneth Soo, originally posted here.
Take a look at the picture below and try to identify what it is:
One should be able to tell that it is a giraffe, despite it being strangely fat. We recognize images and objects… Continue
Added by Vincent Granville on April 5, 2016 at 3:01pm —
These are the findings from a CrowdFlower survey. Data preparation accounts for about 80% of the work of data scientists. Cleaning data is the least enjoyable and most time consuming data science task, according to the survey. Interestingly, when we asked the question to our data scientist, his answer was:…
Added by L.V. on April 5, 2016 at 9:30am —
Bigger is always better, right?
Well, not necessarily. Even in the realm of big data, companies and governments are beginning to see the value in a “less is more” approach. This is actually in stark contrast with things data-driven CEOs like Jeff Bezzos at Amazon believed when he said “We never throw away data”.
In fact, the European Union has recently included this in new laws of the Data Protection Act that will come into effect soon. The act says, “Personal data shall be… Continue
Added by Bernard Marr on April 5, 2016 at 8:30am —
A blank face is the most common reaction of people when I tell them I work in big data. Most people I meet in my private life haven’t even heard of the term ‘big data’, let alone have a solid understanding of what it is.
So, for those people in my (and maybe your) life, I’ve put together a little cheat sheet answering the three biggest questions I get asked about big data — each in about a minute or less, and in terms anyone can understand.
Ready? Start the timer. Here we… Continue
Added by Bernard Marr on April 5, 2016 at 8:30am —
It's nothing new that Machine Learning and Big Data have been making the headlines for quite some time now, almost half a decade at least. The Data Scientist job has even been identified by Glassdoor as the
best job in America (as of April 2016).
However, according to Gartner's 2015 Hype Cycle for emerging technologies, Machine Learning has now passed the peak of inflated expectations and is… Continue
Added by Alex Marandon on April 5, 2016 at 3:00am —
After more than three years of discussion the EU General Data Protection Regulation or GDPR framework has been finally agreed on. This directive will replace the current 1998 Data Protection Act. As with most major legislative change it will not be enforced immediately and will likely become compulsory at the first half of 2018. The main intent of the… Continue
Added by Zygimantas Jacikevicius on April 4, 2016 at 11:44pm —
Earlier it was Random forest , the go-to algorithm for classification problems in most of the data science competitions. Correctly formulated problem , with smart feature engineering and minimal tuning of the RF algorithm ( ntree, mtry) using grid search could get you past the bulk of the crowd .
Then came Xgboost and it soon became the hot favorite. It isn't very tough to say Deep learning is running the show at the moment. Although, GPU powered…
Added by Ashish kumar on April 4, 2016 at 5:52pm —
It’s 5:05pm EST. Bob, CFO of ABC Inc is about to get on an earnings call after just reporting a 20% miss on earnings due to slower revenue growth than forecasted. Company ABC’s stock price is plummeting, down 25% in extended hour trading. The board is furious and investors demand answers on the discrepancies.
Inaccurate revenue forecast remains one of… Continue
Added by Winnie Cheng on April 4, 2016 at 10:43am —