Featured Blog Posts – June 2015 Archive (66)

Decision Boundaries for Deep Learning and other Machine Learning classifiers

This article was originally posted here, by Dr. Takashi J. OZAKI. Below is a short version.

H2O, one of the leading deep learning framework in python, is now available in R. We will show how to get started with H2O, its working, plotting of decision boundaries and finally lessons learned during…


Added by Vincent Granville on June 15, 2015 at 7:30am — No Comments

What are data scientists interested in? Insights from our search engine data

We've gathered data from our newly created DSC search box, and based on 20,000 search queries over the last four months (most of them in the last 30 days), we discovered that the top queries so far are:

Top queries

  • data sets (1266) 
  • excel (846) 
  • python (732) 
  • bernard marr may 2015 (615) 
  • r (608) 
  • visualization (567) 
  • cheat…

Added by Vincent Granville on June 14, 2015 at 8:30pm — No Comments

Harnessing Big Data for Security: Intelligence in the Era of Cyber Warfare.


According to Gartner.com , Big Data is high-volume, high-velocity and high-variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making.

Are there examples of Big Data in action at major global IT giants?

It is…


Added by Christopher Alvin Mokaya on June 14, 2015 at 11:00am — No Comments

Document Similarity Analysis Using ElasticSearch and Python

Elasticsearch is an open source search engine based on Lucene. Its being used by leaders in the market like Wikipedia, Linkedin, ebay etc. It has an official python client elasticsearch-py



Added by Vozag on June 13, 2015 at 9:30pm — No Comments

Fast-Track, On-Demand, No-Fee Program to Become a Data Scientist

We have re-designed our online, accelerated data science apprenticeship: it is now available to anyone, at no cost, with no restrictions, and does not require any application nor deadlines. Data sets, a cheat sheet to get you started, real-life projects to work on, sample code, and tons of resources,…


Added by Vincent Granville on June 12, 2015 at 6:00pm — 3 Comments

The Quantified Workplace: Big Data or Big Brother?

Companies have more data on staff than ever before in history and big data analytics is making its way into HR practices fast. Analyzing staff performance is nothing new, but the extent to which we can now collect and analyze such data is going beyond all norms.

Sociometric Solutions puts sensors into employee name badges that can detect social dynamics in the workplace. The sensors report on how employees move around the workplace, with whom they speak, and even the tone of voice…


Added by Bernard Marr on June 12, 2015 at 4:30pm — No Comments

Big Data In The Amazing World of Gaming

Big Data is big in gaming. Take Zynga, the company behind Farmville, Words with friends, and Zynga Poker. Zynga positions itself as a maker of “social” games, which are generally played on social media platforms (rather than game consoles like Nintendo, Xbox or PlayStation) and take advantage of the connectivity with other users that those platforms offer.…


Added by Bernard Marr on June 12, 2015 at 4:30pm — 1 Comment

Tools to automate... more time to innovate

I absolute loved the point made by Matthew Napleton - we must make tools easier to use!

That doesn't mean we eliminate the need for human interaction. As long as a decision has to be made, a machine can get you further, faster but the magic still happens when a person applies their intellect to the process.

Before leaping to the notion that automation is going…


Added by Cari Jaquet on June 12, 2015 at 1:00pm — 1 Comment

7 Amazing Big Data Myths

Here is my top 7 list of daft things that some people say about Big Data.

I think that Big Data does play a role in some businesses. I also think that some of the basic distributed file store and text search technologies can be usefully employed, in non-traditional indexing, counting and correlation. However, there is an awful lot of nonsense said about Big…


Added by Martyn Jones on June 11, 2015 at 12:26pm — 2 Comments

NewSQL - RDBMS on Steroids

Summary:  NewSQL is alive and well and under the right circumstances could be your best choice.

No this is not a misprint.  Yes we mean NewSQL, not NoSQL.  Recently a colleague asked me about NewSQL and I had to admit that I hadn’t kept…


Added by William Vorhies on June 11, 2015 at 8:00am — 1 Comment

Weekly Digest, June 15

The full version is always published Monday. Starred articles are new additions or updated content, posted between Thursday and Sunday. The picture of the week is from the contribution marked with a +, where you will find the details.


How big is “Big Data?” Business analytics spending will grow to $89.6 billion by 2018. Banking, healthcare, energy and…


Added by Vincent Granville on June 10, 2015 at 7:00am — No Comments

Data is Immortal, but Not Immune to Decay

Data exists in a dangerous state of near-non existence. Few businesses would risk not having backups in place. With cloud computing becoming commonplace in enterprise, we’ve come to accept that our data will be replicated and stored in duplicate.

Even data that is intentionally deleted can often be recovered. When Yahoo! purchased Geocities,…


Added by Martin Doyle on June 9, 2015 at 12:30am — 1 Comment

Case study: how much good traffic a scammer gets from one email spam

I got this in my mailbox this morning:

Subject: Support alert!

From: "LinkedIn Support" <[email protected]>

Date: Mon, June 8, 2015 2:46 pm

Hello [email protected]

It's very important you get verified. 

Pleaseclickhere. [dangerous Link deleted]

LinkedIn Security Team 

This message is sent to…


Added by Vincent Granville on June 8, 2015 at 1:00pm — No Comments

How is Big Data Changing the World?

How is Big Data Changing the World?

When we talk about Big Data, many of the examples and use cases we share center around how Big Data is changing the way businesses must operate. But Big Data is changing the world on some exciting global levels as well. So let’s take a step back from the business impact of bigger data and…


Added by Larisa Bedgood on June 8, 2015 at 5:51am — No Comments

Naive Bayes for Dummies; A Simple Explanation

This blog post was originally published as part of an ongoing series, "Popular Algorithms Explained in Simple English" on the AYLIEN Text Analysis Blog.

Picture added by the…


Added by Mike Waldron on June 8, 2015 at 1:00am — 2 Comments

Regression Prediction using AWS Machine Learning

We wanted to be able to predict median rent of a place given the median price of the home, median household income of the place and the percentage of homes vacant in that place. The data can be downloaded from…


Added by Vozag on June 7, 2015 at 9:30pm — 1 Comment

Data Science: The numbers game Law almost lost.

On the face of it, Analytics and Law are manifestly divergent fields of practice. One need only consider the nature of Algorithms that require numerical attributes for their calculations and the textual rigidity of substantive law to realize this. The very first obstacle one will encounter in applying Analytics to Law is the absence of calculable numerical variables in raw legal data. No judicial precedent, statute or common law principle has ever been reduced to a mathematically sound…


Added by Mkhuseli Mthukwane on June 5, 2015 at 4:00am — 5 Comments

Top Five Data Science Masters Programs

Image source: Link

Which top Masters Courses should you consider for a great career in data-science?

A frequently cited study by McKinsey predicts that by 2018, the United States could face a shortage of 140,000 to 190,000 "people with deep analytic skills" as well as 1.5…


Added by Sudhanshu Ahuja on June 4, 2015 at 4:30pm — 3 Comments

Predicting records (highs or lows) - how to do it right (and without statistics)

While everyone talk about unusual extreme weather events (heat waves, cold spells, floods, droughts), very few, including scientists, have been able to make sound predictions for extreme events, be it weather or stock market extreme behavior, or any bubble. Here you will learn how to produce simple model-free confidence intervals for extreme events in Excel, how to generate (correlated) simulated stock market data and (uncorrelated) natural data such as air pollution index, understand why…


Added by Vincent Granville on June 3, 2015 at 8:30pm — No Comments

Weekly Digest, June 8

The full version is always published Monday. Starred articles are new additions or updated content, posted between Thursday and Sunday. The picture of the week is from the contribution marked with a +, where you will find the details.



Added by Vincent Granville on June 3, 2015 at 9:30am — No Comments

Featured Monthly Archives












© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service