Subscribe to DSC Newsletter

April 2015 Blog Posts (79)

Big data is big business!

What on earth makes the world go round?

  • Is Big Data the next President of the United States of America?
  • Can small people create Big Data?

Well, your guess is not far from the truth! Gone are the days when big machines were the gizmo for nerds and geeks! Today, there is a revolutionary paradigm shift in favor of small machines with capacity for big data!

Businesses are going online at a rate never imagined before in the entire history of the…

Continue

Added by Christopher Alvin Mokaya on April 15, 2015 at 10:00pm — No Comments

An Excel Tutorial on Analyzing Large Data Sets

Ever wanted to use Excel to examine big data sets? This tutorial will show you how to analyze over 300,000 items at one time. And what better topic than baby names? Want to see how popular your name was in 1910? You can do that. Want to find the perfect name for your baby? Here’s your chance to do it with data.

Picture:…

Continue

Added by Vincent Granville on April 14, 2015 at 3:00pm — No Comments

Weekly Digest - April 20

The full version is always published Monday. Starred articles are new additions or updated content, posted between Thursday and Sunday.

Announcements

  • Think Big, a Teradata company, provides data science and engineering services that enable organizations to accelerate their time to value from big data.…
Continue

Added by Vincent Granville on April 14, 2015 at 12:30pm — No Comments

The 7 Most Unusual Applications of Big Data You’ve Ever Seen!

It’s all well and good to talk about customer experience and managing inventory flow, but what has big data done for me lately?

I’ve rounded up seven of the most interesting — and unique — applications for big data I’ve seen recently and how they may be impacting your life.

Big Data Billboards

Outdoor marketing company …

Continue

Added by Bernard Marr on April 13, 2015 at 3:30pm — 2 Comments

Enhance your CRM Data to Sharpen your Sales Pitch

In the old days, you knew who to contact to make…

Continue

Added by Steve Webster on April 13, 2015 at 1:30am — No Comments

Calculate Cosine Similarity Using Scipy – Data Sets & Sample Code

What is Cosine Similarity?

Cosine Similarity is a measure of similarity between two vectors that calculates the cosine of the angle between them. Similarity ranges from −1 meaning exactly opposite, to 1 meaning exactly the same, with 0 usually indicating independence, and in-between values indicating intermediate similarity or dissimilarity.…

Continue

Added by Gridlex on April 12, 2015 at 8:30pm — 1 Comment

It's Time to Bring Your Own Data!

I just got back from my vacation in Barcelona, Spain where I spent about 3 days, then rented a car and drove up north through the South of France. My last stop was Nice, France. The trip was a lot of fun and now I intend to find some data to help bring back great memories (hm...it sounds more geeky than I thought but anyway).

Barcelona is located in Catalonia region of Spain famous for its earthy dry reds as well as Cava - world's most delicious bubbly drink. I am a big wine fan which…

Continue

Added by Tatiana Sorokina on April 12, 2015 at 8:00am — No Comments

Driving Behaviour as a Telematic Fingerprint

The objective of my final project at Metis from weeks 9 to 12, is to categorize drivers based on their behaviour on the roads - their driving style and the type of roads that they follow.

The challenge associated with this objective is to identify uniquely a driver (and hence his proper “driving…

Continue

Added by DAGHER Philippe N. on April 11, 2015 at 11:49am — 2 Comments

Evaluating RF for Survival Analysis Using Prediction Error Curves

Ulla B. Mogensen, Hemant Ishwaran, Thomas A. Gerds (2012). Evaluating Random Forests for Survival Analysis Using Prediction Error Curves. Journal of Statistical Software, 50(11), 1-23.



Abstract Prediction error curves are increasingly used to assess and compare predictions in survival analysis. This article surveys the R package pec which provides a set of functions for efficient computation of prediction error…
Continue

Added by Diego Marinho de Oliveira on April 10, 2015 at 12:21am — No Comments

40 Excel Tricks

First, let's start with an article featuring many great Excel functions, entitled 11 Advanced Excel Tricks That Will Help You Get An Instant Raise At Work. It describes the following Excel functions:

  • VlookupYou can use the VLOOKUP function to search the first column of a range…
Continue

Added by Mirko Krivanek on April 9, 2015 at 10:00pm — 6 Comments

The 5 V's of Big Data by Bernard Marr

Nice infographics produced by famous business management consultant and author, Bernard Marr. Click on the picture, then click one more time on the picture, to see easy-to-read version.

DSC Resources

Continue

Added by Bernard Marr on April 9, 2015 at 7:30pm — No Comments

That’s Data Science: Airbus Puts 10,000 Sensors in Every Single Wing!

In a meeting with Airbus last week I found out that their forthcoming A380-1000 – the supersized airliner capable of carrying up to 1,000 passengers – will be equipped with 10,000 sensors in each wing.

The current A350 model has a total of close to 6,000 sensors across the entire plane and generates 2.5 Tb of data per day, while the newer model – expected to take…

Continue

Added by Bernard Marr on April 9, 2015 at 7:00pm — 3 Comments

A Data Scientist's Advice to Business Schools

I assist enterprises by driving data-driven approaches into their operations, developing market-aware products that learn from data, and encouraging data-smart cultures among the c-suite of executives. I have had the privilege to work with many talented professionals looking to disrupt their industries…

Continue

Added by Sean McClure on April 8, 2015 at 8:00am — 2 Comments

Top 5 Disruptive Technologies that Will Change the World

In order for a business today to remain competitive, it must be willing to embrace new technologies. Using old or outdated technology can leave a business trailing in the dust of those newer businesses that have emerged to the forefront of the industry, especially when reaping the benefits that new technology affords them. Of course, this means that one must also be aware of new technology and how they might benefit your business, which is not always so easy to do. In fact, there is a term…

Continue

Added by Shezagary on April 7, 2015 at 9:44pm — 1 Comment

The Easy Way Big Data Can Be Accessed with Data-as-a-Service

Primed to make a huge entrance in 2015, Data-as-a-Service (DaaS) empowers companies with real-time data to overcome tough challenges with data. DaaS is allowing companies to generate real-time insights and revenue from Big Data. Companies commonly report feeling overwhelmed solely by the mere size of big data, not to mention the processes necessary to use the data. This no longer has to be a reality. With DaaS using big data is no longer a couple month long process.

What is…

Continue

Added by Larisa Bedgood on April 7, 2015 at 12:30pm — No Comments

The Hype Around Graph Databases And Why It Matters

Organizations are struggling with a fundamental challenge – there’s far more data than they can handle.  Sure, there’s a shared vision to analyze structured and unstructured data in support of better decision making but is this a reality for most companies?  The big data tidal wave is transforming the database management industry, employee skill sets, and business strategy as organizations race to unlock meaningful connections between disparate sources of…

Continue

Added by Tony Agresta on April 7, 2015 at 6:45am — 4 Comments

Tuning Machine Learning Models Using the Caret R Package

Machine learning algorithms are parameterized so that they can be best adapted for a given problem. A difficulty is that configuring an algorithm for a given problem can be a project in and of itself.

Like selecting ‘the best’ algorithm for a problem you cannot know before hand which algorithm parameters will be best for a problem. The best thing to do is to investigate empirically with controlled experiments.

The caret R package was designed to make finding…

Continue

Added by Diego Marinho de Oliveira on April 7, 2015 at 6:41am — No Comments

Four Techniques to Apply in the Design of Data-heavy Applications

Guest blog post.

Big data makes a noteworthy contribution to the usefulness of an application, but its presence can make the design of a clean and usable interface rather difficult. Today, many web applications are built on the platform of big cloud-based data, which leads to the question: how can a designer deliver all the necessary data in an application without making a train-wreck of everything?

Creating a balance between complex data requirements and a simplified…

Continue

Added by Vincent Granville on April 7, 2015 at 3:50am — No Comments

Building an NCAA Men’s Basketball Predictive Model

Authors:  / Gregory J. Matthews.

Journal of Quantitative Analysis in Sports. Volume 11, Issue 1, Pages 5–12.



Abstract
 Computing and machine learning advancements have led to the creation of many cutting-edge predictive algorithms, some of which have been demonstrated to provide more accurate forecasts…
Continue

Added by Diego Marinho de Oliveira on April 7, 2015 at 12:46am — No Comments

The Hot Hand Rises Again

Next month marks the 100th anniversary of Babe Ruth’s first home run.

This year, opening day in baseball signals the “closing day” for one of the classic truisms among sports statisticians: the belief that there…

Continue

Added by Peter Bruce on April 6, 2015 at 10:40am — No Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service