Subscribe to DSC Newsletter

September 2015 Blog Posts (118)

There is Something New Under the Sun: Uplift Modeling

Summary:  Just when you thought there was nothing new under the sun in response modeling, surprise, along comes Uplift Modeling.

Tools and approaches to response modeling are so fully explored and…


Added by William Vorhies on September 21, 2015 at 10:00am — No Comments

The Data Analytics Landscape - A Crystal Ball

It wasn’t so long ago that the universe of data analytics companies was much easier to describe.  In the year 2000 there were some clearly defined categories of companies that were all engaged in some part of the data and analytics world, although data analytics was not really recognized as a discrete sector at the time.  Companies typically fit into a specific industry category, such as software (Oracle, Siebel, Epiphany), consulting (Accenture, Bain), data (credit bureaus, IMS, IRI); or a…


Added by Gregory Thompson on September 21, 2015 at 7:35am — No Comments

How many analytics companies are there?

I set out a structure for the data analytics industry in a previous post.  This framework attempted to define a framework by which companies can be categorized and evaluated.  The framework defined a group of four sectors with a number of segments within each of those sectors.  On the one hand there is a group of global leaders that provide a comprehensive suite of analytics software and services to all-comers as well as a group of companies that have deep expertise and offerings focused on…


Added by Gregory Thompson on September 21, 2015 at 7:30am — 2 Comments

Client regulatory data management challenges and opportunities

Stricter rules are being imposed on the financial sector in a variety of areas following the financial crisis. Regulators are more than ever involved in scrutinising Know Your Customer (KYC) processes and are large fines for non-compliant firms. Anti-money laundering (AML) practices have become deeper and more advanced to counter the financing of terrorism, corruption and crime. EMIR and Dodd Frank legislations were also put into place with the aim of improving transparency and reducing…


Added by Zygimantas Jacikevicius on September 21, 2015 at 2:47am — No Comments

5 Text Classification Case Studies Using SciKit Learn

Scikit is an open source machine learning…


Added by Baglom on September 20, 2015 at 11:56pm — 2 Comments

100 top data science presentations

We've already published the top big data presentations on slideshare, as well as great Github list of public data sets, or…


Added by L.V. on September 20, 2015 at 1:30pm — No Comments

Wearable Technology: Bringing Digital Disruption to our Lives !!

Do you remember Captain James Kirk using his wrist watch to communicate with the crew of the Starship Enterprise back in 1966?
Today, almost after 50 years, it has finally become a reality!
Digital disruption is occurring in all business functions all around the world. Wearables are becoming mainstream and disrupting almost every industry, with the…

Added by Sandeep Raut on September 20, 2015 at 6:18am — No Comments

Recurrent neural networks, Time series data and IoT – Part One



In this series of exploratory blog posts, we explore the relationship between recurrent neural networks (RNNs) and IoT data.  The article is written by Ajit Jaokar, Dr Paul Katsande and Dr Vinay Mehendiratta  as part of the…


Added by ajit jaokar on September 19, 2015 at 11:30am — 1 Comment

Data Science In The Cloud With DataJoy

DataJoy is an unbelievably fantastic way for a working data scientist to have their favorite tools at hand. I am a minimalist when it comes to being mobile, whether working on the road, traveling for leisure, and sometimes both. I do not like to keep files on my laptop and I do not, for the most part, like to worry about keeping updated applications on my laptop. I have tried as much as possible to push my life into the cloud. Yes, I travel with a chromebook. Yes, I use…


Added by Dr. William Tribbey on September 19, 2015 at 10:00am — No Comments

My data science glossary

Guest blog by Bob DuCharme

Complete with a dot org domain name.

Lately I've been studying up on the…


Added by William Vorhies on September 19, 2015 at 10:00am — No Comments

50+ Free Data Science Books

Very interesting compilation published here, with a strong machine learning flavor (maybe machine learning book authors - usually academics - are more prone to making their books available for free). Many are O'Reilly books freely available. Here we display those most relevant to data science. I haven't checked all the sources, but they seem legit. If you find some issue, let us know in the…


Added by L.V. on September 19, 2015 at 9:00am — 4 Comments

Book: Creating Value with Big Data Analytics - Making Smart Marketing Decisions

Companies around the world are struggling with a vast amount of data, and can’t make sense of it all. Big Data has the promise of providing firms with significant new information about their markets, their products, their brands, and their customers – but currently, there’s often a great divide between big data and truly usable insights that create value for…


Added by Natasha Walk on September 18, 2015 at 12:00am — 1 Comment

The Real Reason for House Price Inflation in New Zealand

Money Money Money

The Real Reason for House Price Inflation in New Zealand

Simon Knudsen - Sept 2015

Population growth, shortages in housing supply, internal migration, immigration, cheap money, and foreign investors are just a few of the claimed causes of House Price Inflation (HPI) in New Zealand in recent years. The notorious example of HPI in action is NZ's largest city - Auckland. The city has experienced double-digit HPI of late…


Added by Simon Knudsen on September 17, 2015 at 5:13pm — 2 Comments

How Shell Uses Analytics To Drive Business Success

The oil and gas industries are facing major challenges - the costs of extraction are rising and the turbulent state of international politics adds to the difficulties of exploration and drilling for new reserves.

In the face of big problems, its key players are turning to Big Data in the hope of finding innovative solutions to these pressing issues.

Big Data is the name used to describe the theory and practice of applying advanced computer analysis to the ever-growing amount of…


Added by Bernard Marr on September 17, 2015 at 12:00pm — 1 Comment

How to Choose Between Learning Python or R First

If you're starting out in Data Science this is a good question to ask yourself.  After all you want to be immediately employable and also be efficient with your own time.

Cheng Hang Lee took on this question in an article by this same name earlier this year and has a fairly comprehensive discussion of the pros and cons.  Some highlights:

The Case for R

R has a long and trusted history and a robust supporting community in the data…


Added by William Vorhies on September 17, 2015 at 9:44am — 2 Comments

An introduction to Apache drill and why is it useful

With the rapid growth of data and the shift towards rapid development solutions much data is being stored in NoSQL stores such as Hadoop and MongoDB. The infrastructure built upon relational databases that have been used for decades cannot keep up with the volume and scope of data being captured. Further to this SQL is also a really good invention and method for extracting and analysing data that is very widely used.  In short it will not be replaced by hierarchical query techniques…


Added by Zygimantas Jacikevicius on September 17, 2015 at 6:06am — No Comments

No cost training to becoming a data scientist

Statistical analysis and data mining were the top skills that got people hired in 2014 based on LinkedIn analysis of 330 million LinkedIn member profiles. We live in an increasingly data-driven world, and businesses are aggressively hiring experts in data storage, retrieval, and analysis. Across the globe, statistics and data analysis skills were highly valued. In the US, India, and France, those skills are in particularly high demand.

What is data science?

Data scientist…


Added by Marina Mitrashov on September 17, 2015 at 3:30am — 1 Comment

15 Books every Data Scientist Should Read

With all this talk of terabytes and petabytes of digital information zipping around the world at the speed of light, it’s sometimes easy to forget about the humble book!

After all pretty much all you could ever practically need to know is probably conveniently available on a blog, Google Hangout or SlideShare presentation somewhere.

But to many of us, books are special – and whether you are so attached to the feel of turning paper pages between your fingers that you would never…


Added by Bernard Marr on September 16, 2015 at 5:00pm — 4 Comments

All Businesses Are Data Businesses

By Brad Kolarov, co-founder and Managing Partner at B23, a boutique Big Data and Cloud Computing software development and implementation company


Have you seen the Audi commercial with the autonomous mail delivery drones stalking the employees leaving their office? If not, its below.

It’s a bit of a stretch, but the commercial is pretty funny and a good example of the technological advances that are on the horizon.  To manage a fleet of autonomous mail delivery…


Added by Brad Kolarov on September 16, 2015 at 10:00am — 1 Comment

Weekly Digest, September 21

The weekly digest now has 6 sections: (1) Featured Articles and Case Studies, (2) Featured Resources and Technical Contributions, (3) From our Sponsors, (4) News, Events, Books, Training, Forum Questions, (5) Picture of the Week, and (6) Syndicated Content.

The full version is always published Monday. Starred articles are new additions or updated content, posted between Thursday and Sunday.…


Added by Vincent Granville on September 16, 2015 at 10:00am — No Comments

Blog Topics by Tags

Monthly Archives













  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service