Subscribe to DSC Newsletter

All Blog Posts (6,653)

Forecasting with the Baum-Welch Algorithm and Hidden Markov Models

Leonard Baum and Lloyd Welch designed a probabilistic modelling algorithm to detect patterns in Hidden Markov Processes. They built upon the theory of probabilistic functions of a …


Added by Michael Walker on February 24, 2014 at 10:02pm — 1 Comment

R resource guide for quantitative Data scientist

This is a great introductory resource for those interested in R. but have not got a simplified version yet with strong  programming functions of R.  It makes statistical concepts easy to understand and visualize
complex statistical concepts.

Added by Emanuel Baisire on February 24, 2014 at 12:19pm — No Comments

Big data: the door to co-operation and communication in telecommunications

Responsiveness and clarity, perhaps more than in any other industry are crucial to Telecommunications.

Challenged by the advancing communications demands of a ‘smartphone generation’ over the last few years, the role of communications service providers (CSP’s) and the data they offer is increasingly valuable, owing to the sheer quantity and quality of the unstructured data they produce.

Think about it. From mobile network to Internet providers and more, CSP’s are uniquely…


Added by Jed Mole on February 24, 2014 at 3:30am — No Comments

Best Online Educational Programs in U.S.

Source: U.S. News

See Big Data Studio

Added by Fari Payandeh on February 23, 2014 at 10:00pm — 2 Comments

Risks Posed by Commodified Labour in Complex Fields

The commodification of labour coincides with technological advancements in production: it is perhaps most noticeable in relation to factories.  Factory processes replaced the labour once done by skilled tradespeople. It might not be obvious how this trend has continued to this day and is now affecting professionals in complex fields including those in the data sectors. I am talking about the "made to order" and "off the shelf" acquisition of labour commodities. What I describe as commodities…


Added by Don Philip Faithful on February 22, 2014 at 7:05am — No Comments

Big Data Vendor Revenue and Market Forecast 2013-2017

Originating Author: Jeff Kelly. Originally published on…


Added by Vincent Granville on February 19, 2014 at 9:00pm — 1 Comment

Using Analytics to Solve the Problem of Employability

A warm welcome to our guest blogger- Arun Prabhu, Founder and CEO of inTouch Analytics. inTouch analytics delivers ‘everyday analytics’ to clients in India, United States and Europe while focusing on leveraging customer and employee relationships to deliver business value. Their forte being Talent Analytics (among other things), Arun interestingly outlines how they took on the challenge of modeling the problem of…


Added by Gaurav Vohra on February 17, 2014 at 2:58am — No Comments

20 short tutorials all data scientists should read (and practice)

The new, completed version of this Data Science Cheat Sheet can be found here.

We are now at 20, up from 17. I hope I find the time to write a one-page survival guide for UNIX, Python and Perl.…


Added by Vincent Granville on February 15, 2014 at 7:00am — 13 Comments

Interview with Dr. Roy Marsten, the Man Shaping Big Data

By Vincent Granville

Dr. Roy Marsten, author of more than 30 papers on computational optimization in academic journals, has been a professor at MIT, Northwestern, University of Arizona, and the Georgia Institute of Technology before becoming a Big Data entrepreneur, founding several companies. Today, he has taken his…


Added by Vincent Granville on February 14, 2014 at 3:30pm — 3 Comments

Proposal for bulk email processing

Bulk email represents one of the largest portions of legitimate email (spam is not included in this category). Sending bulk email requires a lot of bandwidth, and technical expertize to obtain high delivery rates. Newsletters that you are subscribed to, are typically sent via newsletter management companies, such as Vertical Response, MailChimp, Constant Contact or iContact. It is also expensive, with $10,000 per…


Added by Vincent Granville on February 14, 2014 at 12:30pm — No Comments

The top 1% data users consume 99% of all the data being produced. True or false?

True or false? What would be your numbers, in your opinion? And how do you define data user, even data? Is most of the data dormant and getting deleted even before being processed or summarized to feed some reports, actions or decisions?

Also, not all data is equal, comparing sensor data (very big) with…


Added by Vincent Granville on February 14, 2014 at 10:30am — No Comments

Interesting cartoons

Here are a few ones:…


Added by Vincent Granville on February 14, 2014 at 10:30am — No Comments

Weekly Digest - February 17

Sponsored Announcement

Predictive Analytics World, March 16-21, 2014 in San Francisco is the business event for predictive analytics professionals, managers and commercial practitioners, covering today's commercial deployment of predictive analytics, across industries and across software vendors. The…


Added by Vincent Granville on February 13, 2014 at 12:30pm — No Comments

A Method for Predicting Fishing Activity Based on Geospatial Motion Behaviors - Summarized from an Analyze Technical Report

Illegal fishing is a significant economic and environmental challenge for countries around the world.  Up to 40% of fishing catch in certain parts of the world is unlawful or unregulated, resulting in approximately $10B to $20B in economic losses and significantly depleting…


Added by Analyze on February 13, 2014 at 5:08am — No Comments

Good paper on multidimensional outlier detection on time series


Market analysis is a representative data analysis process with many applications. In such an analysis, critical numerical measures, such as pro¯t and sales, °uctuate over time and form time-series data. Moreover, the time series data correspond to market segments, which are described by a set of…


Added by Romeo Kienzler on February 12, 2014 at 10:30pm — 4 Comments

Big data is cheap and easy

Big data is not expensive. You can process 10 terabytes of data per year on collocated servers using open source tools (Python - I do it in Perl), using your own home-made Hadoop system if needed, to score 100 billion transactions, all for less than $1,000 per year. It requires a bit of…


Added by Vincent Granville on February 12, 2014 at 9:00am — 1 Comment

Data Silos Obstruct Quest for Competitive Advantage

Data and information silos are a significant problem for organizations getting full value from data. Data silos are separate databases or data files that are not part of an organization's enterprise-wide…


Added by Michael Walker on February 11, 2014 at 8:00pm — 2 Comments

Internship in data scientist or statitic


I would like to know how can student find an internship within data scientist or statistic.

I am majoring in statistic and modeling in a top engineering school in france and I am looking for an internship in data mining within I will be able to performe my computional skills ( R,C++,JAVA) and also I will be able to bring my knowledge in statistic.

I am looking for all contries and regions so I am expecting advices or orientations from the members of big data…


Added by Moussa Camara on February 11, 2014 at 12:00pm — 4 Comments

Blog Topics by Tags

Monthly Archives












  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service