Subscribe to DSC Newsletter

March 2016 Blog Posts (102)

Safely sharing data through an api with alliance partners.

The following use case shows how an exchange of clean blended data can simplify the implementation of commercial and strategic alliances between businesses with a common interest.

dlc-3

USE CASE 1

The company is a Professional Sports Club. They operate their own sports stadium which is well…

Continue

Added by Bruce Robbins on March 21, 2016 at 6:00am — No Comments

Biased vs Unbiased: Debunking Statistical Myths

Anyone who attended statistical training at the college level has been taught the four rules that you should always abide by, when developing statistical models and predictions:

  1. You should only use unbiased estimates
  2. You should use estimates that have minimum variance
  3. In any optimization problem (for instance to compute an estimate from a maximum likelihood function, or to detect the best, most predictive subset of variables), you should always shoot for a…
Continue

Added by Vincent Granville on March 19, 2016 at 4:30pm — 3 Comments

A (rather funny) Data Science story.

(At Lana's apartment) 

  • Lana: I like Roger. I think he could make me happy. (Hypothesis) 
  • Steph: By no chance. He’s a professional cheater, I am pretty sure about it. (Another hypothesis) 
  • Lana: I am so confused… What's more important: His loyalty to me or his ability to make me happy? (Data Science’s uttermost matter: Are we asking the right…
Continue

Added by Elías De La Rosa on March 19, 2016 at 12:43pm — No Comments

Machine Learning Algorithm Identifies Tweets Sent Under the Influence of Alcohol

Interesting article posted recently in MIT Technology Reviews. What kind of metrics would help detect such tweets? We think the following might be useful:

  1. Local time (like late at night)
  2. Whether a picture or not is associated with the tweet
  3. Whether a link or not is associated with the tweet
  4. Number of typos for the tweet in question, compared with average for the user in question 
  5. Frequency of tweets (sudden spike) for user in…
Continue

Added by L.V. on March 18, 2016 at 6:30am — 1 Comment

How to Represent Data with Intelligent Use of the Coordinate System

This is a guest repost by Jacob Joseph from CleverTap.

The most widely used coordinate system to represent data is the…

Continue

Added by Shawnee Swarengin on March 18, 2016 at 12:00am — No Comments

Implementing data science projects – the five essential skill sets

The future of business, it is argued, is digital. At the core of this digital transformation is the ability to harness data in enabling better business decisions. Typically, organizations have teams of experts who work on existing data sets to apply diverse analytic tools and techniques to make sense of the data. The more statistically advanced among these teams work on typical ‘data science’ problems. Data science problems are where you need to apply sophisticated algorithms on large data…

Continue

Added by Archisman Majumdar on March 17, 2016 at 11:30pm — No Comments

The Future is What Happens When People Embrace Open Data

Open data is a thing, an idea, and an ideal. Open data is one of those "superhero words" alongside its cousins the Cloud and Big Data. I like to call them superhero words because they are supernatural forces that seemingly defy definition and can't be seen. Yet…

Continue

Added by Jay Gendron on March 17, 2016 at 7:07pm — 1 Comment

Mars Craters: An Interesting Stochastic Geometry Problem

Impact craters are distributed randomly on Mars and many other celestial bodies. Their radius most likely follow an exponential distribution. By estimating the mean of the exponential distribution in question, selecting 100 random locations, and determining how many lie in (at least) one crater, you can determine the age of the celestial body. 

This…

Continue

Added by Vincent Granville on March 17, 2016 at 5:00pm — 2 Comments

The Art of War Applied To Software Development

If you work in the software industry, it’s likely that you have heard about the divide and conquer design paradigm, which basically consists of recursively splitting a problem into two or more sub-problems (divide), until these become simple enough to be solved directly (conquer).

What you might not know is that this paradigm originates from an old political strategy (the name is derived from the Latin saying divide et impera) that suggests it is…

Continue

Added by Irina Papuc on March 17, 2016 at 4:30am — No Comments

Data Science Projects: behind the scene

Organizations empty a considerable measure of exertion into the Business Intelligence and Analytics (BI/A). In late 2013, Gartner anticipated that the significance of BI/An activities for CIOs will keep on developing great into 2017 and past. Notwithstanding all the exertion and consideration on investigation however, organizations are as yet attempting to succeed with Big Data. In a late 2014 review, it was found that just 27% of Big Data ventures succeed, while just 13%…

Continue

Added by Rai Mirrow on March 17, 2016 at 4:30am — No Comments

Data Scientist in a Machine ? Part I

       

This series is written for you the business user, CXO, business owner, who does not care for complex jargon but believes in seeing tangible impact on business. It is to introduce this novel concept of cognitive data-products that are set to storm the business world. I have lost count of the number of times where customer conversations veer towards “Yes; the algorithm sounds good but what decision can I take on the basis of this?”

           …

Continue

Added by Adurthi Ashwin Swarup on March 17, 2016 at 4:00am — No Comments

How the perfect Data Science workers look like

I know you are an examiner and all you think about is numbers. However, what separates a wonderful business expert from normal information examiner? It's their capability to comprehend business. You ought to attempt to comprehend business even before you take up your first venture. Here are a couple of things you should investigate:

  • Client data: Total number of dynamic clients, month on month client wearing down, fragments characterized by business on…
Continue

Added by Rai Mirrow on March 17, 2016 at 4:00am — No Comments

Weekly Digest, March 21

Starred articles are new additions or updated content, posted between Thursday and Sunday. The weekly digest has six sections: (1) Featured Resources and Technical Contributions, (2) Featured Articles and Case Studies, (3) From our Sponsors, (4) News, Events, Books, Training, Forum Questions, (5) Picture of the Week, and (6) Syndicated Content.

The full version is always published Monday.…

Continue

Added by Vincent Granville on March 16, 2016 at 3:00pm — No Comments

Analytics: Going Beyond the Numbers

In the years that MIT Sloan Management Review has been studying the effects of data analytics in companies, we’ve noticed the reality of its value and staying power—and how best to use it—has started to come into sharp focus. In a new research study we conducted with SAS, we determined that most companies are not prepared for the strategic changes required to achieve success with analytics. In fact, the key failing among…

Continue

Added by Deb Gallagher on March 16, 2016 at 11:30am — No Comments

Learn the Concept of linearity in Regression Models

This Tutorial talks about basics of Linear regression by discussing in depth about the concept of Linearity and Which type of linearity is desirable.

What is the meaning of the term Linear ?



In Linear Regression the term linear is understood in 2 ways -

  1. Linearity in variables
  2. Linearity in parameters…



Continue

Added by Shantanu Deo on March 16, 2016 at 4:30am — No Comments

Increasing Number of Computer Devices to Drive Global Internet of Nano Things market

A new market research report by Transparency Market Research, titled “Internet of Nano Things (IoNT) Market - Global Industry Analysis, Size, Share, Growth, Trends and Forecast 2014 – 2020,” provides a comprehensive analysis of the global market for the internet of nano things (IoNT). The study provides insights into the market’s overview, including the current market trends, drivers and restraints, product segmentation, major geographical segments, and competitive landscape…

Continue

Added by Alina John on March 16, 2016 at 3:00am — No Comments

CFP3 - The 10th International Web Rule Symposium (RuleML) 2016 Call for Papers

The 10th International Web Rule Symposium (RuleML) 2016 Call for Papers



Stony Brook, NY, 6-9 July, 2016



http://2016.ruleml.org/calls



Breaking News:



* Best papers of RuleML 2016 will be invited to submit a revised and extended version to the "Rapid Publications" category of the journal TPLP (Theory and Practice of Logic Programming).



* Best papers of RuleML 2016…

Continue

Added by Ольга Сушкова on March 16, 2016 at 1:34am — No Comments

Predictive Modelling and Precautions

Predictive analysis is a journey of refinement of data over time using a predictive model. Right from choosing the right model to refine your data to the amount of time and effort invested, there are many thing which may go wrong in your way. 

Predictive analytics is also the enabler of Big Data; businesses collect vast amounts of real-time customer data and predictive analytics uses this historical data, combined with customer…

Continue

Added by Abhishek Srivastava on March 16, 2016 at 12:00am — No Comments

Analytics in Aviation Industry

The aviation industry is a sector involving high cost and security concerns. Analytics in this sector has huge potential, as varied data can be collected at each touch point showcasing customer interests. Crucial factors such as weather forecast should be critically analyzed using sophisticated tools to ensure passenger safety. A lot of logistics complexity also lies straight from building an aircraft to safe take–off and landing. Also, since customers pay the highest prices in this form of…

Continue

Added by Tanmay Bhandari on March 15, 2016 at 7:30pm — No Comments

14 Timeless Reference Books

These books have been published and re-published in the last 10 year. Most of them are encyclopedias, yet they are extremely useful resources for the data science beginner or expert. Just like top restaurants, they come with a steep price. I did not include two encyclopedias (each with 10+ volumes) that sell for over $5,000 because I felt they were truly overpriced. Also, I have just published a new book, entitled…

Continue

Added by Vincent Granville on March 15, 2016 at 7:00pm — 1 Comment

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service