In writing this book my primary aim is to guide the beginner to the essential elements of associations and correlations and avoid the most common pit-falls.
Here, I discuss a holistic method of discovering the story of all…
Added by William Vorhies on February 17, 2016 at 8:09am — 2 Comments
AI was very popular 30 years ago, then disappeared, and is now making a big come back because of new robotic technologies: driver-less cars, automated diagnostic, IoT (including vacuum cleaning and other household robots), automated companies with zero employee, soldier robots, and much…
ContinueAdded by Vincent Granville on February 16, 2016 at 9:30pm — 6 Comments
Summary: It’s official, Data Scientist is the number 1 best job in America.
It’s official, Data Scientist is the number 1 best job in America. Glassdoor, the site that rates employers and gives job seekers the inside scoop on what to expect just published its 2016 results and Data Scientist is number one.
We got a job score of 4.7…
ContinueAdded by William Vorhies on February 16, 2016 at 11:58am — 2 Comments
By Greta Roberts.
It’s exciting to watch advances in predictive and prescriptive employee solutions. A public HR technology company recently announced the release of an application enabling employers to “identify which employee is likely to quit, and what options need to be considered to retain that…
ContinueAdded by Mike Kennedy on February 16, 2016 at 4:00am — No Comments
NoSQL also know as not only SQL (structured query language) is a tool used for data design and management of large quantity of distributed data. NoSQL database comprise a wide range of architecture and technologies designed to overcome problems of scalability and big data performance problems.…
ContinueAdded by Aman on February 16, 2016 at 3:00am — No Comments
I recently shared an article on how airlines are using information to individually price airfares and extract maximum value from passengers which was thought provoking and insightful for many airline industry and big data…
ContinueAdded by Mark Ross-Smith on February 16, 2016 at 2:00am — 1 Comment
Added by Patnab on February 16, 2016 at 1:48am — 5 Comments
In this blog I will be covering the basic understanding of a data science problem. To begin with I think before starting with any data science problem you should know about 2 key things:
Let’s look at the each of the…
ContinueAdded by Sharayu Rane on February 15, 2016 at 11:30pm — 5 Comments
Reading the academic literature Text Analytics seems difficult. However, applying it in practice has shown us that Text Classification is much easier than it looks. Most of the Classifiers consist of only a few lines of code.In this three-part blog series we will examine the three well-known Classifiers; the Naive Bayes, Maximum Entropy and Support Vector Machines. From the…
ContinueAdded by Ahmet Taspinar on February 15, 2016 at 10:00pm — No Comments
In the last post, we talked about how the open sourcing of machine learning algorithms and hardware architecture gives rise to the latest phenomenon of “Data is king”. As companies compete to get developers to use their libraries, we continue to see this ongoing arms race like trend in open sourcing. In addition, we are also seeing how complicated the big data landscape is becoming. This is the link to the most recent …
ContinueAdded by Srividya Kannan Ramachandran on February 15, 2016 at 4:00am — No Comments
Originally written by Nigel Higgs on LinkedIn Pulse.
We who have been in the data sphere a while and in and around Data Governance will have seen the pitch-decks, watched the webinars, read the blogs and attended the conferences. Some of us will have…
ContinueAdded by Zygimantas Jacikevicius on February 15, 2016 at 3:00am — No Comments
One of the hot topics on Machine Learning is, with no doubts, feature engineering. In fact, it comes before the buzz on this topic, simple when we talk about Data Mining. Remembering the CRISP-DM process, feature engineering (and, consequently, feature selection) is the core of a great data mining project – it comes to life on the Data Preparation phase, that is the task to have constructive data preparation operations such as the production of derived attributes or entire new records, or…
ContinueAdded by Leandro Guerra on February 15, 2016 at 12:30am — No Comments
Feature selection is one of the core topics in machine learning. In statistical science, it is called variable reduction or selection. Our scientist published a methodology to automate this process and efficiently handle la large number of features (called variables by statisticians). Click here for details.
Here, we mention an article published by Isabelle Guyon…
ContinueAdded by L.V. on February 14, 2016 at 4:00pm — No Comments
This is the first in a series about cross format data modeling principles.
In the data modeling realm, there is perhaps no example that is as ubiquitous as modelling personal names. After all, things don’t get much simpler than a name:
Simple, right? Well, not so fast. This isn’t really a model, but rather an instance of a model - an actual example that proves out the model. There are, however, a number of ways that…
ContinueAdded by Kurt A Cagle on February 14, 2016 at 1:19pm — No Comments
This article focuses on cases such as Facebook and protein interaction networks. The article was written by By Paul Scherer (paulmorio) and submitted as a research paper to HackCambridge. What makes this article interesting is the fact that it compares five clustering techniques for this type of problems:
Added by L.V. on February 13, 2016 at 8:00am — No Comments
Added by Eduardo Siman on February 13, 2016 at 3:21am — No Comments
Guest blog post.
Associations and correlations are perhaps the most used of all statistical techniques. As a consequence, they are possibly also the most mis-used.
In writing this book my primary aim is to guide the beginner to the essential elements of associations and correlations and avoid the most common pit-falls.
Here, I discuss a holistic method of discovering the story of all…
Added by Vincent Granville on February 12, 2016 at 7:00pm — 2 Comments
From querying your data and visualizing it all in one place, to documenting your work and building interactive charts and dashboards, to running machine learning algorithms on top of your data and sharing the results with your team, there are very few limits to what one can do with the Jupyter + Redshift stack. However, setting everything up and resolving all the package dependencies can be a painful experience.
In this blog post I will walk…
ContinueAdded by Yevgeniy Slutskiy Meyer on February 12, 2016 at 1:30am — No Comments
Ember Data (a.k.a ember-data or ember.data) is a library for robustly managing model data in Ember.js applications. The developers of Ember Data state that it is designed to be agnostic to the underlying persistence mechanism, so it works just as well with JSON APIs over HTTP as it does with streaming WebSockets or local IndexedDB storage. It provides many of the facilities you’d…
ContinueAdded by Irina Papuc on February 11, 2016 at 5:30am — No Comments
A data-driven organization will use the data as critical evidence to help inform and influence strategy. To be data-driven means cultivating a mindset throughout the business to continually use data and analytics to make fact-based business decisions. Becoming a data-driven organization is no longer a choice, but a necessity. Making decisions based on data-driven approaches not only increases the accuracy of results but also provides consistency in how the results are interpreted and fed…
ContinueAdded by Raghavan Madabusi on February 11, 2016 at 5:00am — No Comments
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
1999
© 2021 TechTarget, Inc.
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles