What is the profession of data science really about? How does one best become a data scientist or grow a career as one? What does the Data Science Central community think about these questions? (Please chime in!)
We’ve all read about the shortage of data scientists from McKinsey, heard about…Continue
Very interesting list of algorithm, data science, machine learning, and computer science keywords. To check the definition for any keyword, go to xlinux.nist.gov. For whatever reasons, and like in many similar lists, the top three letters have more entries than subsequent letters, as if the editor suddenly became lazy when hitting letter D (maybe product developers create products that start with letter A, B, or C, to show up at the…Continue
Added by Mirko Krivanek on April 23, 2015 at 5:00pm — No Comments
The key asset of any company is its customers. It is therefore very important to identify their needs and preferences as well as to know the factors affecting their behavior. The collected customer data allows predicting customer behavior and creating appropriate marketing offers, sales plans, and retention programs that match customers’ needs.
Data mining tools are used to create models that predict customer behavior by using historical data. These methods can be…Continue
Added by Algolytics on April 23, 2015 at 5:00pm — No Comments
Broken down in eight categories.
1. Algorithms and Data Structures
Added by Mirko Krivanek on April 23, 2015 at 12:30pm — No Comments
We are living in a world dominated by data. Data overload and fatigue is drowning most corporations and individuals. There was a time when we had an opinion and we would speak our mind to our friends and families. Now we instantaneously write our minds on Facebook, Instagram, Twitter and LinkedIn. Everyone has a strong opinion and we don't hesitate to take our opinions public. In the last five years, we have generated as much as data as we have never done…Continue
Added by Rupen Shah on April 23, 2015 at 7:30am — No Comments
Retailers know they need Big Data and are charging forward to get in the game. But many retailers continue to face challenges. What type of data should be collected? How should the data be used to generate insights? How do I measure ROI?
101data recently surveyed US retailers, across a range of sizes. When asked about which processes would be most impacted by Big Data technology, 50% stated that targeted offers and promotions would have the biggest…Continue
Added by Larisa Bedgood on April 23, 2015 at 7:30am — No Comments
Added by Khosrow Hassibi on April 22, 2015 at 11:00am — No Comments
Creating interactive visualizations of your data for web is a cakewalk using silk.co, all you need to do is to import your spreadsheet and start generating your interactive visualizations
You need to sign in to start creating visualizations on silk.co. If you dont have an account, signing up is easy. All you need is a name, an email id and a password, and the account is created immediately
Add your data…Continue
RethinkDB is an open source noSQL database that stores JSON documents. This can be great for open ended data analytics. The company officially provides drivers for Ruby, Python and NodeJS and community supported drivers and ORMs are available in around a dozen languages.
The production ready version 2.0 was released very recently on April…Continue
Added by Pansop on April 21, 2015 at 2:30am — No Comments
The full version is always published Monday. Starred articles are new additions or updated content, posted between Thursday and Sunday.
Added by Vincent Granville on April 20, 2015 at 9:30am — No Comments
It’s happened to all of us sooner or later: The hypothesis seemed plausible, the data was clean, the conclusions sound. Our recommendation was damn near foolproof. Yet when put into practice, the result was anything but favorable.
How could that happen? Data science is about, well, DATA. And science, which implies a reliable method. We have the information, we have the models, we aren’t just shooting in the dark here. Where did it go wrong?
The truth is there are lots of things…Continue
Most of us would probably like to work in a profession recognised for its legality, decency and honesty. At least I hope so. In my line of work, what we have right now is palpable evidence that the IT industry lacks a reliable moral compass.
Imagine this. A major sensationalist tabloid pulls together a team of diverse journalists who are set to work on…Continue
Added by Martyn Jones on April 19, 2015 at 10:30pm — No Comments
This was a great question posted on Quora.com, and attracted many comments. Here we summarize the must interesting contributions for you.
Source for picture: …Continue
Many data set resources have been published on DSC, both big and little data. Some associated with our data science apprenticeship. A list can be found here. Below is a repository published on Github, originally posted here. …Continue
So there’s a lot of data out there. Now what? How is that going to make a difference to you? That depends on what you want it to do, how many resources you have to devote, and how much effort you want to put into maintaining your data driven approach over the long term. Make no…Continue
Added by Anne Russell on April 19, 2015 at 8:58am — No Comments
Interesting Infographics received from Verbalink, regarding a complex machine learning problem: automated translations.
Added by Vincent Granville on April 18, 2015 at 3:00pm — No Comments
There are some things that are so big that they have implications for everyone, whether we want them to or not. Big Data is one of those concepts, and is completely transforming the way we do business and is impacting most other parts of our lives.
It’s such an important idea that everyone from your grandma to your CEO needs to have a basic understanding of what it is and why it’s important.…Continue
Some companies really get big data. Not only do they realise size matters – they understand you also have to know what to do with it. Here’s a list of seven companies I think are at the top of the game, when it comes to cutting-edge use of data to strategically achieve business goals. If you run a business yourself and are interested in big data projects, there is…Continue
"Measurement owes its existence to Earth; estimation of quantity to measurement; calculation to estimation of quantity; balancing of chances to calculation; and victory to balancing of chances." - Sun Tzu, The Art of War (Translated by L. Giles)
The quote from Sun Tzu seems to suggest how a military leader gathers data; adapts to different situations; and makes decisions weighing the circumstances. It says that the balancing of chances depends on "calculation." I…Continue
This guide was originally posted on the AYLIEN Blog. It was written as a how-to guide for using RapidMiner and AYLIEN to scape and Analyze online content.
One of the major challenges with mining the Web and Social Media for insights is trying to get all of your…Continue
Added by Mike Waldron on April 17, 2015 at 5:17am — No Comments