All Blog Posts (447)

What Do You Need to Know Before Getting Starting with Reporting Service?

The reporting tool is the software to present the table and statistics chart dynamically. It is the basic requirements for business, a basic means of running business, and the basis for boosting competitiveness. It is the most common tools for business personnel. However, using the traditional reporting tool requires a relatively stronger technical background. The average business personnel cannot handle it independently,…

Continue

Added by Jim King on March 11, 2013 at 11:53pm — No Comments

Unstructured Data Really Isn't

Unstructured Data Really Isn’t

Bradley S. Fordham, PhD (www.linkedin.com/in/drbradleyfordham)

The (ART+DATA) Institute

The term “unstructured data”, is truly an oxymoron.  All data has structure, and in fact most data has multiple structures…

Continue

Added by Zach Piester on March 10, 2013 at 2:18pm — No Comments

Automatically deskew before machine learning in R

I found it odd there was no way to automatically deskew data in R, so I wrote a short little function to do it.  It noticeably improves the peformance of linear models and linear support vector machines.
http://technicaltidbit.blogspot.com/2013/03/automatically-deskew-before-machine.html

Added by Michael Malak on March 9, 2013 at 2:00pm — 1 Comment

Unleashing intelligence through natural language (Part 1 - Autonomously generated questions about gaps in the knowledge)

This is the first of a series of blogs in which I will reveal and explain rules of intelligence contained within grammar, that can be utilized to unleash intelligence in software. These rules are extremely simple, but still undiscovered by scientists.



Current state:

Systems generating questions already exists. However, their questions are useless, because the original sentence - from which the question is derived - already holds to the answer.



A…

Continue

Added by Menno Mafait on March 8, 2013 at 11:30pm — No Comments

Big Data means to support Buzz Marketing actions

 

What is buzz marketing? In the strict sense of the term, buzz marketing is creating noise around a product, service, company or brand. For example you can recruit consumers, preferably proactive volunteers who influence their peers, and help them to try your products in good condition before pushing them to talk about their experience.



The buzz is one of the most powerful forces in the market, and knowing how to master this important marketing channel is critical. Word of…

Continue

Added by Michel Bruley on March 7, 2013 at 4:21am — No Comments

Open for Comment: Proposed "Data Science Code of Professional Conduct"

We are seeking comments and suggestions on a proposed "Data Science Code of Professional Conduct".

Data science is an independent profession. Data scientists have a higher calling than just technical skills. We have a duty to use data science to make life, business and government better.…





Continue

Added by Michael Walker on March 6, 2013 at 8:17pm — 2 Comments

Quick! Hire a Data Scientist! Not so fast, perhaps...

Contrary to the prevailing sentiments, I counsel patience with regards to hiring a data scientist. Firstly, I mean no disrespect to my data science colleagues, and many will likely agree with me because no one likes to enter an environment where sub-optimal results or failure are probable. Yes, it’s likely you need to add one or even a few data scientists to the team, but not as your first step into the wide, wide world of Big Data. Give me a few minutes and hopefully you’ll see it my…

Continue

Added by Alan Nugent on March 5, 2013 at 4:01am — No Comments

How to Integrate the Work of the Data Scientist into the Company's Strategy

This blog entry continues the topic of how a Data Scientist can convince colleagues to become more data driven.  The previous blog covered office politics.  This entry covers integration with the strategy and, more specifically, the process that creates the company's strategy.

Every company is unique and consequently so is its strategy process. At first glance, research on how companies develop strategies is complex and contradictory. There are simply too many ways to go about it:…

Continue

Added by Stephen Penn, DM, PMP on March 5, 2013 at 2:40am — 2 Comments

Weekly Digest - March 4

Sponsored

Featured on…

Continue

Added by Vincent Granville on March 3, 2013 at 2:30pm — No Comments

PAW San Francisco Packs a Punch with Keynotes in 2013

This year, Predictive Analytics World San Francisco (April 14-19) features an incredible agenda filled with awesome keynotes and 35 keynotes from leading organizations..

Check out some of the headliners at PAW SF this year:…

Continue

Added by Vincent Granville on March 3, 2013 at 1:00pm — No Comments

SQL Visualization in the Spreadsheet

SQL is a database query and programming language for retrieving, updating, and managing the data from relational database. SQL was certified to meet ANSI in 1986, and became an international standard in 1987. Nowadays, SQL becomes a basic requirement for every programmer. However, the advantages cannot obscure the disadvantage. SQL is especially designed for technical personnel. SQL syntax is highly abstract, the logic is hard to understand, and only those with strong technical background…

Continue

Added by Jim King on February 28, 2013 at 6:00pm — No Comments

EMC Greenplum Brings SQL to Hadoop | Forbes

By Ben Woo, Forbes Contributor.…
Continue

Added by Vincent Granville on February 28, 2013 at 3:12pm — No Comments

Spark, Shark and Mesos Data Analytics Stack

The Berkeley Data Analytics Stack (BDAS) is an open source, next-generation data analytics stack under development at the…

Continue

Added by Michael Walker on February 27, 2013 at 10:08am — No Comments

The Data Supply Chain and Master Data Management

The recent TDWI Keynote by Evan Levy…

Continue

Added by Michael Walker on February 21, 2013 at 9:00am — No Comments

Truth behind Ticket Purchasing Rush: Statistical Analysis Works

Getting train tickets is always very difficult during holiday, especially in Spring Festival, the most important Chinese holiday for family reunions. The transport peak spans from Jan. 26, 2013 to March 6. During this period, buying tickets at railway station or from the official website of MOR (China’s Ministry of Railways) or through phones seem to become an extravagant expectation. What are the reasons? How to take effective measures?…



Continue

Added by Jessica Qiu on February 20, 2013 at 11:56pm — No Comments

Weekly Digest - Feb 18

Sponsored

Continue

Added by Vincent Granville on February 17, 2013 at 9:00am — No Comments

30 great articles from top news outlets

Vincent's selection for the last 30 days. Enjoy the reading, share with colleagues!

  1. Big Data: Making Complex Things Simpler…
Continue

Added by Vincent Granville on February 16, 2013 at 3:36pm — No Comments

Google Spanner : The Future Of NoSQL

Quite often, while working with Hbase, I used to feel how cool it would be to have a database that can replicate my data to datacenters across the world consistently. So that I can take the pleasure of global availability and geographic locality. And also which will save my data even in case of some catastrophe or…

Continue

Added by Mohammad Tariq Iqbal on February 15, 2013 at 9:21am — No Comments

New York City’s $15M Big Data Partner

New York City Mayor Michael Bloomberg in a recent press conference detailed a new $15M partnership  to tackle the increasing volume of big data to increase New York’s capacity for applied sciences and potential for economic growth.  The agreement will create 440,00 square feet of new space dedicated to big data education and training, as well as the addition of 75 big data research specialists.

Dr. Kathleen McKeown and Dr. Patricia Culligan were selected to lead the new…

Continue

Added by J. David Morris on February 15, 2013 at 7:04am — No Comments

Follow us

© 2013   Data Science Central

Badges  |  Report an Issue  |  Terms of Service