I am excited to find this nice website. I'm an engineer focusing on BigData ( Hadoop/Hive/HBase/Spark/Shark etc) . I found a lot of resource in this website. I'd like to learn more and share with others here.
Added by Zhu Guangbin on June 28, 2013 at 8:38pm — No Comments
This could be a good project in connection with our data science apprenticeship. Two websites, GreatSchools.org and Zillow.com provide two great indicators: school scores for more than 100,000 schools…Continue
Added by Vincent Granville on June 28, 2013 at 8:00pm — No Comments
Note: The domain RonCloud.com seems dead now. Read the comment section below for alternatives.
You have to see it to believe it. Go to roncloud.com and you can enter R commands in the browser-embedded console. Wondering how easy it would be…Continue
Added by Vincent Granville on June 27, 2013 at 10:00am — No Comments
Added by Vincent Granville on June 20, 2013 at 5:00pm — No Comments
Before building a predictive model, you have to prep the data, and there is plenty of room for mistakes in both phases of the modeling process. After building many predictive models in the Rapid Insight office and helping our customer build many more models outside of the office, I have a mental list of data preparation mistakes that could fill a room. Here are some of…Continue
New Genres of Captcha have arrived which truly unleash the power of Captchas. These Captchas enables powerful Advertising via 3D and Video Captchas. Also, Captchas can be used for analytics and sentiment Analysis in wide range of domains such as Advertisements, News, Art, Places, Movies and Sports. Measure the effectiveness of your ads via Sentiment Analysis to predict your advertising results! All that effort on branding, messages, and color schemes can finally be validated! These…Continue
Added by Suhas Aggarwal on June 20, 2013 at 1:51am — No Comments
Added by Michael Walker on June 19, 2013 at 9:39am — No Comments
My new blog post on what I coined as "sparkgrams". Included is an implementation in YUI3 for custom website presentations of data, but I wish R and iPython Notebook had similar functionality.
Added by Michael Malak on June 18, 2013 at 5:17am — No Comments
Or something else. I could not resist to post this chart:
How can organizations use data visualization, visual analytics, and visual data discovery to improve decision-making, collaboration, and operational execution? We present three key insights from the latest TDWI research.
Guest blog by …Continue
Imagine being able to do anything your mind can imagine, well now it can..
The 6th Normal Form is not a TERM.. it is a GOAL.. the "holy Grail" if you will of data management and more importantly "Information" management.. Imagine being able to Store IDEAS as opposed to disconnected bits of data..
A brief "INCORRECT" comment on WIKI.. "A relvar R [table] is in sixth normal form (abbreviated…
Interesting topic posted by InfiniteLoop on PerlMonks, back in 2008!
Added by Vincent Granville on June 15, 2013 at 3:30pm — No Comments
Next-best offer refers to the use of predictive analytics solutions to identify the products or services your customers are most likely to be interested in for their next purchase.
Facing this topic I have made a personal research, and realize a synthesis, which has helped me to clarify some ideas. The attached presentation does not intend to be exhaustive on the subject, but could perhaps bring you some useful insights:…Continue
Added by Michel Bruley on June 14, 2013 at 3:22am — No Comments
"They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety." – Benjamin Franklin…
Added by Michael Walker on June 12, 2013 at 3:30pm — No Comments
Added by Vincent Granville on June 12, 2013 at 12:00pm — No Comments
I have two databases on Microsoft SQL Server (daily business activities performed) and also on peachtree and on orange human resources software. I want to build a data warehouse with this databases available. My questions are:
i. Where can I integrate all these databases together
ii. After I integrate, how can I mine this data?
iii. What is the best software to use and mine this data?
iv. Can combining all these databases produce insight…Continue
Added by Adetula Oluwabunmi on June 12, 2013 at 4:51am — No Comments
There's a lot of talk these days about how governments use all the data they can put their hands on, to monitor every individual in the world. Capabilities offered by big data storage and analytic processing are immense, when in the hands of professional, capable data scientists. Last week the National Security Agency was under the spotlight, a month ago it was the IRS (Income Revenue Service) for a biased auditing …Continue