Subscribe to DSC Newsletter

Featured Blog Posts (5,870)

Cloud Integration Issues

What's your cloud integration strategy? If you're like most IBM i shops, much of your data interchange is handled via good old EDI or flat file transfers. But the rapid spread of cloud services is hastening the move to more sophisticated forms of data and application integration and interchange. According to EXTOL which develops integration broker software for IBM i and other platforms, the day is fast approaching when companies will need new techniques for integrating cloud…

Continue

Added by Pearse William on December 7, 2011 at 11:01am — No Comments

Healthcare fraud detection still uses cave-man data mining techniques

The Washington Education Association (WEA, in Washington State) is partnering with Aon Hewitts (Illinois), a verification company, to eliminate a specific type of health insurance fraud: teachers reporting non-qualifying people as dependents, such as an unemployed friend with no health insurance. The fraud is used by "nice" people (teachers) to provide health insurance to people who would otherwise have none, by reporting them as spouse or kids.

Interestingly, I saw the letter sent to…

Continue

Added by Vincent Granville on December 7, 2011 at 5:16pm — No Comments

Big Data Analytics - Visualization Tools

Jaspersoft announced a second-generation native connector to MongoDB, an open source database. 10gen, the company behind MongoDB, and Jaspersoft have teamed together to deliver an enhanced tool for companies to provide easier reporting, analytics, and visualization of Big Data. Jaspersoft is a sponsor of the upcoming MongoSV, to be held in Santa Clara CA today, December 9, 2011 and will be showcasing the combined solution there.

Building on the popularity of the first generation…

Continue

Added by Stan Mason on December 9, 2011 at 7:00am — No Comments

Only One-Third of Companies Making Effective Use of Data

HOPKINTON, Mass., Dec. 5, 2011 -- EMC Corporation today unveiled the findings of the largest-ever global survey of the data science community. Spanning the United States, the United Kingdom, France, Germany, India and China, the EMC Data Science Study reveals and quantifies a rampant scarcity across the globe for the prerequisite skills necessary for a company to capitalize on the opportunities found at the intersection of Big Data and data analytics. Only one-third of companies are…

Continue

Added by Vincent Granville on December 12, 2011 at 8:33am — No Comments

Big Data might just mean Big Traffic

Big Data, housed in new and disruptive technologies, is expected to account for more than 50 percent of the world’s data in the next five years, according to a a new study. While it offers huge and untapped value, the inevitable result is stress and strain on the world’s Interent infrastructure as companies seek to manage this explosion of information.

The new study, released jointly by Internet Research Group and Infineta Systems a provider of WAN optimization systems, examines how…

Continue

Added by Stan Mason on December 14, 2011 at 7:35am — No Comments

Analysts' Joyful anticipation for 2012 - Cool Tools

This year has seen consolidation and engineering around improving the basic storage and data processing engines of NoSQL and Hadoop. That will doubtless continue, as we see the unruly menagerie of the Hadoop universe increasingly packaged into distributions, appliances and on-demand cloud services. Hopefully it won’t be long before that’s dull, yet necessary, infrastructure.

Looking up the stack, there’s already a line up of cool tools for data scientists and not to be left out,…

Continue

Added by Pearse William on December 15, 2011 at 8:46am — No Comments

Big Data Retention for Structured and Unstructured Data

Caringo announced a partnership to provide a solution for retaining and accessing structured data in Caringo object storage software powered by CAStor. The combination allows users to store increasing volumes of structured and semi-structured data from databases, logs and other machine-generated data sources along with unstructured content such as audio, documents, e-mails, images, and videos in CAStor.

The combined solution provides a horizontal platform that enterprises…

Continue

Added by Stan Mason on December 19, 2011 at 8:17am — No Comments

Social Networks: How to Measure User Engagement?

Here are a few interesting KPI's:

  • Average session duration
  • Proportion of active versus passive behavior (active = posting a blog, inviting a thread; passive = browsing, reading articles)
  • New members per week, with breakdown US vs. non US
  • Average number of groups per member
  • Average number of friends per member
  • Number of page views per user (per member) per month
  • Time to last visit (average; per user; per…
Continue

Added by Vincent Granville on December 20, 2011 at 8:17am — No Comments

New Zealand's Plan to Realize Big Data Opportunities

Informatica Corporation provider of data integration software, announced that Westpac Life New Zealand Limited, New Zealand’s largest full service financial institution, has standardized on the Informatica Platform to help power its strategic initiatives around Big Data.

This includes Westpac’s ambitious Insurance Project, a new enterprise business intelligence environment that is helping the…

Continue

Added by Pearse William on December 21, 2011 at 2:09pm — No Comments

Sales forecasts: how to improve accuracy while simplifying models?

The solution is simple: leverage external data, and simplify your predictive model.

Back in 2000 I was working with GE's analytic team to improve sales forecasts for NBC Internet, a web portal owned by NBC. The sales / finance people were using a very basic formula to predict next month sales, based mostly on sales from previous month. With GE, we started to develop more…

Continue

Added by Vincent Granville on January 3, 2012 at 10:54am — No Comments

Data Analytics for Smart Grid Development and Deployment

The smart grid is leading the power industry into a data and analytics boom as its implementation phase gives way to its value phase, according to Christine Richards, senior analyst with Energy Central's Utility Analytics Institute.

Richards defined the decade from 2000 to 2010 as the smart grid's development phase, the five-year period from 2007 to 2012 as the infrastructure, or implementation phase and the decade from 2010 to 2020 as the value phase.

In the value phase, grid…

Continue

Added by Stan Mason on January 4, 2012 at 8:17am — No Comments

Data Integration...Might be getting a bit easier thanks to SAP

According to the Gartner report, "Demand trends have given rise to new challenges in 2011. Fresh needs arising from contemporary challenges are presenting new opportunities in this market as buyers seek to address data integration as a critical aspect of a coherent information management capability, and to integrate disparate data sources (including emerging sources such as 'big data') and new data types into a cohesive and usable set of information."(1) SAP is helping its customers address…

Continue

Added by Pearse William on January 9, 2012 at 1:00pm — No Comments

5 Big Data Startups That Matter: Platfora, DataStax, Visual.ly, Domo, and Metamarkets

First, let me say how pleased I am to be a part of the Data Science Central community. While I was the VP of Marketing at EMC Greenplum, I had a front row seat to the dynamic and very exciting “Big Data” and “Data Science” shows. EMC Greenplum has played a pivotal role in helping organizations of all sizes start the process of orienting their businesses and decision-making around data. It has been said, and I’ll say it again –the companies that embrace the power of data and live that life…

Continue

Added by Richard Snee on January 10, 2012 at 7:34am — 1 Comment

Missing Values Imputation with Bayesian Networks

A New Approach to Missing Values Processing with Bayesian Networks

Download the white paper here (39.5 MB).

With the abundance of “big data” in the field of analytics, and all the challenges today’s immense data volume is causing, it may not be particularly fashionable or pressing to discuss missing values. After all, who cares…

Continue

Added by Vincent Granville on January 12, 2012 at 1:46pm — No Comments

Big Data in the Enterprise

Companies are capturing and digitizing more information than ever before. According to IDC, the world produced one zettabyte (1,000,000,000,000 gigabytes) of data annually. Fueling this data explosion are over five billion mobile phones, 30 billion pieces of content shared on Facebook per month, 20 billion Internet searches per month, and millions of networked sensors connected to mobile phones, energy meters, automobiles, shipping containers, retail packaging and more. Big Data is a…

Continue

Added by Stan Mason on January 16, 2012 at 9:30am — No Comments

News from the Wharton Customer Analytics Initiative

Wharton Customer Analytics Initiative
Continue

Added by Vincent Granville on January 20, 2012 at 7:32am — No Comments

Another idea to help Amazon increase ROI

When you do a book search on Amazon using keywords, in the search result page you see 16 results. If you've bought one (or more) of these books, it still shows up on the search result page. It would be easy to eliminate that book that you already purchased (e.g. by showing it at the bottom of the page), and replacing it by another book that you haven't purchased yet. Sounds to me like an obvious, simple optimization trick. Maybe Amazon's statisticians use very complex models that change…

Continue

Added by Vincent Granville on January 25, 2012 at 2:50pm — No Comments

Ooyala Video Index Report Provides Big Data Insight

I am a big fan of Ooyala, even though I cannot pronounce the name of the company properly! Here is a link to a video that has helped me with my pronunciation ailment. Ooyala Video

Ooyala has a great report available called "Video Index Report" where "analytics meets big data."  Executive Summary and key findings below:

Ooyala’s technology…

Continue

Added by Richard Snee on January 26, 2012 at 4:12pm — No Comments

DataScienceCentral.com Announces Content Partnership with Visual.ly to Expand Big Data Ecosystem

The online hub for big data practitioners also adds industry thought leader, Richard Snee, as advisor and guest editor

LOS ANGELES, Feb. 1, 2012 /PRNewswire via…

Continue

Added by Vincent Granville on February 1, 2012 at 8:50am — No Comments

[Job] Data Scientist, Analytics - Twitter (San Francisco, CA)

About this Job

We're looking for highly motivated individuals to help us extract meaning from Twitter's massive dataset. As a Data Scientist for the Analytics team, you'll use statistical analysis and data mining techniques to help us better understand how users engage with Twitter, determine whether new and experimental features should be launched, and measure Twitter's success across the entire organization. You should be passionate about finding insights in data and using…

Continue

Added by Vincent Granville on February 9, 2012 at 8:20am — No Comments

Featured Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service