Subscribe to DSC Newsletter

All Blog Posts Tagged 'Big' (284)

Big Data Platforms as a Service

Big Data Platforms as a Service (PaaS) lets an organization take advantage of a service providers compute power, analytical tools, store as much data as needed and pay only for resources used. Data…

Continue

Added by Michael Walker on January 2, 2013 at 8:46am — 1 Comment

Big Data Analytics Infrastructure

Recent surveys suggest the number one investment area for both private and public organizations is the design and building of a modern data warehouse (DW) / business intelligence (BI) / data analytics architecture that provides…

Continue

Added by Michael Walker on December 26, 2012 at 8:11am — 2 Comments

Seek the grail up the Knowledge Pyramid, not down

In following the big data 'buzz' and trends, it appears that there is a disconnect between our analytical goals (i.e., the types of questions our customers are trying to answer) and the computational substrate on which we build in order to answer them.

NoSQL technologies, while being far more scaleable than relational databases, are fundamentally a 'data level'…

Continue

Added by John Fairweather on December 10, 2012 at 10:42am — No Comments

The Big Data Analytics Landscape

As a technologist and evangelist working in the big data marketplace it is certainly exciting. I am excited by the new products we are bringing to market and how this new functionality really helps to bridge the gap for Enterprises adoption. It is also surreal, in terms of the number of blog posts, tweets on Big Data and there seems to be a new big data conference cropping up on a weekly basis across Europe :-)



It is interesting to monitor other vendors in the…

Continue

Added by Donal Daly on December 10, 2012 at 7:31am — No Comments

8 TESTS TO DECODE BUSINESS ACCUMEN OF A DATA SCIENTIST

A data scientist at Flutura has to wear multiple hats in order to deliver next generation analytical solutions in the sectors we operate in namely energy, telecom, digital and health care industry. In order to do that he/she has to wear 3 hats

-         The BUSINESS  hat

-         The MATH hat

-         The DATA hat

Most of the time it’s easy to fathom the depth of the data scientists math / algorithmic knowledge and the depth of…

Continue

Added by derick.jose on December 8, 2012 at 9:41pm — No Comments

What Big Data means at LinkedIn?

LinkedIn was founded in 2003, is currently revenue 243 million and employs 1797 people. This is not what we call a large company. However, LinkedIn has 175 million members in 200 countries including 50% outside the U.S., two new members join the network every second, and analysts said that all "executive" of the Global 500 are members. Under these conditions, LinkedIn is facing a high volume of data to process. Indeed their information system must support 2 billion a year of research carried…

Continue

Added by Michel Bruley on December 3, 2012 at 3:16am — No Comments

Data Veracity

 

Data Veracity, uncertain or imprecise data, is often overlooked yet may be as important as the 3 V's of Big Data: Volume, Velocity and…

Continue

Added by Michael Walker on November 28, 2012 at 3:00pm — No Comments

S3 as Input or Output for Hadoop MR jobs

How to use s3 (s3 native) as input / output for hadoop MapReduce job. In this tutorial we will first try to understand what is s3, difference between s3 and s3n and how to set s3n as Input and output for hadoop map reduce job. Configuring s3n as I/O may be useful for local map reduce jobs (ie MR run on local cluster), But It has significant importance when we run elastic map reduce job (ie when we run job on cloud). When we run job on cloud we need to specify storage location for input as…

Continue

Added by Rahul Patodi on November 11, 2012 at 8:00am — No Comments

Hadoop:- A soft Introduction



What is Hadoop:

Hadoop is a framework written in Java for running applications on large clusters of commodity hardware and incorporates features similar to those of the Google File System and of MapReduce. HDFS is a highly fault-tolerant distributed file system and like…
Continue

Added by Rahul Patodi on November 11, 2012 at 8:00am — No Comments

32 Big data "gotchas" from the trenches in exactly 3 words



1.Usecase ! Usecase ! Usecase



2.Decode intent proxies!



3.Think "20-100 X Scalability" blindspots



4.Actions not insights



5.Frame unanswered questions !



6.Embedd MachineLearning processes !



7.Humanize analytical output !



8.Ingest unstructured data



9.Quantify $ impact !…

Continue

Added by derick.jose on October 9, 2012 at 9:54pm — No Comments

Big Data Analytics Maturity Model

See: http://bit.ly/KQ1bsS

Added by Michael Walker on October 3, 2012 at 10:18am — 1 Comment

Big Data: Opinions & Sentiments Analysis

Analyzes of texts put lights in two main types of information “facts and opinions”. Most current treatment methods of textual information aim to extract and use factual information, this is the case for example of research we do on the web. Analysis of opinions is concerned about feelings and emotions expressed in the texts, it has grown much today because of the space taken from the web in our society, and the very large volume of daily comments expressed by consumers with the advent of the…

Continue

Added by Michel Bruley on October 2, 2012 at 10:43pm — 5 Comments

ACM Data Mining Talk: Representing Predictive Solutions with PMML

Talk on PMML and Predictive Analytics to the ACM Data Mining Bay Area/SF group at the LinkedIn auditorium in Sunnyvale, CA.



Abstract: 



Data mining scientists work hard to analyze historical data and to build the best predictive solutions out of it. IT engineers, on the other hand, are usually responsible for bringing these solutions to life, by recoding them into a format suitable for operational deployment. Given that data mining scientists and engineers…

Continue

Added by Alex Guazzelli on October 2, 2012 at 8:12am — No Comments

Big Data Vendor Landscape

Big Data Vendor Landscape

Companies, products, and technologies included in the Big Data Landscape:

-…

Continue

Added by Michael Walker on August 30, 2012 at 2:58pm — No Comments

Hadoop Technology Stack

The Hadoop stack includes more than a dozen components, or subprojects, that are complex to deploy and manage. Installation, configuration and production deployment at scale is challenging.

The main components…

Continue

Added by Michael Walker on August 22, 2012 at 9:40am — No Comments

11 Core Big Data Workload Design Patterns

As big data use cases proliferate in telecom, health care, government, Web 2.0, retail etc there is a need to create a library of big data workload patterns. . We have created a big data workload design pattern to help map out common solution constructs. There are 11 distinct workloads showcased which have common patterns across many business use cases.

  1. Synchronous streaming real time event sense and respond workload
  2. Ingestion of High velocity…
Continue

Added by derick.jose on August 13, 2012 at 10:51pm — No Comments

Interesting big data opportunity

Interested in using your skills for a good cause, a great challenge and a $25,000 prize? How about predicting the future of ALS patients?

 

 Prize4Life is proud to announce the launch of a computational challenge to predict the future progression of disease in Amyotrophic Lateral Sclerosis (ALS), also known as Lou Gehrig’s disease. The …

Continue

Added by Neta Zach on August 13, 2012 at 2:52pm — No Comments

Which IT infrastructure for Big Data?

A Big Data decision support system requires particular capabilities in terms of volume, variety of data and processing speed.



Today companies to improve their knowledge models and forecasts, do not hesitate to take into account hundreds of factors, and do not hesitate to bring up new means of analysis that can handle large volumes of data. But the processing of large volumes of data is a challenge for traditional BI infrastructure. Storing large volumes is not a problem, but…

Continue

Added by Michel Bruley on July 31, 2012 at 11:03pm — No Comments

3 Game Changing Big Data Use Cases in Telecom

Like many industries the Infrastructure/Security/Compliance function within large telecom companies is becoming more data driven. Here are 3 powerful use cases which vividly bring out new possibilities in Telecom big data

 

Telecom use case-1 : Contact centre text mining and Telecom Bandwidth throttling…

Continue

Added by derick.jose on July 17, 2012 at 2:11am — No Comments

Data Scientists... and the Rest of Us

data-scientist-banner

Recently, I’ve been feeling like I’ve stepped through a looking glass to another similar-but-very-different world. I’m steeped in 20+ years in corporate data warehousing and business intelligence practice. Throughout that time, there have been big and small technology improvements, but nothing truly disruptive (although new…

Continue

Added by Stan Mason on April 13, 2012 at 8:30am — No Comments

Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service