Subscribe to DSC Newsletter

All Blog Posts Tagged 'Big' (175)

9 Key Benefits of Data Lake

A Data Lake has flexible definition, to make this statement true the dataottam team took initiative and released a eBook called “The Collective Definition of Data Lake by Big Data Community”, which contains many definitions from various business savvy and technologist.

And in nutshell Data Lake is a data store and processing data system, where an…

Continue

Added by Kumar Chinnakali on January 28, 2016 at 6:30pm — No Comments

Big Data Helps Businesses Identify Prospective Customers and Increase Sales

Big data is a term that has attracted the attention of nearly every chief manager in current digital arena as it offers them a treasure trove of prospective leads. A massive amount of data is available through websites, CRM solutions, social networks, analytics and news feeds. But this data is present in structured, semi-structured and unstructured form, therefore requires deployment of sophisticated tools to make…

Continue

Added by Brian Jones on January 25, 2016 at 2:00am — No Comments

Celebrate the Big Data Problems – #2

Celebrate the Big Data Problems – #2

How to identify the no of buckets for a Hive table while executing the HiveQL DDLs ?

The dataottam team has come up with blog sharing initiative called “Celebrate the Big Data Problems”. In this series of blogs we will share our big data problems using CPS (Context, Problem, Solutions) Framework.

Context:

Bucketing is another…

Continue

Added by Kumar Chinnakali on January 21, 2016 at 7:41pm — No Comments

Self-Learn Yourself IoT in 21 Blogs – #1

Self-Learn Yourself IoT in 21 Blogs – #1 – In this we will be seeing What is IoT ? Why do we need it? Significance & Impact on Modern life?

Time to Greet the New Clone that it set to rule the world !

Hello All ! Well, this is my first blog for dataottam and so henceforth I welcome your valuable feedback and that in return helps me to deliver better!

Alright, what is Internet of Things (IoT) ? How does it differ from Internet of Everything ? What is M2M ?

All…

Continue

Added by Kumar Chinnakali on January 21, 2016 at 8:30am — No Comments

Increasing role of Analytics in Sports

The most popular example of application of analytics in sports is from the Hollywood movie Moneyball, which is based on the true story of a baseball coach who assembles a stellar team despite having a very limited budget. He uses the help of an economics student to use data models to identify players who have the potential to perform outstandingly, but are under-valued in the market. Fast forward today, and this has become a best-practice in team-building across nations,…

Continue

Added by Tanmay Bhandari on January 18, 2016 at 3:30pm — No Comments

Celebrate the Big Data Problems – #1

Celebrate the Big Data Problems – #1

Daily we are facing many big data problems in production, PoC, and more perspective. Do we have any common repo to collect and share?  No, as we know we don’t have any. As always dataottam is looking forward to share the…

Continue

Added by Kumar Chinnakali on January 15, 2016 at 11:30pm — No Comments

Just 3 clicks to get your Apache Hadoop installed!

Big Data is problem statement and it can be solved with one of the tools like Apache Hadoop. But having Apache Hadoop as infra to do our proof of concepts, proof of values is little challenging. Hence we brought 3 click ideas to have your Apache Hadoop installed.

What is Perquisite?

  • Ubuntu 14.04
  • Internet Connection

Can I have the Script? Yes

How…

Continue

Added by Kumar Chinnakali on January 12, 2016 at 9:53am — No Comments

Self-Learn Yourself Apache Spark in 21 Blogs – #4

In Blog 4, we will see what are Apache Spark Core and its ecosystem and Apache Spark on AWS Cloud. Click to have quick read on blog 1-3 in this learning series.

Apache Spark has many components including Spark Core which is responsible for Task Scheduling, Memory Management, Fault Recovery, and Interacting with storage…

Continue

Added by Kumar Chinnakali on January 12, 2016 at 8:00am — No Comments

Self-Learn Yourself Apache Spark in 21 Blogs – #3

In this Blog 3 – We will see what is Apache Spark’s History and Unified Platform for Big Data, and like to have quick read on blog 1 and blog 2.

Spark was initially started by Matei at UC Berkeley AMPLab in 2009, and open sourced in 2010…

Continue

Added by Kumar Chinnakali on January 9, 2016 at 9:00pm — No Comments

All About Data Science And Big Data

Data Science is the system used to extract insights from data that’s mined from various sources. Using various techniques including predictive modeling, Data Science helps to analyze and interpret vast amounts of data. The people who apply Data Science to manage large amounts of…

Continue

Added by Vaishnavi Agrawal on January 8, 2016 at 11:30pm — No Comments

Self-Learn Yourself Apache Spark in 21 Blogs – #2

By this blog we will share the titles for learning Apache Spark, Basics on Hadoop which is one of the big data tool, and motivations for Apache Spark which is not replacement of Apache Hadoop, but its friend of big data.

Blog 1 – Introduction to Big Data

Blog 2 – Hadoop, Spark’s Motivations

Blog 3 – Apache Spark’s History and Unified Platform for Big Data

Blog 4 – Apache Spark’s First Step – AWS, Apache Spark

Blog 5 – Apache Spark Languages with basic…

Continue

Added by Kumar Chinnakali on January 8, 2016 at 9:00pm — No Comments

5 Reasons Apache Spark is So Awesome

Those who follow big data technology news probably know about Apache Spark, and how it’s popularly known as the Hadoop Swiss Army Knife. For those not so familiar, Spark is a cluster computing framework for data analytics designed to speed up and simplify common data-crunching and analytics tasks. Spark is certainly creating buzz in the big data world, but why? What’s so special about this…

Continue

Added by Ritesh Gujrati on January 8, 2016 at 2:30am — No Comments

The Collective Definition of Data Lake by Big Data Community

The term Data Lake has been gaining popularity recently as most of the enterprises have incorporated it into their analytics software’s. Every word and phrase that is used to describe Data Lake have provided us much useful information about how we interpret it.

So we at dataottam decided to understand the various ways Data Lake could be defined. So we conducted a survey and found very interesting thoughts, words and phrases used for defining Data Lake, from developers to founders, to…

Continue

Added by Kumar Chinnakali on December 30, 2015 at 4:00am — 4 Comments

Self-Learn Yourself Apache Spark in 21 Blogs - #1

We have received many requests from friends who are constantly reading our blogs to provide them a complete guide to sparkle in Apache Spark. So here we have come up with learning initiative called “Self-Learn Yourself Apache Spark in 21 Blogs".

We have drilled down various sources and archives to provide a perfect learning path for you to understand and excel in Apache Spark. These 21 blogs which will be written over a course of time will be a complete guide for you to understand and…

Continue

Added by Kumar Chinnakali on December 30, 2015 at 3:00am — No Comments

Global Big Data Market to Develop Rapidly by 2018:

The global big data market is expected to develop rapidly by 2018, with major contribution from the North America regional market. In the past few years, there has been a growth in ‘big data’, which is generated in several sectors across the globe. Growth in the amount of data has led to the development of…

Continue

Added by James White on December 20, 2015 at 8:30pm — No Comments

Where & Why Do You Keep Big Data & Hadoop?

I am Back ! Yes, I am back (on the track) on my learning track. Sometime, it is really necessary to take a break and introspect why do we learn, before learning.  Ah ! it was 9 months safe refuge to learn how Big Data & Analytics can contribute to Data Product.

DataLake

Data strategy has always been expected to be revenue generation. As Big data and Hadoop entering into the enterprise data strategy it is also expected from big data infrastructure to be revenue addition.…

Continue

Added by Manish Bhoge on December 12, 2015 at 9:53am — No Comments

The Role of Big Data Analytics in the Petabyte Age

The data flood that you are witnessing every minute is trying to tell you hidden secrets about your business growth, are you listening? In the previous years, we may have taken a glass half empty perspective for the analytics sector but this definitely is about to change now.  Big data analytics  is indeed one of the fastest growing markets and is expected to mature in 2016 and…

Continue

Added by Aureus Analytics on December 3, 2015 at 9:30pm — No Comments

6 Often Forgotten ROI Factors In Data Analytics



Continue

Added by David Lefkowich on December 2, 2015 at 8:30am — No Comments

The Collective Definition of Data Lake by Big Data Community

Yes, we are marching towards New Year 2016!  What happened to Resolution of 2014, 2015? Quit Habits? Practice Habits? Road ahead? Am into all, but i could not able to keep it up. Hence this New Year 2016 is no more resolutions, just implement the plan.

Extend to that, as we know big data is bringing more business value to enterprise by leveraging the data lake. Data Lake..... What is that? Data Lake is loosely defined word and the definition gets changed during implementation…

Continue

Added by Kumar Chinnakali on December 2, 2015 at 5:00am — No Comments

6 Ways Big Data Analytics Can Drive Smarter Customer Service

Big data analytics finds immense application across the entire business. One area which can have direct, measurable and visible impact is the area of customer service. Data has been used since time immemorial to improve customer service, but it is only recently that the full power of predictive analytics is being applied in this function. Organizations, both large and small, are using big data…

Continue

Added by Aureus Analytics on November 24, 2015 at 1:00am — No Comments

© 2016   Data Science Central   Powered by

Badges  |  Report an Issue  |  Terms of Service