A Data Lake has flexible definition, to make this statement true the dataottam team took initiative and released a eBook called “The Collective Definition of Data Lake by Big Data Community”, which contains many definitions from various business savvy and technologist.
And in nutshell Data Lake is a data store and processing data system, where an…Continue
Added by Kumar Chinnakali on January 28, 2016 at 6:30pm — No Comments
Big data is a term that has attracted the attention of nearly every chief manager in current digital arena as it offers them a treasure trove of prospective leads. A massive amount of data is available through websites, CRM solutions, social networks, analytics and news feeds. But this data is present in structured, semi-structured and unstructured form, therefore requires deployment of sophisticated tools to make…Continue
Added by Brian Jones on January 25, 2016 at 2:00am — No Comments
Celebrate the Big Data Problems – #2
How to identify the no of buckets for a Hive table while executing the HiveQL DDLs ?
The dataottam team has come up with blog sharing initiative called “Celebrate the Big Data Problems”. In this series of blogs we will share our big data problems using CPS (Context, Problem, Solutions) Framework.
Bucketing is another…Continue
Added by Kumar Chinnakali on January 21, 2016 at 7:41pm — No Comments
Self-Learn Yourself IoT in 21 Blogs – #1 – In this we will be seeing What is IoT ? Why do we need it? Significance & Impact on Modern life?
Time to Greet the New Clone that it set to rule the world !
Hello All ! Well, this is my first blog for dataottam and so henceforth I welcome your valuable feedback and that in return helps me to deliver better!
Alright, what is Internet of Things (IoT) ? How does it differ from Internet of Everything ? What is M2M ?
Added by Kumar Chinnakali on January 21, 2016 at 8:30am — No Comments
The most popular example of application of analytics in sports is from the Hollywood movie Moneyball, which is based on the true story of a baseball coach who assembles a stellar team despite having a very limited budget. He uses the help of an economics student to use data models to identify players who have the potential to perform outstandingly, but are under-valued in the market. Fast forward today, and this has become a best-practice in team-building across nations,…Continue
Added by Tanmay Bhandari on January 18, 2016 at 3:30pm — No Comments
Celebrate the Big Data Problems – #1
Daily we are facing many big data problems in production, PoC, and more perspective. Do we have any common repo to collect and share? No, as we know we don’t have any. As always dataottam is looking forward to share the…Continue
Added by Kumar Chinnakali on January 15, 2016 at 11:30pm — No Comments
Big Data is problem statement and it can be solved with one of the tools like Apache Hadoop. But having Apache Hadoop as infra to do our proof of concepts, proof of values is little challenging. Hence we brought 3 click ideas to have your Apache Hadoop installed.
What is Perquisite?
Can I have the Script? Yes
Added by Kumar Chinnakali on January 12, 2016 at 9:53am — No Comments
Apache Spark has many components including Spark Core which is responsible for Task Scheduling, Memory Management, Fault Recovery, and Interacting with storage…Continue
Added by Kumar Chinnakali on January 12, 2016 at 8:00am — No Comments
Spark was initially started by Matei at UC Berkeley AMPLab in 2009, and open sourced in 2010…Continue
Added by Kumar Chinnakali on January 9, 2016 at 9:00pm — No Comments
Data Science is the system used to extract insights from data that’s mined from various sources. Using various techniques including predictive modeling, Data Science helps to analyze and interpret vast amounts of data. The people who apply Data Science to manage large amounts of…Continue
Added by Vaishnavi Agrawal on January 8, 2016 at 11:30pm — No Comments
By this blog we will share the titles for learning Apache Spark, Basics on Hadoop which is one of the big data tool, and motivations for Apache Spark which is not replacement of Apache Hadoop, but its friend of big data.
Blog 1 – Introduction to Big Data
Blog 2 – Hadoop, Spark’s Motivations
Blog 3 – Apache Spark’s History and Unified Platform for Big Data
Blog 4 – Apache Spark’s First Step – AWS, Apache Spark
Blog 5 – Apache Spark Languages with basic…Continue
Added by Kumar Chinnakali on January 8, 2016 at 9:00pm — No Comments
Those who follow big data technology news probably know about Apache Spark, and how it’s popularly known as the Hadoop Swiss Army Knife. For those not so familiar, Spark is a cluster computing framework for data analytics designed to speed up and simplify common data-crunching and analytics tasks. Spark is certainly creating buzz in the big data world, but why? What’s so special about this…Continue
Added by Ritesh Gujrati on January 8, 2016 at 2:30am — No Comments
The term Data Lake has been gaining popularity recently as most of the enterprises have incorporated it into their analytics software’s. Every word and phrase that is used to describe Data Lake have provided us much useful information about how we interpret it.
So we at dataottam decided to understand the various ways Data Lake could be defined. So we conducted a survey and found very interesting thoughts, words and phrases used for defining Data Lake, from developers to founders, to…Continue
We have received many requests from friends who are constantly reading our blogs to provide them a complete guide to sparkle in Apache Spark. So here we have come up with learning initiative called “Self-Learn Yourself Apache Spark in 21 Blogs".
We have drilled down various sources and archives to provide a perfect learning path for you to understand and excel in Apache Spark. These 21 blogs which will be written over a course of time will be a complete guide for you to understand and…Continue
Added by Kumar Chinnakali on December 30, 2015 at 3:00am — No Comments
The global big data market is expected to develop rapidly by 2018, with major contribution from the North America regional market. In the past few years, there has been a growth in ‘big data’, which is generated in several sectors across the globe. Growth in the amount of data has led to the development of…Continue
Added by James White on December 20, 2015 at 8:30pm — No Comments
I am Back ! Yes, I am back (on the track) on my learning track. Sometime, it is really necessary to take a break and introspect why do we learn, before learning. Ah ! it was 9 months safe refuge to learn how Big Data & Analytics can contribute to Data Product.
Data strategy has always been expected to be revenue generation. As Big data and Hadoop entering into the enterprise data strategy it is also expected from big data infrastructure to be revenue addition.…Continue
Added by Manish Bhoge on December 12, 2015 at 9:53am — No Comments
The data flood that you are witnessing every minute is trying to tell you hidden secrets about your business growth, are you listening? In the previous years, we may have taken a glass half empty perspective for the analytics sector but this definitely is about to change now. Big data analytics is indeed one of the fastest growing markets and is expected to mature in 2016 and…Continue
Added by Aureus Analytics on December 3, 2015 at 9:30pm — No Comments
Added by David Lefkowich on December 2, 2015 at 8:30am — No Comments
Yes, we are marching towards New Year 2016! What happened to Resolution of 2014, 2015? Quit Habits? Practice Habits? Road ahead? Am into all, but i could not able to keep it up. Hence this New Year 2016 is no more resolutions, just implement the plan.
Extend to that, as we know big data is bringing more business value to enterprise by leveraging the data lake. Data Lake..... What is that? Data Lake is loosely defined word and the definition gets changed during implementation…Continue
Added by Kumar Chinnakali on December 2, 2015 at 5:00am — No Comments
Big data analytics finds immense application across the entire business. One area which can have direct, measurable and visible impact is the area of customer service. Data has been used since time immemorial to improve customer service, but it is only recently that the full power of predictive analytics is being applied in this function. Organizations, both large and small, are using big data…Continue
Added by Aureus Analytics on November 24, 2015 at 1:00am — No Comments