Featured Blog Posts – August 2017 Archive (112)

Data Science Simplified Part 1: Principles and Process

In 2006, Clive Humbly, UK Mathematician, and architect of Tesco’s Clubcard coined the phrase “Data is the new oil. He said the following:



Added by Pradeep Menon on August 5, 2017 at 2:10am — 5 Comments

Apache Spark Streaming

Introduction to Apache Spark Streaming

A data stream is an unbounded sequence of data arriving continuously. Streaming divides continuously flowing input data into discrete units for further processing. Stream processing is low latency processing and analyzing of streaming data.

Spark Streaming was added to Apache spark in 2013, an extension of the core Spark API that provides scalable, high-throughput and fault-tolerant stream…


Added by Shreya Gupta on August 5, 2017 at 2:00am — No Comments

Data Science Simplified Part 3: Hypothesis Testing

Edward Teller, the famous Hungarian-American physicist, once quoted:

“A fact is a simple statement that everyone believes. It is innocent, unless found guilty. A hypothesis is a novel suggestion that no one wants to believe. It is guilty, until…

Added by Pradeep Menon on August 5, 2017 at 2:00am — 2 Comments

R code and reproducible model development with DVC

DVC is an open source tool that could help with achieving code simplicity, readability and faster model development.The idea is to track files/data dependencies during model development in order to facilitate reproducibility and track data files versioning.  However, DVC is a language agnostic tool and can be used with any programming language. Here we will describe how we can…


Added by Marija Zoldin on August 4, 2017 at 12:30am — No Comments

Deep Learning with TensorFlow in Python: Convolution Neural Nets

The following problems appeared in the assignments in the Udacity course Deep Learning (by Google). The descriptions of the problems are taken from the assignments (continued from the last post).

Classifying the letters with notMNIST dataset with Deep Network

Here is how some sample images from the dataset look like:


Let’s try to get the best performance using a multi-layer model!…


Added by Sandipan Dey on August 3, 2017 at 10:30pm — No Comments

Dummy errors when using neuralnet package in R

Ok, so you read a bunch of stuff on how to do Neural Networks and how many layers or nodes you should add, and etc... But when you start to implement the actual Neural Network you face a ton of dummy errors that stop your beautiful inspirational programming.

This post talks about some errors you might face when using the neuralnet package in R.…


Added by Renata Ghisloti Duarte Souza Gra on August 2, 2017 at 10:30am — No Comments

3 Ways To Drive Effective HR Insights Using Data Analytics

Image Source: Pexels.com

Data Analytics has become the buzzword in the world of business today. An effective Human Resource or HR management is one of the most important corporate responsibilities to ensure smooth business operations. Therefore, the growing role of data analytics in HR can not be ignored at all. Broadly…


Added by Shantanu Chaturvedi on August 2, 2017 at 1:30am — No Comments

Architectural Primer: Conversation as a Platform (CAAP) on Azure

According to Mckinsey report, Artificial Intelligence (AI) is poised to unleash digital disruption and companies needs to start preparing…


Added by Pradeep Menon on August 1, 2017 at 5:00am — No Comments

Machine Learning Translation and the Google Translate Algorithm

Years ago, it was very time consuming to translate the text from an unknown language. Using simple vocabularies with word-for-word translation was hard for two reasons: 1) the reader had to know the grammar rules and 2) needed to keep in mind all language versions while translating the whole sentence.

Now, we…


Added by Luba Belokon on August 1, 2017 at 5:00am — No Comments

Demystifying Data Lake Architecture

According to Gartner, 80% of successful CDOs will have value creation or revenue generation as their Number 1 priority through 2021.

To create the maximum value out the organization’s data landscape, traditional decision support system architecture are no longer adequate. New architectural patterns need to be developed to harness the…


Added by Pradeep Menon on August 1, 2017 at 5:00am — 3 Comments

Modeling the growth of a sunflower with golden angle and Fibonacci numbers in R

  • In this article, a mathematical model for the growth of a sunflower (shown below) will be described (reference: the video lectures of Prof. Jeffrey R Chesnov from Coursera Course on Fibonacci numbers).


  • New florets are created close to center.
  • Florets move radially out with constant speed as the sunflower grows.
  • Each new…

Added by Sandipan Dey on August 1, 2017 at 1:30am — No Comments

Three Stages Of Effective Data Governance In Healthcare

  • Is data governance critical for you?
  • Would you want to build a flexible data governance strategy, iteratively realizing the benefits as it evolves?
  • Would you have big data oriented programs, generating a vast amount of data?
  • Would you want a governance strategy around individual programs or rolled up into some logical grouping?
  • Would you want to improve over your existing governance strategy?
  • Would you have…

Added by Vijay Kumar on August 1, 2017 at 12:30am — No Comments

Featured Monthly Archives












© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service