Subscribe to DSC Newsletter
.

Divya Singh's Blog – May 2019 Archive (11)

Apache Spark Streaming Tutorial for Beginners

Introduction

In a world where we generate data at an extremely fast rate, the correct analysis of the data and providing useful and meaningful results at the right time can provide helpful solutions for many domains dealing with data products. We can apply this in Health Care and Finance to Media, Retail, Travel Services and etc. some solid examples include Netflix providing personalized recommendations at real-time, Amazon tracking your interaction with different products on its…

Continue

Added by Divya Singh on May 30, 2019 at 8:00pm — No Comments

Basic Statistics Concepts Every Data Scientist Should know

Introduction

Data science is a multidisciplinary blend of data inference, algorithm development, and technology in order to solve analytically complex problems. At the core is data. Troves of raw information, streaming in and stored in enterprise data warehouses. Much to learn by mining it. Advanced capabilities we can build with it. Data science is ultimately about using this data in creative ways to generate business value

The broader fields of understanding what data…

Continue

Added by Divya Singh on May 29, 2019 at 8:00pm — No Comments

10 Areas of Expertise in Data Science

The analytics market is booming, and so is the use of the keyword – Data Science. Professionals from different disciplines are using data in their day to day activities, and feel the need to master the start-of-the-art technology in order to get maximum insights from the data, and subsequently help the business to grow.

Moreover, there are professionals who want to keep them updated with this latest skills such as Machine Learning, Deep Learning, Data Science, and so either to elevate…

Continue

Added by Divya Singh on May 28, 2019 at 10:19pm — No Comments

Why R Programming in Data Science?

Data Science is everyone’s word of the mouth in the current analytical eco-space. The study of Data Science which encompasses various subjects like Machine Learning, Deep Learning, Artificial Intelligence, Natural Language Processing, and so on has made tremendous advancement in the recent past.

Data Science is not something that emerged recently. It was there since computers were invented as the first Data Science application was classifying an email as Spam or Not Spam based on…

Continue

Added by Divya Singh on May 27, 2019 at 8:00pm — No Comments

How to Visualize AWS Cost and Usage Data Using Amazon Athena and QuickSight

Introduction

One of the major reasons organizations migrate to the AWS cloud is to gain the elasticity that can grow and shrink on demand, allowing them to pay only for resources they use. But the freedom to provide on-demand resources can sometimes lead to very high costs if they aren’t carefully monitored. Cost Optimization is one of the five pillars of the AWS Well-Architected Framework, and with good reason. When you optimize your costs, you build a more efficient cloud that…

Continue

Added by Divya Singh on May 27, 2019 at 8:00pm — No Comments

How to Install and Run Hadoop on Windows for Beginners

Introduction

Hadoop is a software framework from Apache Software Foundation that is used to store and process Big Data. It has two main components; Hadoop Distributed File System (HDFS), its storage system and MapReduce, is its data processing framework. Hadoop has the capability to manage large datasets by distributing the dataset into smaller chunks across multiple machines and performing parallel computation on it…

Continue

Added by Divya Singh on May 23, 2019 at 8:30pm — No Comments

What is Data Lake and How to Improve Data Lake Quality

Introduction

Building data pipelines is a core component of data science at a startup. In order to build data products, you need to be able to collect data points from millions of users and process the results in near real-time. Today, many organizations nowadays are struggling with the quality of their data. Data quality (DQ) problems can arise in various ways. Here are common causes of bad data quality:

  • Multiple data sources:…
Continue

Added by Divya Singh on May 22, 2019 at 9:00pm — No Comments

An Introduction to Python Virtual Environment

Data Science, Machine Learning, Deep Learning, and Artificial Intelligence are some of the most heard about buzzwords in the modern analytical eco-space. The exponential growth of technology in this regard has simplified our lives and made us more machine dependent. The astonishing hype surrounding such technologies has prompted professionals from various disciples to hop on to the ship and consider analytics as their career option.

To master Data Science or Artificial Intelligence in…

Continue

Added by Divya Singh on May 21, 2019 at 9:30pm — No Comments

Prediction of Customer Churn with Machine Learning

Machine Learning is the word of the mouth for everyone involved in the analytics world. Gone are those days of the traditional manual approach of taking key business decisions. Machine Learning is the future and is here to stay.

However, the term Machine Learning is not a new one. It was there since the advent of computers but has grown tremendously in the last decade due to the massive amounts of data that’s getting generated, and the enormous computational power that modern-day…

Continue

Added by Divya Singh on May 20, 2019 at 10:30pm — No Comments

The Upcoming Revolution in Predictive Analytics (And Data Science)

The Next Generation of Data Science

Quite literally, I am stunned.

I have just completed my survey of data (from articles, blogs, white papers, university websites, curated tech websites, and research papers all available online) about predictive analytics.

And I have a reason to believe that we are standing on the brink of a revolution that will transform everything we know about data science and predictive analytics.

But before we go there,…

Continue

Added by Divya Singh on May 9, 2019 at 10:30pm — No Comments

How Apache Spark can Boost Your Value?

Introduction

Technological progress and the development of infrastructure has increased the popularity of Big Data immensely. Businesses have started to realize that data can be used to accurately predict the needs of customers which can increase profits significantly. The growing use of Big Data can be…

Continue

Added by Divya Singh on May 6, 2019 at 9:00pm — No Comments

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service