Subscribe to DSC Newsletter

Divya Singh's Blog (43)

How to Discover and Classify Metadata using Apache Atlas on Amazon EMR

Introduction

The boundaries of the enterprise are becoming diffused. You have data on the network, on the endpoint, and on the cloud. Enabling visibility into your data flows is a critical first step to understanding which data is at risk for theft or misuse. You need to know what data you have, where it’s located, and why that data exists in order to properly protect it. This is where data discovery and data classification come into…

Continue

Added by Divya Singh on December 2, 2019 at 1:30am — 1 Comment

Challenges Faced by a Data Scientist and How to Overcome Them?

Data Scientist is regarded as the sexiest job of the 21st century. It is a high paying lucrative jobs which comes with a lot of responsibility and commitment. Any professional needs to master state-of-the-art skills and technologies to become a Data Scientist in the modern world. It is a profession where people from different disciplines could fit in as there are a plethora of specialties embedded in a Data Scientist role.

Data Science is not a present-day…

Continue

Added by Divya Singh on December 1, 2019 at 8:00pm — No Comments

What is Map Reduce Programming and How Does it Work

Introduction

Data Science is the study of extracting meaningful insights from the data using various tools and technique for the growth of the business. Despite its inception at the time when computers came into the picture, the recent hype is a result of the huge amount of unstructured data that is getting generated and the unprecedented computational capacity that modern computers possess.

However, there is a lot of misconception among the masses about the true meaning of…

Continue

Added by Divya Singh on November 23, 2019 at 8:30pm — No Comments

A Guide to Predictive Analysis in R

Predictive analysis is heavily used today to gain insights on a level that are not possible to detect with human eyes. And R is an extremely powerful and easy tool to implement the same. In this piece, we will explore how we can predict the status of breast cancer using predictive modeling in less than 30 lines of code.

Who should read this blog?

  • Someone who wants to get started with R…
Continue

Added by Divya Singh on November 21, 2019 at 9:00pm — 1 Comment

Exploratory Data Analysis with Python

In addition to being the sexiest job of the twenty-first century, Data Science is new electricity as quoted by Andrew Ng. A lot of professionals from various disciplines and domain are looking to make a transition into the field of analytics and use Data Science to solve various problems across multiple channels. Being an inter-disciplinary study, one could easily mine data for various operations and help decision-makers make relevant conclusions to achieve…

Continue

Added by Divya Singh on July 2, 2019 at 8:00pm — No Comments

A Comprehensive Guide to Data Science With Python

A Hearty Welcome to You!

I am so thrilled to welcome you to the absolutely awesome world of data science. It is an interesting subject, sometimes difficult, sometimes a struggle but always hugely rewarding at the end of your work. While data science is not as tough as, say, quantum mechanics, it is not high-school algebra either.

It requires knowledge of Statistics, some Mathematics (Linear Algebra,…

Continue

Added by Divya Singh on June 26, 2019 at 9:00pm — No Comments

Deep learning Data Sets for Every Data Scientist

Machine Learning has seen a tremendous rise in the last decade, and one of its sub-fields which has contributed largely to its growth is Deep Learning. The large volumes of data and the huge computation power that modern system possess has given Data Scientist, Machine Learning Engineers, and others to achieve ground-breaking results in the Deep Learning and continue to bring in new developments in this field.

In this blog post, we would cover the deep learning data sets that you…

Continue

Added by Divya Singh on June 19, 2019 at 8:00pm — No Comments

What is Quantum Computing and How is it Useful for Artificial Intelligence?

Introduction

After decades of a heavy slog with no promise of success, quantum computing is suddenly buzzing! Nearly two years ago, IBM made a quantum computer available to the world. The 5-quantum-bit (qubit) resource they now call the IBM Q experience. It was more like a toy for researchers than a way of getting any serious number crunching done. But 70,000 users worldwide have registered for it, and the qubit count in this…

Continue

Added by Divya Singh on June 13, 2019 at 8:00pm — No Comments

Linear Regression Analysis – Part 1

Who should read this blog:

  • Someone who is new to linear regression.
  • Someone who wants to understand the jargon around Linear Regression

Code Repository:

https://github.com/DhruvilKarani/Linear-Regression-Experiments

Linear regression is generally the first step into anyone’s Data Science journey. When you hear the words Linear and Regression,…

Continue

Added by Divya Singh on June 12, 2019 at 10:00pm — No Comments

Basics of Hive and Impala for Beginners

Data Science is the field of study in which large volumes of data are mined, analysed to build predictive models, and help the business in the process. The data used over here is often unstructured, and it’s huge in quantity. Such data which encompasses the definition of volume, velocity, veracity, and variety is known as Big Data.

Hadoop and Spark are two of the most popular open-source framework used to deal with big data. The Hadoop architecture includes the following…

Continue

Added by Divya Singh on June 6, 2019 at 9:30pm — No Comments

How to Make Machine Learning Models for Beginners

Introduction

Data science is one of the hottest topics in the 21st century because we are generating data at a rate which is much higher than what we can actually process. A lot of business and tech firms are now leveraging key benefits by harnessing the benefits of data science. Due to this, data science right now is really booming.

In this blog, we will deep dive into the world of machine learning. We will walk you…

Continue

Added by Divya Singh on June 4, 2019 at 8:30pm — No Comments

Apache Spark Streaming Tutorial for Beginners

Introduction

In a world where we generate data at an extremely fast rate, the correct analysis of the data and providing useful and meaningful results at the right time can provide helpful solutions for many domains dealing with data products. We can apply this in Health Care and Finance to Media, Retail, Travel Services and etc. some solid examples include Netflix providing personalized recommendations at real-time, Amazon tracking your interaction with different products on its…

Continue

Added by Divya Singh on May 30, 2019 at 8:00pm — No Comments

Basic Statistics Concepts Every Data Scientist Should know

Introduction

Data science is a multidisciplinary blend of data inference, algorithm development, and technology in order to solve analytically complex problems. At the core is data. Troves of raw information, streaming in and stored in enterprise data warehouses. Much to learn by mining it. Advanced capabilities we can build with it. Data science is ultimately about using this data in creative ways to generate business value

The broader fields of understanding what data…

Continue

Added by Divya Singh on May 29, 2019 at 8:00pm — No Comments

10 Areas of Expertise in Data Science

The analytics market is booming, and so is the use of the keyword – Data Science. Professionals from different disciplines are using data in their day to day activities, and feel the need to master the start-of-the-art technology in order to get maximum insights from the data, and subsequently help the business to grow.

Moreover, there are professionals who want to keep them updated with this latest skills such as Machine Learning, Deep Learning, Data Science, and so either to elevate…

Continue

Added by Divya Singh on May 28, 2019 at 10:19pm — No Comments

How to Visualize AWS Cost and Usage Data Using Amazon Athena and QuickSight

Introduction

One of the major reasons organizations migrate to the AWS cloud is to gain the elasticity that can grow and shrink on demand, allowing them to pay only for resources they use. But the freedom to provide on-demand resources can sometimes lead to very high costs if they aren’t carefully monitored. Cost Optimization is one of the five pillars of the AWS Well-Architected Framework, and with good reason. When you optimize your costs, you build a more efficient cloud that…

Continue

Added by Divya Singh on May 27, 2019 at 8:00pm — No Comments

How to Install and Run Hadoop on Windows for Beginners

Introduction

Hadoop is a software framework from Apache Software Foundation that is used to store and process Big Data. It has two main components; Hadoop Distributed File System (HDFS), its storage system and MapReduce, is its data processing framework. Hadoop has the capability to manage large datasets by distributing the dataset into smaller chunks across multiple machines and performing parallel computation on it…

Continue

Added by Divya Singh on May 23, 2019 at 8:30pm — No Comments

What is Data Lake and How to Improve Data Lake Quality

Introduction

Building data pipelines is a core component of data science at a startup. In order to build data products, you need to be able to collect data points from millions of users and process the results in near real-time. Today, many organizations nowadays are struggling with the quality of their data. Data quality (DQ) problems can arise in various ways. Here are common causes of bad data quality:

  • Multiple data sources:…
Continue

Added by Divya Singh on May 22, 2019 at 9:00pm — No Comments

An Introduction to Python Virtual Environment

Data Science, Machine Learning, Deep Learning, and Artificial Intelligence are some of the most heard about buzzwords in the modern analytical eco-space. The exponential growth of technology in this regard has simplified our lives and made us more machine dependent. The astonishing hype surrounding such technologies has prompted professionals from various disciples to hop on to the ship and consider analytics as their career option.

To master Data Science or Artificial Intelligence in…

Continue

Added by Divya Singh on May 21, 2019 at 9:30pm — No Comments

Prediction of Customer Churn with Machine Learning

Machine Learning is the word of the mouth for everyone involved in the analytics world. Gone are those days of the traditional manual approach of taking key business decisions. Machine Learning is the future and is here to stay.

However, the term Machine Learning is not a new one. It was there since the advent of computers but has grown tremendously in the last decade due to the massive amounts of data that’s getting generated, and the enormous computational power that modern-day…

Continue

Added by Divya Singh on May 20, 2019 at 10:30pm — No Comments

The Upcoming Revolution in Predictive Analytics (And Data Science)

The Next Generation of Data Science

Quite literally, I am stunned.

I have just completed my survey of data (from articles, blogs, white papers, university websites, curated tech websites, and research papers all available online) about predictive analytics.

And I have a reason to believe that we are standing on the brink of a revolution that will transform everything we know about data science and predictive analytics.

But before we go there,…

Continue

Added by Divya Singh on May 9, 2019 at 10:30pm — No Comments

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service