Subscribe to DSC Newsletter

Featured Blog Posts – May 2018 Archive (84)

7 Reasons Why Java Developers Should Learn Hadoop

Imagine there are two girls standing in front of you – The first girl is cute, beautiful, interesting and has the smile that any guy would die for. And the other girl is average-looking, quiet, not-so-impressive… no different from the ones that you usually see in the restaurant cash counter. Which girl will you call out for a date? If you’re like me, you will choose the attractive girl. You see, life is full of options and making the right choice is what matters the…

Continue

Added by Venkatesan M on May 16, 2018 at 12:00am — No Comments

Free eBooks on Hadoop, Deep Learning and DataViz by Packt

R Deep Learning Essentials

By Joshua F. Wiley

Get everything you need to know to enter the world of deep learning when it comes to R with this book. Get started from the packages you need to have for your side,…

Continue

Added by Packt Publishing on May 15, 2018 at 10:00pm — No Comments

Big Data and Cyber Security - A double whammy for hackers

Businesses are growing more digitized today. As this happens, cybersecurity threats are rising as well. Companies are placed at an increasing risk, which is why they need help from big data analysis. In fact, KuppingerCole conducted a study entitled “Big Data and Information Security.” study looks in-depth at current deployment levels and the benefits of big data…

Continue

Added by Evan Morris on May 15, 2018 at 8:30pm — 1 Comment

19 Interesting Articles About Excel

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, outliers, regression Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, time series, cross-validation, model fitting, dataviz, AI and many more. To keep receiving these articles, …

Continue

Added by Vincent Granville on May 15, 2018 at 6:00pm — No Comments

Bill Vorhies Retrospective: Part 1

Bill is the Editorial Director for Data Science Central, and President and Chief Data Scientist at Data-Magnum, providing predictive analytics and big data infrastructure projects as a service. Bill has been an active commercial predictive modeler since 2001.

Bill Vorhies

In this…

Continue

Added by Vincent Granville on May 15, 2018 at 12:00pm — No Comments

Demystifying Black-Box Models with SHAP Value Analysis

As an Applied Data Scientist at Civis, I implemented the latest data science research to solve real-world problems. We recently worked with a global tool manufacturing company to reduce churn among their most loyal customers. A newly proposed tool, called SHAP (SHapley Additive ex…

Continue

Added by Civis Analytics on May 15, 2018 at 9:00am — No Comments

Everybody (Still) Loves a Data Scientist

Summary:  The annual Burtch Works study of Data Science salaries and employment statistics is just out and things continue to look great.  There are some interesting trends here you’ll want to know about.

 

This week the annual Burtch…

Continue

Added by William Vorhies on May 15, 2018 at 7:24am — 1 Comment

Why is Data Missing from the Balance Sheet?

I have been asked many times is data a business asset?. Why is it that an intangible asset like data is not in the company’s balance sheet - a statement of the assets, liabilities, and capital of a business at a particular point in time? Technically, an intangible asset is a non-physical asset that has a multi-period useful life. Examples of intangible business assets are patents, copyrights, customer lists, trademarks, brand names, logo, and data. While data in the recent years…

Continue

Added by Prashanth Southekal, PhD on May 15, 2018 at 5:00am — 6 Comments

How Artificial Intelligence Has Influenced E-Commerce – The Retailer’s Story

Artificial Intelligence

Never before have customers been more in control of the retail trade than today. But are they really? Or has the retailer wrested control of the exchange? Let’s revisit this in the light of new technologies and sensors deployed in this “game”.

In the sixties through the eighties, the Sears, Walmart and K-mart kind of super stores aggregated purchase information to decide what to buy and stock their shelves. Improving…

Continue

Added by Hemant Warudkar on May 14, 2018 at 7:00pm — No Comments

Top 7 Data Science Use Cases in Finance

In recent years, the ability of data science and machine learning to cope with a number of principal financial tasks has become an especially important point at issue. Companies want to know more what improvements the technologies bring and how they can reshape their business strategies.

To help you answer these questions, we…

Continue

Added by Igor Bobriakov on May 14, 2018 at 4:30am — No Comments

Six Core Aspects of Semantic AI

  1. Hybrid approach: Semantic AI is the combination of methods derived from symbolic AI and statistical AI. Virtuously playing the AI piano means that for a given use case various stakeholders, not only data scientists, but also process owners or subject matter experts, choose from available methods and tools, and collaboratively develop workflows that are most likely a good fit to tackle the underlying problem. For example, one can combine entity extraction based on…
Continue

Added by Andreas Blumauer on May 14, 2018 at 4:30am — 2 Comments

What is Bitcoin ?

Bitcoin is known as the very first decentralized digital currency, they’re basically coins that can send through the Internet. 2009 was the year where bitcoin was born. The creator’s name is unknown, however the alias Satoshi Nakamoto was given to this person.

Advantages of Bitcoin.

Bitcoin transactions are made directly from person to person trough the internet. There’s no need of a bank or clearinghouse to act as the middle…

Continue

Added by Venkatesan M on May 13, 2018 at 9:30pm — 1 Comment

Weekly Digest, May 14

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Announcements
  • SQL + Notebooks + Charts. All in one platform. …
Continue

Added by Vincent Granville on May 13, 2018 at 5:00am — No Comments

Selection of Great Data Science Articles still Worth Reading

These articles are between 3 and 5 year old, but are still valuable today. The methodology used in these articles is modern, and still state-of-the-art today. Some discuss immense data sets still available to the public, and that resulted in designing new machine learning techniques to handle them. 

I am in the process of organizing these articles (written by myself) to eventually self-publish data science tutorials, in a few separate booklets, that are easy to understand for the…

Continue

Added by Vincent Granville on May 12, 2018 at 4:30pm — No Comments

Machine Learning and Its Algorithms to Know – MLAlgos

Describing and picturing MLAlgos and Machine Learning is the main idea of this post. I will attempt to answer few basic questions as well. Though these questions have been answered many a times in the past and are widely available. Answering them again here from my very own experience on the ground may makes the difference though rather then simply answering from phd or scholar books material prospective.…

Continue

Added by Vinod Sharma on May 10, 2018 at 10:30pm — No Comments

What is DevOps ?

In earlier days, solutions were associated with getting the technology right. The key was technology, the solution was technology and the business expected and paid for technology. Times have changed. Well, at least for those of us taking notice. Today technology is hardly ever a significant problem. Technically, we have a less complicated world. Over the years we have come to understand that technology…

Continue

Added by Venkatesan M on May 10, 2018 at 8:30pm — No Comments

Deep Dive into Polynomial Regression and Overfitting

In this article, we show that the issue with polynomial regression is not over-fitting, but numerical precision. Even if done right, numerical precision still remains an insurmountable challenge. We focus here on step-wise polynomial regression, which is supposed to be more stable than the traditional model. In step-wise regression, we estimate one coefficient at a time, using the classic least square technique. …

Continue

Added by Vincent Granville on May 9, 2018 at 5:30pm — 4 Comments

An Intro to Natural Language Processing in Python: Framing Text Classification in Familiar Terms

Natural language processing (NLP) is a broad field encompassing many different tasks such as text search, translation, named entity recognition, and topic modeling. On a daily basis, we use NLP whenever we search the internet, ask a voice assistant to tell us the weather forecast, or translate web pages written in another language. Businesses use NLP to understand how their…

Continue

Added by Civis Analytics on May 9, 2018 at 1:30pm — No Comments

A guide to manipulating, analyzing, and visualizing data in R

R has spread deep into the private sector and can be found in the production pipelines at some of the most advanced and successful enterprises. 

Learn the fundamentals of data analysis in the second edition of Data Analysis with R, authored by data scientist…

Continue

Added by Packt Publishing on May 8, 2018 at 10:30pm — No Comments

Deep Learning Infrastructure for Extreme Scale with 
the Apache Kafka Open Source Ecosystem

I had a new talk presented at "Codemotion Amsterdam 2018" this week. I discussed the relation of Apache Kafka and Machine Learning to build a Machine Learning infrastructure for extreme scale.

Long version of the title:

"Deep Learning at Extreme Scale (in the Cloud) 
with the Apache Kafka Open Source Ecosystem - How to Build a Machine Learning Infrastructure with Kafka, Connect, Streams, KSQL, etc."

As always, I want to share the slide deck. The talk was…

Continue

Added by Kai Waehner on May 8, 2018 at 9:30pm — No Comments

Featured Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2020   TechTarget ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service