Subscribe to DSC Newsletter

July 2018 Blog Posts (88)

Top 10 Data Science Use Cases in Retail

Nowadays data proves to be a powerful pushing force of the industry. Big companies representing diverse trade spheres seek to make use of the beneficial value of the data. 

Thus, data has become of great importance for those willing to take profitable decisions concerning business. Moreover, a…


Added by Igor Bobriakov on July 26, 2018 at 8:00am — No Comments

Data Scientist's Book of Quotes

Insights and Advice from Data Science Leaders and Key Influencers. Paperback – July 13, 2018. By Matt Corey.

The Data Scientist’s Book of Quotes includes over 300 insightful and inspiring quotes from the world’s leading Data Science thought leaders and key influencers across the world, including Andrew Ng, Bernard Marr, Vincent Granville, Carla Gentry, Cathy O’Neil and Hilary Mason. The Data Scientist role is one of the most pivotal and disruptive roles in today’s global…


Added by Vincent Granville on July 26, 2018 at 7:30am — 1 Comment

Deep learning based object classification model for Autonomous vehicles and Advanced Driver Assist System

Autonomous driving is the way to go forward.

Here is my article based on some of the work we have done in this field.

- Python-tensorflow based deep learning model for object classification trained on a novel data-set

- Trained and deployed on an embedded computing platform for real-time object detection…


Added by Rajshekhar Mukherjee on July 25, 2018 at 3:00pm — No Comments

Practical Apache Spark in 10 minutes. Part 4 - MLlib

The vast possibilities of artificial intelligence are of increasing interest in the field of modern information technologies. One of its most promising and evolving directions is machine learning (ML), which becomes the essential part in various aspects of our life. ML has found successful applications in Natural Languages Processing, Face…


Added by Igor Bobriakov on July 24, 2018 at 10:12pm — No Comments

Bill Vorhies Retrospective: Part 6

Bill is the Editorial Director for Data Science Central, and President and Chief Data Scientist at Data-Magnum, providing predictive analytics and big data infrastructure projects as a service. Bill has been an active commercial predictive modeler since 2001.

In this series consisting  of six…


Added by Vincent Granville on July 24, 2018 at 7:00pm — No Comments

Using Design to Drive Business Outcomes, or Uncovering What You Don’t Know That You Already Know

Okay, that may be the most complicated title that I’ve ever read, but here’s the point of that title:  Most people don’t know what they already know.  Maybe spun another way, is that people don’t realize how much they really know, that much of what they really know is buried behind general rules, rules of thumbs and simple heuristics.  But those heuristics get in the way of one’s real knowledge.  And that’s the real secret to effective…


Added by Bill Schmarzo on July 24, 2018 at 9:54am — 1 Comment

Comparing AI Strategies – Systems of Intelligence

Summary:  The fourth and final AI strategy we’ll review is Systems of Intelligence (SOI).  This is getting nearly as much attention as the Vertical strategy we previously reviewed.  It’s appealing because it seems to offer the financial advantages of a Horizontal strategy but its ability to create a defensible moat requires some fine tuning.



Added by William Vorhies on July 24, 2018 at 9:00am — No Comments

R Code for Friedman Test

Below is an R code for Friedman Test that includes post-hoc tests as well in case the null hypothesis is rejected.

Feel free to use the code after copying and pasting it into R workspace.

friedman.test <- function(data, alpha=0.05) {


# Author : Okan OYMAK, MS in Operations Research at the NPS Monterey, CA, USA

# Date : 12 March 2015


# Data:

# The…


Added by Okan OYMAK on July 24, 2018 at 8:00am — No Comments

Academia to Industry: Applying Your PhD In Business

After years of working on your PhD and being in academia, transitioning into a Data Science career can be a culture shock.


It’s important to understand how the roles you are wanting to apply to vary; what industry is it in? What are the main focus of the projects? Who will you be working alongside? What are the end goals the team would be working on achieving? Does your chosen industry have anything to do with your PhD and academic research?


‘Data Scientist’…


Added by Matt Reaney on July 24, 2018 at 6:17am — 2 Comments

Importance of Metadata in a Big Data World

Updated from original posted on April 17, 2014

The importance of metadata only continues to grow as organizations are realizing that to fully exploit the business and operational potential of machine learning, deep learning and artificial intelligence requires that the raw data be enhanced with metadata.  And while we have growing volumes of  actual data, there is even more data, or metadata, around the usage and source of the actual data.

Metadata is…


Added by Bill Schmarzo on July 23, 2018 at 4:30am — No Comments

Why Data Analytics is Heavy on Data Engineering?

While many companies have embarked on data analytics initiatives, only a few have been successful. Studies have shown that over 70% of data analytics programs fail to realize their full potential and over 80% of the digital transformation initiatives fail. While there are many reasons that affect successful deployment of data analytics, one fundamental reason is lack of good quality data. However, many business enterprises realize this and invest considerable…


Added by Prashanth Southekal, PhD on July 22, 2018 at 4:00pm — No Comments

Time Series Analysis and forecasting - Tutorial

In this tutorial a short introduction to Time Series Modeling and Forecasting is presented. Time Series appears in many industries today that rely on predicting and balancing demand and Supply (e-commerce, retailer , ride-sharing, etc..)

Hence, a good understanding of the underlying model generating the data can significantly help in predicting future…


Added by M Baddar on July 22, 2018 at 12:24am — No Comments

Weekly Digest, July 23

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions


Added by Vincent Granville on July 21, 2018 at 7:30am — No Comments

How Can Project Managers Use Data Science?

Big data, Data Science, and smart insights! These are terms we commonly hear and read when discussing business decisions, industry competition, and customer needs. As the years pass by, our world is becoming a highly connected global village, which has only boosted the business industry. The more connected we become, the more data there is. Therefore, we only have more opportunities to convert it and provide better solutions to the world.

While the steadily growing field of Data…


Added by VAMSI NELLUTLA on July 20, 2018 at 1:41pm — No Comments

Feature Selection For Unsupervised Learning

This is my presentation for the IBM data science day, July 24.


After reviewing popular techniques used in supervised, unsupervised and semi-supervised machine learning, we focus on feature selection methods in these different contexts, especially the metrics used to assess the value of a feature or set of features, be it binary, continuous or categorical…


Added by Vincent Granville on July 20, 2018 at 9:34am — 2 Comments

Thursday News: Blockchain, AI, NLP, Python, R, SQL, Spark, Regression...

Here is our selection of featured articles and resources posted since Monday.



Added by Vincent Granville on July 19, 2018 at 7:30am — No Comments

How Data Science Transforms Document Management

Document management is an inevitable part of every business industry. It is highly recommended to be efficient and neat when it comes to handling business documents. Every aspect of document management is important, whether you need to extract data from a document, to secure PDF files, to merge or split more documents, to cluster…


Added by Cindy Pearson on July 19, 2018 at 5:20am — No Comments

Practical Apache Spark in 10 minutes. Part 3 - DataFrames and SQL

Spark SQL is a part of Apache Spark big data framework designed for processing structured and semi-structured data. It provides a DataFrame API that simplifies and accelerates data manipulations. DataFrame is a special type of object, conceptually similar to a table in relational database. It represents a distributed…


Added by Igor Bobriakov on July 18, 2018 at 10:01pm — No Comments


As I have bought new PC and set up NGC ( NVIDIA GPU CLOUD ), here is the memo of my setting-up process.

1. Registration for NGC

   Access to and push Register Now button.

   Input necessary information and Read Term of Use

   Push Sign-up buttom

   Access to e-mail sent after registration and Reset password

2. Get API key

   Access to…


Added by Kei Yonezawa on July 18, 2018 at 11:41am — No Comments

Staying Sane and Optimistic amid the AI hype

At NIPS 2016, there was an unprecedented story building up. Something that got every AI enthusiast agog about an unknown AI startup ‘Rocket AI’.

The names associated with the hot startup were pioneers in the AI field and it was informed to the media that there was major announcement soon to come. There was even a workshop held, where one of the researchers explained about the concept of…


Added by Ashish kumar on July 18, 2018 at 9:18am — No Comments

Blog Topics by Tags

Monthly Archives













© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service