Subscribe to DSC Newsletter

Featured Blog Posts – March 2017 Archive (103)

Goodbye Age of Hadoop – Hello Cambrian Explosion of Deep Learning

Summary:  Some observations about new major trends and directions in data science drawn from the Strata+Hadoop conference in San Jose last week.

 

I’m fresh off my annual field trip to the Strata+Hadoop conference in San Jose last week.  This is always exciting, enervating, and exhausting but it remains the single best place to pick up on…

Continue

Added by William Vorhies on March 20, 2017 at 4:48pm — No Comments

Free Machine Learning eBooks - March 2017

Here are three eBooks available for free.

MACHINE LEARNING

Edited by Abdelhamid Mellouk and Abdennacer Chebira

Machine Learning can be defined in various ways related to a scientific domain concerned with the design and development of theoretical and implementation tools that allow building systems with some Human Like intelligent behaviour.

Machine Learning addresses more specifically the ability to…

Continue

Added by Emmanuelle Rieuf on March 20, 2017 at 4:00pm — 5 Comments

Little Bee books: Tough topics simply explained

This is a nice collection of free eBooks to learn the ropes on topics covering Hadoop, machine learning, Spark, analytics, and more.

The Little Bee series of books provides an overview of the hot topics in data and analytics, giving you a snapshot of each technology and its potential benefit to your organisation. These books will not make you an expert, but they will improve your understanding and open the door to new ideas.

The subject of data and…

Continue

Added by Emmanuelle Rieuf on March 20, 2017 at 4:00pm — 2 Comments

How Airline Loyalty Programs Use Big Data To Drive Record Revenues

It’s often thought that loyalty programs are designed to reward customers with special offers, treats, and discounts in the hope of retaining their business and encouraging customers to spend more frequently. While there is some…

Continue

Added by Mark Ross-Smith on March 20, 2017 at 11:30am — No Comments

Difference of Data Science, Machine Learning and Data Mining

Data is almost everywhere. The amount of digital data that currently exists is now growing at a rapid pace. The number is doubling every two years and it is completely transforming our basic mode of existence. According to a paper from IBM, about 2.5 billion gigabytes of data had been generated on a daily basis in the year 2012. Another article from Forbes informs us…

Continue

Added by Leonard Heiler on March 20, 2017 at 10:30am — 2 Comments

The Four Stages of a Chatbot’s Business Intelligence Evolution

The Four Stages of a Chatbot’s Business Intelligence Evolution

I see four stages in the progression of chatbot-like AIs interacting with business systems for the purpose of providing actionable business intelligence.

Stage 1) Single Numeric Response

Question :…

Continue

Added by Eduardo Siman on March 20, 2017 at 7:00am — No Comments

What makes a great data scientist?

A data scientist is an umbrella term that describes people whose main responsibility is leveraging data to help other people (or machines) making more informed decisions. The spectrum of data scientist roles is so broad that I will keep this discussion for my next post. What I really want to focus is on what are the distinctive characteristics of a great data scientist.

Over the years that I have worked with data and analytics I have found that this has almost nothing to do with…

Continue

Added by Karolis Urbonas on March 20, 2017 at 12:00am — No Comments

What Happens When Data, Visuals and Emotions Intersect: Stories That Delight

Can data bring the best in humanity? Can it evoke emotions? Can it speak to us?

Every time I walk by a memorial wall, I am filled with visuals. Some folks run their fingers on the wall and when they spot a dear one’s name, a feeling of acknowledgement envelops their face. Many others stand in front of the wall to capture their own memories with their camera.

 As I watch the wall, I stand in awe. The combination of symbolism of the wall in totality and a tribute…

Continue

Added by Karthik Rajan on March 19, 2017 at 11:30am — No Comments

NEXT Machine Learning Paradigm: “DYNAMICAL"​ ML

Dynamical ML is machine learning that can adapt to variations over time; it requires “real-time recursive” learning algorithms and time-varying data models such as the ones described in the blog,…

Continue

Added by PG Madhavan on March 18, 2017 at 2:30pm — 1 Comment

Weekly Digest, March 20

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions

Continue

Added by Vincent Granville on March 18, 2017 at 8:30am — No Comments

Influencing Behaviour Using Persuasive Data

I came across the story of a manager who felt that the best way to encourage desirable behaviours was through reward and humiliation.  This encouragement occurred indirectly through what I would describe as “persuasive data”:  a table of data went out each week showing the best and worst performing employees.  Everyone in the team could see the stats plainly along with the names of coworkers.  They were encouraged to make comparisons.  This represents an aggressive use of data.  From my…

Continue

Added by Don Philip Faithful on March 18, 2017 at 5:51am — No Comments

Numerous reasons why Digital Transformation fails

Many organizations today have realized that digital transformation is essential to their success.

But many of them forget that focus of a digital transformation is not digitization or even technology, it is the Customer!

Digital Transformation is not easy or small endeavor for any business. Several levers will need to be turned in unison just to ensure…

Continue

Added by Sandeep Raut on March 18, 2017 at 5:30am — No Comments

Understanding deep learning requires rethinking generalization

Recent scientific paper by  Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. Published here

Abstract

Despite their massive size, successful deep artificial neural networks can exhibit a remarkably small difference…

Continue

Added by Vincent Granville on March 17, 2017 at 1:11pm — No Comments

Finding Influencers on Twitter

Contributed by Oamar Gianan. He enrolled in the NYC Data Science Academy 12-week full time Data Science Bootcamp program taking place between September 23, 2016 and December 23, 2016. The original article can be found here.

Have you been followed on Twitter or Instagram by someone you don't know? I get this a lot. And so to avoid being thought of as rude, I follow back.…

Continue

Added by NYC Data Science Academy on March 17, 2017 at 12:36pm — No Comments

Scraping NSF Awards to Create Database of Active STEM Researchers

Contributed by Nathan Stevens. He enrolled in the NYC Data Science Academy 12-week full time Data Science Bootcamp program taking place between September 23, 2016 and December 23, 2016. The original article can be found here.

Introduction



There a numerous use cases for having a searchable database of active STEM (Science Technology…

Continue

Added by NYC Data Science Academy on March 17, 2017 at 11:30am — No Comments

Pokemon tracker in selected places using inferential statistic

Contributed by Lei Zhang. 

Recently one phone game spread through the whole world, and cause a lot of interesting topic about the technique behind the game. One of the hot topic is: How to predict the pokemon spawning position?  

Here I developed an app to predict the pokemon spawning position and the probability of spawning at that position. This is the exciting news for the fans of the game, who can easily find the possible spawning position…

Continue

Added by NYC Data Science Academy on March 17, 2017 at 11:30am — No Comments

Email Spam Filtering : A python implementation with scikit-learn

This article was written by ML bot2 on Machine Learning in Action.

Text mining (deriving information from text) is a wide field which has gained popularity with the huge text data being generated. Automation of a number of applications like sentiment analysis, document classification, topic classification, text summarization, machine translation, etc has been done using machine learning…

Continue

Added by Emmanuelle Rieuf on March 17, 2017 at 9:15am — No Comments

Python & JSON: Working with large datasets using Pandas

This article was posted by Vik Paruchuri. 

Introduction

Working with large JSON datasets can be a pain, particularly when they are too large to fit into memory. In cases like this, a combination of command line tools and Python can make for an efficient way to explore and analyze the data. In this post, we’ll look at how to leverage tools like Pandas to explore and map out police activity in Montgomery County, Maryland. We’ll start with a look at the…

Continue

Added by Emmanuelle Rieuf on March 17, 2017 at 9:00am — 1 Comment

Python for Big Data in One Picture

This picture originally posted here covers the following topics:

  1. Basic stack
  2. Newer packages
  3. Integrated platforms
  4. Visualization
  5. Data formats
  6. MapReduce
  7. Glue
  8. GPU
  9. Parallel
  10. Efficiency
  11. Packages

To zoom in, view picture in the original article, or click on picture. The…

Continue

Added by Vincent Granville on March 17, 2017 at 8:16am — No Comments

Squeezing Deep Learning into Mobile Phones - A Practitioner's guide

This is a slideshare presentation by Anirudh Koul. Anirudh is deep learning data scientist at Microsoft AI & Research. He earned a master of computational data science at Carnegie Mellon University, and a graduate certificate in data mining from Stanford University. He currently lives in the Bay Area. Anirudh is leading projects like Seeing…
Continue

Added by Vincent Granville on March 17, 2017 at 7:30am — No Comments

Featured Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service