Subscribe to DSC Newsletter

John Thuma's Blog – August 2015 Archive (3)

Assembling The Data Team: Part 2: Traits to Avoid

In part one of this blog post we discussed the traits of people you should try to build in a data science team.  Of course technical, statistical, programming and mathematical skills are needed.  However, it takes much more.  We discussed the following traits:  The Pioneer, The Cattle Herder, The Muscle, and The Story Teller.  See below for that blog post:



Added by John Thuma on August 22, 2015 at 11:00pm — No Comments

VIDEO: Aster Tango - SQL-MR/GR Scripting Tool

Writing SQL-MR and SQL-GR statements is made much easier by a tool I wrote while on vacation.  As an Aster Data Scientist I needed a tool that would enable me to focus on the 'WHAT' and not the 'HOW!'  I needed a tool to write the code for me.   So I wrote Aster Tango.

Added by John Thuma on August 17, 2015 at 4:12am — No Comments

Aster and Text Analysis (TextChunker, Vector Distance, Levenshtein Distance, Text_Parser, and TF_IDF)

Some of you may not know that Aster provides deep capabilities in text analysis.  These functions are easy to use.  They also allow you to perform text analysis at scale.  What does this mean?  This means that I am able to take billions of customer service notes from a CRM system and perform text analysis.  Here is a sample of some of the Aster Text Analytic Functions:  (TextChunker, Vector Distance, Levenshtein Distance, Text_Parser, and TF_IDF)



Added by John Thuma on August 3, 2015 at 7:30am — No Comments


  • Add Videos
  • View All

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service