Subscribe to DSC Newsletter

All Blog Posts Tagged 'text' (13)

Summarizing Economic Bulletin Documents with TF-IDF

A key strength of NLP (natural language processing) is being able to process large amounts of texts and then summarise them to extract meaningful insights.

In this example, a selection of economic bulletins in PDF format from 2018 to 2019 are analysed in order to gauge economic sentiment. The bulletins in question are sourced from the European Central Bank website. tf-idf is used to rank…

Continue

Added by Michael Grogan on July 11, 2019 at 12:28pm — No Comments

Text Encoding: A Review

The key to perform any text mining operation, such as topic detection or sentiment analysis, is to transform words into numbers, sequences of words into sequences of numbers. Once we have numbers, we are back in the well-known game of data analytics, where machine learning algorithms can help us with classifying and clustering.

We will focus here exactly on that part of the analysis that transforms words…

Continue

Added by Rosaria Silipo on February 11, 2019 at 3:09pm — No Comments

Natural Language Understanding (NLU) in Fraud Risk Management – a case study

I.  Introduction

This is a continuation of my previous blog, “Natural Language Understanding – Application Notes with Context Discriminant”. 

Background:

Natural Language Understanding (NLU) is a subtopic of Natural Language Processing (NLP). Successful implementations of NLU are difficult because of limitations in prevailing technology. SiteFocus solved these limitations with a new approach to NLU. This approach has been successfully…

Continue

Added by Sing Koo on April 10, 2018 at 1:30pm — 1 Comment

Ux + AI = Cognitive Ergonomics

Summary:  The addition of AI capabilities to our personal devices, applications, and even self-driving cars has caused us to take a much deeper look at what we call ‘User Experience’ (Ux).  A more analytical framework identified as Cognitive Ergonomics is becoming an important field for data scientists to understand and implement.

 

I…

Continue

Added by William Vorhies on October 31, 2017 at 9:51am — No Comments

Natural Language Understanding (NLU) in Enterprise – Digesting IPO Prospectus

Introduction

Business ventures based on existing or disruptive business models taking on the route of Initial Public Offering are always a challenge to investors who want to profit from early investment into those would be “unicorn IPO”. A good investment may get worse before it gets better. Others may get worse and never recover. Aside from the macroeconomics and consumer trends that could affect the outcome of such investments, the fundamentals of these new public offerings…

Continue

Added by Sing Koo on October 31, 2017 at 2:30am — No Comments

Natural Language Understanding – Application Notes with Context Discriminant

Introduction

Deep Learning can be used to automate just about every repetitive task that is currently or formerly performed by humans. Factory robots, autonomous cars, Internet of Things are example of these automations. Yet, mentally challenging tasks such as conducting research or strategic planning with natural language textual documents remain a daunting task for automation. We look into the root cause of this challenge and have implemented a solution to automate these…

Continue

Added by Sing Koo on October 6, 2017 at 1:00pm — No Comments

Book: Text Analytics with Python

Text Analytics with Python -- A Practical Real-World Approach to Gaining Actionable Insights from your Data

Text analytics can be a bit overwhelming and frustrating at times with the unstructured and noisy nature of textual data and the vast amount of information available. "Text Analytics with Python" published by Apress\Springer, is a book packed with 385 pages of useful information based on techniques, algorithms,…

Continue

Added by Dipanjan Sarkar on July 14, 2017 at 4:00am — No Comments

Who Made the News? Text Analysis using R, in 7 steps

This post covers the following tasks using R programming:

  • cleans the texts,
  • sorts and aggregates by publisher names
  • creates word clouds and word…
Continue

Added by Ann Rajaram on November 26, 2016 at 3:30am — 5 Comments

Context Matters When Text Mining

Context Matters When Text Mining

Many times the most followed approach can result in failure.  The reason has more to do with thinking that one approach works in all cases.  This is specially true in text mining.  For instance, a common approach in clustering documents is to create tf-idf matrix for all documents, use SVD or other dimension reduction algorithm and then use a clustering.  In most cases, this will work; However, as I will present here,  there are instances…
Continue

Added by Dalila Benachenhou on October 27, 2016 at 5:30pm — 2 Comments

Applications of Deep Learning

This post highlights a number of important applications found for deep learning so far. It is well known that 80% of data is unstructured. Unstructured data is the messy stuff every quantitative analyst tries to traditionally stay away from. It can include images of accidents, text notes of loss adjusters, social media comments, claim documents and review of medical doctors etc. Unstructured data has massive potential but has never been traditionally considered as a source of insight before.…

Continue

Added by Syed Danish Ali on June 26, 2016 at 5:00am — No Comments

Big Data, Meet Dynamic Semantic Publishing: Altering the Media & Publishing Landscape

How many times a day do we ourselves, or hear someone else, utter the phrase “Google it”? It’s hard to imagine that a phrase so ubiquitous and universally understood has been around for less than two decades. The word “Google” has become synonymous with online search, and when we think about why this, it’s because Google yields the most relevant, comprehensive results, quickly. Essentially, it has changed the way we find and interact with content and information.

We’ve seen the…

Continue

Added by Tony Agresta on April 28, 2015 at 3:29am — No Comments

The Hype Around Graph Databases And Why It Matters

Organizations are struggling with a fundamental challenge – there’s far more data than they can handle.  Sure, there’s a shared vision to analyze structured and unstructured data in support of better decision making but is this a reality for most companies?  The big data tidal wave is transforming the database management industry, employee skill sets, and business strategy as organizations race to unlock meaningful connections between disparate sources of…

Continue

Added by Tony Agresta on April 7, 2015 at 6:45am — 4 Comments

Types and Uses of Predictive Analytics, What they are and Where You Can Put Them to Work

Summary: Gartner says that predictive analytics is a mature technology yet only one company in eight is currently utilizing this ability to predict the future of sales, finance, production, and virtually every other area of the…

Continue

Added by William Vorhies on August 13, 2014 at 10:54am — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service