Subscribe to DSC Newsletter

All Blog Posts Tagged 'analysis' (100)

Causality – The Next Most Important Thing in AI/ML

Summary:  Finally there are tools that let us transcend ‘correlation is not causation’ and identify true causal factors and their relative strengths in our models.  This is what prescriptive analytics was meant to be.

 

Just when I thought we’d figured it all out, something comes along to make…

Continue

Added by William Vorhies on April 22, 2019 at 8:47am — 4 Comments

Why BigQuery is The Next Big Thing With Example

BigQuery is Google’s serverless, highly scalable, enterprise data warehouse designed to make all your data analysts productive at an unmatched price-performance. Because there is no infrastructure to manage, you can focus on analyzing data to find meaningful insights using familiar SQL without the need for a database administrator.

Analyze all your data by…

Continue

Added by satyajit maitra on March 22, 2019 at 3:49am — No Comments

A guide for using the Wavelet Transform in Machine Learning

In a previous blog-post we have seen how we can use Signal Processing techniques for the classification of time-series and signals.

A very short summary of that post is: We can use the Fourier Transform to  transform a signal from its time-domain to its frequency domain. The peaks in the frequency spectrum indicate the most…

Continue

Added by Ahmet Taspinar on December 20, 2018 at 9:30pm — No Comments

Relationships, Geometry, and Artificial Intelligence

By Gunnar Carlsson

December 3, 2018

In their very provocative paperPeter Battaglia and his colleagues, posit that in order for artificial intelligence (AI) to achieve the capabilities of human intelligence, it must be…

Continue

Added by Jonathan Symonds on December 4, 2018 at 3:00pm — No Comments

Python Multi-Threading vs Multi-Processing

There is a library called threading in Python and it uses threads (rather than just processes) to implement parallelism. This may be surprising news if you know about the Python's Global Interpreter Lock, or GIL, but it actually works well for certain instances without violating the GIL. And this is all done without any overhead -- simply define…

Continue

Added by Michael Li on November 29, 2018 at 6:30am — 1 Comment

Who are the Datascience Professionals? India vs US | Men vs Women

Introduction

This is an analysis of the Kaggle 2018 survey dataset. In my analysis I am trying to understand the similarities and differences between men and women users from US and India, since these are the two biggest segments of the respondent population. The number of respondents who chose something other than Male/Female is quite low, so I excluded that subset as well.

The complete code is available as a …

Continue

Added by Ann Rajaram on November 6, 2018 at 2:39am — 1 Comment

Sentiment Analysis: Types, Tools, and Use Cases

What do you do before purchasing something that costs more than a pack of gum? Whether you want to treat yourself to new sneakers, a laptop, or an overseas tour, processing an order without checking out similar products or offers and reading reviews doesn’t make much sense anymore. Thanks to comment sections on eCommerce sites, social nets, review platforms, or dedicated forums, you can learn a ton about a product or service and evaluate whether it’s a good value for money. Other customers,…

Continue

Added by Kateryna Lytvynova on October 30, 2018 at 12:45am — No Comments

Ad hoc analysis: BI's fast food problem

(This article originally appeared on the…

Continue

Added by Matthew Gierc on October 4, 2018 at 7:00am — No Comments

Popularity of software programs for data science using recent reviews

In this article we discuss popularity of various software programs used for data analysis which are mentioned in various reviews published online in the period between 2017 and 2018. We used 14 reviews listed in the article Popularity of software programs for data…

Continue

Added by jwork.ORG on September 6, 2018 at 6:00pm — No Comments

How the incorporation of prior information can accelerate the speed at which neural networks learn while simultaneously increasing accuracy

Deep neural nets typically operate on “raw data” of some kind, such as images, text, time series, etc., without the benefit of “derived” features. The idea is that because of their flexibility, neural networks can learn the features relevant to the problem at hand, be it a classification problem or an estimation problem.  Whether derived or learned, features are important. The challenge is in determining how one might use what one learned from the features in future work (staying…

Continue

Added by Jonathan Symonds on August 30, 2018 at 7:00am — No Comments

Going Deeper: More Insight Into How and What Convolutional Neural Networks Learn

In my earlier post I discussed how performing topological data analysis on the weights learned by convolutional neural nets (CNN’s) can give insight into what is being learned and how it is being learned.  

The significance of this work can be summarized as follows:

  1. It…
Continue

Added by Jonathan Symonds on August 9, 2018 at 11:30am — No Comments

5 major sensor data analytics challenges: deadly or curable?

A smoothly running sensor data analytics tool may be just as difficult to manage as a symphony orchestra. Because every musician in an orchestra – and every part of an IoT system – needs to work properly and ‘harmonize’ with the others. But how do conductors make their orchestras work so nicely and sound so heavenly instead of creating a mismanaged cacophony? Obviously, there’s a lot of practice involved. But besides that, they definitely know what pitfalls they need to avoid. Which is why,…

Continue

Added by imranali on July 7, 2018 at 4:30am — No Comments

Market Alignment - An Application of Systems Theory for Organizations

The main components of systems theory that readers might remember are “inputs,” “processes,” and “outputs.”  The part that tends to get neglected is “feedback mechanisms.”  These mechanisms tell the system the extent to which operations fit the environment.  If there is lack of fitness, there is stress.  One adaptive impulse is to make processes more complex and intelligent - i.e. sometimes described as the fight response.  Another impulse is to give up and run away - i.e. the flight…

Continue

Added by Don Philip Faithful on June 23, 2018 at 9:00am — 1 Comment

Using Topological Data Analysis to Understand the Behavior of Convolutional Neural Networks

TLDR: Neural Networks are powerful but complex and opaque tools. Using Topological Data Analysis, we can describe the functioning and learning of a convolutional neural network in a compact and understandable way. The implications of the finding are profound and can accelerate the development of a wide range of applications from self-driving everything to GDPR.

Introduction

Neural networks have demonstrated a great…

Continue

Added by Jonathan Symonds on June 21, 2018 at 9:30am — No Comments

A guide to manipulating, analyzing, and visualizing data in R

R has spread deep into the private sector and can be found in the production pipelines at some of the most advanced and successful enterprises. 

Learn the fundamentals of data analysis in the second edition of Data Analysis with R, authored by data scientist…

Continue

Added by Packt Publishing on May 8, 2018 at 10:30pm — No Comments

Reconciling Opposing Performance Metrics Using Operational Simulations

Sometimes when dealing with performance metrics, there are contradictory signals.  For instance, although both are desirable, it is common for efficiency and efficacy to be in opposition.  An agent in a call centre can handle lots of calls while at the same time getting few sales; this is especially true if the agent’s main objective is to do lots of calls.  This is a highly efficient person albeit unsuccessful in terms of expanding the business.  Conversely, another agent by spending a…

Continue

Added by Don Philip Faithful on May 6, 2018 at 3:30am — No Comments

The Real Facebook Controversy

Cambridge Analytica’s wholesale scraping of Facebook user data is big news now, and people are “shocked” that personal data is being shared and traded on a massive scale on the internet. But the real issue with social media is not harm to individual users whose information was shared, but sophisticated and sometimes subtle mass manipulation of social and political behavior by bad actors, facilitated by deceit, fraud, and amplification of lies that spread easily through societal…

Continue

Added by Peter Bruce on April 18, 2018 at 9:00am — 1 Comment

Technical Boundary Analysis

About a month ago, I posted a blog on “Technical Deconstruction.” I described this as a technique to break down aggregate data to distinguish between its contributing parts: these parts might contain unique characteristics compared to the aggregate.  For instance, I suggested that it can be helpful to break down data by workday - that is to say, maintaining separate data for each day of the week.  I said that the data could be further deconstructed perhaps by time period and employee: the…

Continue

Added by Don Philip Faithful on April 14, 2018 at 8:00am — No Comments

Technical Deconstruction

The term “technical analysis” usually refers to the study of stock prices.  A technical analyst might use real-time or closing prices of stocks to predict future prices.  This is an interesting concept because of what is normally excluded from the analysis - namely, everything except prices.  Given that the approach doesn’t necessarily consider the health or profitability of the underlying companies, a purely technical approach seems to offer guidance that is disconnected from reality.  Yet…

Continue

Added by Don Philip Faithful on March 17, 2018 at 3:00am — No Comments

When Aggregates Fail

In general, any expression of performance that applies to a department can, if the data system is configured properly, be stated in relation to individual workers.  For instance, if # of sales contracts / # of customer enquiries = success rate, the success rate can be given for the entire dealership and also for each sales agent in that dealership.  Due to the differences in performance between agents, it can be problematic to only make use of the aggregate.  Some agents might be blamed for…

Continue

Added by Don Philip Faithful on February 25, 2018 at 7:30am — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service