Subscribe to DSC Newsletter

June 2017 Blog Posts (94)

Some NLP: Probabilistic Context Free Grammar (PCFG) and CKY Parsing in Python

This problem appeared as an assignment in the coursera course Natural Language Processing (by Stanford) in 2012. The following description of the problem is taken directly from the assignment description.

In this article, a probabilistic parser will be built by implementing the CKY parser. The Manually Annotated Sub-Corpus (MASC) from the American National Corpus (ANC): …

Continue

Added by Sandipan Dey on June 5, 2017 at 11:00pm — No Comments

A Guide for Applying Machine Learning Techniques in Finance

Does it sound familiar to you? In order to get an idea of how to choose a parameter for a given classifier, you have to cross reference to a number of papers or books, which often turn out to present competing arguments for or against a certain parameterization choice but with few applications to real-world problems.

For example, you may find a few papers discussing optimal selection of K in…

Continue

Added by Zhongmin Luo on June 5, 2017 at 7:30pm — 6 Comments

Artificial Intelligence 2017: Top 100 Influencers, Brands and Publications

This article was written by Thomas Legoff. 

Artificial intelligence – or AI – is a true part of our world, as well as a substantial hub of interest for science and business. Companies are ferociously investing in, engaging in and including artificial intelligence in their operations. It is a fascinating technology that enables new options for companies, from detecting security intrusions…

Continue

Added by Emmanuelle Rieuf on June 5, 2017 at 8:30am — No Comments

Image Segmentation using deconvolution layer in Tensorflow

This article was written by Koustuch on CV-Tricks. 
In this series of post, we shall learn the algorithm for image segmentation and implementation of the same using Tensorflow. This is the first part of the series where we shall focus on understanding and be implementing a deconvolutional/fractional-strided-convolutional layer in…
Continue

Added by Emmanuelle Rieuf on June 5, 2017 at 7:30am — 1 Comment

If Data is as Valuable as Gold, It’s Time to Polish Your Data Architecture

It speaks volumes of the world we live in today when headlines such as “The world’s most valuable resource is no longer oil, but data” and “Why Data May Be More Valuable Than Dollars” are commonplace. With the explosion of IoT and with that 2.5 quintillion bytes of data being created per day, the underlying power of this data comes as no surprise.

Unlike gold however, data is ubiquitous and being created at an exponential rate. So where’s the value in something that is everywhere?…

Continue

Added by Amy Flippant on June 5, 2017 at 12:30am — No Comments

Cybersecurity in Digital Age

You must have heard about the global cyberattack of WannaCry ransomware in over 200 countries. It encrypted all the files on the machine and asked for payment. Ransomware, which demands payment after launching a cyber-attack, has become a rising trend among hackers looking for a quick payout. Every day it seems another news breaks about cyber-criminals hacking in and…
Continue

Added by Sandeep Raut on June 4, 2017 at 7:30pm — No Comments

Weekly Digest, June 5

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Announcements

  • Samsung is looking for team members who are passionate about data! Our Data Science team is made up of diverse individuals who fearlessly explore the uncharted. You will have access to a…
Continue

Added by Vincent Granville on June 3, 2017 at 9:00am — No Comments

Introduction to R: The Statistical Programming Language

Introduction to R Programming Language

R is an intense dialect utilized broadly for information investigation and measurable registering. It was created in the mid 90s. It is a standout amongst the most well known dialects utilized by analysts, information experts, scientists, and advertisers to recover, clean, dissect,…

Continue

Added by Johny Basha on June 2, 2017 at 8:30pm — No Comments

Choice of K in K-fold Cross Validation for Classification in Financial Market

Cross Validation is often used as a tool for model selection across classifiers. As discussed in detail in the following paper https://ssrn.com/abstract=2967184, Cross Validation is typically performed in the following steps:

  • Step 1: Divide the original sample into K sub samples; each subsample typically has equal sample size and is referred to as one fold, altogether,…
Continue

Added by Zhongmin Luo on June 2, 2017 at 7:00pm — No Comments

Artificial Intelligence in Enterprise – Using Meta-Vision to Disrupt Business Models for Products and Service

Introduction

Enterprise business models evolve over time for many reasons. The Internet has been a key factor driving enterprise business model change in recent years. The recent popularity of smartphones has disrupted consumer habits in travel, investment, entertainment, communication, social engagement, dining, shopping and many daily activities. Consequently, enterprises have been forced to change their business models. While some changes are progressive, others are…

Continue

Added by Sing Koo on June 1, 2017 at 4:30pm — No Comments

Select Important Variables using Boruta Algorithm

This article explains how to select important variables using boruta package in R. Variable Selection is an important step in a predictive modeling project. It is also called 'Feature Selection'. Every private and public agency has started tracking data and collecting information of various attributes. It results to access to too many predictors for a predictive model. But not every variable is important for prediction of a particular task. Hence it is essential to…

Continue

Added by Deepanshu Bhalla on June 1, 2017 at 9:00am — 1 Comment

Thursday News: R, Python, Feature Selection, Deep Learning, Quantum Computing

Here is our selection of featured articles and resources published since Monday.

Continue

Added by Vincent Granville on June 1, 2017 at 8:59am — No Comments

The 6 Biggest Internet Problems We Need to Solve

Over the course of the last two decades, the internet has become nearly ubiquitous. From the ages of rare dial-up connections, our relatively reliable 4G network and endless options for free Wi-Fi at public establishments is its own kind of digital paradise. However, the internet isn’t perfect, and if we want to build a world where our entire population has fast,…

Continue

Added by Larry Alton on June 1, 2017 at 7:00am — No Comments

Random Forests explained intuitively

Random Forests algorithm has always fascinated me. I like how this algorithm can be easily explained to anyone without much hassle. One quick example, I use very frequently to explain the working of random forests is the way a company has multiple rounds of interview to hire a candidate. Let me elaborate. 



Say, you appeared for the position of Statistical analyst at WalmartLabs. Now like most of the companies, you don't just have one round of interview. You…

Continue

Added by Manish Kumar Barnwal on June 1, 2017 at 12:30am — 4 Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service