Subscribe to DSC Newsletter

All Blog Posts Tagged 'r' (15)

Visualizing New York City WiFi Access with K-Means Clustering

Visualization has become a key application of data science in the telecommunications industry.

Specifically, telecommunication analysis is highly dependent on the use of geospatial data. This is because telecommunication networks in themselves are geographically dispersed, and analysis of such dispersions can yield valuable insights regarding network structures, consumer demand and availability.

Data



To illustrate this point, a k-means clustering algorithm is used…

Continue

Added by Michael Grogan on February 19, 2019 at 3:44am — No Comments

Data Science Glossary

There has been much hype surrounding deep learning and data science learning in recent times, and one of the cornerstones of deep learning is the neural network. In this article, we will look at what a neural network is and get familiar with the relevant terminologies.

In simplest terms, a neural network is an interconnection of neurons. Now the question arises, what is a neuron? To understand neurons in deep learning, we first…

Continue

Added by Divya Singh on September 20, 2018 at 4:00am — No Comments

Creating functions in R

Functions are used to simplify a series of calculations.

For instance, let us suppose that there exists an array of numbers which we wish to add to another variable. Instead of carrying out separate calculations for each number in the array, it would be much easier to simply create a function that does this for us automatically.

A function in R generally works by:

(a) Defining the variables to include in the function and the calculation. e.g. to add two…

Continue

Added by Michael Grogan on August 12, 2017 at 5:30am — No Comments

How to write good tests in R

Testing is an often overlooked yet critical component of any software system. In some ways this is more true of models than traditional software. The reason is that computational systems must function correctly at both the system level and the model level. This article provides some guidelines and tips to increase the certainty around the correctness of your models.



Guiding Principles

One of my mantras is that a good tool extends our ability and never gets in…

Continue

Added by Brian Rowe on July 7, 2016 at 4:30am — No Comments

Export Regression results from R to MS Word

In this post I will present a simple way how to export your regression results (or output) from R into Microsoft Word. Previously, I have written a tutorial how to create Table 1 with study characteristics and to export into Microsoft Word. These posts are especially useful for researchers who prepare their manuscript for publication in peer-reviewed journals.

Get the results…

Continue

Added by Klodian on June 9, 2016 at 11:43am — No Comments

Table 1 and the Characteristics of Study Population (rstats)

In research, especially in medical research, we describe characteristics of our study populations through Table 1. The Table 1 contain information about the mean for continue/scale variable, and proportion for categorical variable. For example: we say that the mean of systolic blood pressure in our study population is 145 mmHg, or 30% of participants are smokers. Since is called Table 1, means that is the first table in the manuscript.

To create the Table 1…

Continue

Added by Klodian on May 29, 2016 at 6:46am — No Comments

Identify, describe, plot, and remove the outliers from the dataset with R (rstats)

In statistics, a outlier is defined as a observation which stands far away from the most of other observations. Often a outlier is present due to the measurements error. Therefore, one of the most important task in data analysis is to identify and (if is necessary) to remove the outliers.

There are different methods to detect the outliers, including standard deviation approach and Tukey’s method which use interquartile (IQR) range approach. In this post I will use…

Continue

Added by Klodian on May 24, 2016 at 11:07pm — No Comments

Curated Lists of Data Science, Machine Learning, Deep Learning and NLP resources

Here are three useful resources for learning about Data Science:

Continue

Added by Ujjwal Karn on May 18, 2016 at 8:59am — No Comments

xda: R package for exploratory data analysis (plotting, univariate, bivariate)

I created an R package for exploratory data analysis. You can read about it and install it here.  

The package contains several tools to perform initial exploratory analysis on any input dataset. It includes custom functions for plotting the data as well as performing different kinds of analyses such as univariate, bivariate and multivariate investigation which is the first step of any…

Continue

Added by Ujjwal Karn on May 18, 2016 at 8:30am — No Comments

5 Reasons you should take part in Hackathons

Data is growing, and hence is the requirement for perfect decision making by companies. In this era where analytics is the new buzzword, companies are looking for able and determined Data Scientists.

But as always, the quest to get the sexiest and the highest paying job is not an easy task. For acquiring such a demanding position one must be well skilled and experienced.

Don’t freak out! Because now a days a lot of resources and tips are available to be perfect in what you are…

Continue

Added by Sukanya Mohapatra on April 15, 2016 at 4:30am — No Comments

The Data-Driven Weekly #1.7

0116-toptech16-cyborg-f1-620px-1450292659286

Photo: Nathaniel Welch

It turns out I’m not the only one who thinks AI alarmism is a bit out of hand. The ITIF Luddite Award nominations include “alarmists, even including respected luminaries such as Elon Musk and Stephen Hawking, touting an artificial intelligence…

Continue

Added by Brian Rowe on December 23, 2015 at 8:25am — No Comments

The Data-Driven Weekly #1.6

Right on cue, this past week heralded in an announcement of OpenAI, a new non-profit started by a number of tech luminaries to spearhead AI research that is publicly accessible. The motivation is that apparently these scions of capitalism lose faith in Adam Smith’s invisible hand…

Continue

Added by Brian Rowe on December 16, 2015 at 7:00am — No Comments

Data Science with Python & R: Dimensionality Reduction and Clustering

This tutorial was authored by Jose A Dianes, PhD.


Introduction

An important step in data analysis is data exploration and representation. In this tutorial we will see how by combining a technique called Principal Component Analysis (PCA) together with Cluster…

Continue

Added by Codementor on September 1, 2015 at 4:32pm — No Comments

Super Bowl Analytics: A Closer Look at #WhosGonnaWin

Excerpt reprinted with permission from ckmadvisors.com

This weekend sees Super Bowl XLVIII come to New York (yes, we're well aware that the stadium is technically in New Jersey). Earlier this week one of our data scientists noticed the Empire State Building lights putting on quite a show. A quick search revealed that the iconic building's…

Continue

Added by Nicholas Hartman on February 2, 2014 at 1:29pm — No Comments

Twitter Weather Radar - Test Data for Language Analytics

By: Nicholas Hartman, Director at CKM Advisors

Today we'd like to share with you some fun charts that have come out of our internal linguistics research efforts. Specifically, studying weather events by analyzing social media traffic from Twitter. 

We do not specialize in social media and most of our data analytics work focuses on the…

Continue

Added by Nicholas Hartman on December 15, 2013 at 7:24pm — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service