Subscribe to DSC Newsletter

August 2017 Blog Posts (125)

Predicting the next Eclipse

Here we describe a rudimentary method, using basic trigonometry, to predict lunar and solar eclipses. The purpose is to get people interested in the mathematics behind these events. We assume here that the Sun, the Earth and the Moon are just points. Thus we do not predict where on Earth eclipses take place, nor whether they are full of partial. But we derive some interesting results, such as

  • Solar eclipses are as frequent as lunar eclipses
  • On average, a solar eclipse…

Added by Vincent Granville on August 21, 2017 at 11:00pm — No Comments

Quick Guide to R and Statistical Programming

Guest blog by Rob Kabacoff. Rob is Professor of Quantitative Analytics at Wesleyan University.

R is an elegant and comprehensive statistical and graphical programming language. Unfortunately, it can also have a steep learning curve. I created this website for both current R users, and experienced users of other statistical packages (e.g., SASSPSSStata) who…


Added by L.V. on August 21, 2017 at 10:00am — No Comments

Evolution of Machine Learning - Infographics

Interesting infographics produced by PwC. To view the original article, download the infographics in PDF format, and read the comments, click here

DSC Resources


Added by L.V. on August 21, 2017 at 10:00am — No Comments

New Book: Data Science: Mindset, Methodologies, and Misconceptions

From the author of the bestsellers, Data Scientist and Julia for Data Science, this book covers four foundational areas of data science. The first area is the data science pipeline including methodologies and the data scientist's toolbox. The second are essential practices needed in understanding the data including questions and hypotheses. The third are pitfalls to avoid in the data science process. The fourth is an awareness of future trends…


Added by L.V. on August 21, 2017 at 10:00am — No Comments

Book: R for Data Science

Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible.…


Added by L.V. on August 21, 2017 at 10:00am — No Comments

How Big Is A Terabyte of Data

How Big Is a Terabyte of Data?

By JINAG Buxing

It seems that one mile distance isn’t long, and that a cubic mile isn’t that big if compared with the size of the earth. You may be surprised if I tell you the entire world’s population could all fit in a cubic mile of space. Hendrik Willem van Loon, a Dutch-American writer, once wrote the similar thing in one of his books.

Teradata is a famous provider of database…


Added by JIANG Buxing on August 20, 2017 at 9:30pm — No Comments

R Linear Regression

Basics of Linear Regression

Regression analysis is a statistical tool to determine relationships between different types of variables. Variables that remain unaffected by changes made in other variables are known as independent variables, also known as a predictor or explanatory variables while those that are affected are known as dependent variables also known as the response variable.

Linear regression is a statistical procedure…


Added by Shreya Gupta on August 20, 2017 at 9:00pm — No Comments

Digital Camera Market Trends, Key Vendors Analysis, Import & Export, Revenue by Forecast to 2022

Market Highlights:

Digital camera produce digital image records cinematic image in digital appearance. Dissimilar to film photographs which relatively have enormous resolution, digital camera accumulate images digitally in a memory chip which is an alternative to recording them on the film. Once the photograph is taken it can be initialized to a computer system displayed on the screen and printed. Digital cameras shares optical structure, commonly using lens with a fluctuating…


Added by kiran Sonawane on August 20, 2017 at 8:30pm — No Comments

Robust Attacks on Machine Learning Models

This is a nightmare! Tadayoshi Kohno, Professor at Department of Computer Science and Engineering, University of Washington, manipulated a STOP sign in a typical "graffiti way", that it was recognized as 45 mph SPEED limit by typical AI software, such as built into a Tesla S. It's very likely, that it will become a sport to send Tesla drivers to hell. …


Added by Vincent Granville on August 20, 2017 at 2:00pm — No Comments

Why Deep Learning is Taking off? Season 1 : Part 1

First it was Machine Learning, and now all of a sudden Deep Learning is taking all the thunder even from Machine Learning. So what's the difference, and why all of a sudden Deep Learning has become the most buzzing new Technology of our Era? Is Deep Learning a false idol being…

Added by Ammar A. Raja on August 20, 2017 at 10:00am — 1 Comment

Are you drowning in Data Lake?


Added by Sandeep Raut on August 19, 2017 at 8:00am — No Comments

Data Science Simplified Part 8: Qualitative Variables in Regression Models

The last few blog posts of this series discussed regression models. Fernando has selected the best model. He has built a multivariate regression model. The model takes the following shape:

price = -55089.98 + 87.34 engineSize + 60.93…


Added by Pradeep Menon on August 19, 2017 at 6:30am — No Comments

Top Graphical Models Applications in Real World

1. Objective

Now we are going to explain the various Graphical Models Applications in real life such as – Manufacturing, finance, Steel Production, Handwriting Recognition etc. At last, we will discuss the case study about the use of Graphical Models in the Volkswagen.…


Added by Shreya Gupta on August 18, 2017 at 7:00pm — No Comments

Weekly Digest, August 21

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.


  • Looking to further your experience and knowledge in business analytics, and set yourself apart from colleagues and competition? Michigan State University – ranked in the top 25 U.S.…

Added by Vincent Granville on August 18, 2017 at 5:30pm — No Comments

Machine Learning Vs. Statistics

This article was written by Aatash Shah.

Many people have this doubt, what’s the difference between statistics and machine learning? Is there something like machine learning vs. statistics?

From a traditional data…


Added by Amelia Matteson on August 18, 2017 at 11:30am — 1 Comment

Data Cleansing with Apache Spark and Optimus

Outdated, inaccurate, or duplicated data won’t drive optimal data driven solutions. When data is inaccurate, leads are harder to track and nurture, and insights may be flawed. The data on which you base your big data strategy must be accurate, up-to-date, as complete as possible, and should…


Added by Favio Vázquez on August 18, 2017 at 8:00am — No Comments

Data Analytics and your Small Business

Acquiring, converting, engaging, and retaining customers is the most important challenge facing businesses of any size. But the proliferation of data, which has been made possible by significant advances in technology, have allowed businesses to capture information on their customers and helped to level the playing field between businesses big and small. Analyzing this data can give you visibility into every facet of your business, helping to inform data-driven and actionable…


Added by Gavin Graham on August 18, 2017 at 5:00am — No Comments

Top 25 Hadoop Interview Questions Prepared by Experts

1) Compare Hadoop & Spark


Criteria                                           Hadoop                                                   Spark

Dedicated storage                           HDFS                                                     None

Speed of processing                        average                                              …


Added by Venkatesan M on August 18, 2017 at 12:00am — 2 Comments

Real-Life Applications of Support Vector Machines

Applications of SVM in Real World 

SVMs depends on supervised learning algorithms. The aim of using SVM is to correctly classify unseen data. SVMs have a number of applications in several fields.

Some common applications of SVM are-

  • Face detection – SVMc classify parts of the image as a face and non-face and create a square boundary around the face.
  • Text and hypertext…

Added by Sheetal Sharma on August 17, 2017 at 11:30pm — No Comments

Contingency Tables in R

1. Objective

This R tutorial is all about Contingency tables in R. First of all, we will discuss the introduction to R Contingency tables, different ways to create Contingency tables in R. This tutorial also covers the Complex Tables in R / Flat Tables in R, Cross Tabulation in R, Recreating original data from contingency tables in R, and everything related to R contingency tables.…


Added by Shreya Gupta on August 17, 2017 at 7:00pm — No Comments

Blog Topics by Tags

Monthly Archives











© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service