Subscribe to DSC Newsletter

Capri Granville's Blog (75)

Tutorial: Statistical Tests of Hypothesis

This article is a solid introduction to statistical testing, for beginners, as well as a reference for practitioners. It includes numerous examples as well as illustrations and definitions for concepts such as rejecting the null hypothesis, one sample hypothesis testing, P-values, critical values, and Bayesian hypothesis testing. It has references to additional topics, such as 

  • What is Ad Hoc Testing?
  • What is a Rejection Region?
  • What is a Two Tailed…

Added by Capri Granville on February 19, 2019 at 9:30am — No Comments

Infographic: The Typical Data Scientist Profile in 2019

This infographic was produced by 365DataScience. Last year they completed a research on 1,001 data scientists to get a profile of the ‘typical’ data scientist in 2018. They replicated the study with new data. Below are the findings.

Here are some of our key findings:…


Added by Capri Granville on February 9, 2019 at 10:30am — 1 Comment

Probability Cheat Sheet - Harvard University

Below is an extract of a 10-page cheat sheet about probability, compiled by William Chen ( and Joe Blitzstein, with contributions from Sebastian Chiu, Yuan Jiang, Yuqi Hou, and Jessy Hwang. Material based on Joe Blitzstein’s Harvard's introductory probability course (@stat110 - ( and Blitzstein / Hwang’s Introduction to Probability textbook (…


Added by Capri Granville on February 3, 2019 at 8:00am — No Comments

3 Types of Regression in One Picture

Interesting picture comparing linear, logistic and Poisson regression, found here. For more about regression, read our other articles on this subject, here. For other ML concepts summarized in one picture,…


Added by Capri Granville on February 3, 2019 at 8:00am — No Comments

Top 10 Technology Trends of 2019

Guest blog by Igor Bobriakov.

First days after celebration of the New Year is the time when looking back we can analyze our actions, promises and draw conclusions whether our predictions and expectations came true. As 2018 came to its end, it is perfect time to analyze it and to set trends for the next year.…


Added by Capri Granville on January 27, 2019 at 9:30am — No Comments

[New Book] Unblocked: How Blockchains Will Change Your Business

Imagine it's 1994 and the dawn of the internet. In many waysit is. Entrepreneurs are once again laying the rails for a new digital world. And, just like the first digital revolution, this one will again transform the way we live, work and play. The technology known as blockchain is poised to disrupt entrenched industries and shatter today's business models. With so much at stake, how do you prepare?…


Added by Capri Granville on January 23, 2019 at 9:30am — No Comments

Book: Statistics for Managers Using Microsoft Excel

For one- or two-semester business statistics courses. Not a new book, but a popular one (8th edition.)

This text is the gold standard for learning how to use Excel in business statistics, helping students gain the understanding they need to be successful in their careers. The authors present statistics in the context of specific business fields; full chapters on business analytics further prepare students for success in their professions. Current data throughout the text…


Added by Capri Granville on January 11, 2019 at 10:30am — No Comments

New Book: Machine Learning with R

Independently published (November 20, 2018). 78 pages.

This book intends to provide an overview of Machine Learning and its algorithms & models with help of R software. Machine learning forms the basis for Artificial Intelligence which will play a crucial role in day to day life of human beings in the near future. A basic understanding of machine learning is required, as its application is widely seen in different fields such as banks and financial sectors,…


Added by Capri Granville on November 25, 2018 at 6:00am — No Comments

The Hundred-Page Machine Learning Book

New book, in progress. By Andriy Burkov, Machine Learning Team Leader at Gartner.

The following chapters are currently available:


Chapter 1: Introduction

Part I: Supervised Learning

Chapter 2: Notation and Definitions

Chapter 3: Fundamental Algorithms

Chapter 4: Anatomy of a Learning Algorithm

Chapter 5: Basic Practice

Chapter 6: Neural Networks and Deep…


Added by Capri Granville on November 25, 2018 at 6:00am — 5 Comments

Free Book: The Definitive Guide to Pandas

Pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis…


Added by Capri Granville on September 30, 2018 at 6:30am — No Comments

Free Book: Process Improvement Using Data

This book, initially written for chemical engineers, is actually very interesting for data scientists and machine learning engineers alike. For more free books, visit this page.




Added by Capri Granville on September 22, 2018 at 9:30am — No Comments

Free Book: Introduction to Statistics

Online Statistics Education: A Multimedia Course of Study.  Project Leader: David M. Lane, Rice University.


  • Introduction
  • Graphing Distributions
  • Summarizing Distributions
  • Describing Bivariate Data
  • Probability
  • Research Design
  • Normal Distributions
  • Advanced Graphs
  • Sampling…

Added by Capri Granville on September 19, 2018 at 1:30pm — No Comments

Machine Learning Process Summarized in Two Pictures

These pictures were posted on Quora by Oleg Sergeykin, former Structural Analysis Engineer at Boeing. His philosophy is that Data science is actually an iterative processes. It is never possible to complete a DS project in a single pass. A data scientist constantly tries new ideas and changes steps of his pipeline.…


Added by Capri Granville on May 20, 2018 at 1:00pm — No Comments

15 Data Science and Machine Learning Courses from Top Schools

Many are free. They are available online. They are offered by Princeton, Georgia Tech, Harvard, Columbia, Stanford, and Penn State. 


Added by Capri Granville on May 20, 2018 at 1:00pm — 2 Comments

Free New Book by Andrew Ng: Machine Learning Yearning

This is the new book by Andrew Ng, still in progress. Andrew Yan-Tak Ng is a computer scientist and entrepreneur. He is one of the most influential minds in Artificial Intelligence and Deep Learning. Ng founded and led Google Brain and was a former VP & Chief Scientist at Baidu, building the company's Artificial Intelligence Group into several thousand people. He is an adjunct professor (formerly associate professor and Director of the AI Lab) at Stanford University. Ng is also an early…


Added by Capri Granville on May 20, 2018 at 9:00am — No Comments

Competition: Explaining black box machine learning models

The Explainable Machine Learning Challenge is a collaboration between Google, FICO and academics at Berkeley, Oxford, Imperial, UC Irvine and MIT, to generate new research in the area of algorithmic explainability. Teams will be challenged to create machine learning models with both high accuracy and explainability; they will use a real-world financial dataset provided by FICO. Designers and end users of machine learning algorithms will both benefit from more interpretable and…


Added by Capri Granville on May 19, 2018 at 11:00am — No Comments

New Book: Principles of Database Management

The Practical Guide to Storing, Managing and Analyzing Big and Small Data -- Cambridge University Press.

This comprehensive textbook teaches the fundamentals of database design, modeling, systems, data storage, and the evolving world of data warehousing, governance and more. Written by experienced educators and experts in big data, analytics, data quality, and data integration, it provides an up-to-date approach to database management. This full-color, illustrated text has a…


Added by Capri Granville on May 19, 2018 at 11:00am — 2 Comments

Book: Blockchain Basics: A Non-Technical Introduction in 25 Steps

In 25 concise steps, you will learn the basics of blockchain technology. No mathematical formulas, program code, or computer science jargon are used. No previous knowledge in computer science, mathematics, programming, or cryptography is required. Terminology is explained through pictures, analogies, and metaphors.

This book bridges the gap that exists between purely technical books about the blockchain and purely business-focused books. It does so by explaining both the technical…


Added by Capri Granville on April 12, 2018 at 6:30am — 1 Comment

Book: Artificial Intelligence with Python

Build real-world Artificial Intelligence applications with Python to intelligently interact with the world around you

About This Book

  • Step into the amazing world of intelligent apps using this comprehensive guide
  • Enter the world of Artificial Intelligence, explore it, and create your own applications
  • Work through simple yet insightful examples that will get you up and running with Artificial Intelligence in…

Added by Capri Granville on April 6, 2018 at 5:30pm — 1 Comment

The Browser of a Data Scientist

Yet another funny cartoon about data scientists. More cartoons about data science, can be found here.

This one was originally posted…


Added by Capri Granville on March 27, 2018 at 4:30am — No Comments


  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service