By Avrim Blum, John Hopcroft, and Ravindran Kannan (2018).
Computer science as an academic discipline began in the 1960s. Emphasis was on programming languages, compilers, operating systems, and the mathematical theory that supported these areas. Courses in theoretical computer science covered finite automata, regular expressions, context-free languages, and computability. In the 1970s, the study of algorithms was added as an important component of…Continue
Added by Capri Granville on May 23, 2019 at 9:00am — No Comments
A free online version of the second edition of the book based on Stat 110, Introduction to Probability by Joe Blitzstein and Jessica Hwang, is now available here. Print copies are available via CRC Press, Amazon, and…Continue
Interesting cartoon featuring the decision tree used in medical diagnosis. To see other cartoons about data science, follow this link.…Continue
Lecture notes for the Statistical Machine Learning course taught at the Department of Information Technology, University of Uppsala (Sweden.) Updated in March 2019. Authors: Andreas Lindholm, Niklas Wahlström, Fredrik Lindsten, and Thomas B. Schön.
Source: page 61 in these lecture notes…Continue
Added by Capri Granville on April 25, 2019 at 9:00am — No Comments
Guest blog by Stephan Loyd.
Hello everybody, this is my first post here, so forgive me if I screw it up.
Let me firstly introduce background of my work. Several years ago I landed onto a Perl job. It also involves some other languages like Python and R, but it was mainly Perl, until last year focus of my role switched and I still do some Perl but much less since then. I was a little…Continue
Added by Capri Granville on April 14, 2019 at 6:30pm — No Comments
This 12-page document lists hundreds of Excel functions and formulas, covering
Added by Capri Granville on April 7, 2019 at 6:30am — No Comments
This is the fourth article in a DeepTech Series by Margaretta Colangelo and Dmitry Kaminskiy. Dmitry Kaminskiy, General Partner at Deep Knowledge Ventures, is based in London. Dmitry is Managing Trustee…Continue
Added by Capri Granville on April 4, 2019 at 10:30am — No Comments
Creating Info We Trust is a craft that puts the world into forms that are strong and true. It begins with maps, diagrams, and charts — but must push further than dry defaults to be truly effective. How do we attract attention? How can we offer audiences valuable experiences worth their time? How can we help people access complexity?…Continue
Added by Capri Granville on April 3, 2019 at 8:00am — No Comments
I was comparing home prices in San Francisco between 1994 and 2018, and I noticed that it has increased by a factor 4 over 25 years. In the meanwhile, the inflation index increased by a factor 1.7 (see here.) I am not saying here that my sources are correct or wrong -- entire books have been written on the subject -- but instead, my purpose here is to show how some visualizations can be…Continue
Here is one of the most viral videos about data science posted in the last few months, with over 500,000 views. I could not locate the original copy; I found it in a re-tweet by Kirk Borne. See link to the video below the picture. However, I was able to find who created it (Welcome.ai) and it turns out that they have a YouTube channel with plenty of even more interesting videos. To search these videos by keyword, click…Continue
Added by Capri Granville on February 27, 2019 at 3:30pm — No Comments
Below is an extract of a 10-page cheat sheet about data science, compiled by Maverick Lin. This cheatsheet is currently a reference in data science that covers basic concepts in probability, statistics, statistical learning, machine learning, deep learning, big data frameworks and SQL. The cheatsheet is loosely based off of The Data Science Design Manual by Steven S. Skiena and An Introduction to Statistical Learning by…Continue
Added by Capri Granville on February 26, 2019 at 7:30pm — No Comments
This article is a solid introduction to statistical testing, for beginners, as well as a reference for practitioners. It includes numerous examples as well as illustrations and definitions for concepts such as rejecting the null hypothesis, one sample hypothesis testing, P-values, critical values, and Bayesian hypothesis testing. It has references to additional topics, such as
Added by Capri Granville on February 19, 2019 at 9:30am — No Comments
This infographic was produced by 365DataScience. Last year they completed a research on 1,001 data scientists to get a profile of the ‘typical’ data scientist in 2018. They replicated the study with new data. Below are the findings.
Here are some of our key findings:…Continue
Below is an extract of a 10-page cheat sheet about probability, compiled by William Chen (http://wzchen.com) and Joe Blitzstein, with contributions from Sebastian Chiu, Yuan Jiang, Yuqi Hou, and Jessy Hwang. Material based on Joe Blitzstein’s Harvard's introductory probability course (@stat110 - (http://stat110.net) and Blitzstein / Hwang’s Introduction to Probability textbook (…Continue
Added by Capri Granville on February 3, 2019 at 8:00am — No Comments
Interesting picture comparing linear, logistic and Poisson regression. For more about regression, read our other articles on this subject, here. For other ML concepts summarized in one picture, follow this link. …Continue
Added by Capri Granville on February 3, 2019 at 8:00am — No Comments
Guest blog by Igor Bobriakov.
First days after celebration of the New Year is the time when looking back we can analyze our actions, promises and draw conclusions whether our predictions and expectations came true. As 2018 came to its end, it is perfect time to analyze it and to set trends for the next year.…Continue
Added by Capri Granville on January 27, 2019 at 9:30am — No Comments
Imagine it's 1994 and the dawn of the internet. In many ways, it is. Entrepreneurs are once again laying the rails for a new digital world. And, just like the first digital revolution, this one will again transform the way we live, work and play. The technology known as blockchain is poised to disrupt entrenched industries and shatter today's business models. With so much at stake, how do you prepare?…Continue
Added by Capri Granville on January 23, 2019 at 9:30am — No Comments
For one- or two-semester business statistics courses. Not a new book, but a popular one (8th edition.)
This text is the gold standard for learning how to use Excel in business statistics, helping students gain the understanding they need to be successful in their careers. The authors present statistics in the context of specific business fields; full chapters on business analytics further prepare students for success in their professions. Current data throughout the text…Continue
Added by Capri Granville on January 11, 2019 at 10:30am — No Comments
Independently published (November 20, 2018). 78 pages.
This book intends to provide an overview of Machine Learning and its algorithms & models with help of R software. Machine learning forms the basis for Artificial Intelligence which will play a crucial role in day to day life of human beings in the near future. A basic understanding of machine learning is required, as its application is widely seen in different fields such as banks and financial sectors,…Continue
Added by Capri Granville on November 25, 2018 at 6:00am — No Comments
The following chapters are currently available:
Chapter 1: Introduction
Part I: Supervised Learning
Chapter 2: Notation and Definitions
Chapter 3: Fundamental Algorithms
Chapter 4: Anatomy of a Learning Algorithm
Chapter 5: Basic Practice
Chapter 6: Neural Networks and Deep…