Subscribe to DSC Newsletter

Capri Granville's Blog (72)

Traveling to Other Planets with Google Maps

Interesting application leveraging very big data, as you can zoom in to check out pretty detailed landscape on a few nearby planets and satellites. It would be even better if it allowed you to simulate space travel. At a speed 32 million times faster than the speed of light, it would take you 4 seconds to reach the closest star (Alpha Centauri): that's how fast you need to travel to see the stars really moving in the sky. By contrast, space travel at just the speed of light would make stars…

Continue

Added by Capri Granville on January 4, 2020 at 9:30am — No Comments

Machine Learning Technical Landscape

The picture below is from the article Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey, posted here.

Authors: Giang Nguyen, Stefan Dlugolinsky, Martin Bobák, Viet…

Continue

Added by Capri Granville on December 26, 2019 at 12:00pm — No Comments

List of Time Series Methods in One Picture

The picture below was found in some tweets posted by top data science influencers, though its origin is somewhat obscure. 

Many of these methods are described in Wikipedia. Many are also described on Data Science Central, see for instance…

Continue

Added by Capri Granville on December 8, 2019 at 12:30pm — No Comments

Alternatives to the Gradient Descent Algorithm

Gradient Descent has a problem of getting stuck in Local Minima. The following alternatives are available. The following is a summary of answers suggested on CrossValided, originally posted here. …

Continue

Added by Capri Granville on November 14, 2019 at 10:00am — No Comments

Free 2,900-page Manual about Pandas

Pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.

Hexagonal binning with Hexbin: extract from the Visualization chapter

This reference is available…

Continue

Added by Capri Granville on November 3, 2019 at 10:00am — 1 Comment

Best 2019 Paper Awards in Computer Vision

The IEEE International Conference on Computer Vision received 4,303 papers and accepted 1,075 for the 2019 summit. Bellow is the best paper award.

Source: see paper listed below…

Continue

Added by Capri Granville on November 3, 2019 at 10:00am — No Comments

Math of Deep Learning: Free Chapter from 832-page eBook

The title of the eBook is Dive in Deep Learning. Below I list the content of chapter 16, dealing with the math of deep learning. But the whole book (entirely free) is worth reading. This is an interactive deep learning book with code, math, and discussions. It is based on the NumPy interface.

Authors

  • Aston Zhang, Amazon Senior Scientist, UIUC Ph.D.
  • Zack C. Lipton, Amazon Scientist, CMU Assistant Professor, UCSD Ph.D.
  • Mu Li,…
Continue

Added by Capri Granville on November 3, 2019 at 10:00am — No Comments

Jupyter Notebooks: Fundamentals of Machine Learning and Deep Learning

Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2. 

Source: from the Support Vector Machines chapter,…

Continue

Added by Capri Granville on October 27, 2019 at 6:30am — No Comments

Machine Learning and Deep Learning Textbook - Cornell University

Another free book to learn Machine Learning. It also comes with a Youtube video series available here

Content

  • Machine Learning Setup
  • k-Nearest Neighbors / Curse of…
Continue

Added by Capri Granville on October 27, 2019 at 6:30am — No Comments

40+ Modern Tutorials Covering All Aspects of Machine Learning

This list of lists contains books, notebooks, presentations, cheat sheets, and tutorials covering all aspects of data science, machine learning, deep learning, statistics, math, and more, with most documents featuring Python or R code and numerous illustrations or case studies. All this material is available for free, and consists of content mostly created in 2019 and 2018, by various top experts in their respective fields. A few of these documents are available on LinkedIn: see last section…

Continue

Added by Capri Granville on October 12, 2019 at 7:30am — 1 Comment

GIF Image Featuring a Beautiful Visualization

Interesting GIF visualization. We do not endorse any political opinion, and the picture below is provided only for its visual value, not for its political content.

It was originally posted here. Other visualizations can…

Continue

Added by Capri Granville on October 11, 2019 at 6:30am — No Comments

Free Book: A Comprehensive Guide to Machine Learning (Berkeley University)

By Soroush Nasiriany, Garrett Thomas, William Wang, Alex Yang. Department of Electrical Engineering and Computer Sciences, University of California, Berkeley. Dated June 24, 2019. This is not the same book as The Math of Machine Learning, also published by the same department at Berkeley, in 2018, and also authored by Garret…

Continue

Added by Capri Granville on October 3, 2019 at 8:30am — 1 Comment

Artificial Neural Networks in a Nutshell

According to Wikipedia, an ANN is based on a collection of connected units or nodes called artificial neurons, which loosely model the neurons in a biological brain. Each connection, like the synapses in a biological brain, can transmit a signal to other neurons. An artificial neuron that receives a signal then processes it and can signal neurons connected to it.

In ANN implementations, the "signal" at a connection is a real number, and the output of each neuron is computed by some…

Continue

Added by Capri Granville on September 23, 2019 at 4:30am — No Comments

Stylometric Feature Engineering Techniques in Authorship Analysis

Technical paper, published in IEEE Xplore.

Abstract:

Authorship analysis (AA) is the study of unveiling the hidden properties of authors from textual data. It extracts an author's identity and sociolinguistic characteristics based on the reflected writing styles in the text. The process is essential for various areas,…
Continue

Added by Capri Granville on September 22, 2019 at 1:30pm — No Comments

AI Trading the Market

Written by Jun Wu.

Every year since I worked on wall street, traditional trading-is-an-art-form traders are leaving and retiring.

Their jobs are replaced by a new breed of experts who are savvy with numbers, systems, and the market. These experts don’t sleep, eat or drink. They are the AI Systems that can run on a thousand machines to…

Continue

Added by Capri Granville on September 22, 2019 at 1:30pm — No Comments

The problem that Google Solved with Quantum Supremacy

The problem has to do with sampling, random numbers and probability distributions, so it is of interest to our community. As Scott Aaronson describes it in his blog, here is the problem:

You can read more here, including answers to…

Continue

Added by Capri Granville on September 22, 2019 at 1:30pm — No Comments

The Math of Machine Learning - Berkeley University Textbook

This document is an attempt to provide a summary of the mathematical background needed for an introductory class in machine learning, which at UC Berkeley is known as CS 189/289A.

Our assumption is that the reader is already familiar with the basic concepts of multivariable calculus and linear algebra (at the level of UCB Math 53/54). We emphasize that this document is not a replacement for the prerequisite classes. Most subjects presented here are covered rather minimally; we intend…

Continue

Added by Capri Granville on September 22, 2019 at 11:30am — No Comments

What are the Typical Data Scientist Profiles on LinkedIn? Survey Results

The charts below represents the main findings of some recent analysis of 1,000 data scientist LinkedIn profiles, using a web scraper. It was limited to Singapore, and for people having "data scientist" on their profile. Of course, many have a different job title especially in fields such as Fintech (quant engineer) or Healthcare (biostatistician) but the findings are interesting nevertherless and seem to apply to other locales as well.

The first chart features the educational…

Continue

Added by Capri Granville on September 12, 2019 at 10:00am — 1 Comment

R Users’ Salaries from the 2019 Stackoverflow Survey

Interesting analysis done in R, about salaries of R developers broken down by country, featuring salary range and median salary. 

The dataset consists of survey answers from nearly 90,000 respondents. About 5,000 of them reported using R for “extensive development work over the past year”. The first filter used reduces the dataset from 88,883 respondents to 5,048. The second filter…

Continue

Added by Capri Granville on September 5, 2019 at 7:00am — No Comments

Interesting Charts and Maps Obtained Using Census Data

I found an interesting websites featuring hundreds of charts derived from US census data. It shows contrasts between states, cities, regarding education, jobs, languages spoken, salaries, even discrepencies between men and women or Asians and Caucasians, regarding various metrics broken down by location, education, or other criteria. I selected four of these charts.…

Continue

Added by Capri Granville on June 10, 2019 at 5:00pm — No Comments

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service