This notebook was written by Dr. Randal S. Olson from GitHub. In this notebook, Randal is going to go over a basic Python data analysis pipeline from start to finish to show you what a typical data science workflow looks like. In addition to providing code examples, he also hopes to imbue in you a sense of good practices so you can be a more effective — and more collaborative — data scientist. Randal will be following along with the data analysis checklist from The Elements of Data Analytic Style, which he strongly recommends reading as a free and quick guidebook to performing outstanding data analysis.
In the time it took you to read this sentence, terabytes of data have been collectively generated across the world — more data than any of us could ever hope to process, much less make sense of, on the machines we're using to read this notebook.In response to this massive influx of data, the field of Data Science has come to the forefront in the past decade. Cobbled together by people from a diverse array of fields — statistics, physics, computer science, design, and many more — the field of Data Science represents our collective desire to understand and harness the abundance of data around us to build a better world.