For those of you who wish to begin learning Python for Data Science, here is a list of various resources that will get you up and running. Included are things like online tutorials and short interactive course, MOOCs, newsletters, books, useful tools and more.
Python Tips decided to put this together so that you can…
This notebook was written by Dr. Randal S. Olson from GitHub. In this notebook, Randal is going to go over a basic Python data analysis pipeline from start to finish to show you what a typical data science workflow looks like. In addition to providing code examples, he also hopes to imbue in you a sense of good practices so you can be a more effective — and more collaborative — data scientist. Randal will be following along with the data analysis checklist from …Continue
Added by Emmanuelle Rieuf on July 29, 2016 at 3:30pm — No Comments
Under growing pressure to report accurate findings as they interpret increasingly larger amounts of data, researchers are finding it more important than ever to follow sound statistical practices.
For that reason, a team of statisticians including Carnegie Mellon University's Robert E. Kass wrote "Ten Simple Rules for Effective Statistical Practice." Published in PLOS…Continue
Added by Emmanuelle Rieuf on July 28, 2016 at 5:00pm — No Comments
Here are some white papers about Tamr, Lavastorm, Teradata, Rapidminer, Looker, Thingworx, and DataRobot :
Added by Emmanuelle Rieuf on July 28, 2016 at 3:30pm — No Comments
Written by sought-after speaker, designer, and researcher Stephanie D. H. Evergreen, Effective Data Visualizationshows readers how to create Excel charts and graphs that best communicate data findings. This comprehensive how-to guide functions as a set of blueprints—supported by research and the author’s extensive experience with clients in industries all over the world—for conveying data in an impactful way. Delivered in Evergreen’s humorous and approachable style, the book covers the…Continue
Added by Emmanuelle Rieuf on July 27, 2016 at 7:00am — No Comments
Added by Emmanuelle Rieuf on July 24, 2016 at 11:30am — No Comments
This article introduces Mahout, a library for scalable machine learning, and studies potential applications through two Mahout projects. It was written by Linda Terlouw. Linda is a computer scientist who works on Data Science (Data Analysis, Data Visualization, Process Mining).Continue
Added by Emmanuelle Rieuf on July 8, 2016 at 4:47am — No Comments
This article was written by Bhavani Raskutti. Bhavani joined the ANZ Teradata Advanced Analytics team in 2014. She is internationally recognised as a data mining thought leader and is regularly invited to present at international conferences on Mining Big Data. She is passionate about transforming businesses to make better decisions using their data capital.
The term “data science” was first used by the statistician William H. Cleveland in his 2001 paper entitled, “…Continue
Added by Emmanuelle Rieuf on July 2, 2016 at 11:30pm — No Comments
This article on the 10-dimensional burrito was posted by Scott Cole. Scott is a PhD student studying neuroscience at UC San Diego, currently researching the functions and measures of neural oscillations in the Voytek lab.
In his work, Scott and his team have developed a 10-dimensional system for rating the burritos…Continue
Added by Emmanuelle Rieuf on July 2, 2016 at 11:26pm — No Comments