Home » Uncategorized

Upcoming Book: Foundations of Data science

Not to be confused with this free Microsoft book with same title.

Data science underlies Amazon’s product recommender, LinkedIn’s People You Know feature, Pandora’s personalized radio stations, Stripe’s fraud detectors, and the incredible insights arising from the world’s increasingly ubiquitous sensors. In the future, the world’s most interesting and impactful problems will be solved with data science. But right now, there’s a shortage of data scientists in every industry, traditional schools can’t teach students fast enough, and much of the knowledge data scientists need remains trapped in large tech companies.

 2739583311

This comprehensive, practical tutorial is the solution. Drawing on his experience building Zipfian Academy’s immersive 12-week data science training program, Jonathan Dinu brings together all you need to teach yourself data science, and successfully enter the profession.

First, Dinu helps you internalize the data science “mindset”: that virtually anything can be quantified, and once you have data, you can harvest amazing insights through statistical analysis and machine learning. He illuminates data science as it really is: a holistic, interdisciplinary process that encompasses the collection, processing, and communication of data: all that data scientists do, say, and believe.

With this foundation in place, he teaches core data science skills through hands-on Python and SQL-based exercises integrated with a full book-length case study. Step by step, you’ll learn how to leverage algorithmic thinking and the power of code, gain intuition about the power and limitations of current machine learning methods, and effectively apply them to real business problems. You’ll walk through:

  • Building basic and advanced models
  • Performing exploratory data analysis
  • Using data analysis to acquire and retain users or customers
  • Making predictions with regression
  • Using machine learning techniques
  • Working with unsupervised learning and NLP
  • Communicating with data
  • Performing social network analyses
  • Working with data at scale
  • Getting started with Hadoop, Spark and other advanced tools
  • Recognizing where common approaches break down, and how to overcome real world constraints
  • Taking your next steps in your study and career

Well-crafted appendices provide reference material on everything from the basics of Python and SQL to the essentials of probability, statistics, and linear algebra — even preparing for your data science job interview!

The book is available (pre-order) on Amazon, here