Apache Spark™ has rapidly emerged as the de facto standard for big data processing across all industries and use cases — from providing recommendations based on user behavior to analyzing millions of genomic sequence data to accelerate drug innovation and development for personalized medicine.
This eBook offers a collection of the most popular technical blog posts that provide an introduction to machine learning on Apache Spark™, and highlights many of the major developments around Spark MLlib and GraphX.
Whether you are just getting started with Spark or are already a Spark power user, it will arm you with the knowledge to be successful on your next Spark project. Read the eBook to learn:
An introduction to machine learning in Apache Spark™
Using Spark for advanced topics such as clustering, trees, and graph processing
ow you can use SparkR to analyze data at scale with the R language
Sincerely, Databricks Team
Databricks: 160 Spear Street, 13th Floor, San Francisco, CA 94105 US