Subscribe to DSC Newsletter

Parallelize R Code Using Apache® Spark™

Event Details

Parallelize R Code Using Apache® Spark™

Time: August 15, 2017 from 9am to 10am
Location: Online
Website or Map:
Event Type: dsc, webinar
Organized By: Bill Vorhies, Editorial Director -- Data Science Central
Latest Activity: Aug 1

Export to Outlook or iCal (.ics)

Event Description

Space is limited.

Reserve your Webinar seat now

R is the latest language added to Apache Spark, and the SparkR API is slightly different from PySpark. SparkR’s evolving interface to Apache Spark offers a wide range of APIs and capabilities to Data Scientists and Statisticians. With the release of Spark 2.0, and subsequent releases, the R API officially supports executing user code on distributed data. This is done primarily through a family of apply() functions.

In this Data Science Central webinar, we will explore the following:

  • Provide an overview of this new functionality in SparkR.
  • Show how to use this API with some changes to regular code with dapply().
  • Focus on how to correctly use this API to parallelize existing R packages.
  • Consider performance and examine correctness when using the apply family of functions in SparkR.

SpeakerHossein Falaki, Software Engineer -- Databricks Inc.

Hosted by: 
Bill VorhiesEditorial Director -- Data Science Central

Again, Space is limited so please register early:

Reserve your Webinar seat now


After registering you will receive a confirmation email containing information about joining the Webinar.

Comment Wall


RSVP for Parallelize R Code Using Apache® Spark™ to add comments!

Join Data Science Central

Attending (3)

Not Attending (1)

Follow Us


  • Add Videos
  • View All


© 2017   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service