Subscribe to DSC Newsletter

Jonathan Symonds's Blog Posts Tagged 'Spark' (1)

Running Peta-Scale Spark Jobs on Object Storage Using S3 Select

When one looks at the amazing roster of talks for most data science conferences what you don’t see is a lot of discussion on how to leverage object storage. On some level you would expect to — ultimately if you want to run your Spark or Presto job on peta-scale data sets and have it be available to your applications in the public or private cloud — this would be the logical storage architecture.

While logical, there has been a catch, at least historically, and that is object storage…

Continue

Added by Jonathan Symonds on June 25, 2019 at 9:00am — No Comments

Monthly Archives

2020

2019

2018

2017

2016

2015

Videos

  • Add Videos
  • View All

© 2020   TechTarget ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service