Subscribe to DSC Newsletter

I have been having some bitter experiences in processing compressed data (>1 TB) residing on Hadoop. Simplest way to accomplish is to sampling with Pig-DataFu and analyzing with R. Scaling R code is tedious using RHadoop. Appreciate if you can share your experience and any alternative efficient ways to accomplish the analysis. Thanks!

Views: 430

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2017   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service