Home » Uncategorized

Python for Big Data in One Picture

This picture originally posted here covers the following topics:

  1. Basic stack
  2. Newer packages
  3. Integrated platforms
  4. Visualization
  5. Data formats
  6. MapReduce
  7. Glue
  8. GPU
  9. Parallel
  10. Efficiency
  11. Packages

To zoom in, view picture in the original article, or click on picture. The original article also provides a detailed listing of all the 100+ entities listed in the picture, broken down in categories and sub-categories, some items belonging to multiple categories. Anyone interested in creating a clickable link for each of these entities? For instance, entity 1.1 (in the original article) is numpy, while 4.1 is matplotlib

2808329843

Other interesting pictures worth checking out:

Top DSC Resources

Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge