A new 191-page PDF eBook published by the National Academies of Sciences Press is available, "Frontiers in Massive Data Analysis," and can be downloaded for free (after free website registration):
The first 9 of the 10 chapters offer a comprehensive survey of state-of-the-art big data architectures, machine learning, and analysis techniques.
Chapter 10 really shines as it offers a new framework for evaluating systems and techniques intended to conduct massive data analysis. Called the "seven giants," it's patterned after the "seven dwarfs" of evaluating high-performance computing systems. The seven giants are:
Generalized N-body problem,
Linear algebraic computations,
For each of these problem domains, the chapter describes what it is, and what the challenges and examples of notable approaches are.