This document discusses 7 computational problems that arise in massive data analysis:
1. Basic statistics - calculating descriptive statistics like mean, variance on large datasets. Challenges include high dimensionality and noise. Solutions include new statistical methods and computational infrastructure.
2. Generalized N-body problem - predicting interactions between large numbers of objects. Challenges are its computational intensity as it scales as O(N^2). Solutions include tree-based algorithms that reduce complexity to O(N log N).
3. Linear algebraic computations - operations on large matrices. Challenges are matrices with slowly decaying spectra and noise. Solutions include truncated SVD, regularization, and distributed computation.