Run I vs. Run II
Bottleneck: I/O, very difficult to put data into the CPU
solution: bring the CPU to the data, build powerful cluster
Beware these numbers. Very difficult to make good predictions. Hope that conclusions do not change if numbers are a bit wrong.