(i) the three most important things the paper says
One of the most important things that this paper says is the fact that, at the volume of requests that Google receives, aggregate throughput is much more important than single request latency. This reasoning follows quite well with reasoning of a typical scenario: does it matter more that a single user receives a response in .1 seconds vs 1 second, or does it matter more that a particular server/cluster served 10x more users at 1 second versus just 1x at .1 seconds. This observation allows Google to save tons of money in server hardware by purchasing with the most economical (performance/price) mindset. Another important idea that the paper demonstrates is that hardware replication (redundancy) is much easier/cost-effective when handling failures than writing software that will handle those failures gracefully. Hardware replication, at the price-point of commodity hardware is extremely cheap, while software developer time is much more expensive. Also, this type of software might require frequent changes depending on the type of hardware used, which would require even more engineering time. A third important observation made in the paper detailed the price/performance disadvantage of concentrating on low-power hardware versus standard commodity hardware (typically). The paper says that the power and cooling cost savings must outweigh the cost of the hardware itself (while factoring in how long that hardware will last). When this paper was written, commodity hardware won this battle.
(ii) the most glaring problem with the paper
One of the biggest problems in this paper is that it is devoid of any alternate storage analysis. We’re expected to take the analysis that hard disks are the way to go without any explanations. Alternate memory technologies are much more prevalent now and should be included in such a justification, as many of them provide great latency, power, and durability advantages over commodity disks.
(iii) the future research directions of the work
It would be interesting to see some analysis numbers on how switching to lower-power servers would impact the power usage of Google as a whole (and how that impact would translate to power generation companies, and thereby the environment as a whole). It would also be interesting to see how CMPs and SMT would help power usage (versus single processor commodity hardware). It may be the case that a mix of low-power hardware with CMP or SMT technology might save money overall.