It is important to examine the long latency tails of systems, even when they appear fast (I take this seriously even when on vacation).

+Luiz André Barroso and I wrote an article called "The Tail at Scale" about managing latency variability in large-scale distributed systems.  The article was just published in this month's CACM:
Shared publiclyView activity