This is an interesting story. When faced with a monumental monitoring task with a huge data set (big data probably isn't an adequate term), Facebook engineers developed a tool to help them monitor and keep up with the data in real time -- and most importantly help them understand why they were seeing whatever performance problems they were detecting.
Read my full post on the Real User Monitoring blog.























