Various embodiments include methods, apparatus, and systems for detecting causes of application latency degradations and/or other types of abnormalities, such as error rates, in large-scale distributed computing systems, and for generating appropriate alerts to system administrators or other individuals or entities that have an interest in the status of the application and computing system.