Performance Anomaly Detection and Bottleneck Identification

Olumuyiwa Ibidunmoye, Francisco Hernández-Rodriguez, Erik Elmroth
2015 ACM Computing Surveys  
In order to meet stringent performance requirements, system administrators must e↵ectively detect undesirable performance behaviours, identify potential root causes and take adequate corrective measures. The problem of uncovering and understanding performance anomalies and their causes (bottlenecks) in di↵erent system and application domains is well studied. In order to assess progress, research trends and identify open challenges, we have reviewed major contributions in the area and present
more » ... findings in this survey. Our approach provides an overview of anomaly detection and bottleneck identification research as it relates to the performance of computing systems. By identifying fundamental elements of the problem, we are able to categorize existing solutions based on multiple factors such as the detection goals, nature of applications and systems, system observability, and detection methods. 1 ⇤ c ACM, 2015. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ACM Computing Surveys, VOL. 48, ISS. 1, June 2015 http://dx.Auxiliary Materials (provide filenames and a description of auxiliary content, if any, for display in the ACM Digital Library. The description may be provided as a ReadMe file):
doi:10.1145/2791120 fatcat:yajvevdzl5h6tc6ii5sjd7qzlm