IEEE Transactions on Parallel and Distributed Systems | 2019

Hytrace: A Hybrid Approach to Performance Bug Diagnosis in Production Cloud Infrastructures

 
 
 
 
 

Abstract


Server applications running inside production cloud infrastructures are prone to various performance problems (e.g., software hang, performance slowdown). When those problems occur, developers often have little clue to diagnose those problems. In this paper, we present Hytrace, a novel hybrid approach to diagnosing performance problems in production cloud infrastructures. Hytrace combines rule-based static analysis and runtime inference techniques to achieve higher bug localization accuracy than pure-static and pure-dynamic approaches for performance bugs. Hytrace does not require source code and can be applied to both compiled and interpreted programs such as C/C++ and Java. We conduct experiments using real performance bugs from seven commonly used server applications in production cloud infrastructures. The results show that our approach can significantly improve the performance bug diagnosis accuracy compared to existing diagnosis techniques.

Volume 30
Pages 107-118
DOI 10.1145/3127479.3132562
Language English
Journal IEEE Transactions on Parallel and Distributed Systems

Full Text