ACM Home Page
Please provide us with feedback. Feedback
DARC: dynamic analysis of root causes of latency distributions
Full text PdfPdf (457 KB)
Source
Joint International Conference on Measurement and Modeling of Computer Systems archive
Proceedings of the 2008 ACM SIGMETRICS international conference on Measurement and modeling of computer systems table of contents
Annapolis, MD, USA
SESSION: Systems table of contents
Pages 277-288  
Year of Publication: 2008
ISBN:978-1-60558-005-0
Also published in ...
Authors
Avishay Traeger  Stony Brook University, Stony Brook, NY, USA
Ivan Deras  Stony Brook University, Stony Brook, NY, USA
Erez Zadok  Stony Brook University, Stony Brook, NY, USA
Sponsors
SIGMETRICS: ACM Special Interest Group on Measurement and Evaluation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 91,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1375457.1375489
What is a DOI?

ABSTRACT

OSprof is a versatile, portable, and efficient profiling methodology based on the analysis of latency distributions. Although OSprof has offers several unique benefits and has been used to uncover several interesting performance problems, the latency distributions that it provides must be analyzed manually. These latency distributions are presented as histograms and contain distinct groups of data, called peaks, that characterize the overall behavior of the running code. By automating the analysis process, we make it easier to take advantage of OSprof's unique features.

We have developed the Dynamic Analysis of Root Causes system (DARC), which finds root cause paths in a running program's call-graph using runtime latency analysis. A root cause path is a call-path that starts at a given function and includes the largest latency contributors to a given peak. These paths are the main causes for the high-level behavior that is represented as a peak in an OSprof histogram. DARC performs PID and call-path filtering to reduce overheads and perturbations, and can handle recursive and indirect calls. DARC can analyze preemptive behavior and asynchronous call-paths, and can also resume its analysis from a previous state, which is useful when analyzing short-running programs or specific phases of a program's execution.

We present DARC and show its usefulness by analyzing behaviors that were observed in several interesting scenarios. We also show that DARC has negligible elapsed time overheads for normal use cases.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
 
5
W. Cohen. Gaining insight into the Linux kernel with Kprobes. RedHat Magazine, March 2005.
 
6
7
 
8
9
 
10
 
11
 
12
A. V. Mirgorodskiy and B. P. Miller. CrossWalk: A tool for performance profiling across the user-kernel boundary. In Proc. of PARCO 2003, pp. 745--752, Dresden, Germany, September 2003. Elsevier.
 
13
A. Nataraj, A. Malony, S. Shende, and A. Morris. Kernel-level measurement for integrated parallel performance views: the ktau project. In Proc. of the 2006 IEEE Conference on Cluster Computing, Barcelona, Spain, September 2006
 
14
C. Nentwich and M. Tiihonen. Intel2gas. http://www.niksula.hut.fi/~mtiihone/intel2gas/, 2000.
 
15
V. Patkov. Hacker disassembler engine. http://vx.netlux.org/vx.php?id=eh04, 2007.
 
16
 
17
 
18
 
19
 
20


Collaborative Colleagues:
Avishay Traeger: colleagues
Ivan Deras: colleagues
Erez Zadok: colleagues