ACM Home Page
Please provide us with feedback. Feedback
DEP: detailed execution profile
Full text PdfPdf (566 KB)
Source PACT archive
Proceedings of the 15th international conference on Parallel architectures and compilation techniques table of contents
Seattle, Washington, USA
SESSION: Performance profiling and tuning table of contents
Pages: 154 - 163  
Year of Publication: 2006
ISBN:1-59593-264-X
Authors
Qin Zhao  Singapore-MIT Alliance
Joon Edward Sim  National University of Singapore
Weng-Fai Wong  Singapore-MIT Alliance and National University of Singapore
Larry Rudolph  Singapore-MIT Alliance and Massachusetts Institute of Technology
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 35,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1152154.1152180
What is a DOI?

ABSTRACT

In many areas of computer architecture design and program development, the knowledge of dynamic program behavior can be very handy. Several challenges beset the accurate and complete collection of dynamic control flow and memory reference information. These include scalability issues, runtime-overhead, and code coverage. For example, while Tallam and Gupta's work on extending WPP (Whole Program Paths) showed good compressibility, their profile requires 500MBytes of intermediate memory space and an average of 23 times slowdown to be collected.To address these challenges, this paper presents DEP (Detailed Execution Profile). DEP captures the complete dynamic control flow, data dependency and memory reference of a whole program's execution. The profile size is significantly reduced due to the insight that most information can be recovered from a tightly coupled record of control flow and register value changes. DEP is collected in an infrastructure called Adept (A dynamic execution profiling tool), which uses the DynamoRIO binary instrumentation framework to insert profile-collecting instructions within the running application. DEP profiles user-level code execution in its entirety, including interprocedural paths and the execution of multiple threads.The framework for collecting DEP has been tested on real, large and commercial applications. Our experiments show that DEP of Linux SPECInt 2000 benchmarks and Windows SysMark benchmarks can be collected with an average of 5 times slowdown while maintaining competitive compressibility. DEP's profile sizes are about 60% that of traditional profiles.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Standard Performance Evaluation Corporation. SPEC CPU2000 benchmark suite, 2000. http://www.spec.org/osg/cpu2000/.
 
2
Sysmark 2004 SE. SYSmark benchmark Second Edition, 2004. http://www.bapco.com/.
3
 
4
 
5
6
 
7
8
 
9
Nicholas Nethercote. Dynamic Binary Analysis and Instrumentation. PhD thesis, University of Cambridge, November 2004. http://valgrind.org/.
 
10
Craig G. Nevill-Manning and Ian H. Witten. Identifying hierarchical structure in sequences: A linear-time algorithm. Journal of Artificial Intelligence Research, 7:67--82, 1997.
11
12
 
13
Julian Seward. bzip2. http://www.bzip.org/.
 
14
 
15
16


Collaborative Colleagues:
Qin Zhao: colleagues
Joon Edward Sim: colleagues
Weng-Fai Wong: colleagues
Larry Rudolph: colleagues