ACM Home Page
Please provide us with feedback. Feedback
Efficient, Unified, and Scalable Performance Monitoring for Multiprocessor Operating Systems
Full text PdfPdf (250 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 2003 ACM/IEEE conference on Supercomputing table of contents
Page: 3  
Year of Publication: 2003
ISBN:1-58113-695-1
Authors
Robert W. Wisniewski  IBM T. J. Watson Research Center
Bryan Rosenburg  IBM T. J. Watson Research Center
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 45,   Citation Count: 8
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Programming, understanding, and tuning the performance of large multiprocessor systems is challenging. Experts have difficulty achieving good utilization for applications on large machines. The task of implementing a scalable system such as an operating system or database on large machines is even more challenging. And the importance of achieving good performance on multiprocessor machines is increasing as the number of cores per chip increases and as the size of multiprocessors increases. Crucial to achieving good performance is being able to understand the behavior of the system. We have developed an efficient, unified, and scalable tracing infrastructure that allows for correctness debugging, performance debugging, and performance monitoring of an operating system. The infrastructure allows variable-length events to be logged without locking and provides random access to the event stream. The infrastructure allows cheap and parallel logging of events by applications, libraries, servers, and the kernel. The infrastructure was designed for K42, a new open-source research kernel designed to scale near perfectly on large cache-coherent 64-bit multiprocessor systems. The techniques are generally applicable, and many of them have been integrated into the Linux Trace Toolkit. In this paper, we describe the implementation of the infrastructure, how we used the facility, e.g., analyzing lock contention, to understand and achieve K42's scalable performance, and the lessons we learned. The infrastructure has been invaluable to achieving great scalability.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
[1] Jonathan Appavoo, Marc Auslander, David Edelsohn, Dilma da Silva, Orran Krieger, Michal Ostrowski, Bryan Rosenburg, Robert W. Wisniewski, and Jimi Xenidis. Providing a Linux API on the scalable K42 kernel. In Freenix, pages 323-336, San Antonio, TX, June 9-14 2003.
 
2
[2] Marc Auslander, David Edelsohn, Dilma da Silva, Orran Krieger, Michal Ostrowski, Bryan Rosenburg, Robert W. Wisniewski, and Jimi Xenidis. K42 Overview. IBM Research, http://www.research.ibm.com/K42, August 2002.
 
3
[3] Marc Auslander, David Edelsohn, Dilma da Silva, Orran Krieger, Michal Ostrowski, Bryan Rosenburg, Robert W. Wisniewski, and Jimi Xenidis. K42's Performance Monitoring and Tracing. IBM Research, http://www.research.ibm.com/K42, August 2002.
 
4
[4] IBM Linux Technology Center. Dynamic probes. http://www- 124.ibm.com/developerworks/oss/linux/projects/dprobes/.
 
5
[5] IBM Corporation. Aix version 3.1 for risc system/6000 performance monitoring and tuning guide. Technical Report SC23-2365- 00, IBM Corporation.
 
6
[6] Dyninst. An application program interface (api) for runtime code generation. http://www.dyninst.org/.
 
7
[7] D. Kohr, X. Zhang, M. Rahman, and D. Reed. A performance study of an object-oriented parallel operating system. In Proceedings of the 27th Hawaii International Conference on System Sciences , November 27 2000.
 
8
 
9
[9] Daniel A. Reed, James Arendt, Ruth Aydt, Thomas Birkett, David Jensen, Tara Madhyastha, Bobby Nazief, Ted Nelson, Robert Olson, and Brian Totty. Scalable performance environments for parallel systems. In Sixth Distributed Memory Computing Conference , pages 562-569, Portland OR, April-May 1991.
 
10
[10] Craig A. N. Soules, Jonathan Appavoo, Kevin Hui, Robert W. Wisniewski, Dilma da Silva, Gregory R. Ganger, Orran Krieger, Michael Stumm, Marc Auslander, Michal Ostrowski, Bryan Rosenburg, and Jimi Xenidis. System support for online reconfiguration. In USENIX, pages 141-154, San Antonio, TX, June 9-14 2003.
 
11
[11] John Stasko, John Domingue, Marc H. Brown, and Blaine A. Price. Software Visualization, volume 1, chapter 20 Visualization of Dynamics in Real World Software Systems, Doug Kimelman, Bryan Rosenburg, and Tova Roth, pages 293-314. MIT Press, 1998.
 
12
 
13
14
 
15
 
16
[16] Karim Yaghmour. Ltt web page. http://www.opersys.com/LTT/index.html.
 
17
 
18
[18] Tom Zanussi, Karim Yaghmour, Robert W. Wisniewski, Michel Dagenais, and Richard Moore. An efficient unified approach for trasmitting data from kernel to user space. In OLS 2003 - Ottawa Linux Symposium, page to appear, July 23-26 2003.

CITED BY  9
Collaborative Colleagues:
Robert W. Wisniewski: colleagues
Bryan Rosenburg: colleagues