|
ABSTRACT
Modern microprocessors contain small TLBs that maintain a cache of recently used translations. A TLB's coverage is the sum of the number of bytes mapped by each entry. Applications with working sets larger than the TLB coverage will perform poorly due to high TLB miss rates. Superpages have been proposed as a mechanism for increasing TLB coverage. A superpageis a virtual memory page with size and alignment that are a power of two multiple of the system's base page size. In this paper, we describe online policies for superpage management that monitor TLB miss traffic to decide when a superpage should be constructed. Our policies take into account both the benefit of a superpage promotion (potential for preventing future misses) and the cost (page copying). Although our approach increases the cost of each TLB miss, the net effect is to improve total execution time by eliminating a large number of misses without significantly increasing memory usage, thereby improving system performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
Appel & Li 91
|
|
 |
Babaoglu & Joy 81
|
|
| |
Bala et al 94
|
Bala, K., Kaashoek, E, and Weihl, W. Software Prefetching and Caching forTranslanon Buffers. In Proceedings of the 1st USENIX Symposium on Operating System Design and Implementation, pages 243-254, November 1994.
|
| |
Blanck & Krueger 92
|
|
| |
Cao et al 94
|
Cao, P, Felten, E, and Li, K Implementation and Performance of Apphcation-ControlIed Fde Caching. In Proceedings of the 1st USENIX Symposium on Operating System Design andlmplementatton, pages 165- 177, November 1994.
|
 |
Carter et al 91
|
John B. Carter , John K. Bennett , Willy Zwaenepoel, Implementation and performance of Munin, Proceedings of the thirteenth ACM symposium on Operating systems principles, p.152-164, October 13-16, 1991, Pacific Grove, California, United States
|
| |
Chambers 93
|
Chambers, C. The Cecil Language: Speclficanon and Rationale. Technical Report 93-03-05, University of Washington, March 1993.
|
 |
Chen et al. 92
|
|
| |
Dig 92
|
Digital Eqmpment Corporation. DECchip 21064-AA Microproce,~sor, Hardware Reference Manual, 1992. Order Number: EC-N0079-72.
|
| |
Dutton et al. 92
|
Dutton, T., Eiref, D., Kurth, H., Reisert, j., and Stewart, R. The Design of the DEC 3000 AXP Systems, Two High-Performance Workstations. Digital Technical Journal, 4(4) 66-81, 1992 Special Issue.
|
| |
Hauck & Borriello 95
|
Hauck, S. and Borriello, G An Evaluation of Bipartitioning Techniques Submitted for pubhcation to IEEE Transactaons on Computer-Aided Design of Integrated Clrcmts and Systems, 1995.
|
 |
Hosking & Moss 93
|
|
| |
Kane & Heinrich 92
|
|
| |
Karlin et al. 88
|
Karlin, A., Manasse, M, Rudolph, L, and Sleator, D Competitive Snoopy Caching. Algorithmica, 3(1 ):70-1 t 9, 1988.
|
 |
Karlin et al. 91
|
Anna R. Karlin , Kai Li , Mark S. Manasse , Susan Owicki, Empirical studies of competitve spinning for a shared-memory multiprocessor, Proceedings of the thirteenth ACM symposium on Operating systems principles, p.41-55, October 13-16, 1991, Pacific Grove, California, United States
|
| |
Khalidi et al 93
|
Khali&, Y A, Talluri, M, Nelson, M, and Williams, D Virtual Memory Support for Multiple Page Sizes In Proceedings of the Fourth Workshop on Workstation Operating Systems, pages 104-109, October 1993
|
| |
Mogul 93
|
Mogul, J Big Memories on the Desktop In Proceedings oJ the Fourth Workshop on Workstatton Operatmg S)wtems, pages 110-115, October 1993
|
 |
Ramakrishnan et al. 93
|
Raghu Ramakrishnan , Divesh Srivastava , S. Sudarshan , Praveen Seshadri, Implementation of the CORAL deductive database system, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.167-176, May 25-28, 1993, Washington, D.C., United States
|
 |
Rashid et al 87
|
Richard Rashid , Avadis Tevanian , Michael Young , David Golub , Robert Baron, Machine-independent virtual memory management for paged uniprocessor and multiprocessor architectures, Proceedings of the second international conference on Architectual support for programming languages and operating systems, p.31-39, October 1987, Palo Alto, California, United States
|
 |
Sleator & Tarjan 85
|
|
 |
Srivastava & Eustace 94
|
|
 |
Talluri & Hill 94
|
|
 |
Talluri et al 92
|
Madhusudhan Talluri , Shing Kong , Mark D. Hill , David A. Patterson, Tradeoffs in supporting two page sizes, Proceedings of the 19th annual international symposium on Computer architecture, p.415-424, May 19-21, 1992, Queensland, Australia
|
 |
Young et al 87
|
M. Young , A. Tevanian , R. Rashid , D. Golub , J. Eppinger, The duality of memory and communication in the implementation of a multiprocessor operating system, Proceedings of the eleventh ACM Symposium on Operating systems principles, p.63-76, November 08-11, 1987, Austin, Texas, United States
|
| |
Zekaukas et al 94
|
Zekaukas, M., Sawdon, W, and Bershad, B. Software Write Detection for Dlsmbuted Shared Memory In Proceedings of the 1st USENIX Symposium on Operating System Design and lmpIementatton, pages 87- 100, November 1994.
|
CITED BY 27
|
|
|
|
|
|
|
|
Hervé A. Jamrozik , Michael J. Feeley , Geoffrey M. Voelker , James Evans, II , Anna R. Karlin , Henry M. Levy , Mary K. Vernon, Reducing network latency using subpages in a global memory environment, ACM SIGPLAN Notices, v.31 n.9, p.258-267, Sept. 1996
|
|
|
|
|
|
|
|
|
|
|
|
Jeffrey Dean , James E. Hicks , Carl A. Waldspurger , William E. Weihl , George Chrysos, ProfileMe: hardware support for instruction-level profiling on out-of-order processors, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.292-302, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
|
|
|
|
|
B. N. Bershad , S. Savage , P. Pardyak , E. G. Sirer , M. E. Fiuczynski , D. Becker , C. Chambers , S. Eggers, Extensibility safety and performance in the SPIN operating system, ACM SIGOPS Operating Systems Review, v.29 n.5, p.267-283, Dec. 3, 1995
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lixin Zhang , Zhen Fang , Mide Parker , Binu K. Mathew , Lambert Schaelicke , John B. Carter , Wilson C. Hsieh , Sally A. McKee, The Impulse Memory Controller, IEEE Transactions on Computers, v.50 n.11, p.1117-1132, November 2001
|
|
|
|
|
|
|
|
|
|
|
|
Juan Navarro , Sitaram Iyer , Peter Druschel , Alan Cox, Practical, transparent operating system support for superpages, Proceedings of the 5th symposium on Operating systems design and implementation Due to copyright restrictions we are not able to make the PDFs for this conference available for downloading, December 09-11, 2002, Boston, Massachusetts
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Indira Subramanian , Cliff Mather , Kurt Peterson , Balakrishna Raghunath, Implementation of multiple pagesize support in HP-UX, Proceedings of the Annual Technical Conference on USENIX Annual Technical Conference, 1998, p.9-9, June 15-19, 1998, New Orleans, Louisiana
|
|
|
Norman W. Paton , Jorge Buenabad-Chavez , Mengsong Chen , Vijayshankar Raman , Garret Swart , Inderpal Narang , Daniel M. Yellin , Alvaro A. Fernandes, Autonomic query parallelization using non-dedicated computers: an evaluation of adaptivity options, The VLDB Journal — The International Journal on Very Large Data Bases, v.18 n.1, p.119-140, January 2009
|
|