ACM Home Page
Please provide us with feedback. Feedback
Dynamic switching of coherent cache protocols and its effects on Doacross loops
Full text PdfPdf (983 KB)
Source International Conference on Supercomputing archive
Proceedings of the 7th international conference on Supercomputing table of contents
Tokyo, Japan
Pages: 328 - 337  
Year of Publication: 1993
ISBN:0-89791-600-X
Authors
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 10,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/165939.166004
What is a DOI?

ABSTRACT

In multiprocessor systems, overheads caused by interprocessor communication and synchronization have been one of the largest obstacles for efficient execution of parallel programs. To reduce these overheads in shared-memory/shared-bus multiprocessors, we have proposed two hardware mechanisms: the Inter-Cache Snoop Control Mechanism (ICSCM), which dynamically switches snoop-protocols for improving shared-bus utilization, and the Mechanism for Integrated Synchronization and Communication (MISC), which extends ICSCM to support producer-consumer type synchronization efficiently. We have developed an execution-driven multiprocessor simulator for evaluating performance with these mechanisms. Simulation experiments on doacross loops show remarkable speed-ups by ICSCM/MISC mechanisms. Although the proposed mechanisms are originally implemented on a single shared-bus system, they are easily applicable to a clustered multiprocessing systems. The methods used in a clustered system are discussed.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Matsumoto, T.: Fine-Grain Support Mechanisms (in Japanese). IPS Japan SIC Reports, Vol.89, No.60, ARC-77-12, pp. 91-98 (Jul. 1989).
 
3
Matsumoto, T. et al.: A Mechanism for Integrating Communication and Synchroniza~on using Snoopy Cache (in Japanese). IEICE Japan SIG Report, Vol.90, No.144, CPSY 90-42, pp. 25-30 (Jul. 1990).
 
4
 
5
Matsumoto, T., et al.: MISC: A Mechanism for Integrated Synchronization and Commun~ation U~ng Snoop Caches. Proc. of the 1991 Int. Conf. on Parallel Proces~n~ Vol. 1, pp. 161-170 (Aug. 1991).
6
7
 
8
Advanced Micro Devices, Inc.: Am29000 32-Bit S~eam~ned Ins{ruction Processor Users Manual Advanced Micro Devices, Inc., 1988.
9
10
 
11
NEC Co~p.: ~PD30300 (VR3000) 32 bit Microprocessor User~ Manual-Architecture-(in Japanese). NEC Corp. ( 19 89).
12
 
13
Midkiff, S. P. and Padua, D. A.: Compiler Generated Synchronization for Do Loops. Proc. 1986 Int. Conf. on Parallel Processin~ pp. 544-551 (Aug. 1986).
 
14
Torrellas, J., et al.: Measurement, Analysis and ~mprovement of the Cache Behavior of Shared Data in Coherent Mulhprocessors, Technical Report CSL-TR- 90-412, Stanford Univ., (Feb. 1990).
 
15
Jones, A. K. and Gehringer, E. F.: The Cm* Multiprocessor Project: Research Rev~w. Technical Report CMU-CS-80-131, Carnegie-Mellon Univ., (1980).
16
 
17
Li, K.: IVY: A Shared Virtual Memory System for Parariel Computing. Proc. t988 Int. Conf. on ParaHel Processin9, St. Char~, IL, pp. 94-101 (Aug. 1988).
 
18
Lenoski, D., et al.: Design of Stanford DASH Multiprocessor. Technical Report CSL-TR-89-403, Stanibrd Univ. (Dec. 1989).
 
19
Matsumoto, T.: A Multiprocessor System with Memory-Based Processors and Register-Based Processors (in Japanese). Proc. of 43th Annual Convention of IPS Japan, Vol.6, 6Q-3, pp.115-116 (Oct. 1991).
 
20
Matsumoto, T." and Hiraki, K.: A Shared-Memory Architecture for Massively Parallel Computer Systems (in Japanese). IEICE Japan SIC Reports, Vol.92 No.173, CPSY 92-26, pp.47-55 (Aug. 1992).
 
21
Matsumoto, T. and Hiraki, K.: Distributed Shared- Memory Architecture Using Memory-Based Processors (in Japanese). Proc. of Joint Syrup. on Parallel Processing '93, IPSJ/IEICE/JSSST (May 1993).
 
22
Matsumoto, T.: A Study of FGSM: Fine-Grain Support Multiprocessor (in Japanese). IEICE Japan SIC Reports, Vol.89 No.167, CPSY 89-37, pp. 37-42 (Aug. 1989).


Collaborative Colleagues:
Takashi Matsumoto: colleagues
Kei Hiraki: colleagues