ACM Home Page
Please provide us with feedback. Feedback
Aggressive snoop reduction for synchronized producer-consumer communication in energy-efficient embedded multi-processors
Full text PdfPdf (164 KB)
Source
International Conference on Hardware Software Codesign archive
Proceedings of the 5th IEEE/ACM international conference on Hardware/software codesign and system synthesis table of contents
Salzburg, Austria
SESSION: Embedded systems architecture table of contents
Pages: 245 - 250  
Year of Publication: 2007
ISBN:978-1-59593-824-4
Authors
Chenjie Yu  University of Maryland: College Park, College Park, MD
Peter Peter Petrov  University of Maryland: College Park, College Park, MD
Sponsors
SIGDA: ACM Special Interest Group on Design Automation
ACM: Association for Computing Machinery
SIGBED: ACM Special Interest Group on Embedded Systems
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 48,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1289816.1289876
What is a DOI?

ABSTRACT

Snoop-based cache coherence protocols are typically used when multiple processor cores share memory through a common bus. It is well known, however, that these coherence protocols introduce an excessive power overhead.To help alleviate this problem, we propose an application-driven customization technique where application knowledge regarding data sharing in producer-consumer relationships is used in order to aggressively eliminate unnecessary and predictable snoop-induced cache tag lookups even for references to shared data, thus, achieving significant power reduction with minimal hardware cost. Snoop-induced cache tag lookups for accesses to both shared and private data are eliminated when it is ensured that such lookups will not result in extra knowledge regarding the cache state in respect to the other caches and memories.The proposed methodology relies on the combined support from the compiler, the operating system, and the hardware architecture. Our experiments show average power reductions of more than 80% compared to a general-purpose snoop protocol.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
 
4
5
6
 
7
 
8
 
9
 
10
D. Tarjan, S. Thoziyoor and N. Jouppi, "CACTI 4.0: An Integrated Cache Timing, Power and Area Model", Technical report, HP Laboratories Palo Alto, June 2006.
11


Collaborative Colleagues:
Chenjie Yu: colleagues
Peter Peter Petrov: colleagues