ACM Home Page
Please provide us with feedback. Feedback
Three-dimensional memory vectorization for high bandwidth media memory systems
Full text Publisher SitePublisher Site PdfPdf (1.29 MB)
Source International Symposium on Microarchitecture archive
Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture table of contents
Istanbul, Turkey
SESSION: Register file and memory system design table of contents
Pages: 149 - 160  
Year of Publication: 2002
ISBN ~ ISSN:1072-4451 , 0-7695-1859-1
Authors
Jesus Corbal  Universitat Politècnica de Catalunya--Barcelona, Spain
Roger Espasa  Universitat Politècnica de Catalunya--Barcelona, Spain
Mateo Valero  Universitat Politècnica de Catalunya--Barcelona, Spain
Sponsors
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
: IEEE TC-uArch
Publisher
IEEE Computer Society Press  Los Alamitos, CA, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 24,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Vector processors have good performance, cost and adaptability when targeting multimedia applications. However, for a significant number of media programs, conventional memory configurations fail to deliver enough memory references per cycle to feed the SIMD functional units. This paper addresses the problem of the memory bandwidth.We propose a novel mechanism suitable for 2-dimensional vector architectures and targeted at providing high effective bandwidth for SIMD memory instructions. The basis of this mechanism is the extension of the scope of vectorization at the memory level, so that 3-dimensional memory patterns can be fetched into a second-level register file.By fetching long blocks of data and by reusing 2-dimensional memory streams at this second-level register file, we obtain a significant increase in the effective memory bandwidth. As side benefits, the new 3-dimensional load instructions provide a high robustness to memory latency and a significant reduction of the cache activity, thus reducing power and energy requirements. At the investment of a 50% more area than a regular SIMD register file, we have measured and average speed-up of 13% and the potential for power savings in the L2 cache of a 30%.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
Pentium iii processor: Developer's manual. Technical Report http://developer.intel.com/design/PentiumlIl, INTEL, 1999.
 
5
 
6
3dnow! technology manual. Technical Report http://www.amd.com, Advanced Micro Devices, Inc., 1999.
 
7
Mips extension for digital media with 3d. Technical Report http://www.mips.com, MIPS technologies, Inc., 1997.
 
8
9
 
10
 
11
 
12
13
14
 
15
16
 
17
 
18
Jesus Corbal, Roger Espasa, and Mateo Valero. Morn: Instruction set architecture. Technical report, Universitat Politècnica de Catalunya, 1999.
19
 
20
S. Rixner, W.J. Dally, B. Khailany, P. Mattson, U. Kapasi, and J.D. Owens. Register organization for media processing. High Performance Computer Architecture, HPCA-5, pages 375--386, 2000.
 
21
Peter Bannon. Alpha 21364: A Scalable Single-chip SMP. Technical Report http://www.digital.com/alphaoem/microprocessorforum.htm, Compaq Computer Corporation, 1998.
 
22
 
23
 
24
William J. Dally. Tomorrow's computing engines (keynote speech). Feb 1998.
 
25
 
26
 
27
 
28
R. Schaffer, F. Catthoor, and R. Merker. Combining background memory management and regular array co-partitioning illustrated on a full motion estimation kernel. special issue on Advanced Regular Array Design (T. Plaks, ed.) in J. of Parallel Algorithms and Applications, Vol. 15, No. 3-4:pp. 201--228, December 2000.
 
29
 
30
 
31
Akihiro Iwaya and Tadashi Watanabe. The parallel processing feature of the NEC SX-3 supercomputer system. Intl. Journal of High Speed Computing, 3(3&4):187--197, 1991.

Collaborative Colleagues:
Jesus Corbal: colleagues
Roger Espasa: colleagues
Mateo Valero: colleagues