|
ABSTRACT
Microprocessors and memory systems suffer from a growing gap in performance. We introduce Active Pages, a computation model which addresses this gap by shifting data-intensive computations to the memory system. An Active Page consists of a page of data and a set of associated functions which can operate upon that data. We describe an implementation of Active Pages on RADram (Reconfigurable Architecture DRAM), a memory system based upon the integration of DRAM and reconfigurable logic. Results from the SimpleScalar simulator [BA97] demonstrate up to 1000X speedups on several applications using the RADram system versus conventional memory systems. We also explore the sensitivity of our results to implementations in other memory technologies.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
A+96
|
|
| |
AA95
|
|
| |
Ash90
|
Peter J. Ashenden. The VHDL cookbook, 1st ed. Dept of CS, U of Adelaide, S Australia, July 1990.
|
| |
B+96
|
D. Buell et al. Splash 2: FPGAs in a Custom Computing Machine. IEEE Computer Society, 1996.
|
| |
B+97a
|
N. Bowman et al. Evaluation of existing architectures in IRAM systems. In Workshop on Mixing Logic and DRAM, Denver, CO, June 1997'.
|
| |
B+97b
|
A. Brown et al. Using MML to simulate multiple dualported SRAMs: Parallel routing lookups in an ATM switch controller. In Workshop on Mixing Logic and DRAM, Denver, CO, June 1997.
|
 |
BA97
|
|
| |
Bat74
|
K.E. Batcher. STARAN parallel processor system hardware. AFIPS Conf Proceedings, pages 405-410, 1974.
|
| |
Bee96
|
Nelson H. F. Beebe. A bibliography of publications about the Linux operating system. Technical report, Ctr for Scientific Comp, Dept of Math, U of Utah, Salt Lake City, UT, May 1996.
|
 |
BGK96
|
Doug Burger , James R. Goodman , Alain Kägi, Memory bandwidth limitations of future microprocessors, Proceedings of the 23rd annual international symposium on Computer architecture, p.78-89, May 22-24, 1996, Philadelphia, Pennsylvania, United States
|
| |
CG86
|
|
| |
CLR96
|
|
| |
D+92
|
I. Duff et ai. User's guide for the Harwell-Boeing sparse matrix collection. Technical Report TR/PA/92/86, CER- FACS, 42 Ave G. Coriolis, 31057 Toulouse Cedex, France, October 1992.
|
| |
DeH95
|
A. DeHon. Notes on integrating reconfigurable logic with DRAM arrays. Transit Note 120, MIT, AI Lab. 545 Tech Sq. Cambridge NIA 02139, March 1995.
|
 |
DeH96a
|
|
| |
DeH96b
|
|
| |
EJ73
|
A. Evensen and J.Troy. Introduction to the architecture of a 288-element PEPE. In Proc. 1973 Sagamore Con}. on Par Processing, pages 162-169, 1973.
|
| |
Gus97
|
|
| |
GVNG94
|
|
| |
GW96
|
|
| |
H+96
|
Mary W. Hall , Jennifer M. Anderson , Saman P. Amarasinghe , Brian R. Murphy , Shih-Wei Liao , Edouard Bugnion , Monica S. Lam, Maximizing Multiprocessor Performance with the SUIF Compiler, Computer, v.29 n.12, p.84-89, December 1996
[doi> 10.1109/2.546613]
|
| |
HJ88
|
|
| |
HS86
|
|
| |
HT72
|
R. Hintz and D. Tote. Control data STAR-100 processor design. In COMPCON, pages i-4, 1972.
|
| |
HW97
|
|
| |
I+97
|
K. itch et al. Limitations and challenges of multigigabit DRAM chip design. IEEE Journal of Solid-State Circuits, 32(5):624-634, 1997.
|
| |
K+96
|
W . King et al. Using MORPH in an industrial machine vision system. In K. L. Pocek and J. Arnold, editors, Proceedings of IEEE Workshop on FPGAs /or Custom Computing Machines, pages 18-26, Napa, CA, april 1996.
|
 |
L+92
|
Charles E. Leiserson , Zahi S. Abuhamdeh , David C. Douglas , Carl R. Feynman , Mahesh N. Ganmukhi , Jeffrey V. Hill , Daniel Hillis , Bradley C. Kuszmaul , Margaret A. St. Pierre , David S. Wells , Monica C. Wong , Shaw-Wen Yang , Robert Zak, The network architecture of the Connection Machine CM-5 (extended abstract), Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures, p.272-285, June 29-July 01, 1992, San Diego, California, United States
[doi> 10.1145/140901.141883]
|
| |
M+96
|
|
| |
NM65
|
J. Nelder and R. Mead. A simplex method for function minimization. Computer Journal, 7:308-313, 1965.
|
| |
Pat95
|
David Patterson. Microprocessors in 2020. Scientific American, September 1995.
|
| |
Prz97
|
Steven Przybylski. Embedded DRAMs: Today and toward system-level integration. Technical report, Verdande Group, Inc., 3281 Lynn Oaks Drive, San Jose, CA, September 1997.
|
| |
R+93
|
|
 |
RS94
|
|
| |
RW92
|
|
| |
Sem94
|
Semiconductor Industry Association. The national technology roadmap for semicond uctors. http: / /www.sematech.org/public/roadmap/, 1994.
|
| |
SH90
|
|
| |
SKS97
|
|
| |
WC96
|
R. Wittig and P. Chow. OneChip: An FPGA processor with reconfigurable logic. In Symposium on FPGAs for Custom Computing Machines, pages 126-135, Napa Valley, California, April 1996.
|
| |
WH96
|
|
 |
WM95
|
|
CITED BY 40
|
|
George S. Almasi , Călin Caşcaval , José G. Castaños , Monty Denneau , Wilm Donath , Maria Eleftheriou , Mark Giampapa , Howard Ho , Derek Lieber , José E. Moreira , Dennis Newns , Marc Snir , Henry S. Warren, Jr., Demonstrating the Scalability of a Molecular Dynamics Application on a Petaflops Computer, International Journal of Parallel Programming, v.30 n.4, p.317-351, August 2002
|
|
|
|
|
|
|
|
|
|
|
|
Mark Oskin , Justin Hensley , Diana Keen , Frederic T. Chong , Matthew Farrens , Aneet Chopra, Exploiting ILP in page-based intelligent memory, Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture, p.208-218, November 16-18, 1999, Haifa, Israel
|
|
|
Chao Huang , Srivaths Ravi , Anand Raghunathan , Niraj K. Jha, High-level synthesis of distributed logic-memory architectures, Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design, p.564-571, November 10-14, 2002, San Jose, California
|
|
|
Lixin Zhang , Zhen Fang , Mide Parker , Binu K. Mathew , Lambert Schaelicke , John B. Carter , Wilson C. Hsieh , Sally A. McKee, The Impulse Memory Controller, IEEE Transactions on Computers, v.50 n.11, p.1117-1132, November 2001
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
George Almási , Cǎlin Caşcaval , José G. Castaños , Monty Denneau , Derek Lieber , José E. Moreira , Henry S. Warren, Jr., Dissecting Cyclops: a detailed analysis of a multithreaded architecture, ACM SIGARCH Computer Architecture News, v.31 n.1, March 2003
|
|
|
|
|
|
Karthikeyan Sankaralingam , Ramadass Nagarajan , Haiming Liu , Changkyu Kim , Jaehyuk Huh , Nitya Ranganathan , Doug Burger , Stephen W. Keckler , Robert G. McDonald , Charles R. Moore, TRIPS: A polymorphous architecture for exploiting ILP, TLP, and DLP, ACM Transactions on Architecture and Code Optimization (TACO), v.1 n.1, p.62-93, March 2004
|
|
|
|
|
|
Mary Hall , Peter Kogge , Jeff Koller , Pedro Diniz , Jacqueline Chame , Jeff Draper , Jeff LaCoss , John Granacki , Jay Brockman , Apoorv Srivastava , William Athas , Vincent Freeh , Jaewook Shin , Joonseok Park, Mapping irregular applications to DIVA, a PIM-based data-intensive architecture, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.57-es, November 14-19, 1999, Portland, Oregon, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jay B. Brockman , Shyamkumar Thoziyoor , Shannon K. Kuntz , Peter M. Kogge, A low cost, multithreaded processing-in-memory system, Proceedings of the 3rd workshop on Memory performance issues: in conjunction with the 31st international symposium on computer architecture, p.16-22, June 20-20, 2004, Munich, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jaffrey Draper , J. Tim Barrett , Jeff Sondeen , Sumit Mediratta , Chang Woo Kang , Ihn Kim , Gokhan Daglikoca, A Prototype Processing-In-Memory (PIM) Chip for the Data-Intensive Architecture (DIVA) System, Journal of VLSI Signal Processing Systems, v.40 n.1, p.73-84, May 2005
|
|
|
|
|
|
|
|
|
|
|
|
John B. Carter , Wilson C. Hsieh , Leigh B. Stoller , Mark Swanson , Lixin Zhang , Sally A. McKee, Impulse: Memory system support for scientific applications, Scientific Programming, v.7 n.3-4, p.195-209, August 1999
|
|
|
|
|
|
|
|
|
Zhen Fang , Lixin Zhang , John B. Carter , Ali Ibrahim , Michael A. Parker, Active memory operations, Proceedings of the 21st annual international conference on Supercomputing, June 17-21, 2007, Seattle, Washington
|
|
|
|
|
|
Jeff Draper , Jacqueline Chame , Mary Hall , Craig Steele , Tim Barrett , Jeff LaCoss , John Granacki , Jaewook Shin , Chun Chen , Chang Woo Kang , Ihn Kim , Gokhan Daglikoca, The architecture of the DIVA processing-in-memory chip, Proceedings of the 16th international conference on Supercomputing, June 22-26, 2002, New York, New York, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|