|
ABSTRACT
The Niagara processor implements a thread-rich architecture designed to provide a high-performance solution for commercial server applications. The hardware supports 32 threads with a memory subsystem consisting of an on-board crossbar, level-2 cache, and memory controllers for a highly integrated design that exploits the thread-level parallelism inherent to server applications, while targeting low levels of power consumption.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S.R. Kunkel, et al., "A Performance Methodology for Commercial Servers," <i>IBM J. Research and Development,</i> vol. 44, no. 6, 2000, pp. 851-872.
|
| |
2
|
|
 |
3
|
Kunle Olukotun , Basem A. Nayfeh , Lance Hammond , Ken Wilson , Kunyung Chang, The case for a single-chip multiprocessor, Proceedings of the seventh international conference on Architectural support for programming languages and operating systems, p.2-11, October 01-04, 1996, Cambridge, Massachusetts, United States
|
 |
4
|
James Laudon , Anoop Gupta , Mark Horowitz, Interleaving: a multithreading technique targeting multiprocessors and workstations, Proceedings of the sixth international conference on Architectural support for programming languages and operating systems, p.308-318, October 05-07, 1994, San Jose, California, United States
|
 |
5
|
Luiz André Barroso , Kourosh Gharachorloo , Robert McNamara , Andreas Nowatzyk , Shaz Qadeer , Barton Sano , Scott Smith , Robert Stets , Ben Verghese, Piranha: a scalable architecture based on single-chip multiprocessing, Proceedings of the 27th annual international symposium on Computer architecture, p.282-293, June 2000, Vancouver, British Columbia, Canada
|
| |
6
|
S. Kapil H. McGhan and J. Lawrendra, "A Chip Multithreaded Processor for Network-Facing Workloads," <i>IEEE Micro,</i> vol. 24, no. 2, Mar.-Apr. 2004, pp. 20-30.
|
| |
7
|
J. Hart, et al., "Implementation of a 4th-Generation 1.8 GHz Dual Core Sparc V9 Microprocessor," <i>Proc. Int'l Solid-State Circuits Conf.</i> (ISSCC 05), IEEE Press, 2005, http://www.isscc.org/isscc/2005/ap/ISSCC2005AdvanceProgram.pdf.
|
CITED BY 131
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wei Huang , Mircea R. Stant , Karthik Sankaranarayanan , Robert J. Ribando , Kevin Skadron, Many-core design from a thermal perspective, Proceedings of the 45th annual conference on Design automation, June 08-13, 2008, Anaheim, California
|
|
|
|
|
|
Engin Ipek , Sally A. McKee , Karan Singh , Rich Caruana , Bronis R. de Supinski , Martin Schulz, Efficient architectural design space exploration via predictive modeling, ACM Transactions on Architecture and Code Optimization (TACO), v.4 n.4, p.1-34, January 2008
|
|
|
|
|
|
|
|
|
|
|
|
Mark Aiken , Manuel Fähndrich , Chris Hawblitzel , Galen Hunt , James Larus, Deconstructing process isolation, Proceedings of the 2006 workshop on Memory system performance and correctness, October 22-22, 2006, San Jose, California
|
|
|
Taeho Kgil , Shaun D'Souza , Ali Saidi , Nathan Binkert , Ronald Dreslinski , Trevor Mudge , Steven Reinhardt , Krisztian Flautner, PicoServer: using 3D stacking technology to enable a compact energy efficient chip multiprocessor, ACM SIGPLAN Notices, v.41 n.11, November 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Brian D. Carlstrom , Austen McDonald , Michael Carbin , Christos Kozyrakis , Kunle Olukotun, Transactional collection classes, Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming, March 14-17, 2007, San Jose, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
Brian D. Carlstrom , Austen McDonald , Hassan Chafi , JaeWoong Chung , Chi Cao Minh , Christos Kozyrakis , Kunle Olukotun, The Atomos transactional programming language, ACM SIGPLAN Notices, v.41 n.6, June 2006
|
|
|
Matteo Monchiero , Gianluca Palermo , Cristina Silvano , Oreste Villa, Power/performance hardware optimization for synchronization intensive applications in MPSoCs, Proceedings of the conference on Design, automation and test in Europe: Proceedings, March 06-10, 2006, Munich, Germany
|
|
|
|
|
|
Brian D. Carlstrom , JaeWoong Chung , Hassan Chafi , Austen McDonald , Chi Cao Minh , Lance Hammond , Christos Kozyrakis , Kunle Olukotun, Executing Java programs with transactional memory, Science of Computer Programming, v.63 n.2, p.111-129, 1 December 2006
|
|
|
Songqing Chen , Xinyuan Wang , Lei Liu , Xinwen Zhang, WormTerminator: an effective containment of unknown and polymorphic fast spreading worms, Proceedings of the 2006 ACM/IEEE symposium on Architecture for networking and communications systems, December 03-05, 2006, San Jose, California, USA
|
|
|
Dan Wallin , Henrik Löf , Erik Hagersten , Sverker Holmgren, Multigrid and Gauss-Seidel smoothers revisited: parallelization on chip multiprocessors, Proceedings of the 20th annual international conference on Supercomputing, June 28-July 01, 2006, Cairns, Queensland, Australia
|
|
|
|
|
|
Matthew Curtis-Maury , James Dzierwa , Christos D. Antonopoulos , Dimitrios S. Nikolopoulos, Online power-performance adaptation of multithreaded programs using hardware event-based prediction, Proceedings of the 20th annual international conference on Supercomputing, June 28-July 01, 2006, Cairns, Queensland, Australia
|
|
|
|
|
|
Jarek Nieplocha , Andrès Márquez , John Feo , Daniel Chavarría-Miranda , George Chin , Chad Scherrer , Nathaniel Beagley, Evaluating the potential of multithreaded platforms for irregular scientific computations, Proceedings of the 4th international conference on Computing frontiers, May 07-09, 2007, Ischia, Italy
|
|
|
Bratin Saha , Ali-Reza Adl-Tabatabai , Anwar Ghuloum , Mohan Rajagopalan , Richard L. Hudson , Leaf Petersen , Vijay Menon , Brian Murphy , Tatiana Shpeisman , Eric Sprangle , Anwar Rohillah , Doug Carmean , Jesse Fang, Enabling scalability and performance in a large scale CMP environment, ACM SIGOPS Operating Systems Review, v.41 n.3, June 2007
|
|
|
Shigeru Kusakabe , Mitsuhiro Aono , Masaaki Izumi , Satoshi Amamiya , Yoshinari Nomura , Hideo Taniguchi , Makoto Amamiya, Scalability of continuation-based fine-grained multithreading in handling multiple I/O requests on FUCE, Proceedings of the 4th international conference on Computing frontiers, May 07-09, 2007, Ischia, Italy
|
|
|
Salvatore Carta , Andrea Acquaviva , Pablo G. Del Valle , David Atienza , Giovanni De Micheli , Fernando Rincon , Luca Benini , Jose M. Mendias, Multi-processor operating system emulation framework with thermal feedback for systems-on-chip, Proceedings of the 17th great lakes symposium on Great lakes symposium on VLSI, p.311-316, March 11-13, 2007, Stresa-Lago Maggiore, Italy
|
|
|
|
|
|
Shailender Chaudhry , Robert Cypher , Magnus Ekman , Martin Karlsson , Anders Landin , Sherman Yip , Håkan Zeffer , Marc Tremblay, Simultaneous speculative threading: a novel pipeline architecture implemented in sun's rock processor, ACM SIGARCH Computer Architecture News, v.37 n.3, June 2009
|
|
|
|
|
|
Lisa R. Hsu , Steven K. Reinhardt , Ravishankar Iyer , Srihari Makineni, Communist, utilitarian, and capitalist cache policies on CMPs: caches as a shared resource, Proceedings of the 15th international conference on Parallel architectures and compilation techniques, September 16-20, 2006, Seattle, Washington, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Satoshi Amamiya , Masaaki Izumi , Takanori Matsuzaki , Ryuzo Hasegawa , Makoto Amamiya, Fuce: the continuation-based multithreading processor, Proceedings of the 4th international conference on Computing frontiers, May 07-09, 2007, Ischia, Italy
|
|
|
|
|
|
David W. Oehmke , Nathan L. Binkert , Trevor Mudge , Steven K. Reinhardt, How to Fake 1000 Registers, Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture, p.7-18, November 12-16, 2005, Barcelona, Spain
|
|
|
|
|
|
Feihui Li , Chrysostomos Nicopoulos , Thomas Richardson , Yuan Xie , Vijaykrishnan Narayanan , Mahmut Kandemir, Design and Management of 3D Chip Multiprocessors Using Network-in-Memory, ACM SIGARCH Computer Architecture News, v.34 n.2, p.130-141, May 2006
|
|
|
|
|
|
Jack Sampson , Ruben Gonzalez , Jean-Francois Collard , Norman P. Jouppi , Mike Schlansker , Brad Calder, Exploiting Fine-Grained Data Parallelism with Chip Multiprocessors and Fast Barriers, Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, p.235-246, December 09-13, 2006
|
|
|
|
|
|
Karthikeyan Sankaralingam , Ramadass Nagarajan , Robert McDonald , Rajagopalan Desikan , Saurabh Drolia , M. S. Govindan , Paul Gratz , Divya Gulati , Heather Hanson , Changkyu Kim , Haiming Liu , Nitya Ranganathan , Simha Sethumadhavan , Sadia Sharif , Premkishore Shivakumar , Stephen W. Keckler , Doug Burger, Distributed Microarchitectural Protocols in the TRIPS Prototype Processor, Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, p.480-491, December 09-13, 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Srinivasan Murali , Almir Mutapcic , David Atienza , Rajesh Gupta , Stephen Boyd , Luca Benini , Giovanni De Micheli, Temperature control of high-performance multi-core platforms using convex optimization, Proceedings of the conference on Design, automation and test in Europe, March 10-14, 2008, Munich, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Mikhail Smelyanskiy , Victor W Lee , Daehyun Kim , Anthony D Nguyen , Pradeep Dubey, Scaling performance of interior-point method on large-scale chip multiprocessor system, Proceedings of the 2007 ACM/IEEE conference on Supercomputing, November 10-16, 2007, Reno, Nevada
|
|
|
Vicenç Beltran , Jordi Torres , Eduard Ayguadé, Improving disk bandwidth-bound applications through main memory compression, Proceedings of the 2007 workshop on MEmory performance: DEaling with Applications, systems and architecture, p.57-63, September 16-16, 2007, Brasov, Romania
|
|
|
|
|
|
Ana Bosque , Pablo Ibañez , Víctor Viñals , Per Stenström , Jose M. Llabería, Characterization of Apache web server with Specweb2005, Proceedings of the 2007 workshop on MEmory performance: DEaling with Applications, systems and architecture, p.65-72, September 16-16, 2007, Brasov, Romania
|
|
|
Aamer Jaleel , William Hasenplaugh , Moinuddin Qureshi , Julien Sebot , Simon Steely, Jr. , Joel Emer, Adaptive insertion policies for managing shared caches, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, October 25-29, 2008, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|
|
Dana Vantrease , Robert Schreiber , Matteo Monchiero , Moray McLaren , Norman P. Jouppi , Marco Fiorentino , Al Davis , Nathan Binkert , Raymond G. Beausoleil , Jung Ho Ahn, Corona: System Implications of Emerging Nanophotonic Technology, ACM SIGARCH Computer Architecture News, v.36 n.3, p.153-164, June 2008
|
|
|
|
|
|
Dongkook Park , Soumya Eachempati , Reetuparna Das , Asit K. Mishra , Yuan Xie , N. Vijaykrishnan , Chita R. Das, MIRA: A Multi-layered On-Chip Interconnect Router Architecture, ACM SIGARCH Computer Architecture News, v.36 n.3, p.251-261, June 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Michael Engel , Olaf Spinczyk, Aspects in hardware: what do they look like?, Proceedings of the 2008 AOSD workshop on Aspects, components, and patterns for infrastructure software, p.1-6, March 31-April 08, 2008, Brussels, Belgium
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Larry Seiler , Doug Carmean , Eric Sprangle , Tom Forsyth , Michael Abrash , Pradeep Dubey , Stephen Junkins , Adam Lake , Jeremy Sugerman , Robert Cavin , Roger Espasa , Ed Grochowski , Toni Juan , Pat Hanrahan, Larrabee: a many-core x86 architecture for visual computing, ACM Transactions on Graphics (TOG), v.27 n.3, August 2008
|
|
|
Qiong Cai , José González , Ryan Rakvic , Grigorios Magklis , Pedro Chaparro , Antonio González, Meeting points: using thread criticality to adapt multicore hardware to parallel regions, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, October 25-29, 2008, Toronto, Ontario, Canada
|
|
|
José González , Qiong Cai , Pedro Chaparro , Grigorios Magklis , Ryan Rakvic , Antonio González, Thread fusion, Proceeding of the thirteenth international symposium on Low power electronics and design, August 11-13, 2008, Bangalore, India
|
|
|
|
|
|
|
|
|
Taeho Kgil , Ali Saidi , Nathan Binkert , Steve Reinhardt , Krisztian Flautner , Trevor Mudge, PicoServer: Using 3D stacking technology to build energy efficient servers, ACM Journal on Emerging Technologies in Computing Systems (JETC), v.4 n.4, p.1-34, October 2008
|
|
|
Li Zhao , Ravi Iyer , Jaideep Moses , Ramesh Illikkal , Srihari Makineni , Don Newell, Exploring Large-Scale CMP Architectures Using ManySim, IEEE Micro, v.27 n.4, p.21-33, July 2007
|
|
|
Lan Dong , Yang Yang, An approach on distributed and shared dynamic cache partition, Proceedings of the 7th conference on Data networks, communications, computers, p.155-157, November 07-09, 2008, Bucharest, Romania
|
|
|
|
|
|
|
|
|
|
|
|
Jason Cong , Karthik Gururaj , Guoling Han , Adam Kaplan , Mishali Naik , Glenn Reinman, MC-Sim: an efficient simulation tool for MPSoC designs, Proceedings of the 2008 IEEE/ACM International Conference on Computer-Aided Design, November 10-13, 2008, San Jose, California
|
|
|
|
|
|
|
|
|
|
|
|
Seung Woo Son , Mahmut Kandemir , Mustafa Karakoy , Dhruva Chakrabarti, A compiler-directed data prefetching scheme for chip multiprocessors, Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, February 14-18, 2009, Raleigh, NC, USA
|
|
|
Christos D. Antonopoulos , Filip Blagojevic , Andrey N. Chernikov , Nikos P. Chrisochoides , Dimitrios S. Nikolopoulos, Algorithm, software, and hardware optimizations for Delaunay mesh generation on simultaneous multithreaded architectures, Journal of Parallel and Distributed Computing, v.69 n.7, p.601-612, July, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ryan Johnson , Ippokratis Pandis , Nikos Hardavellas , Anastasia Ailamaki , Babak Falsafi, Shore-MT: a scalable storage manager for the multicore era, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, March 24-26, 2009, Saint Petersburg, Russia
|
|
|
|
|
|
|
|
|
Christos D. Antonopoulos , Filip Blagojevic , Andrey N. Chernikov , Nikos P. Chrisochoides , Dimitrios S. Nikolopoulos, A multigrain Delaunay mesh generation method for multicore SMT-based architectures, Journal of Parallel and Distributed Computing, v.69 n.7, p.589-600, July, 2009
|
|
|
|
|
|
|
|
|
Andrew Herdrich , Ramesh Illikkal , Ravi Iyer , Don Newell , Vineet Chadha , Jaideep Moses, Rate-based QoS techniques for cache/memory in CMP platforms, Proceedings of the 23rd international conference on Supercomputing, June 08-12, 2009, Yorktown Heights, NY, USA
|
|
|
|
|
|
Martin Thuresson , Magnus Själander , Magnus Björk , Lars Svensson , Per Larsson-Edefors , Per Stenstrom, FlexCore: Utilizing Exposed Datapath Control for Efficient Computing, Journal of Signal Processing Systems, v.57 n.1, p.5-19, October 2009
|
|
|
|
|
|
|
|
|
Hui Liu , Zili Shao , Meng Wang , Junzhao Du , Chun Jason Xue , Zhiping Jia, Combining Coarse-Grained Software Pipelining with DVS for Scheduling Real-Time Periodic Dependent Tasks on Multi-Core Embedded Systems, Journal of Signal Processing Systems, v.57 n.2, p.249-262, November 2009
|
|
|
Amin Firoozshahian , Alex Solomatnikov , Ofer Shacham , Zain Asgar , Stephen Richardson , Christos Kozyrakis , Mark Horowitz, A memory system design framework: creating smart memories, ACM SIGARCH Computer Architecture News, v.37 n.3, June 2009
|
|