ACM Home Page
Please provide us with feedback. Feedback
Larrabee: a many-core x86 architecture for visual computing
Full text MovMov (28:05),  PdfPdf (2.21 MB)
Source
ACM Transactions on Graphics (TOG) archive
Volume 27 ,  Issue 3  (August 2008) table of contents
Proceedings of ACM SIGGRAPH 2008
SESSION: Parallelism table of contents
Article No. 18  
Year of Publication: 2008
ISSN:0730-0301
Also published in ...
Authors
Larry Seiler  Intel® Corporation
Doug Carmean  Intel® Corporation
Eric Sprangle  Intel® Corporation
Tom Forsyth  Intel® Corporation
Michael Abrash  RAD Game Tools
Pradeep Dubey  Intel® Corporation
Stephen Junkins  Intel® Corporation
Adam Lake  Intel® Corporation
Jeremy Sugerman  Stanford University
Robert Cavin  Intel® Corporation
Roger Espasa  Intel® Corporation
Ed Grochowski  Intel® Corporation
Toni Juan  Intel® Corporation
Pat Hanrahan  Stanford University
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 341,   Downloads (12 Months): 3190,   Citation Count: 29
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1360612.1360617
What is a DOI?

ABSTRACT

This paper presents a many-core visual computing architecture code named Larrabee, a new software rendering pipeline, a manycore programming model, and performance analysis for several applications. Larrabee uses multiple in-order x86 CPU cores that are augmented by a wide vector processor unit, as well as some fixed function logic blocks. This provides dramatically higher performance per watt and per unit of area than out-of-order CPUs on highly parallel workloads. It also greatly increases the flexibility and programmability of the architecture as compared to standard GPUs. A coherent on-die 2nd level cache allows efficient inter-processor communication and high-bandwidth local data access by CPU cores. Task scheduling is performed entirely with software in Larrabee, rather than in fixed function logic. The customizable software graphics rendering pipeline for this architecture uses binning in order to reduce required memory bandwidth, minimize lock contention, and increase opportunities for parallelism relative to standard GPUs. The Larrabee native programming model supports a variety of highly parallel applications that use irregular data structures. Performance analysis on those applications demonstrates Larrabee's potential for a broad range of parallel computation.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Aila, T., Laine, S. 2004. Alias-Free Shadow Maps. In Proceedings of Eurographics Symposium on Rendering 2004, Eurographics Association. 161--166.
 
3
 
4
AMD. 2007. Product description web site: ati.amd.com/products/Radeonhd3800/specs.html.
 
5
Bader, A., Chhugani, J., Dubey, P., Junkins, S., Morrison T., Ragozin, D., Smelyanskiy. 2008. Game Physics Performance On Larrabee Architecture. Intel whitepaper, available in August, 2008. Web site: techresearch.intel.com.
6
 
7
8
 
9
Bookout, D. July, 2007. Shadow Map Aliasing. Web site: www.gamedev.net/reference/articles/article2376.asp.
10
 
11
 
12
13
 
14
Chen, Y., Chhugani, J., Dubey, P., Hughes, C., Kim, D., Kumar, S., Lee, V., Nguyen A., Smelyanskiy, M. 2008. Convergence of Recognition, Mining, and Synthesis Workloads and its Implications. In Procedings of IEEE, v. 96, n. 5, 790--807.
 
15
Chuvelev, M., Greer, B., Henry, G., Kuznetsov, S., Burylov, I., Sabanin, B. Nov. 2007. Intel Performance Libraries: Multicore ready Software for Numeric Intensive Computation. Intel Technology Journal, v. 11, i. 4, 1--10.
16
 
17
 
18
19
 
20
Ghuloum, A., Smith, T., Wu, G., Zhou, X., Fang, J., Guo, P., So, B., Rajagopalan, M., Chen, Y., Chen, B. November 2007. Future-Proof Data Parallel Algorithms and Software on Intel Multi-Core Architectures. Intel Technology Journal, v. 11, i. 04, 333--348.
 
21
Gilbert, E., Johnson, D., Keerthi, S. 1988. A fast procedure for computing the distance between complex objects in three-dimensional space. IEEE Journal of Robotics and Automation, 4, 2, 193--203.
 
22
GPGPU. 2007. GPGPU web site: www.gpgpu.org.
23
 
24
 
25
Gwennap, L. 1995. Intel's P6 Uses Decoupled Superscalar Design. Microprocessor Report. v. 9, n. 2, Feb. 16, 1995.
 
26
27
 
28
IEEE Std. 1003.1, 2004 Edition. Standard for Information Technology - Portable Operating System Interface (POSIX) System Interfaces. The Open Group Technical Standard Base Specifications. Issue 6.
 
29
Jacobsen, T. 2001. Advanced Character Physics. Proc. Game Developers Conference 2001, 1--10.
30
31
32
 
33
Kessenich, J., Baldwin, D., Rost, R. The OpenGL Shading Language. Version 1.1. Sept. 7, 2006. Web site: www.opengl.org/registry/doc/GLSLangSpec.Full.1.20.8.pdf
 
34
 
35
 
36
Lake, A. 2005. Intel Graphics Media Accelerator Series 900 Developer's Guide. Version 2.0. Web site:download.intel.com/ids/gma/Intel_915G_SDG_Feb05.pdf.
 
37
38
 
39
Microsoft. 2007. Microsoft Reference for HLSL. Web site: msdn2.microsoft.com/en-us/library/bb509638.aspx.
 
40
41
 
42
Morein, S. 2000. ATI Radeon HyperZ Technology. Presented at Graphics Hardware 2000. Web site:www.graphicshardware.org/previous/www_2000/presentations/ATIHot3D.pdf.
43
 
44
Nvidia. 2008. Product description web site:www.nvidia.com/object/geforce_family.html.
 
45
Owens, J., Luebke, D., Govindaraju, N., Harris, M., Kruger, J., Lefohn, A., Purcell, T. 2007. A Survey of General Purpose Computation on Graphics Hardware. Computer Graphics Forum. v.26, n. 1, 80--113.
 
46
Pham D., Asano, S., Bolliger, M., Day, M., Hofstee, H., Johns., C., Kahle, J., Kameyama, A., Keaty, J., Masubuchi, Y., Riley, M., Shippy, D., Stasiask, D., Suzuodi, M., Wang, M., Warnock, J., Weitzel, S., Wendel, D., Yamazaki, T., Yazawa, K. 2005. The Design and Implementation of a First Generation CELL Processor. IEEE International Solid-State Circuits Conference. 184--186.
 
47
Pharr, M. 2006. Interactive Rendering in the Post-GPU Era. Presented at Graphics Hardware 2006. Web site:www.pharr.org/matt/.
48
 
49
Power VR. 2008. Web site:www.imgtec.com/powervr/products/Graphics/index.asp.
 
50
 
51
52
 
53
 
54
Shevtsov, M., Soupikov, A., Kapustin, A. 2007. Ray-Triangle Intersection Algorithm for Modern CPU Architectures. In Proceedings of GraphiCon 2007, 33--39.
 
55
Stevens, A. 2006. ARM Mali 3D Graphics System Solution. Web site:www.arm.com/miscPDFs/16514.pdf.
56
57
58

CITED BY  29


REVIEW

"Hector Yee : Reviewer"

In the early years of computer graphics, software renderers were very popular on the personal computer. These renderers have been recently supplanted by graphics processing units (GPUs), which first took over fixed-function operations such as tria  more...

Collaborative Colleagues:
Larry Seiler: colleagues
Doug Carmean: colleagues
Eric Sprangle: colleagues
Tom Forsyth: colleagues
Michael Abrash: colleagues
Pradeep Dubey: colleagues
Stephen Junkins: colleagues
Adam Lake: colleagues
Jeremy Sugerman: colleagues
Robert Cavin: colleagues
Roger Espasa: colleagues
Ed Grochowski: colleagues
Toni Juan: colleagues
Pat Hanrahan: colleagues