ACM Home Page
Please provide us with feedback. Feedback
Optimizing techniques for saturated arithmetic with first-order linear recurrence
Full text PdfPdf (461 KB)
Source
Symposium on Applied Computing archive
Proceedings of the 2009 ACM symposium on Applied Computing table of contents
Honolulu, Hawaii
SESSION: Programming languages track table of contents
Pages 1883-1889  
Year of Publication: 2009
ISBN:978-1-60558-166-8
Authors
Weihua Zhang  Fudan University, Shanghai, China
Lili Liu  Fudan University, Shanghai, China
Chen Zhang  Fudan University, Shanghai, China
Hongjiong Zhang  Fudan University, Shanghai, China
Binyu Zang  Fudan University, Shanghai, China
Chuanqi Zhu  Fudan University, Shanghai, China
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 35,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1529282.1529704
What is a DOI?

ABSTRACT

Saturated arithmetic is a typical operation in multimedia applications, most multimedia extensions in the instruction set architecture (ISA) of modern processors provide saturation instructions for such operation. Therefore, extensive researches have focused on how to utilize saturation instructions to optimize programs. Previous algorithms mainly focus on purely saturated arithmetic, however saturated arithmetic is often mingled with first-order linear recurrence (FOLR) in real life applications. When FLOR pattern appears in the program, previous algorithms can not identify the saturated arithmetic as well.

In fact, the saturated arithmetic with FOLR (SAWF) is a new and significant pattern, especially, SAWF with one as coefficient is frequently used in multimedia applications. Hence, it is necessary to explore a method with which such pattern can be efficiently vectorized. This paper discusses how to vectorize SAWF, explores the efficient method to vectorize SAWF with one as coefficient and gives its evaluation and implement a library for the optimizing technique. Such an implementation manner can make compilers are able to exploit it more easily. The experimental results shows the optimizing technique can achieve a speedup of 1.19 to 1.46 on Pentium IV processor. At the same time, the optimizing techniques in this paper can also be used to develop a library for SAWF so a programmer can benefit even without changing the compiler.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Aart J. C. Bik, Milind Girkae, Paul M. Grey, Xinmin Tian. Automatic Detection of Saturation and Clipping Idioms. Proceedings of the 15th International Workshop on Languages and Compilers for parallel computers, July, 2002
 
3
 
4
 
5
Ren G, Wu P, Padua D. A Preliminary Study On the Vectorization of Multimedia Applications for Multimedia Extensions. Proc. Of the 16th Int'l WorkShop on Languages and Compilers for Parallel Computing. 2003
 
6
Weihua Jiang, Chao Mei, BoHuang, Jianhui Li, Jiahua Zhu, Binyu Zang, Chuanqi Zhu. Boosting the Performance of Multimedia Applications Using SIMD Instructions. Compiler Constructions. 2005
 
7
Jiahua Zhu, HongJiang Zhang, Hui Shi, Binyu Zang, Chuanqi Zhu "Overflow Controlled SIMD Arithmetic". The 17th International Workshop on Languages and Compilers for Parallel Computing (LCPC 04)
 
8
 
9
M. Nakamura, Y. Okabe, and T. Tsuda. New fast algorithms for first-order linear recurrences on vector computers. In 5th Workshop on Compilers for Parallel Computers, pp. 167C174, June 1995.
 
10
 
11
 
12
13
 
14

Collaborative Colleagues:
Weihua Zhang: colleagues
Lili Liu: colleagues
Chen Zhang: colleagues
Hongjiong Zhang: colleagues
Binyu Zang: colleagues
Chuanqi Zhu: colleagues