ACM Home Page
Please provide us with feedback. Feedback
A low-power parallel design of discrete wavelet transform using subthreshold voltage technology
Full text PdfPdf (596 KB)
Source
International Conference on Compilers, Architecture and Synthesis for Embedded Systems archive
Proceedings of the 2008 international conference on Compilers, architectures and synthesis for embedded systems table of contents
Atlanta, GA, USA
SESSION: Energy, power, and security table of contents
Pages 235-244  
Year of Publication: 2008
ISBN:978-1-60558-469-0
Authors
Michael B. Henry  Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
Syed I. Haider  Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
Leyla Nazhandali  Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
Sponsors
SIGDA: ACM Special Interest Group on Design Automation
ACM: Association for Computing Machinery
SIGBED: ACM Special Interest Group on Embedded Systems
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 84,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1450095.1450130
What is a DOI?

ABSTRACT

The Discrete Wavelet Transform (DWT) is a means to analyze the frequency content of a signal and has extensive uses, including the JPEG2000 codec. Many portable and battery operated applications of DWT are expected in the near future that require a low power implementation of this transform. In this paper, a parallel VLSI implementation of a 2D lifting-based DWT processor is presented that is scalable from 2 to 256 parallel units. This design benefits from an efficient data distribution module to the parallel units, which constitutes a small overhead, and is able to significantly benefit from voltage scaling to achieve energy efficiency. In our design, the number of parallel units is increased and their speed is reduced through voltage scaling, while maintaining a constant throughput. Our results show that the optimal operating voltage of the parallel units, for a target throughput of 200MHz,4 is 386mV. This is below the threshold voltage, which is the voltage that turns the transistors on. Since operating a circuit in subthreshold voltage consumes 100+ times less power than running it at nominal voltage, our design is able to provide the same throughput as a reference pipelined implementation with 26 times less power consumption.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Oklahoma state university standard cell library
 
2
 
3
A. P. Chandrakasan, S. Sheng, and R. W. Brodersen. Low-power CMOS digital design. IEEE Journal of Solid-State Circuits, 27(4):473--484, Apr. 1992.
 
4
 
5
I. Daubechies The wavelet transform, time-frequency localization and signalanalysis Information Theory, IEEE Transactions on 36(5):961--1005, Sept. 1990.
 
6
I. Daubechies and W. Sweldens. Factoring wavelet transforms into lifting steps. J. Fourier Anal. Appl., 4(3):245--267, 1998.
 
7
8
9
10
11
 
12
C. H. I. Kim, H. Soeleman, and K. Roy. Ultra-low-power DLMS adaptive filter for hearing aid applications. IEEE Transactions on Very Large Scale Integration (VLSI) Systems,11(6):1058--1067, Dec. 2003.
13
 
14
J. D. Meindl and J. A. Davis. The fundamental limit on binary switching energy for terascale integration (TSI). In IEEE JSSCC vol. 35, Feb. 2002.
15
16
 
17
18
 
19
V. Sze, R. Blazquez, M. Bhardwaj, and A. Chandrakasan. An energy efficient sub-threshold baseband processor architecture for pulsed ultra-wideband communications. In Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, volume 3, Toulouse, 2006.
 
20
K. C. B. Tan and T. Arslan. Low power embedded extension algorithm for lifting{based discrete wavelet transform in JPEG2000. Electronics Letters, 37(22):1328--1330, Oct. 2001.
 
21
H. Varshney, M. Hasan, and S. Jain. Energy efficient novel architectures for the lifting-based discrete wavelet transform. IET Image Processing, 1:305--310, Sept. 2007.
 
22
A. Wang and A. Chandrakasan. A 180-mv subthreshold FFT processor using a minimum energy design methodology. IEEE Journal of Solid-State Circuits, 40(1):310--319, Jan. 2005.
 
23
S. Wilton and N. Jouppi. Cacti: An enhanced cache access and cycle time model, 1996.
24
25

Collaborative Colleagues:
Michael B. Henry: colleagues
Syed I. Haider: colleagues
Leyla Nazhandali: colleagues