ACM Home Page
Please provide us with feedback. Feedback
Automatic text discovering through stroke-based segmentation and text string combination
Full text PdfPdf (373 KB)
Source
International Multimedia Conference archive
Proceeding of the 16th ACM international conference on Multimedia table of contents
Vancouver, British Columbia, Canada
SESSION: Applications track short papers session 1 table of contents
Pages 805-808  
Year of Publication: 2008
ISBN:978-1-60558-303-7
Authors
Lei Xu  Nokia Research Center, Beijing, and Beijing University of Posts and Telecommunications, China
Yingfei Liu  Nokia Research Center, Beijing, China
Kongqiao Wang  Nokia Research Center, Beijing, China
Hao Wang  Nokia Research Center, Beijing, China
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 64,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1459359.1459492
What is a DOI?

ABSTRACT

In this paper we present a novel framework of automatic text discovering for content-based multimedia application. For single image, the stroke-based binarization and the coarse-to-fine text extraction will collaborate to generate a clean text image for recognition. For image sequence, multi-frame text enhancement is adopted to increase the text/background contrast, and the recognition results are finally refined by the text string combination algorithm to get more precise semantic information. Two prototype demos have been successfully developed on mobile phones. The experimental results on different platforms show the superior performance of the proposed method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
X. Chen and A. L. Yuille. Detecting and Reading Text in Natural Scenes. In Proceedings of CVPR, pages 366--373, 2004.
 
2
 
3
I. Haritaoglu. Scene Text Extraction and Translation for Handheld Devices. In Proceedings of CVPR, pages 408--413, 2001.
 
4
X.-S. Hua, P. Yin, and H.-J. Zhang. Efficient Video Text Recognition Using Multiple Frame Integration. In Proceedings of ICIP, pages 397--400, 2002.
5
 
6
K. Wang and J. A. Kangas. Character Location in Scene Images from Digital Camera. Pattern Recognition, 36(10):2287--2299, 2003.
 
7
 
8
L. Xu and K. Wang. Extracting Text Information for Content-Based Video Retrieval. In Proceedings of MMM, pages 58--69, 2008.
9

Collaborative Colleagues:
Lei Xu: colleagues
Yingfei Liu: colleagues
Kongqiao Wang: colleagues
Hao Wang: colleagues