| Automatic text discovering through stroke-based segmentation and text string combination |
| Full text |
Pdf
(373 KB)
|
Source
|
International Multimedia Conference
archive
Proceeding of the 16th ACM international conference on Multimedia
table of contents
Vancouver, British Columbia, Canada
SESSION: Applications track short papers session 1
table of contents
Pages 805-808
Year of Publication: 2008
ISBN:978-1-60558-303-7
|
|
Authors
|
|
Lei Xu
|
Nokia Research Center, Beijing, and Beijing University of Posts and Telecommunications, China
|
|
Yingfei Liu
|
Nokia Research Center, Beijing, China
|
|
Kongqiao Wang
|
Nokia Research Center, Beijing, China
|
|
Hao Wang
|
Nokia Research Center, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 64, Citation Count: 0
|
|
|
ABSTRACT
In this paper we present a novel framework of automatic text discovering for content-based multimedia application. For single image, the stroke-based binarization and the coarse-to-fine text extraction will collaborate to generate a clean text image for recognition. For image sequence, multi-frame text enhancement is adopted to increase the text/background contrast, and the recognition results are finally refined by the text string combination algorithm to get more precise semantic information. Two prototype demos have been successfully developed on mobile phones. The experimental results on different platforms show the superior performance of the proposed method.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
X. Chen and A. L. Yuille. Detecting and Reading Text in Natural Scenes. In Proceedings of CVPR, pages 366--373, 2004.
|
| |
2
|
|
| |
3
|
I. Haritaoglu. Scene Text Extraction and Translation for Handheld Devices. In Proceedings of CVPR, pages 408--413, 2001.
|
| |
4
|
X.-S. Hua, P. Yin, and H.-J. Zhang. Efficient Video Text Recognition Using Multiple Frame Integration. In Proceedings of ICIP, pages 397--400, 2002.
|
 |
5
|
Jinqiao Wang , Lingyu Duan , Lei Xu , Hanqing Lu , Jesse S. Jin, TV ad video categorization with probabilistic latent concept learning, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
[doi> 10.1145/1290082.1290113]
|
| |
6
|
K. Wang and J. A. Kangas. Character Location in Scene Images from Digital Camera. Pattern Recognition, 36(10):2287--2299, 2003.
|
| |
7
|
|
| |
8
|
L. Xu and K. Wang. Extracting Text Information for Content-Based Video Retrieval. In Proceedings of MMM, pages 58--69, 2008.
|
 |
9
|
|
|