ACM Home Page
Please provide us with feedback. Feedback
Object fingerprints for content analysis with applications to street landmark localization
Full text PdfPdf (1.32 MB)
Source
International Multimedia Conference archive
Proceeding of the 16th ACM international conference on Multimedia table of contents
Vancouver, British Columbia, Canada
SESSION: Content track C5: multimedia content analysis and applications table of contents
Pages 169-178  
Year of Publication: 2008
ISBN:978-1-60558-303-7
Authors
Wen Wu  Carnegie Mellon University, Pittsburgh, PA, USA
Jie Yang  Carnegie Mellon University, Pittsburgh, PA, USA
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 23,   Downloads (12 Months): 134,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1459359.1459383
What is a DOI?

ABSTRACT

An object can be a basic unit for multimedia content analysis. Besides similarity among common objects, each object has its own unique characteristics which we cannot find in other surrounding objects in multimedia data. We call such unique characteristics object fingerprints. In this paper, we propose a novel approach to extract and match object fingerprints for multimedia content analysis. In particular, we focus on the problem of street landmark localization from images. Instead of modeling and matching a street landmark as a whole, our proposed approach extracts the landmark's object fingerprints in a given image and match to a new image or video in order to localize the landmark. We formulate matching the landmark's object fingerprints as a classification problem solved by a cascade of 1NN classifiers. We develop a street landmark localization system that combines salient region detection, segmentation, and object fingerprint extraction techniques for the purpose. To evaluate, we have compiled a novel dataset which consists of 15 U.S. street landmarks' images and videos. Our experiments on this dataset show superior performance to state-of-the-art recognition algorithms [20, 33]. The proposed approach can also be well generalized to other objects of interest and content analysis tasks. We demonstrate the feasibility through the application of our approach to refine web image search results and obtained encouraging results.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
G. E. Burnett. Turn right at the king's head: Drivers' requirements for route guidance information. In PhD thesis, Loughborough University, UK, 1998.
 
3
4
 
5
A. de la Escalera, L. E. Moreno, M. A. Salichs, and J. M. Armingol. Road traffic sign detection and classification. IEEE Transactions on Industrial Electronics, 1997.
 
6
 
7
 
8
 
9
 
10
X. Hou and L. Zhang. Saliency detection: a spectral residual approach. In CVPR, 2007.
11
12
13
14
 
15
S. Kim, X. Jin, and J. Han. Sparclus: Spatial relationship pattern-based hierarchical clustering. In SIAM Int. Conf. on Data Mining, 2008.
16
 
17
B. Leibe, N. Cornelis, K. Cornelis, and L. Van Gool. Dynamic 3d scene analysis from a moving vehicle. In CVPR, 2007.
18
19
 
20
 
21
T. Malisiewicz and A. A. Efros. Improving spatial support for objects via multiple segmentations. In British Machine Vision Conference, 2007.
22
 
23
G. Piccioli, E. De Micheli, P. Parodi, and M. Campani. Robust method for road sign detection and recognition. Image and Vision Computing, 1996.
 
24
J. Pilet, V. Lepetit, and P. Fua. Real-time non-rigid surface detection. In CVPR, 2005.
 
25
26
 
27
G. Schindler, M. Brown, and R. Szeliski. City-scale location recognition. In CVPR, 2007.
 
28
 
29
 
30
 
31
J. Willamowski, D. Arregui, G. Csurka, C. R. Dance, and L. Fan. Categorizing nine visual classes using local appearance descriptors. In IWLAVS, 2004.
32
 
33
34
35
 
36
C. L. Zitnick, J. Sun, R. Szeliski, and S. Winder. Object instance recognition using triplets of feature symbols. In Microsoft Research Technical Report, 2007.