ACM Home Page
Please provide us with feedback. Feedback
An approximate string matching approach for handling incorrectly typed urls
Full text PdfPdf (87 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
POSTER SESSION: Poster session 1/information retrieval table of contents
Pages 1339-1340  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Mihai Stroe  Google Switzerland GmbH, Zurich, Switzerland
Radu Berinde  MIT, Cambridge, MA, USA
Cosmin Negruseri  Google Inc., Mountain View, CA, USA
Dan Popovici  Google Inc., Mountain View, CA, USA
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 123,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458268
What is a DOI?

ABSTRACT

In this paper we approach the problem of providing corrections for incorrectly typed URLs. This problem is significantly different from the classical spelling correction problem. We describe our contribution - building a custom data structure and a search algorithm that can find approximate matches for incorrect URLs. We evaluate the quality of our results through experiments with analysts. Our system is now being used in the Google search engine.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
V.I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10(8):707--710, 1966.
2
 
3
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford University, 1998.

Collaborative Colleagues:
Mihai Stroe: colleagues
Radu Berinde: colleagues
Cosmin Negruseri: colleagues
Dan Popovici: colleagues