ACM Home Page
Please provide us with feedback. Feedback
Gazetiki: automatic creation of a geographical gazetteer
Full text PdfPdf (409 KB)
Source
International Conference on Digital Libraries archive
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries table of contents
Pittsburgh PA, PA, USA
SESSION: Geography and trust on the web table of contents
Pages 85-93  
Year of Publication: 2008
ISBN:978-1-59593-998-2
Authors
Adrian Popescu  CEA, Fontenay aux Roses, France
Gregory Grefenstette  CEA, Fontenay aux Roses, France
Pierre Alain Moëllic  CEA, Fontenay aux Roses, France
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 120,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1378889.1378906
What is a DOI?

ABSTRACT

Geolocalized databases are becoming necessary in a wide variety of application domains. Thus far, the creation of such databases has been a costly, manual process. This drawback has stimulated interest in automating their construction, for example, by mining geographical information from the Web. Here we present and evaluate a new automated technique for creating and enriching a geographical gazetteer, called Gazetiki. Our technique merges disparate information from Wikipedia, Panoramio, and web search engines in order to identify geographical names, categorize these names, find their geographical coordinates and rank them. The information produced in Gazetiki enhances and complements the Geonames database, using a similar domain model. We show that our method provides a richer structure and an improved coverage compared to another known attempt at automatically building a geographic database and, where possible, we compare our Gazetiki to Geonames.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Auer, S., Bizer, C., Lehmann, J., Kobilarov, G., Cyganiak, R. and Ives, Z. 2007. DBpedia: A Nucleus for a Web of Open Data. In Proc. of ISWC 2007 (Busan, Korea, November 2007).
 
3
Brill, E., Lin, J., Banko, M., Dumais, S. and Ng, A. Data-intensive question answering. 2001. In Proc. of the TREC-10 Conference (Gaithersburg, Maryland, USA, Nov. 2001), 183--189.
 
4
Cimiano, P., Pivk, A., Schmidt-Thieme, L. And Staab, S. 2004. Learning taxonomic relations from heterogeneous evidence. In Proc. Of ECAI 2004, OLP Workshop (Valencia, Spain, 2004).
 
5
Geonames - http://geonames.org
 
6
Hill, L. L., Frew, J. and Zheng, Q. 1999. Geographic names - the implementation of a gazetteer in a georeferenced digital library. CNRI D-Lib Magazine (January, 1999).
 
7
Kazama J. and Torisawa, K. 2007. Exploiting wikipedia as external knowledge for named entity recognition. In Proc. of EMNLP 07 (Praque, Czech Republic, June 2007).
 
8
Naaman, M. Song, Y. J., Paepcke, A., Garcia-Molina, H. 2007. Assigning Textual Names to Sets of Geographic Coordinates. Journal of Computers, Environment, and Urban Systems, 30(4):418--435 (July 2006).
 
9
Panoramio - http://panoramio.com
10
 
11
12
 
13
Toral, A. and Munoz, R. 2006. A proposal to automatically build and maintain gazetteers for named entity recognition by using wikipedia. In Proc. of Workshop on NEW TEXT Wikis and blogs and other dynamic text sources (Trento, Italy, April 2006).
 
14
Wikimapia - http://wikimapia.org


Collaborative Colleagues:
Adrian Popescu: colleagues
Gregory Grefenstette: colleagues
Pierre Alain Moëllic: colleagues