ACM Home Page
Please provide us with feedback. Feedback
A method of geographical name extraction from Japanese text for thematic geographical search
Full text PdfPdf (1.28 MB)
Source Conference on Information and Knowledge Management archive
Proceedings of the eighth international conference on Information and knowledge management table of contents
Kansas City, Missouri, United States
Pages: 46 - 54  
Year of Publication: 1999
ISBN:1-58113-146-1
Author
Yasusi Kanada  Central Research Laboratory, Hitachi Ltd., Higashi-Koigakubo 1-280, Kokubunji, Tokyo 185, Japan
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGIR: ACM Special Interest Group on Information Retrieval
SIGMIS: ACM Special Interest Group on Management Information Systems
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 20,   Citation Count: 0
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/319950.323229
What is a DOI?

ABSTRACT

A text retrieval method called the thematic geographical search method has been developed and applied to a Japanese encyclopedia called the World Encyclopædia. In this method, the user specifies a search theme using free words, then obtains a sorted list of excerpts and hyperlinks to encyclopedia sentences that contain geographical names. Using this list, the user can also open maps that indicate the locations of the names. To generate an index of names for this searching, a method of extracting geographical names has been developed. In this method, geographical names are extracted, matched to names in a geographical name database, and identified. Geographical names, however, often have several types of ambiguities. Ambiguities are resolved by using non-local context analysis, which uses a stack and several other techniques. As a result, the precision of extracted names is more than 96% on average. This method depends on features of the Japanese language, but the strategy and most of the techniques can be applied to texts in English or other languages.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
HDH 98
DVD/CD-ROM World Encyclopcedia, version 2, Hitachi Digital Heibonsha, 1998.
 
HDH 99
CD-ROM Mypcedia 99, Hitachi Digital Heibonsha, 1999.
 
His 97
Hisamitsu, T., and Niwa, Y.: Acquisition of Person Names from Newspaper Articles by Lexical Knowledge and Co-occurrence Analysis, SIG on Natural Language Processing, Information Processing Society of Japan, 118-1, pp. 1- 6, 1997 (in Japanese).
 
Ino96
Inoue, Y., et al.: Template-based Products Information Extraction from Newspaper Articles, SIG on Natural Language Processing, Information Processing Society of lapan, 96-NL-115, pp. 83-90, 1996 (in Japanese).
Kan 98
 
Kan 99
Kanada, Y.: Methods of Extracting Year References for Chronological-table-generating Text Searching, Int 'l Symposium, on Digital Libraries 1999, Univ. of Library and Information Sci., Tsukuba, 1999.
 
MUC 98
Proceedings of the Seventh Message Understanding Conference (MUC-7). SAIC, 1998.
 
Tak 99
Takao, Y., Nagai, H., Nakamura, S., and Nomura, H.: Information Extraction from Newspaper Articles of Multiple Products B classification of expression patterns -- SIG on Natural Language Processing, Information Processing Society of Japan, 129-17, pp. 117-124, 1999 (in Japanese).