ACM Home Page
Please provide us with feedback. Feedback
Generating diverse katakana variants based on phonemic mapping
Full text PdfPdf (83 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Singapore, Singapore
POSTER SESSION: Posters group 3: multimedia and domain specific IR table of contents
Pages 793-794  
Year of Publication: 2008
ISBN:978-1-60558-164-4
Authors
Kazuhiro Seki  Kobe University, Kobe, Japan
Hiroyuki Hattori  Google Inc., Shibuya, Japan
Kuniaki Uehara  Kobe University, Kobe, Japan
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 104,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390334.1390507
What is a DOI?

ABSTRACT

In Japanese, it is quite common for the same word to be written in several different ways. This is especially true for katakana words which are typically used for transliterating foreign languages. This ambiguity becomes critical for automatic processing such as information retrieval (IR). To tackle this problem, we propose a simple but effective approach to generating katakana variants by considering phonemic representation of the original language for a given word. The proposed approach is evaluated through an assessment of the variants it generates. Also, the impact of the generated variants on IR is studied in comparison to an existing approach using katakana rewriting rules.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
C. Kubomura and H. Kameda. Information retrieval system with abilities of processing katakana allographs. IEICE Trans. Inf. & Syst., J86-D-II(3):418--428, 2003. (In Japanese)
 
3
M. Shishibori, K. Tsuda, and J. Aoe. A method for generation and normalization of katakana variant notations. IEICE Trans. Info. & Syst., J77-D-II(2):380--387, 1994. (In Japanese)

Collaborative Colleagues:
Kazuhiro Seki: colleagues
Hiroyuki Hattori: colleagues
Kuniaki Uehara: colleagues