|
ABSTRACT
We propose a method for detecting related terms of a given term quickly using a conventional Web search engine. There are many kinds of related terms. For example, hypernyms and hyponyms are basic related terms that are treated in dictionaries. Synonyms and coordinate terms are also well defined related terms. Topic terms and description terms represent topics of the given term and they are vaguely defined. There are other related terms such as abbreviations and nicknames. The proposed method can be used these many kinds of related terms. It extracts related terms from text resources only from Web search results, which consist of titles, snippets, and URLs of Web pages. We use two different kind of lexico-syntactic patterns to extract related terms from the search results, and they are called bi-directional lexico-syntactic patterns. The proposed method can be applied to both languages where words are separated by a space such as English and Korean and ones where words are not separated by a space such as Japanese and Chinese. The proposed method does not need any advanced natural language processing such as morphological analysis or syntactic parsing. It works relatively fast with good precision.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Baroni and S. Bisi. Using cooccurrence statistics and the Web to discover synonyms in a technical language. In Proc. of the 4th International Conference on Language Resources and Evaluation (LREC 2004), pages 1725--1728, May 2004.
|
| |
2
|
D. Bollegala, Y. Matsuo, and M. Ishizuka. WWW sits the SAT-Measuring relational similarity on the Web. In Proc. of the 18th European Conference on Artificial Intelligence (ECAI 2008), pages 333--337, July 2008.
|
| |
3
|
Kenneth Ward Church , Patrick Hanks, Word association norms, mutual information, and lexicography, Proceedings of the 27th annual meeting on Association for Computational Linguistics, p.76-83, June 26-29, 1989, Vancouver, British Columbia, Canada
[doi> 10.3115/981623.981633]
|
| |
4
|
|
 |
5
|
Oren Etzioni , Michael Cafarella , Doug Downey , Stanley Kok , Ana-Maria Popescu , Tal Shaked , Stephen Soderland , Daniel S. Weld , Alexander Yates, Web-scale information extraction in knowitall: (preliminary results), Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
[doi> 10.1145/988672.988687]
|
| |
6
|
Z. Ghahramani and K. Heller. Bayesian sets. In Proc. of the 19th Annual Conference on Neural Information Processing Systems (NIPS 2005), 2005.
|
 |
7
|
Eric Glover , David M. Pennock , Steve Lawrence , Robert Krovetz, Inferring hierarchical descriptions, Proceedings of the eleventh international conference on Information and knowledge management, November 04-09, 2002, McLean, Virginia, USA
[doi> 10.1145/584792.584876]
|
| |
8
|
|
| |
9
|
T. Hokama and H. Kitagawa. Extracting mnemonic names of people from the Web. In Proc. of the 9th International Conference on Asian Digital Libraries (ICADL 2006), pages 121--130, November 2006.
|
| |
10
|
M. Komachi and H. Suzuki. Minimally supervised learning of semantic knowledge from query logs. In Proc. of the 3rd International Joint Conference on Natural Language Processing (IJCNLP 2008), pages 358--365, January 2008.
|
| |
11
|
|
| |
12
|
G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller. Introduction to WordNet: An on-line lexical database. International Journal of Lexicography 3(4), pages 235--312, 1990.
|
| |
13
|
|
| |
14
|
K. Nakayama, T. Hara, and S. Nishio. Wikipedia mining for an association web thesaurus construction. In Proc. of the 8th International Conference on Web Information Systems Engineering (WISE 2007), pages 322--334, December 2007.
|
| |
15
|
|
| |
16
|
H. Ohshima, S. Oyama, and K. Tanaka. Searching coordinate terms with their context from the Web. In Proc. of the 7th International Conference on Web Information Systems Engineering (WISE 2006), pages 40--47, October 2006.
|
| |
17
|
S. Oyama and K. Tanaka. Query modification by discovering topic from Web page structures. In Proc. of the 6th Asia Pacific Web Conference (APWEB 2004), pages 553--564, 2004.
|
 |
18
|
|
| |
19
|
M. Pa3ca and B. V. Durme. What you seek is what you get: Extraction of class attributes from query logs. In Proc. of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), pages 2832--2837, February 2007.
|
 |
20
|
|
 |
21
|
|
| |
22
|
K. Shinzato and K. Torisawa. Acquiring hyponymy relations from Web documents. In Proc. of Human Language Technology Conference/North American chapter of the Association for Computational Linguistics annual meeting (HLT-NAACL 2004), pages 73--80, 2004.
|
| |
23
|
K. Shinzato and K. Torisawa. A simple WWW-based method for semantic word class acquisition. In Proc. of the Recent Advances in Natural Language Processing (RANLP 2005), pages 493--500, 2005.
|
| |
24
|
|
| |
25
|
P. D. Turney. Measuring semantic similarity by latent relational analysis. In Proc. of the 19th International Joint Conference on Artificial Intelligence (IJCAI 2005), pages 1136--1141, July 2005.
|
| |
26
|
|
| |
27
|
|
|