| Semantic document engineering with WordNet and PageRank |
| Full text |
Pdf
(153 KB)
|
| Source
|
Symposium on Applied Computing
archive
Proceedings of the 2005 ACM symposium on Applied computing
table of contents
Santa Fe, New Mexico
SESSION: Document engineering (DE)
table of contents
Pages: 782 - 786
Year of Publication: 2005
ISBN:1-58113-964-0
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 23, Downloads (12 Months): 79, Citation Count: 0
|
|
|
ABSTRACT
This paper describes Natural Language Processing techniques for document engineering in combination with graph algorithms and statistical methods. Google's PageRank and similar fast-converging recursive graph algorithms have provided practical means to statically rank vertices of large graphs like the World Wide Web. By combining a fast Java-based PageRank implementation with a Prolog base inferential layer, running on top of an optimized WordNet graph, we describe applications to word sense disambiguation and evaluate their accuracy on standard benchmarks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
C. Felbaum. Wordnet, an Electronic Lexical Database for English. Cambridge: MIT Press, 1998.
|
| |
3
|
E. Figa and P. Tarau. Story Traces and Projections: Exploring the Patterns of Storytelling. In N. Braun and U. Spierling, editors, TIDSE'2003, Darmstadt, Germany, Mar. 2003.
|
| |
4
|
S. Kamvar, T. Haveliwala, C. Manning, and G. Golub. Extrapolation methods for accelerating pagerank computations, 2003. http://citeseer.nj.nec.com/kamvar03extrapolation.html.
|
| |
5
|
A. Kiryakov, K. Simov, and M. Dimitrov. OntoMap: ontologies for lexical semantics. In G. Angelova, K. Bontcheva, R. Mitkov, N. Nicolov, and N. Nikolov, editors, Proceedings of the Euroconference Recent Advances in Natural Language Processing (RANLP-2001), pages 142--148, Tzigov, Bulgaria, 2001. http://citeseer.nj.nec.com/kiryakov01ontomap.html.
|
 |
6
|
|
 |
7
|
|
| |
8
|
G. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. Five papers on WordNet. CSL Report 43, Cognitive Science Laboratory, Princeton University, July 1990.
|
| |
9
|
|
| |
10
|
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
|
| |
11
|
P. Tarau. The Jinni 2004 Prolog Compiler: a High Performance Java and .NET based Prolog for Object and Agent Oriented Internet Programming. Technical report, BinNet Corp. http://www.binnetcorp.com/download/jinnidemo/JinniUserGuide.html.
|
| |
12
|
|
|