ACM Home Page
Please provide us with feedback. Feedback
Semantic document engineering with WordNet and PageRank
Full text PdfPdf (153 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2005 ACM symposium on Applied computing table of contents
Santa Fe, New Mexico
SESSION: Document engineering (DE) table of contents
Pages: 782 - 786  
Year of Publication: 2005
ISBN:1-58113-964-0
Authors
Paul Tarau  University of North Texas, Denton, Texas
Rada Mihalcea  University of North Texas, Denton, Texas
Elizabeth Figa  University of North, Denton, Texas
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 23,   Downloads (12 Months): 79,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1066677.1066855
What is a DOI?

ABSTRACT

This paper describes Natural Language Processing techniques for document engineering in combination with graph algorithms and statistical methods. Google's PageRank and similar fast-converging recursive graph algorithms have provided practical means to statically rank vertices of large graphs like the World Wide Web. By combining a fast Java-based PageRank implementation with a Prolog base inferential layer, running on top of an optimized WordNet graph, we describe applications to word sense disambiguation and evaluate their accuracy on standard benchmarks.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
C. Felbaum. Wordnet, an Electronic Lexical Database for English. Cambridge: MIT Press, 1998.
 
3
E. Figa and P. Tarau. Story Traces and Projections: Exploring the Patterns of Storytelling. In N. Braun and U. Spierling, editors, TIDSE'2003, Darmstadt, Germany, Mar. 2003.
 
4
S. Kamvar, T. Haveliwala, C. Manning, and G. Golub. Extrapolation methods for accelerating pagerank computations, 2003. http://citeseer.nj.nec.com/kamvar03extrapolation.html.
 
5
A. Kiryakov, K. Simov, and M. Dimitrov. OntoMap: ontologies for lexical semantics. In G. Angelova, K. Bontcheva, R. Mitkov, N. Nicolov, and N. Nikolov, editors, Proceedings of the Euroconference Recent Advances in Natural Language Processing (RANLP-2001), pages 142--148, Tzigov, Bulgaria, 2001. http://citeseer.nj.nec.com/kiryakov01ontomap.html.
6
7
 
8
G. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. Five papers on WordNet. CSL Report 43, Cognitive Science Laboratory, Princeton University, July 1990.
 
9
 
10
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
 
11
P. Tarau. The Jinni 2004 Prolog Compiler: a High Performance Java and .NET based Prolog for Object and Agent Oriented Internet Programming. Technical report, BinNet Corp. http://www.binnetcorp.com/download/jinnidemo/JinniUserGuide.html.
 
12

Collaborative Colleagues:
Paul Tarau: colleagues
Rada Mihalcea: colleagues
Elizabeth Figa: colleagues