ACM Home Page
Please provide us with feedback. Feedback
Web-derived resources for web information retrieval: from conceptual hierarchies to attribute hierarchies
Full text PdfPdf (401 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
SESSION: Web Retrieval II table of contents
Pages 596-603  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Marius Paşca  Google Inc., Mountain View, CA, USA
Enrique Alfonseca  Google Inc., Zurich, Switzerland
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 55,   Downloads (12 Months): 166,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1572043
What is a DOI?

ABSTRACT

A weakly-supervised extraction method identifies concepts within conceptual hierarchies, at the appropriate level of specificity (e.g., Bank vs. Institution), to which attributes (e.g., routing number) extracted from unstructured text best apply. The extraction exploits labeled classes of instances acquired from a combination of Web documents and query logs, and inserted into existing conceptual hierarchies. The correct concept is identified within the top three positions on average over gold-standard attributes, which corresponds to higher accuracy than in alternative experiments.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
M. Banko and O. Etzioni. The tradeoffs between open and traditional relation extraction. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL-08), pages 28--36, Columbus, Ohio, 2008.
 
2
 
3
C. Fellbaum, editor. WordNet: An Electronic Lexical Database and Some of its Applications. MIT Press, 1998.
4
 
5
 
6
 
7
8
 
9
M. Paşca. Turning Web text and search queries into factual knowledge: Hierarchical class attribute extraction. In Proceedings of the 23rd National Conference on Artificial Intelligence (AAAI-08), pages 1225--1230, Chicago, Illinois, 2008.
 
10
M. Palmer, H. Dang, and C. Fellbaum. Making fine-grained and coarse-grained sense distinctions, both manually and automatically. Natural Language Engineering, 13(2):137--163, 2007.
 
11
S. Pradhan, E. Loper, D. Dligach, and M. Palmer. SemEval-2007 Task-17: English lexical sample, SRL and all words. In Proceedings of the 4th Workshop on Semantic Evaluations (SemEval-07), pages 87--92, Prague, Czech Republic, 2007.
 
12
M. Remy. Wikipedia: The free encyclopedia. Online Information Review, 26(6):434, 2002.
 
13
14
 
15
K. Tokunaga, J. Kazama, and K. Torisawa. Automatic discovery of attribute words from Web documents. In Proceedings of the 2nd International Joint Conference on Natural Language Processing (IJCNLP-05), pages 106--118, Jeju Island, Korea, 2005.
 
16
17

Collaborative Colleagues:
Marius Paşca: colleagues
Enrique Alfonseca: colleagues