ACM Home Page
Please provide us with feedback. Feedback
METEOR: metadata and instance extraction from object referral lists on the web
Full text PdfPdf (176 KB)
Source International World Wide Web Conference archive
Special interest tracks and posters of the 14th international conference on World Wide Web table of contents
Chiba, Japan
POSTER SESSION: Posters table of contents
Pages: 1180 - 1181  
Year of Publication: 2005
ISBN:1-59593-051-5
Authors
Hasan Davulcu  Arizona State University, Tempe, AZ
Srinivas Vadrevu  Arizona State University, Tempe, AZ
Saravanakumar Nagarajan  Arizona State University, Tempe, AZ
Fatih Gelgi  Arizona State University, Tempe, AZ
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 20,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1062745.1062928
What is a DOI?

ABSTRACT

The Web has established itself as the largest public data repository ever available. Even though the vast majority of information on the Web is formatted to be easily readable by the human eye, "meaningful information" is still largely inaccessible for the computer applications. In this paper we present the METEOR system which utilizes various presentation and linkage regularities from referral lists of various sorts to automatically separate and extract metadata and instance information. Experimental results for the university domain with 12 computer science department Web sites, comprising 361 individual faculty and course home pages indicate that the performance of the metadata and instance extraction averages 85%, 88% F-measure respectively. METEOR achieves this performance without any domain specific engineering requirement.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
G. Yang, W. Tan, S. Mukherjee, I.V.Ramakrishnan, and H. Davulcu. On the power of semantic partitioning of web documents. In Workshop on Information Integration on the Web, Acapulco, Mexico, 2003.

Collaborative Colleagues:
Hasan Davulcu: colleagues
Srinivas Vadrevu: colleagues
Saravanakumar Nagarajan: colleagues
Fatih Gelgi: colleagues