ACM Home Page
Please provide us with feedback. Feedback
Template-independent wrapper for web forums
Full text PdfPdf (467 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
POSTER SESSION: Posters table of contents
Pages 794-795  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Qi Zhang  Fundan University, Shanghai, China
Yang Shi  Fundan University, Shanghai, China
Xuanjing Huang  Fundan University, Shanghai, China
Lide Wu  Fundan University, Shanghai, China
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 60,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1572132
What is a DOI?

ABSTRACT

This paper presents a novel work on the task of extracting data from Web forums. Millions of users contribute rich information to Web forum everyday, which has become an important resource for manyWeb applications, such as product opinion retrieval, social network analysis, and so on. The novelty of the proposed algorithm is that it can not only extract the pure text but also distinguish between the original post and replies. Experimental results on a large number of real Web forums indicate that the proposed algorithm can correctly ex



Collaborative Colleagues:
Qi Zhang: colleagues
Yang Shi: colleagues
Xuanjing Huang: colleagues
Lide Wu: colleagues