ACM Home Page
Please provide us with feedback. Feedback
An effective statistical approach to blog post opinion retrieval
Full text PdfPdf (263 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
SESSION: IR: blog table of contents
Pages 1063-1072  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Ben He  University of Glasgow, Glasgow, United Kingdom
Craig Macdonald  University of Glasgow, Glasgow, United Kingdom
Jiyin He  University of Amsterdam, Amsterdam, Netherlands
Iadh Ounis  University of Glasgow, Glasgow, United Kingdom
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 36,   Downloads (12 Months): 438,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458223
What is a DOI?

ABSTRACT

Finding opinionated blog posts is still an open problem in information retrieval, as exemplified by the recent TREC blog tracks. Most of the current solutions involve the use of external resources and manual efforts in identifying subjective features. In this paper, we propose a novel and effective dictionary-based statistical approach, which automatically derives evidence for subjectivity from the blog collection itself, without requiring any manual effort. Our experiments show that the proposed approach is capable of achieving remarkable and statistically significant improvements over robust baselines, including the best TREC baseline run. In addition, with relatively little computational costs, our proposed approach provides an effective performance in retrieving opinionated blog posts, which is as good as a computationally expensive approach using Natural Language Processing techniques.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
G. Amati. Probabilistic models for information retrieval based on Divergence from Randomness. PhD thesis, University of Glasgow, 2003.
 
2
G. Amati, E. Ambrosi, M. Bianchi, C. Gaibisso, and G. Gambosi. FUB, IASI-CNR and University of Tor Vergata at TREC 2007 Blog Track. In Proceedings of TREC 2007.
 
3
G. Amati, E. Ambrosi, M. Bianchi, C. Gaibisso, and G. Gambosi. Automatic Construction of an Opinion-Term Vocabulary for Ad Hoc Retrieval. In Proceedings of ECIR 2008.
 
4
 
5
B. Ernsting, W. Weerkamp, and M. de Rijke. Language Modeling Approaches to Blog Post and Feed Finding. In Proceedings of TREC 2007.
 
6
N. Godbole, M. Srinivasaiah, and S. Skiena. Large-Scale Sentiment Analysis for News and Blogs. In Proceedings of ICWSM 2006.
 
7
D. Hannah, C. Macdonald, J. Peng, B. He, and I. Ounis. Experiments in Blog and Enterprise Tracks with Terrier. In Proceedings of TREC 2007.
8
 
9
A. Java, P. Kolari, T. Finin, A. Joshi, and J. Martineau. The BlogVox Opinion Retrieval System. In Proceedings of TREC 2006.
 
10
A. Lenhart, and S. Fox. Bloggers : a portrait of the Internet's new storytellers. Pew Internet & American Life Project, 2006.
 
11
C. Lioma, C. Macdonald, V. Plachouras, J. Peng, B. He, I. Ounis. University of Glasgow at TREC 2006: Experiments in Terabyte and Enterprise Tracks with Terrier. In Proceedings of TREC 2006.
 
12
C. Macdonald, and I. Ounis. The TREC Blog06 Collection : Creating and Analysing a Blog Test Collection DCS Technical Report TR-2006-224. University of Glasgow. 2006.
 
13
C. Macdonald, I. Ounis, and I. Soboroff. Overview of the TREC 2007 Blog Track. In Proceedings of TREC 2007.
14
 
15
G. Mishne, and M. de Rijke. A Study of Blog Search. In Proceedings of ECIR 2006.
 
16
G. Mishne. Multiple Ranking Strategies for Opinion Retrieval in Blogs. In Proceedings of TREC 2006.
 
17
G. Mishne. Using Blog Properties to Improve Retrieval. In Proceedings of ICWSM 2006.
 
18
I. Ounis, M. de Rijke, C. Macdonald, G. Mishne, and I. Soboroff. Overview of the TREC 2006 Blog Track. In Proceedings of TREC 2006.
 
19
I. Ounis, G. Amati, V. Plachouras, B. He, C. Macdonald, and C. Lioma. Terrier: A High Performance and Scalable Information Retrieval Platform. In Proceedings of OSIR 2006 Workshop.
 
20
I. Ounis, C. Macdonald, and I. Soboroff. On the TREC Blog track. In Proceedings of ISWSM 2008.
 
21
 
22
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, M. Gatford, and A. Payne. Okapi at TREC-4. In Proceedings of TREC 4.
 
23
O. Vechtomova. Using Subjective Adjectives in Opinion Retrieval from Blogs. In Proceedings of TREC 2007.
 
24
 
25
 
26
H. Yang, J. Callan, and L. Si. Knowledge Transfer and Opinion Detection in the TREC 2006 Blog Track. In Proceedings of TREC 2006.
 
27
K. Yang, N. Yu, A. Valerio, H. Zhang, and W. Ke. Fusion Approach to Finding opinions in Blogosphere. In Proceedings of ICWSM 2006.
 
28
K. Yang, N. Yu, and H. Zhang. WIDIT in TREC 2007 Blog Track: Combining Lexicon-Based Methods to Detect Opinionated Blogs. In Proceedings of TREC 2007.
29
 
30
W. Zhang, and C. Yu. UIC at TREC 2007 Blog Track. In Proceedings of TREC 2007.
 
31
G. Zhou, H. Joshi, and C. Bayrak. Topic Categorization for Relevance and Opinion Detection. In Proceedings of TREC 2007.


Collaborative Colleagues:
Ben He: colleagues
Craig Macdonald: colleagues
Jiyin He: colleagues
Iadh Ounis: colleagues