|
ABSTRACT
Finding opinionated blog posts is still an open problem in information retrieval, as exemplified by the recent TREC blog tracks. Most of the current solutions involve the use of external resources and manual efforts in identifying subjective features. In this paper, we propose a novel and effective dictionary-based statistical approach, which automatically derives evidence for subjectivity from the blog collection itself, without requiring any manual effort. Our experiments show that the proposed approach is capable of achieving remarkable and statistically significant improvements over robust baselines, including the best TREC baseline run. In addition, with relatively little computational costs, our proposed approach provides an effective performance in retrieving opinionated blog posts, which is as good as a computationally expensive approach using Natural Language Processing techniques.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
G. Amati. Probabilistic models for information retrieval based on Divergence from Randomness. PhD thesis, University of Glasgow, 2003.
|
| |
2
|
G. Amati, E. Ambrosi, M. Bianchi, C. Gaibisso, and G. Gambosi. FUB, IASI-CNR and University of Tor Vergata at TREC 2007 Blog Track. In Proceedings of TREC 2007.
|
| |
3
|
G. Amati, E. Ambrosi, M. Bianchi, C. Gaibisso, and G. Gambosi. Automatic Construction of an Opinion-Term Vocabulary for Ad Hoc Retrieval. In Proceedings of ECIR 2008.
|
| |
4
|
|
| |
5
|
B. Ernsting, W. Weerkamp, and M. de Rijke. Language Modeling Approaches to Blog Post and Feed Finding. In Proceedings of TREC 2007.
|
| |
6
|
N. Godbole, M. Srinivasaiah, and S. Skiena. Large-Scale Sentiment Analysis for News and Blogs. In Proceedings of ICWSM 2006.
|
| |
7
|
D. Hannah, C. Macdonald, J. Peng, B. He, and I. Ounis. Experiments in Blog and Enterprise Tracks with Terrier. In Proceedings of TREC 2007.
|
 |
8
|
|
| |
9
|
A. Java, P. Kolari, T. Finin, A. Joshi, and J. Martineau. The BlogVox Opinion Retrieval System. In Proceedings of TREC 2006.
|
| |
10
|
A. Lenhart, and S. Fox. Bloggers : a portrait of the Internet's new storytellers. Pew Internet & American Life Project, 2006.
|
| |
11
|
C. Lioma, C. Macdonald, V. Plachouras, J. Peng, B. He, I. Ounis. University of Glasgow at TREC 2006: Experiments in Terabyte and Enterprise Tracks with Terrier. In Proceedings of TREC 2006.
|
| |
12
|
C. Macdonald, and I. Ounis. The TREC Blog06 Collection : Creating and Analysing a Blog Test Collection DCS Technical Report TR-2006-224. University of Glasgow. 2006.
|
| |
13
|
C. Macdonald, I. Ounis, and I. Soboroff. Overview of the TREC 2007 Blog Track. In Proceedings of TREC 2007.
|
 |
14
|
|
| |
15
|
G. Mishne, and M. de Rijke. A Study of Blog Search. In Proceedings of ECIR 2006.
|
| |
16
|
G. Mishne. Multiple Ranking Strategies for Opinion Retrieval in Blogs. In Proceedings of TREC 2006.
|
| |
17
|
G. Mishne. Using Blog Properties to Improve Retrieval. In Proceedings of ICWSM 2006.
|
| |
18
|
I. Ounis, M. de Rijke, C. Macdonald, G. Mishne, and I. Soboroff. Overview of the TREC 2006 Blog Track. In Proceedings of TREC 2006.
|
| |
19
|
I. Ounis, G. Amati, V. Plachouras, B. He, C. Macdonald, and C. Lioma. Terrier: A High Performance and Scalable Information Retrieval Platform. In Proceedings of OSIR 2006 Workshop.
|
| |
20
|
I. Ounis, C. Macdonald, and I. Soboroff. On the TREC Blog track. In Proceedings of ISWSM 2008.
|
| |
21
|
|
| |
22
|
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, M. Gatford, and A. Payne. Okapi at TREC-4. In Proceedings of TREC 4.
|
| |
23
|
O. Vechtomova. Using Subjective Adjectives in Opinion Retrieval from Blogs. In Proceedings of TREC 2007.
|
| |
24
|
|
| |
25
|
Theresa Wilson , Paul Hoffmann , Swapna Somasundaran , Jason Kessler , Janyce Wiebe , Yejin Choi , Claire Cardie , Ellen Riloff , Siddharth Patwardhan, OpinionFinder: a system for subjectivity analysis, Proceedings of HLT/EMNLP on Interactive Demonstrations, p.34-35, October 07-07, 2005, Vancouver, British Columbia, Canada
[doi> 10.3115/1225733.1225751]
|
| |
26
|
H. Yang, J. Callan, and L. Si. Knowledge Transfer and Opinion Detection in the TREC 2006 Blog Track. In Proceedings of TREC 2006.
|
| |
27
|
K. Yang, N. Yu, A. Valerio, H. Zhang, and W. Ke. Fusion Approach to Finding opinions in Blogosphere. In Proceedings of ICWSM 2006.
|
| |
28
|
K. Yang, N. Yu, and H. Zhang. WIDIT in TREC 2007 Blog Track: Combining Lexicon-Based Methods to Detect Opinionated Blogs. In Proceedings of TREC 2007.
|
 |
29
|
|
| |
30
|
W. Zhang, and C. Yu. UIC at TREC 2007 Blog Track. In Proceedings of TREC 2007.
|
| |
31
|
G. Zhou, H. Joshi, and C. Bayrak. Topic Categorization for Relevance and Opinion Detection. In Proceedings of TREC 2007.
|
|