|
ABSTRACT
Opinion retrieval is a document retrieval process, which requires documents to be retrieved and ranked according to their opinions about a query topic. A relevant document must satisfy two criteria: relevant to the query topic, and contains opinions about the query, no matter if they are positive or negative. In this paper, we describe an opinion retrieval algorithm. It has a traditional information retrieval (IR) component to find topic relevant documents from a document set, an opinion classification component to find documents having opinions from the results of the IR step, and a component to rank the documents based on their relevance to the query, and their degrees of having opinions about the query. We implemented the algorithm as a working system and tested it using TREC 2006 Blog Track data in automatic title-only runs. Our result showed 28% to 32% improvements in MAP score over the best automatic runs in this 2006 track. Our result is also 13% higher than a state-of-art opinion retrieval system, which is tested on the same data set.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
H. Chernoff and E. Lehmann. The use of maximum likelihood estimates in χ2 tests for goodness-of-fit. The Annals of Mathematical Statistics. 1954.
|
| |
2
|
S. Das, and M. Chen. Yahoo! for Amazon: Extracting market sentiment from stock message boards. APFA, 2001.
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
N. Kobayashi, R. Iida, K. Inui, and Y. Matsumoto. Opinion mining on the Web by extracting subject-attribute-value relations. AAAI-CAAW'06, 2006.
|
 |
14
|
|
 |
15
|
|
 |
16
|
|
| |
17
|
I. Ounis, M. de Rijke, C. Macdonald, G. Mishne, and I. Soboroff. Overview of the TREC-2006 Blog Track. In Proceedings of TREC. 2006.
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
S. Robertson, S. Walker Okapi/Keenbow at TREC-8, 1999.
|
| |
22
|
R. Tong. An operational system for detecting and tracking opinions in on-line discussions. SIGIR 2001 Workshop on Operational Text Classification, 2001.
|
| |
23
|
|
| |
24
|
Ellen M. Voorhees. Overview of the TREC 2005 Robust Retrieval Track. In Proceedings of TREC-2005.
|
 |
25
|
|
| |
26
|
|
| |
27
|
|
| |
28
|
|
| |
29
|
G. Mishne. Multiple ranking strategies for opinion retrieval in blogs. In TREC 2006, 2006.
|
| |
30
|
Gilad Mishne. Using Blog Properties to Improve Retrieval. In proceedings of ICWSM. 2007.
|
| |
31
|
Wei Zhang and Clement Yu. UIC at TREC 2006 Blog Track. In proceedings of 15th TREC. 2006.
|
| |
32
|
K. Yang, N. Yu, A. Valerio and H. Zhang. WIDIT in TREC 2006 Blog Track. In proceedings of 15th TREC. 2006.
|
| |
33
|
Douglas Oard, Tamer Elsayed, Jianqiang Wang, Yejun Wu, Pengyi Zhang, Eileen Abels, Jimmy Lin and Dagbert Soergel. TREC-2006 at Maryland: Blog, Enterprise, Legal and QA Tracks. In proceedings of 15th TREC. 2006.
|
 |
34
|
Wei Zhang , Shuang Liu , Clement Yu , Chaojing Sun , Fang Liu , Weiyi Meng, Recognition and classification of noun phrases in queries for effective retrieval, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, November 06-10, 2007, Lisbon, Portugal
[doi> 10.1145/1321440.1321540]
|
 |
35
|
|
CITED BY 3
|
|
Wei Zhang , Lifeng Jia , Clement Yu , Weiyi Meng, Improve the effectiveness of the opinion retrieval and opinion polarity classification, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Ben He , Craig Macdonald , Jiyin He , Iadh Ounis, An effective statistical approach to blog post opinion retrieval, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|