ACM Home Page
Please provide us with feedback. Feedback
A holistic lexicon-based approach to opinion mining
Full text PdfPdf (342 KB)
Source
Web Search and Web Data Mining archive
Proceedings of the international conference on Web search and web data mining table of contents
Palo Alto, California, USA
SESSION: Social search table of contents
Pages 231-240  
Year of Publication: 2008
ISBN:978-1-59593-927-9
Authors
Xiaowen Ding  University of Illinois at Chicago, Chicago, IL
Bing Liu  University of Illinois at Chicago, Chicago, IL
Philip S. Yu  University of Illinois at Chicago, Chicago, IL
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 111,   Downloads (12 Months): 723,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1341531.1341561
What is a DOI?

ABSTRACT

One of the important types of information on the Web is the opinions expressed in the user generated content, e.g., customer reviews of products, forum posts, and blogs. In this paper, we focus on customer reviews of products. In particular, we study the problem of determining the semantic orientations (positive, negative or neutral) of opinions expressed on product features in reviews. This problem has many applications, e.g., opinion mining, summarization and search. Most existing techniques utilize a list of opinion (bearing) words (also called opinion lexicon) for the purpose. Opinion words are words that express desirable (e.g., great, amazing, etc.) or undesirable (e.g., bad, poor, etc) states. These approaches, however, all have some major shortcomings. In this paper, we propose a holistic lexicon-based approach to solving the problem by exploiting external evidences and linguistic conventions of natural language expressions. This approach allows the system to handle opinion words that are context dependent, which cause major difficulties for existing algorithms. It also deals with many special words, phrases and language constructs which have impacts on opinions based on their linguistic patterns. It also has an effective function for aggregating multiple conflicting opinion words in a sentence. A system, called Opinion Observer, based on the proposed technique has been implemented. Experimental results using a benchmark product review data set and some additional reviews show that the proposed technique is highly effective. It outperforms existing methods significantly


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
A. Andreevskaia and S. Bergler. Mining WordNet for Fuzzy Sentiment: Sentiment Tag Extraction from WordNet Glosses. In EACL'06, pp. 209--216, 2006.
 
2
P. Beineke, T. Hastie, C. Manning, and S. Vaithyanathan. An Exploration of Sentiment Summarization. In Proc. of the AAAI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications, 2003.
3
 
4
S. Das, and M. Chen. Yahoo! for Amazon: Extracting market sentiment from stock message boards. APFA'01, 2001.
5
6
 
7
A. Esuli and F. Sebastiani, EACL-06, 2006. Determining Term Subjectivity and Term Orientation for Opinion Mining, EACL-06, 2006.
 
8
C. Fellbaum. WordNet: an Electronic Lexical Database, MIT Press, 1998.
 
9
M. Gamon, A. Aue, S. Corston-Oliver, and E. K. Ringger. Pulse: Mining customer opinions from free text. IDA'2005.
 
10
 
11
 
12
13
 
14
N. Jindal, and B. Liu. Mining Comparative Sentences and Relations. In AAAI'06, 2006.
 
15
 
16
H. Kanayama and T. Nasukawa. Fully Automatic Lexicon Expansion for Domain-Oriented Sentiment Analysis. EMNLP'06, 2006.
 
17
 
18
 
19
N. Kobayashi, R. Iida, K. Inui and Y. Matsumoto. Opinion Mining on the Web by Extracting Subject-Attribute-Value Relations. In Proc. of AAAI-CAAW'06, 2006.
 
20
L.-W. Ku, Y.-T. Liang and H.-H. Chen. Opinion Extraction, Summarization and Tracking in News and Blog Corpora. In Proc. of the AAAI-CAAW'06, 2006.
21
22
23
 
24
 
25
NLProcessor ¿ Text Analysis Toolkit. 2000. http://www.infogistics.com/textanalysis.html.
 
26
 
27
 
28
 
29
 
30
V. Stoyanov and C. Cardie. Toward opinion summarization: Linking the sources. In Proc. of the Workshop on Sentiment and Subjectivity in Text, 2006.
 
31
R. Tong. An Operational System for Detecting and Tracking Opinions in on-line discussion. SIGIR 2001 Workshop on Operational Text Classification, 2001.
 
32
 
33
T. Wilson, J. Wiebe, and R. Hwa. Just how mad are you? Finding strong and weak opinion clauses. AAAI'04, 2004.
 
34
 
35
J. Wiebe, and E. Riloff: Creating Subjective and Objective sentence classifiers from unannotated texts. CICLing, 2005.
 
36
37


Collaborative Colleagues:
Xiaowen Ding: colleagues
Bing Liu: colleagues
Philip S. Yu: colleagues