ACM Home Page
Please provide us with feedback. Feedback
Mining and summarizing customer reviews
Full text PdfPdf (232 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Seattle, WA, USA
SESSION: Research track papers table of contents
Pages: 168 - 177  
Year of Publication: 2004
ISBN:1-58113-888-1
Authors
Minqing Hu  University of Illinois at Chicago, Chicago, IL
Bing Liu  University of Illinois at Chicago, Chicago, IL
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 106,   Downloads (12 Months): 934,   Citation Count: 81
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1014052.1014073
What is a DOI?

ABSTRACT

Merchants selling products on the Web often ask their customers to review the products that they have purchased and the associated services. As e-commerce is becoming more and more popular, the number of customer reviews that a product receives grows rapidly. For a popular product, the number of reviews can be in hundreds or even thousands. This makes it difficult for a potential customer to read them to make an informed decision on whether to purchase the product. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinions. For the manufacturer, there are additional difficulties because many merchant sites may sell the same product and the manufacturer normally produces many kinds of products. In this research, we aim to mine and to summarize all the customer reviews of a product. This summarization task is different from traditional text summarization because we only mine the features of the product on which the customers have expressed their opinions and whether the opinions are positive or negative. We do not summarize the reviews by selecting a subset or rewrite some of the original sentences from the reviews to capture the main points as in the classic text summarization. Our task is performed in three steps: (1) mining product features that have been commented on by customers; (2) identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative; (3) summarizing the results. This paper proposes several novel techniques to perform these tasks. Our experimental results using reviews of a number of products sold online demonstrate the effectiveness of the techniques.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Boguraev, B., and Kennedy, C. 1997. Salience-Based Content Characterization of Text Documents. In Proc. of the ACL'97/EACL'97 Workshop on Intelligent Scalable Text Summarization.
 
3
Bourigault, D. 1995. Lexter: A terminology extraction software for knowledge acquisition from texts. KAW'95.
 
4
 
5
Cardie, C., Wiebe, J., Wilson, T. and Litman, D. 2003. Combining Low-Level and Summary Representations of Opinions for Multi-Perspective Question Answering. 2003 AAAI Spring Symposium on New Directions in Question Answering.
 
6
 
7
Daille, B. 1996. Study and Implementation of Combined Techniques for Automatic Extraction of Terminology. The Balancing Act: Combining Symbolic and Statistical Approaches to Language. MIT Press, Cambridge
 
8
Das, S. and Chen, M., 2001. Yahoo! for Amazon: Extracting market sentiment from stock message boards. APFA'01.
9
 
10
DeJong, G. 1982. An Overview of the FRUMP System. Strategies for Natural Language Parsing. 149--176.
 
11
FASTR. http://www.limsi.fr/Individu/jacquemi/FASTR/
 
12
Fellbaum, C. 1998. WordNet: an Electronic Lexical Database, MIT Press.
 
13
Finn, A. and Kushmerick, N. 2003. Learning to Classify Documents according to Genre. IJCAI-03 Workshop on Computational Approaches to Style Analysis and Synthesis.
 
14
15
 
16
 
17
 
18
 
19
Hu, M., and Liu, B. 2004. Mining Opinion Features in Customer Reviews. To appear in AAAI'04, 2004.
 
20
Huettner, A. and Subasic, P., 2000. Fuzzy Typing for Document Management. In ACL'00 Companion Volume: Tutorial Abstracts and Demonstration Notes.
 
21
Jacquemin, C., and Bourigault, D. 2001. Term extraction and automatic indexing. In R. Mitkov, editor, Handbook of Computational Linguistics. Oxford University Press.
 
22
Justeson, J. S., and Katz, S.M. 1995. Technical Terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering 1(1):9--27.
 
23
 
24
25
 
26
Liu, B., Hsu, W., Ma, Y. 1998. Integrating Classification and Association Rule Mining. KDD'98, 1998.
 
27
Mani, I., and Bloedorn, E., 1997. Multi-document Summarization by Graph Search and Matching. AAAI'97.
 
28
 
29
Miller, G., Beckwith, R, Fellbaum, C., Gross, D., and Miller, K. 1990. Introduction to WordNet: An on-line lexical database. International Journal of Lexicography (special issue), 3(4):235--312.
30
 
31
NLProcessor - Text Analysis Toolkit. 2000. http://www.infogistics.com/textanalysis.html
 
32
 
33
 
34
Reimer, U. and Hahn, U. 1997. A Formal Model of Text Summarization based on Condensation Operators of a Terminological Logic. In Proceedings of ACL'97 Workshop on Intelligent, Scalable Text Summarization.
 
35
36
 
37
Sparck J. 1993a. Discourse Modeling for Automatic Text Summarizing. Technical Report 290, University of Cambridge Computer Laboratory.
 
38
Sparck J. 1993b. What might be in a summary? Information Retrieval 93: 9--26.
 
39
Tait, J. 1983. Automatic Summarizing of English Texts. Ph.D. Dissertation, University of Cambridge.
 
40
 
41
Tong, R., 2001. An Operational System for Detecting and Tracking Opinions in on-line discussion. SIGIR 2001 Workshop on Operational Text Classification.
 
42
 
43
 
44

CITED BY  81