|
ABSTRACT
Merchants selling products on the Web often ask their customers to review the products that they have purchased and the associated services. As e-commerce is becoming more and more popular, the number of customer reviews that a product receives grows rapidly. For a popular product, the number of reviews can be in hundreds or even thousands. This makes it difficult for a potential customer to read them to make an informed decision on whether to purchase the product. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinions. For the manufacturer, there are additional difficulties because many merchant sites may sell the same product and the manufacturer normally produces many kinds of products. In this research, we aim to mine and to summarize all the customer reviews of a product. This summarization task is different from traditional text summarization because we only mine the features of the product on which the customers have expressed their opinions and whether the opinions are positive or negative. We do not summarize the reviews by selecting a subset or rewrite some of the original sentences from the reviews to capture the main points as in the classic text summarization. Our task is performed in three steps: (1) mining product features that have been commented on by customers; (2) identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative; (3) summarizing the results. This paper proposes several novel techniques to perform these tasks. Our experimental results using reviews of a number of products sold online demonstrate the effectiveness of the techniques.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Boguraev, B., and Kennedy, C. 1997. Salience-Based Content Characterization of Text Documents. In Proc. of the ACL'97/EACL'97 Workshop on Intelligent Scalable Text Summarization.
|
| |
3
|
Bourigault, D. 1995. Lexter: A terminology extraction software for knowledge acquisition from texts. KAW'95.
|
| |
4
|
|
| |
5
|
Cardie, C., Wiebe, J., Wilson, T. and Litman, D. 2003. Combining Low-Level and Summary Representations of Opinions for Multi-Perspective Question Answering. 2003 AAAI Spring Symposium on New Directions in Question Answering.
|
| |
6
|
|
| |
7
|
Daille, B. 1996. Study and Implementation of Combined Techniques for Automatic Extraction of Terminology. The Balancing Act: Combining Symbolic and Statistical Approaches to Language. MIT Press, Cambridge
|
| |
8
|
Das, S. and Chen, M., 2001. Yahoo! for Amazon: Extracting market sentiment from stock message boards. APFA'01.
|
 |
9
|
|
| |
10
|
DeJong, G. 1982. An Overview of the FRUMP System. Strategies for Natural Language Parsing. 149--176.
|
| |
11
|
FASTR. http://www.limsi.fr/Individu/jacquemi/FASTR/
|
| |
12
|
Fellbaum, C. 1998. WordNet: an Electronic Lexical Database, MIT Press.
|
| |
13
|
Finn, A. and Kushmerick, N. 2003. Learning to Classify Documents according to Genre. IJCAI-03 Workshop on Computational Approaches to Style Analysis and Synthesis.
|
| |
14
|
|
 |
15
|
Jade Goldstein , Mark Kantrowitz , Vibhu Mittal , Jaime Carbonell, Summarizing text documents: sentence selection and evaluation metrics, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.121-128, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312665]
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
Hu, M., and Liu, B. 2004. Mining Opinion Features in Customer Reviews. To appear in AAAI'04, 2004.
|
| |
20
|
Huettner, A. and Subasic, P., 2000. Fuzzy Typing for Document Management. In ACL'00 Companion Volume: Tutorial Abstracts and Demonstration Notes.
|
| |
21
|
Jacquemin, C., and Bourigault, D. 2001. Term extraction and automatic indexing. In R. Mitkov, editor, Handbook of Computational Linguistics. Oxford University Press.
|
| |
22
|
Justeson, J. S., and Katz, S.M. 1995. Technical Terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering 1(1):9--27.
|
| |
23
|
|
| |
24
|
|
 |
25
|
Julian Kupiec , Jan Pedersen , Francine Chen, A trainable document summarizer, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, p.68-73, July 09-13, 1995, Seattle, Washington, United States
[doi> 10.1145/215206.215333]
|
| |
26
|
Liu, B., Hsu, W., Ma, Y. 1998. Integrating Classification and Association Rule Mining. KDD'98, 1998.
|
| |
27
|
Mani, I., and Bloedorn, E., 1997. Multi-document Summarization by Graph Search and Matching. AAAI'97.
|
| |
28
|
|
| |
29
|
Miller, G., Beckwith, R, Fellbaum, C., Gross, D., and Miller, K. 1990. Introduction to WordNet: An on-line lexical database. International Journal of Lexicography (special issue), 3(4):235--312.
|
 |
30
|
Satoshi Morinaga , Kenji Yamanishi , Kenji Tateishi , Toshikazu Fukushima, Mining product reputations on the Web, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, July 23-26, 2002, Edmonton, Alberta, Canada
[doi> 10.1145/775047.775098]
|
| |
31
|
NLProcessor - Text Analysis Toolkit. 2000. http://www.infogistics.com/textanalysis.html
|
| |
32
|
|
| |
33
|
|
| |
34
|
Reimer, U. and Hahn, U. 1997. A Formal Model of Text Summarization based on Condensation Operators of a Terminological Logic. In Proceedings of ACL'97 Workshop on Intelligent, Scalable Text Summarization.
|
| |
35
|
|
 |
36
|
Gerard Salton , Amit Singhal , Chris Buckley , Mandar Mitra, Automatic text decomposition using text segments and text themes, Proceedings of the the seventh ACM conference on Hypertext, p.53-65, March 16-20, 1996, Bethesda, Maryland, United States
[doi> 10.1145/234828.234834]
|
| |
37
|
Sparck J. 1993a. Discourse Modeling for Automatic Text Summarizing. Technical Report 290, University of Cambridge Computer Laboratory.
|
| |
38
|
Sparck J. 1993b. What might be in a summary? Information Retrieval 93: 9--26.
|
| |
39
|
Tait, J. 1983. Automatic Summarizing of English Texts. Ph.D. Dissertation, University of Cambridge.
|
| |
40
|
|
| |
41
|
Tong, R., 2001. An Operational System for Detecting and Tracking Opinions in on-line discussion. SIGIR 2001 Workshop on Operational Text Classification.
|
| |
42
|
|
| |
43
|
|
| |
44
|
|
CITED BY 78
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jian-Tao Sun , Xuanhui Wang , Dou Shen , Hua-Jun Zeng , Zheng Chen, CWS: a comparative web search system, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
|
|
|
Christopher Scaffidi , Allen Cypher , Sebastian Elbaum , Andhy Koesnandar , Brad Myers, Using scenario-based requirements to direct research on web macro tools, Journal of Visual Languages and Computing, v.19 n.4, p.485-498, August, 2008
|
|
|
|
|
|
Hongyan Liu , Hui Yang , Wenbo Li , Wei Wei , Jun He , Xiaoyong Du, CRO: a system for online review structurization, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
Theresa Wilson , Janyce Wiebe , Paul Hoffmann, Recognizing contextual polarity in phrase-level sentiment analysis, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.347-354, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ana-Maria Popescu , Bao Nguyen , Oren Etzioni, OPINE: extracting product features and opinions from reviews, Proceedings of HLT/EMNLP on Interactive Demonstrations, p.32-33, October 07-07, 2005, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
Sebastian Schmidt , Stefan Mandl , Bernd Ludwig , Herbert Stoyan, Product-advisory on the web: an information extraction approach, Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications, p.633-636, February 12-14, 2007, Innsbruck, Austria
|
|
|
|
|
|
|
|
|
|
|
|
Chao Zhou , Guang Qiu , Kangmiao Liu , Jiajun Bu , Mingcheng Qu , Chun Chen, SOPING: a Chinese customer review mining system, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, July 20-24, 2008, Singapore, Singapore
|
|
|
Christopher Scaffidi , Kevin Bierhoff , Eric Chang , Mikhael Felker , Herman Ng , Chun Jin, Red Opal: product-feature scoring from reviews, Proceedings of the 8th ACM conference on Electronic commerce, June 11-15, 2007, San Diego, California, USA
|
|
|
|
|
|
|
|
|
Qi Su , Xinying Xu , Honglei Guo , Zhili Guo , Xian Wu , Xiaoxun Zhang , Bin Swen , Zhong Su, Hidden sentiment association in chinese web opinion mining, Proceeding of the 17th international conference on World Wide Web, April 21-25, 2008, Beijing, China
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Adam Funk , Yaoyong Li , Horacio Saggion , Kalina Bontcheva , Christian Leibold, Opinion analysis for business intelligence applications, Proceedings of the first international workshop on Ontology-supported business intelligence, p.1-9, October 27-27, 2008, Karlsruhe, Germany
|
|
|
Ali Harb , Michel Plantié , Gerard Dray , Mathieu Roche , François Trousset , Pascal Poncelet, Web opinion mining: how to extract opinions from blogs?, Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology, October 28-31, 2008, Cergy-Pontoise, France
|
|
|
Chong Long , Xiaoyan Zhu , Ming Li , Bin Ma, Information shared by many objects, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Liang Zhang , Jiangqin Wu , Yueting Zhuang , Yin Zhang , Chenxing Yang, Review-oriented metadata enrichment: a case study, Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, June 15-19, 2009, Austin, TX, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Qi Zhang , Yuanbin Wu , Tao Li , Mitsunori Ogihara , Joseph Johnson , Xuanjing Huang, Mining product reviews based on shallow dependency parsing, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
|
|
|
|
|
|
Shen Huang , Dan Shen , Wei Feng , Yongzheng Zhang , Catherine Baudin, Discovering clues for review quality from author's behaviors on e-commerce sites, Proceedings of the 11th International Conference on Electronic Commerce, August 12-15, 2009, Taipei, Taiwan
|
|
|
Shen Huang , Dan Shen , Wei Feng , Catherine Baudin , Yongzheng Zhang, Improving product review search experiences on general search engines, Proceedings of the 11th International Conference on Electronic Commerce, August 12-15, 2009, Taipei, Taiwan
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|