|
ABSTRACT
The increasing pervasiveness of the Internet has dramatically changed the way that consumers shop for goods. Consumer-generated product reviews have become a valuable source of information for customers, who read the reviews and decide whether to buy the product based on the information provided. In this paper, we use techniques that decompose the reviews into segments that evaluate the individual characteristics of a product (e.g., image quality and battery life for a digital camera). Then, as a major contribution of this paper, we adapt methods from the econometrics literature, specifically the hedonic regression concept, to estimate: (a) the weight that customers place on each individual product feature, (b) the implicit evaluation score that customers assign to each feature, and (c) how these evaluations affect the revenue for a given product. Towards this goal, we develop a novel hybrid technique combining text mining and econometrics that models consumer product reviews as elements in a tensor product of feature and evaluation spaces. We then impute the quantitative impact of consumer reviews on product demand as a linear functional from this tensor product space. We demonstrate how to use a low-dimension approximation of this functional to significantly reduce the number of model parameters, while still providing good experimental results. We evaluate our technique using a data set from Amazon.com consisting of sales data and the related consumer reviews posted over a 15-month period for 242 products. Our experimental evaluation shows that we can extract actionable business intelligence from the data and better understand the customer preferences and actions. We also show that the textual portion of the reviews can improve product sales prediction compared to a baseline technique that simply relies on numeric data.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Berndt, E. R. The Practice of Econometrics: Classic and Contemporary. Addison-Wesley, 1996.
|
| |
2
|
Bickart, B., and Schindler, R. M. Internet forums as in?uential sources of consumer information. Journal of Interactive Marketing 15, 3 (2001), 31--40.
|
 |
3
|
|
| |
4
|
Chen, Y., and Xie, J. Online consumer review: A strategic analysis of an emerging type of word-of-mouth. University of Arizona, Working Paper, 2004.
|
| |
5
|
Chevalier, J. A., and Goolsbee, A. Measuring prices and price competition online: Amazon.com and BarnesandNoble.com. Quantitative Marketing and Economics 1, 2 (2003), 203--222.
|
| |
6
|
Chevalier, J. A., and Mayzlin, D. The effect of word of mouth on sales: Online book reviews. Journal of Marketing Research 43, 3 (Aug. 2006), 345--354.
|
| |
7
|
Das, S. R., and Chen, M. Yahoo! for Amazon: Sentiment extraction from small talk on the web. Working Paper, Santa Clara University. Available at http://scumis.scu.edu/~srdas/chat.pdf, 2006.
|
 |
8
|
|
 |
9
|
|
| |
10
|
Ghose, A., Ipeirotis, P.G., and Sundararajan, A. Opinion mining using econometrics: A case study on reputation systems,. In Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics (ACL 2007)(2007).
|
| |
11
|
Ghose, A., and Sundararajan, A. Evaluating pricing strategy using ecommerce data: Evidence and estimation challenges. Statistical Science 21, 2 (2006), 131--142.
|
| |
12
|
Greene, W. H. Econometric Analysis, 5th ed. Prentice Hall, 2002.
|
 |
13
|
Daniel Gruhl , R. Guha , Ravi Kumar , Jasmine Novak , Andrew Tomkins, The predictive power of online chatter, Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
[doi> 10.1145/1081870.1081883]
|
| |
14
|
Hastie, T., Tibshirani, R., and Friedman, J. H. The Elements of Statistical Learning. Springer Verlag, Aug. 2001.
|
 |
15
|
|
| |
16
|
Hu, M., and Liu, B. Mining opinion features in customer reviews. In Proceeding of the 2004 AAAI Spring Symposium Series: Semantic Web Services (2004), pp. 755--760.
|
| |
17
|
Lee, T. Use-centric mining of customer reviews. In Workshop on Information Technology and Systems (2004).
|
| |
18
|
Lewitt, S., and Syverson, C. Market distortions when agents are better informed: The value of information in real estate transactions. Working Paper, University of Chicago, 2005.
|
 |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
Rosen, S. Hedonic prices and implicit markets: Product differentiation in pure competition. The Journal of Political Economy 82, 1 (Jan.-Feb. 1974), 34--55.
|
| |
25
|
Samuelson, P. A., and Nordhaus, W. D. Economics, 18th ed. McGraw-Hill/Irwin, 2004.
|
| |
26
|
Scaffidi, C. Application of a probability-based algorithm to extraction of product features from online reviews. Tech. Rep. CMU-ISRI-06-111, Institute for Software Research, School of Computer Science, Carnegie Mellon University, June 2006.
|
| |
27
|
Snyder, B., and Barzilay, R. Multiple aspect ranking using the good grief algorithm. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL 2007) (2007).
|
| |
28
|
|
 |
29
|
|
| |
30
|
Wilson, T., Wiebe, J., and Hwa, R. Recognizing strong and weak opinion clauses. Computational Intelligence 22, 2 (May 2006), 73--99.
|
| |
31
|
Wooldridge, J. M. Econometric Analysis of Cross Section and Panel Data. The MIT Press, 2001.
|
INDEX TERMS
Primary Classification:
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.7
Natural Language Processing
Subjects:
Text analysis
Additional Classification:
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.4
Systems
Subjects:
Textual databases
H.2.8
Database applications
Subjects:
Data mining
General Terms:
Algorithms,
Economics,
Experimentation,
Measurement
Keywords:
consumer reviews,
e-commerce,
econometrics,
electronic commerce,
electronic markets,
hedonic analysis,
internet,
opinion mining,
product review,
sentiment analysis,
text mining,
user-generated content
|