| ARSA: a sentiment-aware model for predicting sales performance using blogs |
| Full text |
Pdf
(207 KB)
|
Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Amsterdam, The Netherlands
SESSION: Combination and fusion
table of contents
Pages: 607 - 614
Year of Publication: 2007
ISBN:978-1-59593-597-7
|
|
Authors
|
|
Yang Liu
|
York University, Toronto, ON, Canada
|
|
Xiangji Huang
|
York University, Toronto, ON, Canada
|
|
Aijun An
|
York University, Toronto, ON, Canada
|
|
Xiaohui Yu
|
York University, Toronto, ON, Canada
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 29, Downloads (12 Months): 210, Citation Count: 3
|
|
|
ABSTRACT
Due to its high popularity, Weblogs (or blogs in short) present a wealth of information that can be very helpful in assessing the general public's sentiments and opinions. In this paper, we study the problem of mining sentiment information from blogs and investigate ways to use such information for predicting product sales performance. Based on an analysis of the complex nature of sentiments, we propose Sentiment PLSA (S-PLSA), in which a blog entry is viewed as a document generated by a number of hidden sentiment factors. Training an S-PLSA model on the blog data enables us to obtain a succinct summary of the sentiment information embedded in the blogs. We then present ARSA, an autoregressive sentiment-aware model, to utilize the sentiment information captured by S-PLSA for predicting product sales performance. Extensive experiments were conducted on a movie data set. We compare ARSA with alternative models that do not take into account the sentiment information, as well as a model with a different feature selection method. Experiments confirm the effectiveness and superiority of the proposed approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of Royal Statistical Society B(39):1--38, 1977.
|
 |
5
|
|
| |
6
|
Walter Enders. Applied Econometric Time Series Wiley, New York, 2nd edition, 2004.
|
 |
7
|
Daniel Gruhl , R. Guha , Ravi Kumar , Jasmine Novak , Andrew Tomkins, The predictive power of online chatter, Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
[doi> 10.1145/1081870.1081883]
|
 |
8
|
Daniel Gruhl , R. Guha , David Liben-Nowell , Andrew Tomkins, Information diffusion through blogspace, Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
[doi> 10.1145/988672.988739]
|
| |
9
|
Thomas Hofmann. Probabilistic latent semantic analysis. In UAI'99 1999.
|
 |
10
|
Wolfgang Jank , Galit Shmueli , Shanshan Wang, Dynamic, real-time forecasting of online auctions via functional models, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
[doi> 10.1145/1150402.1150471]
|
| |
11
|
Jaap Kamps and Maarten Marx. Words with attitude. In Proc. of the First International Conference on Global WordNet pages 332--341, 2002.
|
 |
12
|
|
 |
13
|
|
 |
14
|
|
 |
15
|
|
 |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
Technorati. URL:http://technorati. com/about/. Retrieved on January 27, 2007.
|
| |
22
|
B. L. Tseng, J. Tatemura, and Y. Wu. Tomographic clustering to visualize blog communities as mountain views. In Proc. of 2nd Annual Workshop on the Weblogging Ecosystem 2005.
|
| |
23
|
|
 |
24
|
|
 |
25
|
|
CITED BY 3
|
|
|
|
|
Munmun De Choudhury , Hari Sundaram , Ajita John , Dorée Duncan Seligmann, What makes conversations interesting?: themes, participants and consequences of conversations in online social media, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
Adam Jatowt , Kensuke Kanazawa , Satoshi Oyama , Katsumi Tanaka, Supporting analysis of future-related information in news archives and the web, Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, June 15-19, 2009, Austin, TX, USA
|
|