ACM Home Page
Please provide us with feedback. Feedback
ARSA: a sentiment-aware model for predicting sales performance using blogs
Full text PdfPdf (207 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Amsterdam, The Netherlands
SESSION: Combination and fusion table of contents
Pages: 607 - 614  
Year of Publication: 2007
ISBN:978-1-59593-597-7
Authors
Yang Liu  York University, Toronto, ON, Canada
Xiangji Huang  York University, Toronto, ON, Canada
Aijun An  York University, Toronto, ON, Canada
Xiaohui Yu  York University, Toronto, ON, Canada
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 30,   Downloads (12 Months): 233,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1277741.1277845
What is a DOI?

ABSTRACT

Due to its high popularity, Weblogs (or blogs in short) present a wealth of information that can be very helpful in assessing the general public's sentiments and opinions. In this paper, we study the problem of mining sentiment information from blogs and investigate ways to use such information for predicting product sales performance. Based on an analysis of the complex nature of sentiments, we propose Sentiment PLSA (S-PLSA), in which a blog entry is viewed as a document generated by a number of hidden sentiment factors. Training an S-PLSA model on the blog data enables us to obtain a succinct summary of the sentiment information embedded in the blogs. We then present ARSA, an autoregressive sentiment-aware model, to utilize the sentiment information captured by S-PLSA for predicting product sales performance. Extensive experiments were conducted on a movie data set. We compare ARSA with alternative models that do not take into account the sentiment information, as well as a model with a different feature selection method. Experiments confirm the effectiveness and superiority of the proposed approach.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of Royal Statistical Society B(39):1--38, 1977.
5
 
6
Walter Enders. Applied Econometric Time Series Wiley, New York, 2nd edition, 2004.
7
8
 
9
Thomas Hofmann. Probabilistic latent semantic analysis. In UAI'99 1999.
10
 
11
Jaap Kamps and Maarten Marx. Words with attitude. In Proc. of the First International Conference on Global WordNet pages 332--341, 2002.
12
13
14
15
16
17
 
18
 
19
 
20
 
21
Technorati. URL:http://technorati. com/about/. Retrieved on January 27, 2007.
 
22
B. L. Tseng, J. Tatemura, and Y. Wu. Tomographic clustering to visualize blog communities as mountain views. In Proc. of 2nd Annual Workshop on the Weblogging Ecosystem 2005.
 
23
24
25


Collaborative Colleagues:
Yang Liu: colleagues
Xiangji Huang: colleagues
Aijun An: colleagues
Xiaohui Yu: colleagues