| Automatic scoring of online discussion posts |
| Full text |
Pdf
(115 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceeding of the 2nd ACM workshop on Information credibility on the web
table of contents
Napa Valley, California, USA
SESSION: Analyzing social networks and discussion forums
table of contents
Pages 19-26
Year of Publication: 2008
ISBN:978-1-60558-259-7
|
|
Authors
|
|
Nayer Wanas
|
Cairo Microsoft Innovation Center, Abourawsh, Giza, Egypt
|
|
Motaz El-Saban
|
Cairo Microsoft Innovation Center, Abourawash, Giza, Egypt
|
|
Heba Ashour
|
Cairo Microsoft Innovation Center, Abourawash, Giza, Egypt
|
|
Waleed Ammar
|
Cairo Microsoft Innovation Center, Abourawash, Giza, Egypt
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 28, Downloads (12 Months): 201, Citation Count: 1
|
|
|
ABSTRACT
Online discussions forums, known as forums for short, are conversational social cyberspaces constituting rich repositories of content and an important source of collaborative knowledge. However, most of this knowledge is buried inside the forum infrastructure and its extraction is both complex and difficult. The ability to automatically rate postings in online discussion forums, based on the value of their contribution, enhances the ability of users to find knowledge within this content. Several key online discussion forums have utilized collaborative intelligence to rate the value of postings made by users. However, a large percentage of posts go unattended and hence lack appropriate rating. In this paper, we focus on automatic rating of postings in online discussion forums. A set of features derived from the posting content and the threaded discussion structure are generated for each posting. These features are grouped into five categories, namely (i) relevance, (ii) originality, (iii) forum-specific features, (iv) surface features, and (v) posting-component features. Using a non-linear SVM classifier, the value of each posting is categorized into one of three levels High, Medium, or Low. This rating represents a seed value for each posting that is leveraged in filtering forum content. Experimental results have shown promising performance on forum data.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Christian Borgs , Jennifer Chayes , Mohammad Mahdian , Amin Saberi, Exploring the community structure of newsgroups, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
[doi> 10.1145/1014052.1016914]
|
| |
2
|
Chang, C., and Lin, C. 2001. LibSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
|
| |
3
|
Dikli, S., 2006. An Overview of Automatic Scoring of Essays. The Journal of Technology, Learning, and Assessment, Vol 5(1) August 2006, 3--35.
|
 |
4
|
Andrew T. Fiore , Scott Lee Tiernan , Marc A. Smith, Observed behavior and perceived value of authors in usenet newsgroups: bridging the gap, Proceedings of the SIGCHI conference on Human factors in computing systems: Changing our world, changing ourselves, April 20-25, 2002, Minneapolis, Minnesota, USA
[doi> 10.1145/503376.503434]
|
| |
5
|
|
 |
6
|
|
 |
7
|
Natalie Glance , Matthew Hurst , Kamal Nigam , Matthew Siegler , Robert Stockton , Takashi Tomokiyo, Deriving marketing intelligence from online discussion, Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
[doi> 10.1145/1081870.1081919]
|
 |
8
|
|
 |
9
|
|
| |
10
|
Lui, A., Li, S., and Choy, S. 2007. An Evaluation of Automatic Text Categorization in Online Discussion Analysis. In Proceedings of the Seventh IEEE International Conference on Advanced Learning Technologies (Niigata, Japan, July 18-20, 2007) ICALT 2007, IEEE Computer Society Press, New Jersey, NJ, 205--209
|
| |
11
|
Weimer, M., Gurevych, I., and Mühlhäuser, M. 2007. Automatically Assessing the Post Quality in Online Discussions on Software. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (Prague, Czech Republic, June 23-30, 2007). ACL2007 Volume P07-2, 125--128.
|
| |
12
|
Wu, Q., Burges, C. Svore, K, and Gao, J, 2008, Ranking, Boosting, and Model Adaptation, Technical Report, MSR-TR-2008-109, Microsoft Corporation, Redmond, WA, August 2008.
|
| |
13
|
|
|