|
ABSTRACT
We consider classification of email messages as to whether or not they contain certain "email acts", such as a request or a commitment. We show that exploiting the sequential correlation among email messages in the same thread can improve email-act classification. More specifically, we describe a new text-classification algorithm based on a dependency-network based collective classification method, in which the local classifiers are maximum entropy models based on words and certain relational features. We show that statistically significant improvements over a bag-of-words baseline classifier can be obtained for some, but not all, email-act classes. Performance improvements obtained by collective classification appears to be consistent across many email acts suggested by prior speech-act theory.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
V.R. Carvalho, W. Wu, W.W. Cohen and J. Kleinberg. Predicting Leadership Roles in Email Workgroups. Work in Progress, http://www.cs.cmu.edu/~vitor/publications.html.
|
| |
4
|
W.W. Cohen, V.R. Carvalho and T.M. Mitchell. Learning to Classify Email into "Speech Acts". Proceedings of the EMNLP, Barcelona, Spain, July 2004.
|
| |
5
|
W.W. Cohen. Minorthird: Methods for Identifying Names and Ontological Relations in Text using Heuristics for Inducing Regularities from Data. In http://minorthird.sourceforge.net, 2004.
|
 |
6
|
Soumen Chakrabarti , Byron Dom , Piotr Indyk, Enhanced hypertext categorization using hyperlinks, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.307-318, June 01-04, 1998, Seattle, Washington, United States
|
| |
7
|
S. Geman and D. Geman. Stochastic Relaxation, Gibbs Distributions and the Bayesian Restoration of Images. IEEE Transactions on Pattern Analysis and Machine Intelligence, (6):721--741, 1984.
|
| |
8
|
David Heckerman , David Maxwell Chickering , Christopher Meek , Robert Rounthwaite , Carl Kadie, Dependency networks for inference, collaborative filtering, and data visualization, The Journal of Machine Learning Research, 1, p.49-75, 9/1/2001
|
 |
9
|
|
| |
10
|
R.E. Kraut, S.R. Fussell, F.J. Lerch,and A. Espinosa. A. Coordination in Teams: Evidence from a Simulated Management Game. To appear in the Journal of Organizational Behavior.
|
 |
11
|
|
| |
12
|
H. Murakoshi, A. Shimazu, and K. Ochimizu. Construction of Deliberation Structure in Email Communication. Pacific Association for Computational Linguistics, 1999.
|
| |
13
|
J. Neville and D. Jensen. Iterative Classification in Relational Data. AAAI-2000 Workshop on Learning Statistical Models from Relational Data. AAAI Press, 2000.
|
| |
14
|
J. Neville, D. Jensen, and J. Rattigan. Statistical Relational Learning: Four Claims and a Survey. Workshop on Learning Statistical Models from Relational Data, 18th IJCAI, 2003.
|
| |
15
|
|
| |
16
|
M. Schoop. A Language-Action Approach to Electronic Negotiations. Proc. of the Eighth Annual Working Conference on Language-Action Perspective on Communication Modelling, 2003.
|
| |
17
|
|
CITED BY 9
|
|
Jaime Arguello , Brian S. Butler , Elisabeth Joyce , Robert Kraut , Kimberly S. Ling , Carolyn Rosé , Xiaoqing Wang, Talk to me: foundations for successful individual-group interactions in online communities, Proceedings of the SIGCHI conference on Human Factors in computing systems, April 22-27, 2006, Montréal, Québec, Canada
|
|
|
|
|
|
Donghui Feng , Erin Shaw , Jihie Kim , Eduard Hovy, Learning to detect conversation focus of threaded discussions, Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, p.208-215, June 04-09, 2006, New York, New York
|
|
|
|
|
|
Lida Li , Michael J. Muller , Werner Geyer , Casey Dugan , Beth Brownholtz , David R. Millen, Predicting individual priorities of shared activities using support vector machines, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, November 06-10, 2007, Lisbon, Portugal
|
|
|
|
|
|
K. Selçuk Candan , Mehmet E. Dönderler , Terri Hedgpeth , Jong Wook Kim , Qing Li , Maria Luisa Sapino, SEA: Segment-enrich-annotate paradigm for adapting dialog-based content for improved accessibility, ACM Transactions on Information Systems (TOIS), v.27 n.3, p.1-45, May 2009
|
|
|
|
|
|
Qiankun Zhao , Prasenjit Mitra , Bi Chen, Temporal and information flow based event detection from social text streams, Proceedings of the 22nd national conference on Artificial intelligence, p.1501-1506, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|