ACM Home Page
Please provide us with feedback. Feedback
Semantic text classification of disease reporting
Full text PdfPdf (191 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Amsterdam, The Netherlands
POSTER SESSION: Posters table of contents
Pages: 747 - 748  
Year of Publication: 2007
ISBN:978-1-59593-597-7
Authors
Yi Zhang  University of Illinois at Chicago, Chicago, IL
Bing Liu  University of Illinois at Chicago, Chicago, IL
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 97,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1277741.1277889
What is a DOI?

ABSTRACT

Traditional text classification studied in the IR literature is mainly based on topics. That is, each class or category represents a particular topic, e.g., sports, politics or sciences. However, many real-world text classification problems require more refined classification based on some semantic aspects. For example, in a set of documents about a particular disease, some documents may report the outbreak of the disease, some may describe how to cure the disease, some may discuss how to prevent the disease, and yet some others may include all the above information. To classify text at this semantic level, the traditional "bag of words" model is no longer sufficient. In this paper, we report a text classification study at the semantic level and show that sentence semantic and structure features are very useful for such kind of classification. Our experimental results based on a disease outbreak dataset demonstrated the effectiveness of the proposed approach.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Joachims, T. Making large-Scale SVM Learning Practical. Advances in Kernel Methods -- Support Vector Learning. Ed. Schölkopf, B., Burges, C. and Smola, A. MIT-Press, 1999.
 
2
 
3
McCallum, A. K. Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering. http://www.cs.cmu.edu/~mccallum/bow, 1996.
 
4
ProMED-mail. http://www.promedmail.org, 2007.
 
5
6