ACM Home Page
Please provide us with feedback. Feedback
Discovering relationships among categories using misclassification information
Full text PdfPdf (1.19 MB)
Source Symposium on Applied Computing archive
Proceedings of the 2008 ACM symposium on Applied computing table of contents
Fortaleza, Ceara, Brazil
SESSION: Data mining table of contents
Pages 932-937  
Year of Publication: 2008
ISBN:978-1-59593-753-7
Authors
Saket S. R. Mengle  Illinois Institute of Technology Chicago, Illinois
Nazli Goharian  Illinois Institute of Technology Chicago, Illinois
Alana Platt  Illinois Institute of Technology Chicago, Illinois
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 45,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1363686.1363899
What is a DOI?

ABSTRACT

Knowledge of relationships among categories is of the interest in different domains such as text classification, content analysis, and text mining. We propose and evaluate approaches to effectively identify relationships among document categories. Our proposed novel method capitalizes on the misclassification results of a text classifier to identify potential relationships among categories. We demonstrate that our system detects such relationships, even those relationships that assessors failed to identify in manual evaluation. Furthermore, we favorably compare the effectiveness of our methods with the state of art method and demonstrate a significant improvement in precision (34%) and recall (5%).


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
20 News Groups dataset. http://people.csail.mit.edu/jrennie/20Newsgroups
2
 
3
 
4
Mengle S, Goharian N and Platt A., FACT: Fast Algorithm for Categorizing Text. The 5th IEEE International Conference on Intelligence and Security Informatics, 2007
 
5
Open Directory Project (http://dmoz.org)
6
 
7
8
 
9
Vapnik Vladimir, Statistical learning theory. Wiley, 1998
10
 
11
Yahoo Directories (http://dir.yahoo.com)
12
13


Collaborative Colleagues:
Saket S. R. Mengle: colleagues
Nazli Goharian: colleagues
Alana Platt: colleagues