ACM Home Page
Please provide us with feedback. Feedback
Interactive methods for taxonomy editing and validation
Full text PdfPdf (184 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the eleventh international conference on Information and knowledge management table of contents
McLean, Virginia, USA
SESSION: Poster session table of contents
Pages: 665 - 668  
Year of Publication: 2002
ISBN:1-58113-492-4
Authors
Scott Spangler  IBM Almaden Research Center, San Jose, CA
Jeffrey Kreulen  IBM Almaden Research Center, San Jose, CA
Sponsors
SIGMIS: ACM Special Interest Group on Management Information Systems
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 41,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/584792.584913
What is a DOI?

ABSTRACT

Taxonomies are meaningful hierarchical categorizations of documents into topics reflecting the natural relationships between the documents and their business objectives. Improving the quality of these taxonomies and reducing the overall cost required to create them is an important area of research. Supervised and unsupervised text clustering are important technologies that comprise only a part of a complete solution. However, there exists a great need for the ability for a human to efficiently interact with a taxonomy during the editing and validation phase. We have developed a comprehensive approach to solving this problem, and implemented this approach in a software tool called eClassifier. eClassifier provides features to help the taxonomy editor understand and evaluate each category of a taxonomy and visualize the relationships between the categories. Multiple techniques allow the user to make changes at both the category and document level. Metrics then establish how well the resultant taxonomy can be modeled for future document classification. In this paper, we present a comprehensive set of viewing, editing and validation techniques we have implemented in the Lotus Discovery Server resulting in a significant reduction in the time required to create a quality taxonomy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Dhillon, I., Modha, D., and Spangler, S. (1998). Visualizing Class Structures of Multi-Dimensional Data. Proceedings of 30th Conference on Interface, Computer Science and Statistics. May 1998.
 
4
Dom, B. (2001) "An Information-Theoretic External Cluster-Validity Measure", IBM Research Report RJ 10219, 10/5/2001
 
5
 
6
 
7
Jing, H., Barzilay, R., McKeown, K., and Elhadad, M. (1998) Summarization evaluation methods experiments and analysis. In AAAI Intelligent Text Summarization Workshop (Stanford, CA, Mar. 1998), pp. 60--68.
 
8
Pohs, W. (2001). Practical Knowledge Management - The Lotus Discovery Server System, IBM Press. 2001.
 
9
 
10
 
11
 
12
 
13
 
14


Collaborative Colleagues:
Scott Spangler: colleagues
Jeffrey Kreulen: colleagues