| Interactive methods for taxonomy editing and validation |
| Full text |
Pdf
(184 KB)
|
| Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the eleventh international conference on Information and knowledge management
table of contents
McLean, Virginia, USA
SESSION: Poster session
table of contents
Pages: 665 - 668
Year of Publication: 2002
ISBN:1-58113-492-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 41, Citation Count: 3
|
|
|
ABSTRACT
Taxonomies are meaningful hierarchical categorizations of documents into topics reflecting the natural relationships between the documents and their business objectives. Improving the quality of these taxonomies and reducing the overall cost required to create them is an important area of research. Supervised and unsupervised text clustering are important technologies that comprise only a part of a complete solution. However, there exists a great need for the ability for a human to efficiently interact with a taxonomy during the editing and validation phase. We have developed a comprehensive approach to solving this problem, and implemented this approach in a software tool called eClassifier. eClassifier provides features to help the taxonomy editor understand and evaluate each category of a taxonomy and visualize the relationships between the categories. Multiple techniques allow the user to make changes at both the category and document level. Metrics then establish how well the resultant taxonomy can be modeled for future document classification. In this paper, we present a comprehensive set of viewing, editing and validation techniques we have implemented in the Lotus Discovery Server resulting in a significant reduction in the time required to create a quality taxonomy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Dhillon, I., Modha, D., and Spangler, S. (1998). Visualizing Class Structures of Multi-Dimensional Data. Proceedings of 30th Conference on Interface, Computer Science and Statistics. May 1998.
|
| |
4
|
Dom, B. (2001) "An Information-Theoretic External Cluster-Validity Measure", IBM Research Report RJ 10219, 10/5/2001
|
| |
5
|
|
| |
6
|
|
| |
7
|
Jing, H., Barzilay, R., McKeown, K., and Elhadad, M. (1998) Summarization evaluation methods experiments and analysis. In AAAI Intelligent Text Summarization Workshop (Stanford, CA, Mar. 1998), pp. 60--68.
|
| |
8
|
Pohs, W. (2001). Practical Knowledge Management - The Lotus Discovery Server System, IBM Press. 2001.
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
CITED BY 3
|
|
|
|
|
|
|
|
Indrajit Bhattacharya , Shantanu Godbole , Ajay Gupta , Ashish Verma , Jeff Achtermann , Kevin English, Enabling analysts in managed services for CRM analytics, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France
|
|