ACM Home Page
Please provide us with feedback. Feedback
A machine learning approach for the curation of biomedical literature: KDD Cup 2002 (task 1)
Full text PdfPdf (281 KB)
Source ACM SIGKDD Explorations Newsletter archive
Volume 4 ,  Issue 2  (December 2002) table of contents
Pages: 93 - 94  
Year of Publication: 2002
ISSN:1931-0145
Authors
S. Sathiya Keerthi  National University of Singapore, Singapore
Chong Jin Ong  National University of Singapore, Singapore
Keng Boon Siah  National University of Singapore, Singapore
David B. L. Lim  National University of Singapore, Singapore
Wei Chu  National University of Singapore, Singapore
Min Shi  National University of Singapore, Singapore
David S. Edwin  National University of Singapore, Singapore
Rakesh Menon  National University of Singapore, Singapore
Lixiang Shen  National University of Singapore, Singapore
Jonathan Y. K. Lim  National University of Singapore, Singapore
Han Tong Loh  National University of Singapore, Singapore
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 9,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/772862.772875
What is a DOI?

ABSTRACT

In this paper, we present an automated text classification system for the classification of biomedical papers. This classification is based on whether there is experimental evidence for the expression of molecular gene products for specified genes within a given paper. The system performs pre-processing and data cleaning, followed by feature extraction from the raw text. It subsequently classifies the paper using the extracted features with a Naïve Bayes Classifier. Our approach has made it possible to classify (and curate) biomedical papers automatically, thus potentially saving considerable time and resources.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
BRADLEY, A.P. 1997. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7), 1145--1159


Collaborative Colleagues:
S. Sathiya Keerthi: colleagues
Chong Jin Ong: colleagues
Keng Boon Siah: colleagues
David B. L. Lim: colleagues
Wei Chu: colleagues
Min Shi: colleagues
David S. Edwin: colleagues
Rakesh Menon: colleagues
Lixiang Shen: colleagues
Jonathan Y. K. Lim: colleagues
Han Tong Loh: colleagues