ACM Home Page
Please provide us with feedback. Feedback
Learning with a non-exhaustive training dataset: a case study: detection of bacteria cultures using optical-scattering technology
Full text MovMov (25:21),  PdfPdf (769 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Paris, France
SESSION: Research track papers table of contents
Pages 279-288  
Year of Publication: 2009
ISBN:978-1-60558-495-9
Authors
M. Murat Dundar  Indiana University-Purdue University, Indianapolis, Indianapolis, IN, USA
E. Daniel Hirleman  Purdue University, West Lafayette, IN, USA
Arun K. Bhunia  Purdue University, West Lafayette, IN, USA
J. Paul Robinson  Purdue University, West Lafayette, IN, USA
Bartek Rajwa  Purdue University, West Lafayette, IN, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 19,   Downloads (12 Months): 96,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557019.1557055
What is a DOI?

ABSTRACT

For a training dataset with a nonexhaustive list of classes, i.e. some classes are not yet known and hence are not represented, the resulting learning problem is ill-defined. In this case a sample from a missing class is incorrectly classified to one of the existing classes. For some applications the cost of misclassifying a sample could be negligible. However, the significance of this problem can better be acknowledged when the potentially undesirable consequences of incorrectly classifying a food pathogen as a nonpathogen are considered. Our research is directed towards the real-time detection of food pathogens using optical-scattering technology. Bacterial colonies consisting of the progeny of a single parent cell scatter light at 635 nm to produce unique forward-scatter signatures. These spectral signatures contain descriptive characteristics of bacterial colonies, which can be used to identify bacteria cultures in real time. One bottleneck that remains to be addressed is the nonexhaustive nature of the training library. It is very difficult if not impractical to collect samples from all possible bacteria colonies and construct a digital library with an exhaustive set of scatter signatures. This study deals with the real-time detection of samples from a missing class and the associated problem of learning with a nonexhaustive training dataset. Our proposed method assumes a common prior for the set of all classes, known and missing. The parameters of the prior are estimated from the samples of the known classes. This prior is then used to generate a large number of samples to simulate the space of missing classes. Finally a Bayesian maximum likelihood classifier is implemented using samples from real as well as simulated classes. Experiments performed with samples collected for 28 bacteria subclasses favor the proposed approach over the state of the art.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. Bae, P. P. Banada, K. Huff, A. K. Bhunia, J. P. Robinson, and E. D. Hirleman. Biophysical modeling of forward scattering from bacterial colonies using scalar diffraction theory. Applied Optics, 46(17):3639--48, June 2007.
 
2
P. P. Banada, S. Guo, B. Bayraktar, E. Bae, B. Rajwa, J. P. Robinson, E. D. Hirleman, and A. K. Bhunia. Optical forward-scattering for detection of listeria monocytogenes and other listeria species. Biosensors&Bioelectronics, 22(8):1664--71, Mar. 2007.
 
3
P. P. Banada, K. Huff, E. Bae, B. Rajwa, A. Aroonnual, B. Bayraktar, A. Adil, J. P. Robinson, E. D. Hirleman, and A. K. Bhunia. Label-free detection of multiple bacterial pathogens using light-scattering sensor. Biosensors&Bioelectronics, 24(6):1685--92, Feb. 2009.
 
4
B. Bayraktar, P. P. Banada, E. D. Hirleman, A. K. Bhunia, J. P. Robinson, and B. Rajwa. Feature extraction from light-scatter patterns of listeria colonies for identification and classification. Journal of Biomedical Optics, 11(3):34006, 2006.
 
5
 
6
 
7
T. H. J. Friedman and R. Tibshirani. Additive logistic regression: A statistical view of boosting. The Annals of Statistics, 38, 2000.
 
8
J. Munoz-Mari, L. Bruzzone, and G. Camps-Valls. A support vector domain description approach to supervised classification of remote sensing images. IEEE Transaction on Geoscience and Remote Sensing, 45(8):2683--2692, 2008.
 
9
B. Rajwa, M. Venkatapathi, K. Ragheb, P. P. Banada, E. D. Hirleman, T. Lary, and J. P. Robinson. Automated classification of bacterial particles in flow by multiangle scatter measurement and support vector machine classifier. Cytometry. Part A: The Journal of the International Society for Analytical Cytology, 73(4):369--79, Apr. 2008.
 
10
 
11
B. Scholkopf, R. Williamson, A. Smola, J. Shawe-Taylor, and J. Platt. Support vector method for novelty detection. volume 12, 2000.
 
12
E. J. Spinosa and A. C. Carvalho. Support vector machines for novel class detection in bioinformatics. Genet Mol Res., 4(3):608--15, 2005.
 
13
H. B. Steen. Light scattering measurement in an arc lamp-based flow cytometer. Cytometry, 11(2):223--30, 1990.
 
14
 
15
J. Theiler and D. M. Cai. Resampling approach for anomaly detection in multispectral images, 2003.
 
16
M. E. Tipping. The relevance vector machine. In S. Solla, T. Leen, and K.-R. Muller, editors, Advances in Neural Information Processing Systems 12, pages 652--658. MIT Press, Cambridge, MA, 2000.
 
17
 
18
P. J. Wyatt. Identification of bacteria by differential light scattering. Nature, 221(5187):1257--8, Mar. 1969.
 
19
P. J. Wyatt and D. T. Phillips. Structure of single bacteria from light scattering. Journal of Theoretical Biology, 37(3):493--501, Dec. 1972.

Collaborative Colleagues:
M. Murat Dundar: colleagues
E. Daniel Hirleman: colleagues
Arun K. Bhunia: colleagues
J. Paul Robinson: colleagues
Bartek Rajwa: colleagues