ACM Home Page
Please provide us with feedback. Feedback
A noise model on learning sets of strings
Full text PdfPdf (715 KB)
Source Annual Workshop on Computational Learning Theory archive
Proceedings of the fifth annual workshop on Computational learning theory table of contents
Pittsburgh, Pennsylvania, United States
Pages: 295 - 302  
Year of Publication: 1992
ISBN:0-89791-497-X
Authors
Yasubumi Sakakibara  International Institute for Advanced Study of Social Information Science (IIAS-SIS), Fujitsu Laboratories Ltd., 140, Miyamoto, Numazu, Shizuoka 410-03, Japan
Rani Siromoney  Madras Christian College, Tambaram, Madras 600 059, India
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 30,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/130385.130418
What is a DOI?

ABSTRACT

In this paper, we introduce a new noise model on learning sets of strings in the framework of PAC learning and consider the effect of the noise on learning. The instance domain is the set &Sgr;n of strings over a finite alphabet &Sgr;, and the examples are corrupted by purely random errors affecting only the instances (and not the labels). We consider three types of errors on instances, called EDIT operation errors. EDIT operations consist of “insertion”, “deletion”, and “change” of a symbol in a string. We call such a noise where the examples are corrupted by random errors of EDIT operations on instances the EDIT noise. First we show general upper bounds on the EDIT noise rate that a learning algorithm of taking the strategy of minimizing disagreements can tolerate and a learning algorithm can tolerate. Next we present an efficient algorithm that can learn a class of decision lists over the attributes “a string w contains a pattern p?” from noisy examples under some restriction on the EDIT noise rate.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
AL88
 
GS91
Sally A. Goldman and Robert H. Sloan. The difficulty of random attribute noise. Technical Report WUCS-91-29, Department of Computer Science, Washington University, 1991.
KL88
 
Li90
Ming Li. Towards a DNA sequencing theory. In Proceedings of 31st IEEE Symposium on Foundations of Computer Science, pages 125-134. IEEE Computer Society Press, 1990.
 
LV88
 
Riv87
 
Slo88
 
Ukk85
Val84
 
Val85
Leslie G. Valiant. Learning disjunctions of conjunctions. In Proceedings of 9ih International Joint Conference on Artificial Intelligence, pages 560-566. Morgan Kaufmann, 1985.


Collaborative Colleagues:
Yasubumi Sakakibara: colleagues
Rani Siromoney: colleagues