ACM Home Page
Please provide us with feedback. Feedback
Diversity-based inference of finite automata
Full text PdfPdf (2.29 MB)
Source Journal of the ACM (JACM) archive
Volume 41 ,  Issue 3  (May 1994) table of contents
Pages: 555 - 589  
Year of Publication: 1994
ISSN:0004-5411
Authors
Ronald L. Rivest  MIT Lab. for Computer Science, Cambridge, MA
Robert E. Schapire  MIT Lab. for Computer Science, Cambridge, MA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 57,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/176584.176589
What is a DOI?

ABSTRACT

We present new procedures for inferring the structure of a finite-state automaton (FSA) from its input/output behavior, using access to the automaton to perform experiments. Our procedures use a new representation for finite automata, based on the notion of equivalence between tests. We call the number of such equivalence classes the diversity of the automaton; the diversity may be as small as the logarithm of the number of states of the automaton. For the special class of permutation automata, we describe an inference procedure that runs in time polynomial in the diversity and log(1/&dgr;), where &dgr; is a given upper bound on the probability that our procedure returns an incorrect result. (Since our procedure uses randomization to perform experiments, there is a certain controllable chance that it will return an erroneous result.) We also discuss techniques for handling more general automata. We present evidence for the practical efficiency of our approach. For example, our procedure is able to infer the structure of an automaton based on Rubik's Cube (which has approximately 1019 states) in about 2 minutes on a DEC MicroVax. This automaton is many orders of magnitude larger than possible with previous techniques, which would require time proportional at least to the number of global states. (Note that in this example, only a small fraction (10-14) of the global states were even visited.) Finally, we present a new procedure for inferring automata of a special type in which the global state is composed of a vector of binary local state variables, all of which are observable (or visible) to the experimenter. Our inference procedure runs provably in time polynomial in the size of this vector (which happens to be the diversity of the automaton), even though the global state space may be exponentially larger. The procedure plans and executes experiments on the unknown automaton; we show that the number of input symbols given to the automaton during this process is (to within a constant factor) the best possible.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
~ANGLUIN, D. 1978. On the complexity of mlmmum inference of regular sets. Inf. Cont. 39, ~337-350.
2
 
3
 
4
~ANGLUIN, m. 1987b. A note on diversity. Unpublished manuscript.
 
5
~BAINBR1DGE, E. S. 1977. The fundamental duality of system theory. In Systems: .4pproaches, ~Theones, Apphcattons, W. E. Hartnett, Ed. Reidel, Dordrecht, Holland, pp. 45-61
 
6
~BERMAN, A. AND PLEMMONS, R. J. 1979. Nonnegatwe 3lattices m the Mathematical Sciences. ~Academic Press, Orlando, Fla.
 
7
~CORMEN, T. H., LEiSERSON, C. E., AND RIVEST, R. L. 1990. hztroductton to/t{gortthms. MIT ~Press, Cambridge, Mass.
 
8
~DEAN, T., ANGLUIN, D., BASYE, K., ENGELSON, S., KAELBLING, L., KOKKEVIS, E., AND MARON, O. ~1992. Inferring finite automata with stochastic output functions and an application to map ~learning. In Proceedings of the lOth Natzoua/ Confereuce on ,4rtificial bztelhgence (July). MIT ~Press, Cambridge, Mass., pp. 208-214.
 
9
 
10
~DRESCHER, G. L. 1987. A mechanism for early Piagetian learning. In Proceedings of AAAI-87.' ~Sixth National Conference on Artificial Intelligence (Seattle, Wash., July). Morgan-Kaufmann, San ~Mateo, Calif., pp. 290-294.
 
11
~FIEDLER, M. 1972. Bounds for eigenvalues of doubly stochastic matrices. Lin..4lg Appl. 5, 3 ~(July), 299-310.
 
12
~F1LL, J. A. 1991. Eigenvalue bounds on convergence to stationarity for nonreversible Markov ~ chains, with an apphcation to the exclusion process. Ann Applied Prob. l, 1, 62-87.
 
13
~FRANKLIN, J. N. 1968. Matrix Theory. Prentice-Hall, Englewood Cliffs, N.J.
 
14
~GOLD, E. M. 1967. Language identification in the limit, h~f. Cont. 10, 447-474.
 
15
~GOLD, E. M. 1972. System identification via state characterization. Autornattca, 8, 621 636.
 
16
~GOLD, E. M. 1978. Complexity of automaton identification from given data. Inf. Cont. 37, ~302-320.
 
17
18
 
19
~KOHAVL Z. 1978. Swztchzng and Fintte Automata Theoo,. McGraw-Hill, New York.
 
20
~LOV/SZ, L. 1979. Combbmtorial Problems and Exercises. North-Holland, Amsterdam, The ~Netherlands.
 
21
22
 
23
24
 
25
~ DE SAINT-EXUPERY, A. 1943. The Little Prfi~ce. Harcourt, Brace, & World, New York.
 
26
27
 
28
~TRAKHTENBROT, B. A., AND BARZDIN', YA. M. 1973. Finite Automata: Behacior and Synthesis. ~North-Holland, Amsterdam, The Netherlands.

CITED BY  9

Collaborative Colleagues:
Ronald L. Rivest: colleagues
Robert E. Schapire: colleagues