|
ABSTRACT
We present new procedures for inferring the structure of a finite-state automaton (FSA) from its input/output behavior, using access to the automaton to perform experiments.
Our procedures use a new representation for finite automata, based on the notion of equivalence between tests. We call the number of such equivalence classes the diversity of the automaton; the diversity may be as small as the logarithm of the number of states of the automaton. For the special class of permutation automata, we describe an inference procedure that runs in time polynomial in the diversity and log(1/&dgr;), where &dgr; is a given upper bound on the probability that our procedure returns an incorrect result. (Since our procedure uses randomization to perform experiments, there is a certain controllable chance that it will return an erroneous result.) We also discuss techniques for handling more general automata.
We present evidence for the practical efficiency of our approach. For example, our procedure is able to infer the structure of an automaton based on Rubik's Cube (which has approximately 1019 states) in about 2 minutes on a DEC MicroVax. This automaton is many orders of magnitude larger than possible with previous techniques, which would require time proportional at least to the number of global states. (Note that in this example, only a small fraction (10-14) of the global states were even visited.)
Finally, we present a new procedure for inferring automata of a special type in which the global state is composed of a vector of binary local state variables, all of which are observable (or visible) to the experimenter. Our inference procedure runs provably in time polynomial in the size of this vector (which happens to be the diversity of the automaton), even though the global state space may be exponentially larger. The procedure plans and executes experiments on the unknown automaton; we show that the number of input symbols given to the automaton during this process is (to within a constant factor) the best possible.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
~ANGLUIN, D. 1978. On the complexity of mlmmum inference of regular sets. Inf. Cont. 39, ~337-350.
|
 |
2
|
|
| |
3
|
|
| |
4
|
~ANGLUIN, m. 1987b. A note on diversity. Unpublished manuscript.
|
| |
5
|
~BAINBR1DGE, E. S. 1977. The fundamental duality of system theory. In Systems: .4pproaches, ~Theones, Apphcattons, W. E. Hartnett, Ed. Reidel, Dordrecht, Holland, pp. 45-61
|
| |
6
|
~BERMAN, A. AND PLEMMONS, R. J. 1979. Nonnegatwe 3lattices m the Mathematical Sciences. ~Academic Press, Orlando, Fla.
|
| |
7
|
~CORMEN, T. H., LEiSERSON, C. E., AND RIVEST, R. L. 1990. hztroductton to/t{gortthms. MIT ~Press, Cambridge, Mass.
|
| |
8
|
~DEAN, T., ANGLUIN, D., BASYE, K., ENGELSON, S., KAELBLING, L., KOKKEVIS, E., AND MARON, O. ~1992. Inferring finite automata with stochastic output functions and an application to map ~learning. In Proceedings of the lOth Natzoua/ Confereuce on ,4rtificial bztelhgence (July). MIT ~Press, Cambridge, Mass., pp. 208-214.
|
| |
9
|
|
| |
10
|
~DRESCHER, G. L. 1987. A mechanism for early Piagetian learning. In Proceedings of AAAI-87.' ~Sixth National Conference on Artificial Intelligence (Seattle, Wash., July). Morgan-Kaufmann, San ~Mateo, Calif., pp. 290-294.
|
| |
11
|
~FIEDLER, M. 1972. Bounds for eigenvalues of doubly stochastic matrices. Lin..4lg Appl. 5, 3 ~(July), 299-310.
|
| |
12
|
~F1LL, J. A. 1991. Eigenvalue bounds on convergence to stationarity for nonreversible Markov ~ chains, with an apphcation to the exclusion process. Ann Applied Prob. l, 1, 62-87.
|
| |
13
|
~FRANKLIN, J. N. 1968. Matrix Theory. Prentice-Hall, Englewood Cliffs, N.J.
|
| |
14
|
~GOLD, E. M. 1967. Language identification in the limit, h~f. Cont. 10, 447-474.
|
| |
15
|
~GOLD, E. M. 1972. System identification via state characterization. Autornattca, 8, 621 636.
|
| |
16
|
~GOLD, E. M. 1978. Complexity of automaton identification from given data. Inf. Cont. 37, ~302-320.
|
| |
17
|
|
 |
18
|
|
| |
19
|
~KOHAVL Z. 1978. Swztchzng and Fintte Automata Theoo,. McGraw-Hill, New York.
|
| |
20
|
~LOV/SZ, L. 1979. Combbmtorial Problems and Exercises. North-Holland, Amsterdam, The ~Netherlands.
|
| |
21
|
|
 |
22
|
|
| |
23
|
|
 |
24
|
|
| |
25
|
~ DE SAINT-EXUPERY, A. 1943. The Little Prfi~ce. Harcourt, Brace, & World, New York.
|
| |
26
|
|
 |
27
|
|
| |
28
|
~TRAKHTENBROT, B. A., AND BARZDIN', YA. M. 1973. Finite Automata: Behacior and Synthesis. ~North-Holland, Amsterdam, The Netherlands.
|
CITED BY 7
|
|
|
|
|
|
|
|
|
Thomas Huining Feng , Lynn Wang , Wei Zheng , Sri Kanajan , Sanjit A. Seshia, Interactive presentation: Automatic model generation for black box real-time systems, Proceedings of the conference on Design, automation and test in Europe, April 16-20, 2007, Nice, France
|
|
|
|
|
Michael A. Bender , Antonio Fernández , Dana Ron , Amit Sahai , Salil Vadhan, The power of a pebble: exploring and mapping directed graphs, Proceedings of the thirtieth annual ACM symposium on Theory of computing, p.269-278, May 24-26, 1998, Dallas, Texas, United States
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|