ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Rapid on-line temporal sequence prediction by an adaptive agent
Full text PdfPdf (355 KB)
Source International Conference on Autonomous Agents archive
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems table of contents
The Netherlands
SESSION: Papers: learning table of contents
Pages: 67 - 73  
Year of Publication: 2005
ISBN:1-59593-093-0
Authors
Steven Jensen  University of Minnesota
Daniel Boley  University of Minnesota
Maria Gini  University of Minnesota
Paul Schrater  University of Minnesota
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 25,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1082473.1082484
What is a DOI?

ABSTRACT

Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning agent operating in a non-stationary environment. The challenge for a learning agent under these conditions is to exploit the relevant experience from a limited environmental event history while preserving flexibility.We propose a novel time/space efficient method for learning temporal sequences and making short-term predictions. Our method operates on-line, requires few exemplars, and adapts easily and quickly to changes in the underlying stochastic world model. Using a short-term memory of recent observations, the method maintains a dynamic space of candidate hypotheses in which the growth of the space is systematically and dynamically pruned using an entropy measure over the observed predictive quality of each candidate hypothesis.The method compares well against Markov-chain predictions, and adapts faster than learned Markov-chain models to changes in the underlying distribution of events. We demonstrate the method using both synthetic data and empirical experience from a game-playing scenario with human opponents.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
D. Fudenberg and D. K. Levine. The Theory of Learning in Games. MIT Press, Cambridge, Massachusetts, 1999.
 
5
 
6
S. A. Huettel, P. B. Mack, and G. McCarthy. Perceiving patterns in random series: dynamic processing of sequence in prefrontal cortex. Nature Neuroscience, 5(5):485--490, May 2002.
 
7
G. A. Miller. The magical number 7 plus or minus two: Some limits on our capacity in processing information. Psychol. Rev., 63:81--97, 1956.
 
8
 
9
L. K. Saul and M. I. Jordan. Mixed memory Markov models: decomposing complex stochastic processes as mixtures of simpler ones. Machine Learning, pages 1--11, 1998.
 
10


Collaborative Colleagues:
Steven Jensen: colleagues
Daniel Boley: colleagues
Maria Gini: colleagues
Paul Schrater: colleagues