ACM Home Page
Please provide us with feedback. Feedback
On-line discovery of temporal-difference networks
Full text PdfPdf (341 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 632-639  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Authors
Takaki Makino  Tokyo University, Kashiwa-shi, Chiba, Japan
Toshihisa Takagi  Research Organization of Information and Systems, Tokyo, Japan
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 24,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390236
What is a DOI?

ABSTRACT

We present an algorithm for on-line, incremental discovery of temporal-difference (TD) networks. The key contribution is the establishment of three criteria to expand a node in TD network: a node is expanded when the node is well-known, independent, and has a prediction error that requires further explanation. Since none of these criteria requires centralized calculation operations, they are easily computed in a parallel and distributed manner, and scalable for bigger problems compared to other discovery methods of predictive state representations. Through computer experiments, we demonstrate the empirical effectiveness of our algorithm.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Cassandra, A. (1999). Tony's POMDP file repository page. URL http://www.cs.brown.edu/research/ai/pomdp/examples/index.html.
 
3
4
 
5
Littman, M. L., Sutton, R. S., & Singh, S. (2002). Predictive representations of state. In Advances in neural information processing systems 14, 1555--1561. MIT Press.
 
6
McCracken, P. (2005). An online algorithm for discovery and learning of predictive state representations. Master's thesis, University of Alberta.
 
7
McCracken, P., & Bowling, M. (2006). Online discovery and learning of predictive state representations. In Advances in neural information processing systems 18, 875--882. MIT Press.
8
 
9
Rudary, M. R., & Singh, S. (2004). A nonlinear predictive state representation. In Advances in neural information processing systems 16. MIT Press.
 
10
Sutton, R. S., & Tanner, B. (2005). Temporal-difference networks. In Advances in neural information processing systems 17, 1377--1384. MIT Press.
11
 
12
Tanner, B., & Sutton, R. S. (2005b). Temporal-difference networks with history. In Proc. of IJCAI'05, 865--870.
13


Collaborative Colleagues:
Takaki Makino: colleagues
Toshihisa Takagi: colleagues