ACM Home Page
Please provide us with feedback. Feedback
Combining importance sampling and temporal difference control variates to simulate Markov Chains
Full text PdfPdf (209 KB)
Source ACM Transactions on Modeling and Computer Simulation (TOMACS) archive
Volume 14 ,  Issue 1  (January 2004) table of contents
Pages: 1 - 30  
Year of Publication: 2004
ISSN:1049-3301
Authors
R. S. Randhawa  Stanford University, Stanford, CA
S. Juneja  Tata Institute of Fundamental Research, Mumbai, India
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 63,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/974734.974735
What is a DOI?

ABSTRACT

It is well known that in estimating performance measures associated with a stochastic system a good importance sampling distribution (IS) can give orders of magnitude of variance reduction while a bad one may lead to large, even infinite, variance. In this paper we study how this sensitivity of the estimator variance to the importance sampling change of measure may be "dampened" by combining importance sampling with stochastic approximation based temporal difference (TD) method. We consider a finite state space discrete time Markov chain (DTMC) with one-step transition rewards and an absorbing set of states and focus on estimating the cumulative expected reward to absorption starting from any state. In this setting we develop sufficient conditions under which the estimate resulting from the combined approach has a mean square error that asymptotically equals zero even when the estimate formed by using only importance sampling change of measure has infinite variance. In particular, we consider the problem of estimating the small buffer overflow probability in a queuing network, where the change of measure suggested in literature is shown to have infinite variance under certain parameters and where the appropriate combination of IS and TD method can be empirically seen to have a much faster convergence rate compared to naive simulation.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Andradottir, S., Heyman, D., and Ott, T. J. 1995. On the choice of alternative measures in importance sampling with Markov chains. Operat. Res. 43, 3, 509--519.
 
3
 
4
Billingsley, P. 1995. Probability and Measure. John Wiley & Sons, New York, NY.
 
5
 
6
 
7
Crane, M. and Iglehart, D. 1975. Simulating stable stochastic systems, iii: Regenerative processes and discrete-event simulations. Operat. Res. 23, 33--45.
 
8
Dembo, A. and Zeitouni, O. 1992. Large Deviations Techniques and Applications. Jones and Bartlett, Boston, MA.
 
9
Frater, M., Lennon, T., and Anderson, B. 1991. Optimally efficient estimation of the statistics of rare events in queuing networks. IEEE Trans. Automat. Contr. 36, 12, 1395--1404.
10
 
11
Glasserman, P. and Wang, Y. 1997. Counterexamples in importance sampling for large deviations probabilities. Ann. Appl. Prob. 7, 731--746.
 
12
 
13
 
14
Heidelberger, P. 1980a. Variance reduction techniques for the simulation of Markov processes, i: Multiple estimates. IBM J. Res. Develop. 24, 570--581.
 
15
Heidelberger, P. 1980b. Variance reduction techniques for the simulation of Markov processes, ii: Matrix iterative methods. Acta Informatica 13, 21--37.
16
 
17
Hseih, M. and Glynn, P. 2002. Confidence regions for stochastic approximation algorithms. In Proceedings of the 2002 Winter Simulation Conference. 370--376.
 
18
 
19
Juneja, S. 2003. Efficient rare event simulation using importance sampling: An introduction. In Computational Mathematics, Modelling and Algorithms, J. C. Misra, Ed. Narosa Publishing House, New Delhi, India, 357--396.
 
20
 
21
Kushner, H. and Yin, G. 1997. Stochastic Approximation Algorithms and Applications. Springer-Verlag, New York, NY.
 
22
Parekh, S. and Walrand, J. 1989. A quick simulation method for excessive backlogs in networks of queues. IEEE Trans. Automat. Contr. 34, 1, 54--66.
 
23
 
24
Rubinstein, R. 1997. Optimization of computer simulation models with rare events. European J. Operat. Res. 99, 89--112.
 
25
Rubinstein, R. 1999. Rare event simulation via cross-entropy and importance sampling. Second Workshop on Rare Event Simulation (RESIM'99). 1--17.
 
26
 
27
 
28
 
29


Collaborative Colleagues:
R. S. Randhawa: colleagues
S. Juneja: colleagues