ACM Home Page
Please provide us with feedback. Feedback
Recovering latent time-series from their observed sums: network tomography with particle filters.
Full text PdfPdf (503 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Seattle, WA, USA
SESSION: Research track papers table of contents
Pages: 30 - 39  
Year of Publication: 2004
ISBN:1-58113-888-1
Authors
Edoardo Airoldi  Carnegie Mellon University, Pittsburgh, PA
Christos Faloutsos  Carnegie Mellon University, Pittsburgh, PA
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 58,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1014052.1014059
What is a DOI?

ABSTRACT

Hidden variables, evolving over time, appear in multiple settings, where it is valuable to recover them, typically from observed sums. Our driving application is 'network tomography', where we need to estimate the origin-destination (OD) traffic flows to determine, e.g., who is communicating with whom in a local area network. This information allows network engineers and managers to solve problems in design, routing, configuration debugging, monitoring and pricing. Unfortunately the direct measurement of the OD traffic is usually difficult, or even impossible; instead, we can easily measure the loads on every link, that is, sums of desirable OD flows.In this paper we propose i-FILTER, a method to solve this problem, which improves the state-of-the-art by (a) introducing explicit time dependence, and by (b) using realistic, non-Gaussian marginals in the statistical models for the traffic flows, as never attempted before. We give experiments on real data, where i-FILTER scales linearly with new observations and out-performs the best existing solutions, in a wide variety of settings. Specifically, on real network traffic measured at CMU, and at AT&T, i-FILTER reduced the estimation errors between 15% and 46% in all cases.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. M. Airoldi. Advances in network tomography. Technical Report CMU-CALD-03-101, Carnegie Mellon University, 2003.
 
2
E. M. Airoldi and C. Faloutsos. Recovering Latent Time-Series from their Observed Sums. Technical Report CMU-CS-04-130, Carnegie Mellon University, 2004.
 
3
Y. Bejerano, Y. Breitbart, M. Garofalakis, and R. Rastogi. Physical topology discovery for large multi-subnet networks. In Proceedings of IEEE INFOCOM, pages 342--352, 2003.
4
 
5
J. Cao, D. Davis, S. Van Der Viel, and B. Yu. Time-varying network tomography: router link data. Journal of the American Statistical Association, 95:1063--75, 2000.
 
6
J. Cao, D. Davis, S. Van Der Viel, B. Yu, and Z. Zu. A scalable method for estimating network traffic matrices from link counts. Technical report, Bell Labs, 2001.
 
7
S. E. Fienberg. The geometry of an r x c contingency table. Annals of Mathematical Statistics, 39:1186--90, 1968.
 
8
S. E. Fienberg. The geometry of a two by two contingency table. Journal of the American Statistical Association, 65:694--701, 1970a.
 
9
W. R. Gilks and C. Berzuini. Following a moving target | monte carlo inference for dynamic bayesian models. Journal of the Royal Statistical Society, Series B, 63:127--146, 2001.
 
10
T. Higuchi. Self-organizing time series model. In A. Doucet, N. de Freitas, and N. Gordon, editors, Sequential Monte Carlo Methods in Practice. Springer-Verlag, 2001.
 
11
G. Liang and B. Yu. Pseudo-likelihood estimations in network tomography. In Proceedings of IEEE INFOCOM, 2003.
 
12
J. Liu. Monte Carlo Strategies in Scientific Computing. Springer-Verlag, 2001.
 
13
B. Mandelbrot. An informational theory of the statistical structure of language. In W. Jackson, editor, Communication Theory. Butterworths, 1953.
14
 
15
 
16
I. Rish, M. Brodie, N. Odintsova, S. Ma, and G. Grabarnik. Real-time problem determination in distributed systems using active probing. To appear in NOMS, 2004.
 
17
C. and G. Casella. Monte Carlo Statistical Methods. Springer-Verlag, 1999.
 
18
C. Tebaldi and M. West. Bayesian inference on network traffic using link count data. Journal of the American Statistical Association, 93:557--576, 1998.
 
19
R. J. Vanderbei and J. Iannone. An em approach to od matrix estimation. Technical Report SOR 94-04, Princeton University, 1994.
 
20
Y. Vardi. Network tomography: estimating source-destination traffic intensities from link data. Journal of the American Statistical Association, 91:365--377, 1996.
21
 
22
23
24
 
25
G. K. Zipf. Selected Studies of the Principle of Relative Frequency in Language. Harvard University Press, 1932.

Collaborative Colleagues:
Edoardo Airoldi: colleagues
Christos Faloutsos: colleagues