ACM Home Page
Please provide us with feedback. Feedback
Towards optimal sampling for flow size estimation
Full text PdfPdf (464 KB)
Source
Internet Measurement Conference archive
Proceedings of the 8th ACM SIGCOMM conference on Internet measurement table of contents
Vouliagmeni, Greece
SESSION: Sampling and probing table of contents
Pages 243-256  
Year of Publication: 2008
ISBN:978-1-60558-334-1
Authors
Paul Tune  University of Melbourne, Melbourne, Australia
Darryl Veitch  University of Melbourne, Melbourne, Australia
Sponsors
SIGCOMM: ACM Special Interest Group on Data Communication
SIGMETRICS: ACM Special Interest Group on Measurement and Evaluation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 24,   Downloads (12 Months): 285,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1452520.1452550
What is a DOI?

ABSTRACT

The flow size distribution is a useful metric for traffic modeling and management. It is well known however that its estimation based on sampled data is problematic. Previous work has shown that flow sampling (FS) offers enormous statistical benefits over packet sampling, however it suffers from high resource requirements and is not currently used in routers. In this paper we present Dual Sampling, which can to a large extent provide flow-sampling-like statistical performance for packet-sampling-like computational cost. Our work is grounded in a Fisher information based approach recently used to evaluate a number of sampling schemes, excluding however FS, for TCP flows. We show how to revise and extend the approach to include FS as well as DS and others, and how to make rigorous and fair comparisons. We show how DS significantly outperforms other packet based methods, but also prove that DS is inferior to flow sampling. However, since DS is a two-parameter family of methods which includes FS as a special case, DS can be used to approach flow sampling continuously. We then describe a packet sampling based implementation of DS and analyze its key computational costs to show that router implementation is feasible. Our approach offers insights into many issues, including how the notions of 'flow quality' and 'packet gain' can be used to understand the relative performance of methods, and how the problem of optimal sampling can be formulated. Our work is theoretical with some simulation support and a case study on Internet data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
4
 
5
G. Varghese, Network Algorithmics. San Francicso: Elsevier/Morgan Kaufmann, 2005.
 
6
L. Yang and G. Michailidis, "Estimation of flow lengths from sampled traffic," in Proc. GLOBECOM, San Francisco, CA, Nov 2006.
 
7
 
8
A. Hero, J. Fessler, and M. Usman, "Exploring estimator bias-variance tradeoffs using the Uniform CR bound," IEEE Trans. Sig. Proc., vol. 44, no. 8, pp. 2026--2041, Aug 1996.
 
9
P. Tune and D. Veitch, "Fisher information in flow size distribution estimation: Technical report," Dept. E&EE, The University of Melbourne, Tech. Rep., 2008, available: http://www.ee.unimelb.edu.au/people/lsptune/index.html/.
 
10
J. D. Gorman and A. O. Hero, "Lower bounds for parametric estimation with constraints," IEEE Trans. Info. Th., vol. 26, no. 6, pp. 1285--1301, Nov 1990.
 
11
D. Harville, Matrix Algebra from a Statistician's Perspective. Springer-Verlag, 1997.
 
12
J. E. Strum, "Binomial matrices," The Two Year College Mathematics Journal, vol. 8, no. 5, pp. 260--266, November 1977.
 
13
R. Zamir, "A proof of the Fisher information inequality via a data processing argument," IEEE Trans. Info. Th., vol. 44, no. 3, pp. 1246--1250, May 1998.
 
14
E. L. Lehmann and G. Casella, Theory of Point Estimation, 2nd ed., ser. Springer Texts in Statistics. Springer, 1998.
 
15
R. T. Rockafellar, Convex Analysis, ser. Princeton Landmarks in Mathematics and Physics. Princeton University Press, 1970.
16
 
17
NLANR, Leipzig-II Trace Data, http://pma.nlanr.net/Special/leip2.html.
 
18
F. Zhang, Matrix Theory: Basic Results and Techniques. Springer-Verlag, 1999.
 
19
K. S. Miller, "On the inverse of the sum of matrices," Mathematics Magazine, vol. 54, no. 2, March 1981.


Collaborative Colleagues:
Paul Tune: colleagues
Darryl Veitch: colleagues