|
ABSTRACT
Identifying elephant flows is very important in developing effective and efficient traffic engineering schemes. In addition, obtaining the statistics of these flows is also very useful for network operation and management. On the other hand, with the rapid growth of link speed in recent years, packet sampling has become a very attractive and scalable means to measure flow statistics; however, it also makes identifying elephant flows become much more difficult. Based on Bayes' theorem, this paper develops techniques and schemes to identify elephant flows in periodically sampled packets. We show that our basic framework is very flexible in making appropriate trade-offs between false positives (misidentified flows) and false negatives (missed elephant flows) with regard to a given sampling frequency. We further validate and evaluate our approach by using some publicly available traces. Our schemes are generic and require <i>no</i> per-packet processing; hence, they allow a very cost-effective implementation for being deployed in large-scale high-speed networks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
 |
3
|
Nick Duffield , Carsten Lund , Mikkel Thorup, Estimating flow distributions from sampled flow statistics, Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications, August 25-29, 2003, Karlsruhe, Germany
[doi> 10.1145/863955.863992]
|
 |
4
|
|
 |
5
|
S. Ben Fred , T. Bonald , A. Proutiere , G. Régnié , J. W. Roberts, Statistical bandwidth sharing: a study of congestion at flow level, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.111-122, August 2001, San Diego, California, United States
|
 |
6
|
Lukasz Golab , David DeHaan , Erik D. Demaine , Alejandro Lopez-Ortiz , J. Ian Munro, Identifying frequent items in sliding windows over on-line packet streams, Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement, October 27-29, 2003, Miami Beach, FL, USA
[doi> 10.1145/948205.948227]
|
 |
7
|
Abhishek Kumar , Jun (Jim) Xu , Li Li , Jia Wang, Space-code bloom filter for efficient traffic flow measurement, Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement, October 27-29, 2003, Miami Beach, FL, USA
[doi> 10.1145/948205.948226]
|
| |
8
|
T. Mori, R. Kawahara, S. Naito, and S. Goto, "On the characteristics of Internet Traffic variability: Spikes and Elephants," In Proceedings of IEEE/IPSJ SAINT, pp. 99--106, Tokyo, Japan, Jan 2004
|
| |
9
|
NLANR: Abilene-I data set, http://pma.nlanr.net/Traces/long/ipls1.html
|
| |
10
|
NLANR: CESCA-I data set, http://pma.nlanr.net/Special/cesc1.html
|
| |
11
|
Cisco NetFlow, http://www.cisco.com/warp/public/732/netflow/index.html
|
| |
12
|
K. Papagiannaki, N. Taft, S. Bhattacharya, P. Thiran, K. Salamatian, and C. Diot, "On the feasibility of identifying elephants in internet backbone traffic. Sprint ATL Technical Report TR01-ATL-110918," Sprint Labs, November 2001.
|
| |
13
|
IETF Packet Sampling (psamp) Working Group, http://www.ietf.org/html.charters/psamp-charter.html
|
| |
14
|
InMon sFlow Probe, http://www.inmon.com/products/probes.php
|
| |
15
|
K. Thompson, G. J. Miller, and R. Wilder, "Wide-area internet traffic patterns and characteristics," IEEE Network, vol. 11, no. 6, pp. 10--23, November/December 1997.
|
 |
16
|
Yin Zhang , Lee Breslau , Vern Paxson , Scott Shenker, On the characteristics and origins of internet flow rates, Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications, August 19-23, 2002, Pittsburgh, Pennsylvania, USA
|
CITED BY 8
|
|
Claude Chaudet , Eric Fleury , Isabelle Guérin Lassous , Hervé Rivano , Marie-Emilie Voge, Optimal positioning of active and passive monitoring devices, Proceedings of the 2005 ACM conference on Emerging network experiment and technology, October 24-27, 2005, Toulouse, France
|
|
|
|
|
|
|
|
|
|
|
|
Jeffrey Erman , Anirban Mahanti , Martin Arlitt , Ira Cohen , Carey Williamson, Offline/realtime traffic classification using semi-supervised learning, Performance Evaluation, v.64 n.9-12, p.1194-1213, October, 2007
|
|
|
Naimul Basher , Aniket Mahanti , Anirban Mahanti , Carey Williamson , Martin Arlitt, A comparative analysis of web and peer-to-peer traffic, Proceeding of the 17th international conference on World Wide Web, April 21-25, 2008, Beijing, China
|
|
|
|
|
|
Patrick Loiseau , Paulo Gonçalves , Stéphane Girard , Florence Forbes , Pascale Vicat-Blanc Primet, Maximum likelihood estimation of the flow size distribution tail index from sampled packet data, Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems, June 15-19, 2009, Seattle, WA, USA
|
|