| Discovering critical edge sequences in E-commerce catalogs |
| Full text |
Pdf
(271 KB)
|
| Source
|
Electronic Commerce
archive
Proceedings of the 3rd ACM conference on Electronic Commerce
table of contents
Tampa, Florida, USA
Pages: 65 - 74
Year of Publication: 2001
ISBN:1-58113-387-1
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 19, Citation Count: 2
|
|
|
ABSTRACT
Web sites allow the collection of vast amounts of navigational data -- clickstreams of user traversals through the site. These massive data stores offer the tantalizing possibility of uncovering interesting patterns within the dataset. For e-businesses, always looking for an edge in the hyper-competitive online marketplace, this possibility is of particular interest. Of significant particular interest to e-businesses is the discovery of Critical Edge Sequences (CES), which denote frequently traversed subpaths in the catalog. CESs can be used to improve site performance and site management, increase the effectiveness of advertising on the site, and gather additional knowledge of customer interest patterns on the site.Using traditional graph-based and web mining strategies to find CESs could turn out to be expensive in both space and time. In this paper, we propose a method to compute the most popular paths bewteen node pairs in a catalog, which are then used to discover CESs. Our method is both space-efficient and accurate, providing a vast reduction in the storage requirement with a minimum impact on accuracy. This algorithm, executed off-line in batch mode, is also practical with respect to running time. As a variant of single-source shortest-path, it runs in log linear time.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
 |
5
|
Roberto J. Bayardo, Jr. , Rakesh Agrawal, Mining the most interesting rules, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.145-154, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312219]
|
| |
6
|
|
| |
7
|
|
| |
8
|
A. Datta, D. VanderMeer, K. Ramamritham, and S. Navathe. Toward a comprehensive model of the content and structure of, and user interaction over, a web site. In Proceedings of the VLDB Workshop on Technologies for E-Services, Cairo, Egypt, September 2000.
|
| |
9
|
E. Dijkstra. A note on two problems in connexion with graphs. Numerische Mathematik, 1:269-271, 1959.
|
| |
10
|
K. Dutta, D. VanderMeer, A. Datta, and K. Ramamritham. Discovering critical edge sequences in e-commerce catalogs. Technical report, Chutney Technologies Technical Report TR2001-15, 2001.
|
| |
11
|
|
| |
12
|
|
| |
13
|
B. Mosbasher, N. Jain, E. Han, and J. Srivastava. Web mining: Pattern discovery from world wide web transactions. Technical Report 96-050, University of Minnesota, Dept. of Computer Science, Minneapolis, 1996.
|
| |
14
|
J. Pitkow and P. Priolli. Mining longest repeating subsequences to predict world wide web surfing. In Proceedings of USITS'99: The 2nd USENIX Symposium on Internet Technologies and Systems, Boulder, Colorado, October 1999.
|
| |
15
|
D. Simpson. Corral your storage management costs. Datamation, pages 88-93, April 1997.
|
| |
16
|
M. Spiliopoulou, L. Faulstich, and K. Winkler. A data miner analyzing the navigaitional behavior of web users. In International Conference ofACAI'99: Workshop on Machine Learning in User Modelling, 1999.
|
| |
17
|
|
| |
18
|
M. Zaki, N. Lesh, and M. Ogihara. Planmine: Sequence mining for plan failures. In Proceedings of the 4th Intl. Conference on Knowledge Discovery and Data Mining, pages 369-373, 1998.
|
|