ACM Home Page
Please provide us with feedback. Feedback
Web mining research: a survey
Full text PdfPdf (1.58 MB)
Source ACM SIGKDD Explorations Newsletter archive
Volume 2 ,  Issue 1  (June 2000) table of contents
Pages: 1 - 15  
Year of Publication: 2000
ISSN:1931-0145
Authors
Raymond Kosala  Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, B-3001 Heverlee, Belgium
Hendrik Blockeel  Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, B-3001 Heverlee, Belgium
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 171,   Downloads (12 Months): 970,   Citation Count: 83
Additional Information:

references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/360402.360406
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
[2] S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. L. Wiener. The lorel query language for semistructured data. Int. J. on Digital Libraries, 1(1):68-88, 1997.
 
3
 
4
[4] H. Ahonen, O. Heinonen, M. Klemettinen, and A. Verkamo. Finding co-occurring text phrases by combining sequence and frequent set discovery. In R. Feldman, editor, Proceedings of 16th International Joint Conference on Artificial Intelligence IJCAI-99 Workshop on Text Mining: Foundations, Techniques and Applications, pages 1-9, 1999.
 
5
[5] J. Allan, J. Carbonell, G. Doddington, J. Yamron, and Y. Yang. Topic detection and tracking pilot study: Final report. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, 1998, 1998.
6
 
7
[7] D. E. Appelt and D. Israel. Introduction to information extraction technology. In Proceedings of 16th International Joint Conference on Artificial Intelligence IJCAI-99, Tutorial, 1999.
 
8
9
 
10
11
 
12
13
 
14
 
15
[15] J. Borges and M. Levene. Mining association rules in hypertext databases. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98), August 27-31, 1998, New York City, New York, USA, 1998.
 
16
 
17
18
19
 
20
[20] J. Carbonell, M. Craven, S. Fienberg, T. Mitchell, and Y. Yang. Report on the conald workshop on learning from text and the web. In CONALD Workshop on Learning from Text and the Web, June, 1998, 1998.
 
21
 
22
[22] C. Cardie. Empirical methods in information extraction. AI Magazine, 18(4):65-79, 1997.
23
 
24
25
 
26
[26] S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J. Widom. The tsimmis project: Integration of heterogeneous information sources. In Proceedings of the 10th Meeting of the Information Processing Society of Japan, pages 7-18, 1994.
 
27
[27] W.W. Cohen. Learning to classify english text with ilp methods. In Advances in Inductive Logic Programming (Ed. L. De Raedt), IOS Press, 1995.
 
28
[28] W.W. Cohen. Some practical observations on integration of web information. In ACM SIGMOD Workshop on The Web and Databases (WebDB'99), pages 55-60, Philadelphia, Pennsylvania, USA, 1999.
 
29
[29] W. W. Cohen. What can we learn from the web? In Proceedings of the Sixteenth International Conference on Machine Learning (ICML'99), pages 515-521, 1999.
 
30
 
31
[31] R. Cooley, B. Mobasher, and J. Srivastava. Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems, 1(1), 1999.
 
32
33
 
34
 
35
 
36
[36] S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6):391-407, 1990.
 
37
 
38
[38] J. A. Delgado. Agent-Based Information Filtering and Recommender System On the Internet. PhD thesis, Dept. of Intelligence Computer Science, Nagoya Institute of Technology, March 2000.
39
 
40
[40] J. S. T. Eliassi-Rad. Intelligent agents for web-based tasks: An advice-taking approach. In Working Notes of the AAAI/ICML-98 Workshop on Learning for Text Categorization, Madison, WI, pages 588-589, 1999.
41
 
42
 
43
 
44
[44] U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth. Knowledge discovery and data mining: toward a unifying framework. In Proceeding of The Second Int. Conference on Knowledge Discovery and Data Mining , pages 82-88, 1996.
 
45
[45] R. Feldman and I. Dagan. Knowledge discovery in textual databases (kdt). In Proceedings of the First International Conference on Knowledge Discovery and Data Mining (KDD-95), pages 112-117, Montreal, Canada, 1995.
 
46
 
47
[47] D. Fensel, C. Knoblock, N. Kushmerick, and M.-C. Rousset. Workshop on intelligent information integration (iii'99). AI Magazine, 21(1):91-94, 2000.
48
 
49
50
 
51
 
52
 
53
 
54
55
 
56
 
57
[57] R. Goldman and J. Widom. Approximate dataguides. In Proceedings of the Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats , 1999.
 
58
[58] S. Green, L. Hurst, B. Nangle, P. Cunningham, F. Somers, and R. Evans. Software agents: A review. Technical Report TCD-CS-1997-06, Technical Report of Trinity College, University of Dublin, 1997.
 
59
 
60
[60] J. Hammer, H. Garcia-Molina, J. Cho, A. Crespo, and R. Aranha. Extracting semistructured information from the web. In Proceedings of the Workshop on Management of Semistructured Data, pages 18-25, 1997.
 
61
 
62
 
63
 
64
[64] S. J. Hong and S. M. Weiss. Advances in predictive model generation for data mining. Technical Report Report RC-21570, IBM Research Report, 1999.
 
65
[65] T. Honkela, S. Kaski, K. Lagus, and T. Kohonen. Websom - self-organizing maps of document collections. In Proc. of Workshop on Self-Organizing Maps 1997 (WSOM'97), pages 310-315, 1997.
 
66
 
67
 
68
[68] T. Joachims, D. Freitag, and T. Mitchell. Webwatcher: A tour guide for the world wide web. In Proceedings of the International Joint Conference on Artificial Intelligence IJCAI-97, pages 770-777, 1997.
 
69
 
70
[70] H. L. K. Wang. Discovering association of structure from semistructured objects. To appear in IEEE Transactions on Knowledge and Data Engineering, 1999.
 
71
[71] H. Kargupta, I. Hamzaogiu, and B. Stafford. Distributed data mining using an agent based architecture. In Proceedings of Knowledge Discovery And Data Mining, pages 211-214. AAAI Press, 1997.
 
72
[72] H. Kautz, B. Selman, and M. Shah. The hidden web. Al magazine, 18(2):27-36, 1997.
 
73
 
74
 
75
[75] Y. Kodratoff. About knowledge discovery in texts: A definition and an example. In Proc. of Advanced Course on Artificial Intelligence 1999 (ACAI-99) on Machine Learning Applications (Invited talk), 1999.
 
76
 
77
 
78
[78] N. Kushmerick, D. Weld, and R. Doorenbos. Wrapper induction for information extraction. In Proceedings of the International Joint Conference on Artificial Intelligence IJCAI-97, pages 729-737, 1997.
 
79
 
80
 
81
[81] S. Lawrence and C. L. Giles. Accessibility of information on the web. Nature, 400:107-109, 1999.
 
82
 
83
[83] B. Lent, R. Agrawal, and R. Srikant. Discovering trends in text databases. In Proc. 3rd Int Conf. On Knowledge Discovery and Data Mining (KDD 1997), pages 227-230, 1997.
 
84
 
85
86
87
 
88
 
89
90
 
91
 
92
 
93
[93] I. Muslea. Extraction patterns for information extraction tasks: A survey. In AAAI-99 Workshop on Machine Learning for Information Extraction, 1999.
 
94
[94] I. Muslea, S. Minton, and C. Knoblock. Wrapper induction for semistructured, web-based information sources. In Proceedings of the Conference on Automatic Learning and Discovery CONALD-98, 1998.
 
95
96
97
 
98
[98] K. Nigam, J. Lafferty, and A. McCallum. Using maximum entropy for text classification. In Proceedings of the International Joint Conference on Artificial Intelligence IJCAI-99 Workshop on Machine Learning for Information Filtering, pages 61-67, 1999.
 
99
[99] G. Paliouras, C. Papatheodorou, V. Karkaletsis, P. Tzitziras, and C. D. Spyropoulos. Large-scale mining of usage data on web sites. In AAAI 2000 Spring Symposium on Adaptive User Interfaces, 2000.
 
100
[100] M. T. Pazienza, editor. Information Extraction: A multidisciplinary Approach to an Emerging Information Technology, volume 1299 of Lecture Notes in Computer Science. International Summer School, SCIE-97, Frascati (Rome), Springer, 1997.
 
101
[101] M. T. Pazienza, editor. Information Extraction, Frascati (Rome), 1999. International Summer School, SCIE-99, Frascati (Rome).
 
102
[102] G. Piatetsky-Shapiro, R. Braachman, T. Khabaza, W. Kloesgen, and E. Simoudis. An overview of issues in developing industrial data mining and knowledge discovery applications. In Proceeding of The Second Int. Conference on Knowledge Discovery and Data Mining, 1996, pages 89-95, 1996.
 
103
[103] M. Rajman and R. Besançon. Text mining - knowledge extraction from unstructured textual data. In Proc. of 6th Conference of International Federation of Classification Societies (IFCS-98), Roma (Italy), pages 473- 480, 1998.
 
104
 
105
106
 
107
 
108
 
109
[109] L. Singh, B. Chen, R. Haight, P. Scheuermann, and K. Aoki. A robust system architecture for mining semistructured data. In Proceeding of The Second Int. Conference on Knowledge Discovery and Data Mining, 1998, pages 329-333, 1998.
 
110
 
111
 
112
113
 
114
 
115
[115] A.-H. Tan. Text mining: The state of the art and the challenges. In Proc of the Pacific Asia Conf on Knowledge Discovery and Data Mining PAKDD'99 workshop on Knowledge Discovery from Advanced Databases, pages 65-70, 1999.
 
116
[116] H. Toivonen. On knowledge discovery in graph-structured data. In Workshop on Knowledge Discovery from Advanced Databases (KDAD'99), pages 26-31, 1999.
 
117
 
118
 
119
 
120
[120] K. Wang and H. Liu. Schema discovery for semistructured data. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD'97), pages 271-274, 1997.
 
121
 
122
[122] W. Wiener, J. Pedersen, and A. Weigend. A neural network approach to topic spotting. In Proceedings of the 4th Symposium on Document Analysis and Information Retrieval (SDAIR 95), pages 317-332, 1995.
 
123
 
124
 
125
 
126
 
127
[127] O. Zaïane and J. Han. Webml: Querying the world-wide web for resources and knowledge. In Proc. ACM CIKM'98 Workshop on Web Information and Data Management (WIDM'98), pages 9-12, 1998.
128

CITED BY  83

Collaborative Colleagues:
Raymond Kosala: colleagues
Hendrik Blockeel: colleagues