|
ABSTRACT
In this paper, we present a new multithreaded framework for information extraction with Java in heterogeneous enterprise application environments, which frees the developer from having to deal with the error-prone task of low-level thread programming. The power of this framework is demonstrated by an example of extracting product prices from web sites, but the framework is useful for numerous other purposes, too. Strong points of the framework are its performance, continuous feedback, and adherence to maximum response times. The description of the framework uses UML modeling techniques for visualizing multithreading. Moreover, we tackle Java problems of stopping running threads.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Robert B. Doorenbos , Oren Etzioni , Daniel S. Weld, A scalable comparison-shopping agent for the World-Wide Web, Proceedings of the first international conference on Autonomous agents, p.39-48, February 05-08, 1997, Marina del Rey, California, United States
[doi> 10.1145/267658.267666]
|
| |
3
|
Eikvil, L. (1999): Information Extraction from World Wide Web - A Survey. Norwegian Computing Center, P. B. 114 Blindern, N-0314 Oslo, Norwegen, Rapport Nr. 945
|
| |
4
|
|
 |
5
|
|
| |
6
|
Krulwich, B. T. (1996): The BargainFinder Agent - Comparison Price Shopping on the Internet, in: Williams, Joseph (ed.): Bots and other Internet Beasties, Sams. net Publishing (Macmillan), pp. 257--263
|
| |
7
|
Stefan Kuhlins , Ross Tredwell, Toolkits for Generating Wrappers, Revised Papers from the International Conference NetObjectDays on Objects, Components, Architectures, Services, and Applications for a Networked World, p.184-198, October 07-10, 2002
|
| |
8
|
Kushmerick, N. (1998): (Toward) an Extensible Wrapper Repository Standard, in: Proc. Workshop on AI & Information Integration, AAAI-98 (Madison), http://www.cs.ucd.ie/staff/nick/home/research/download/kushmerick-aaai98-aiii-panel.ps.gz
|
| |
9
|
Kushmerick, N. (2002): Gleaning Answers from the Web. Position paper, AAAI 2002 Spring Symposium on Mining Answers from Texts and Knowledge Bases.
|
| |
10
|
|
| |
11
|
Roth, M. T. and Schwarz, P. (1997): A Wrapper Architecture for Legacy Data Sources. IBM Almaden Research Center.
|
| |
12
|
Schader, M., and Korthaus, A. (1998): Modeling Java Threads in UML. In: Schader, M., and Korthaus, A. (eds.): The Unified Modeling Language - Technical Aspects and Applications. Physica, Heidelberg, New York, pp. 122--143
|
| |
13
|
Sun Microsystems (2003): Java 2 Platform, Standard Edition, v 1.4.2, API Specification, Class Thread, http://java.sun.com/j2se/1.4.2/docs/api/java/lang/Thread.html#stop()
|
| |
14
|
|
|