|
ABSTRACT
Scientists, economists, and planners in government, industry and academia spend much of their time accessing, integrating, and analyzing data. However, many of their studies are one-of-a-kind with little sharing and reuse for subsequent endeavors. The Argos project seeks to improve the productivity of analysts by providing a framework that encourages reuse of data sources and data processing operations, and by developing tools to generate data processing workflows. In this paper, we present an approach to automatically generate data processing workflows. First, we define a methodology for assigning formal semantics to data and operations according to a domain ontology, which allows sharing and reuse. Specifically, we define data contents using relational descriptions in an expressive logic. Second, we develop a novel planner that uses relational subsumption to connect the output of a data processing operation with the input of another. Our modeling methodology has the significant advantage that the planner can automatically insert adaptor operations wherever necessary to bridge the inputs and outputs of operations in the workflow. We have implemented the approach in a transportation modeling domain.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
José Luis Ambite , Yigal Arens , Eduard Hovy , Andrew Philpot , Luis Gravano , Vasileios Hatzivassiloglou , Judith Klavans, Simplifying Data Access: The Energy Data Collection Project, Computer, v.34 n.2, p.47-54, February 2001
|
| |
3
|
Jose Luis Ambite , Genevieve Giuliano , Peter Gordon , Andreas Harth , LanLan Wang , Qisheng Pan , Stefan Decker, Argos: dynamic composition of web services for goods movement analysis and planning, Proceedings of the 2004 annual national conference on Digital government research, p.1-2, May 24-26, 2004, Seattle, WA
|
| |
4
|
Jose Luis Ambite , Genevieve Giuliano , Peter Gordon , Qisheng Pan , Sandipan Bhattacharjee, Integrating heterogeneous data sources for better freight flow analysis and planning, Proceedings of the 2002 annual national conference on Digital government research, p.1-12, May 19-22, 2002, Los Angeles, California
|
| |
5
|
|
| |
6
|
|
| |
7
|
Patricia G. Baker , Andy Brass , Sean Bechhofer , Carole A. Goble , Norman W. Paton , Robert Stevens, TAMBIS: Transparent Access to Multiple Bioinformatics Information Sources, Proceedings of the 6th International Conference on Intelligent Systems for Molecular Biology, p.25-34, July 01, 1998
|
| |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
M. R. Genesereth. Knowledge interchange format. In Proceedings of the 2nd International Conference on Principles of Knowledge Representation and Reasoning, 1991.
|
| |
13
|
G. Giuliano, P. Gordon, Q. Pan, J. Park, and L. Wang. Estimating freight flows for metropolitan highway networks using secondary data sources. In Proceedings of Transportation Research Board Commodity Flow Survey Conference, Transportation Research Circular, E-C088, July 2005.
|
| |
14
|
P. Gordon and Q. Pan. Assembling and processing freight shipment data: developing a gis-based origin-destination matrix for southern california freight flows. Final report of METRANS research project 99--25, University of Southern California, 2001. www.metrans.org/research/final/99--25_Final.pdf
|
| |
15
|
|
| |
16
|
I. Horrocks, U. Sattler, S. Tessaris, and S. Tobies. How to decide query containment under constraints using a description logic. In Proceedings of the 7th International Conference on Logic for Programming and Automated Reasoning (LPAR'2000), 2000.
|
 |
17
|
|
| |
18
|
C. A. Knoblock. Building a planner for information gathering: A report from the trenches. In 3rd International Conference on Artificial Intelligence Planning Systems, Edinburgh, Scotland, 1996.
|
| |
19
|
R. MacGregor. A deductive pattern matcher. In Proceedings of the 7th National Conference on Artificial Intelligence, Saint Paul, MN, 1988.
|
| |
20
|
|
| |
21
|
S. McIlraith and T. C. Son. Adapting Golog for composition of semantic web services. In Proceedings of the Eighth International Conference on Knowledge Representation and Reasoning (KR2002), 2002.
|
| |
22
|
J. S. Penberthy and D. S. Weld. UCPOP: A sound, complete, partial order planner for ADL. In Third International Conference on Principles of Knowledge Representation and Reasoning, Cambridge, MA, 1992.
|
| |
23
|
PowerLoom. The Power Loom knowledge representation & reasoning system. www.isi.edu/isd/LOOM/PowerLoom, 2003.
|
| |
24
|
Wanda Pratt , Marti A. Hearst , Lawrence M. Fagan, A knowledge-based approach to organizing retrieved documents, Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence, p.80-85, July 18-22, 1999, Orlando, Florida, United States
|
| |
25
|
Y. Sheffi. Urban Transportation Networks; Equilibrium Analysis with Mathematical Programming Methods. Prentice Hall, New Jersey, 1985.
|
| |
26
|
E. Sirin, B. Parsia, D. Wu, J. Hendler, and D. Nau. HTN planning for web service composition using SHOP2. Web Semantics, 1(4):377--396, 2004.
|
| |
27
|
R. D. Stevens, A. J. Robinson, and C. A. Goble. myGrid: personalised bioinformatics on the information grid. Bioinformatics, 19(1):302--304, 2003.
|
|