| Simplifying data access: the energy data collection (EDC) project |
| Full text |
Pdf
(190 KB)
|
| Source
|
dg.o; Vol. 128
archive
Proceedings of the 2000 annual national conference on Digital government research
table of contents
Pages: 1 - 11
Year of Publication: 2000
|
|
Authors
|
|
José Luis Ambite
|
University of Southern California, Marina del Rey, CA
|
|
Yigal Arens
|
University of Southern California, Marina del Rey, CA
|
|
Luis Gravano
|
Columbia University, New York, NY
|
|
Vasileios Hatzivassiloglou
|
Columbia University, New York, NY
|
|
Eduard Hovy
|
University of Southern California, Marina del Rey, CA
|
|
Judith Klavans
|
Columbia University, New York, NY
|
|
Andrew Philpot
|
University of Southern California, Marina del Rey, CA
|
|
Usha Ramachandran
|
University of Southern California, Marina del Rey, CA
|
|
Jay Sandhaus
|
Columbia University, New York, NY
|
|
Anurag Singla
|
Columbia University, New York, NY
|
|
Brian Whitman
|
Columbia University, New York, NY
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 1, Downloads (12 Months): 15, Citation Count: 0
|
|
|
ABSTRACT
The massive amount of statistical and text data available from government agencies has created a set of daunting challenges to both research and analysis communities. These problems include heterogeneity, size, distribution, and control of terminology. At the Digital Government Research Center we are investigating solutions to these key problems. In this paper we focus on (1) ontological mappings for terminology standardization, (2) data integration across data bases with high speed query processing, and (3) interfaces for query input and presentation of results. This collaboration between researchers from Columbia University and the Information Sciences Institute of the University of Southern California employs technology developed at both locations, in particular the SENSUS ontology, the SIMS multi-database access planner, the LKB automated dictionary and terminology analysis system, and others. The pilot application targets gasoline data from the Bureau of Labor Statistics, the Energy Information Administration of the Department of Energy, the Census Bureau, and other government agencies.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Alicia Ageno , Francese Ribas , German Rigau , Horacio Rodríguez , Anna Samiotou, TGE: Tlinks Generation Environment, Proceedings of the 15th conference on Computational linguistics, August 05-09, 1994, Kyoto, Japan
[doi> 10.3115/991886.991942]
|
| |
2
|
Arens, Y., C. A., Knoblock and C.-N. Hsu. 1996. Query Processing in the SIMS Information Mediator. In A. Tate (ed), Advanced Planning Technology. Menlo Park: AAAI Press.
|
| |
3
|
|
| |
4
|
Bateman, J. A., Kasper, R. T., Moore, J. D., and Whitney, R. A. 1989. A General Organization of Knowledge for Natural Language Processing: The Penman Upper Model. Unpublished research report, USC/Information Sciences Institute, Marina del Rey, CA.
|
| |
5
|
Fellbaum, C. 1998. (ed.) WordNet: An On-Line Lexical Database and Some of its Applications. Cambridge: MIT Press.
|
 |
6
|
Venky Harinarayan , Anand Rajaraman , Jeffrey D. Ullman, Implementing data cubes efficiently, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.205-216, June 04-06, 1996, Montreal, Quebec, Canada
|
| |
7
|
Hovy, E. H. 1998. Combining and Standardizing Large-Scale, Practical Ontologies for Machine Translation and Other Uses. Proceedings of the 1st International Conference on Language Resources and Evaluation (LREC). Granada, Spain.
|
| |
8
|
Hovy, E. H., A. Philpot, J.-L. Ambite, and U. Ramachandran. 2000. Automating the Placement of Database Concepts into a Large Ontology. In preparation.
|
| |
9
|
|
| |
10
|
Klavans, J. L., C. Jacquemin and E. Tzoukermann. 1997. "A Natural language approach to multi-word term conflation". Proceedings of the DELOS conference from the European Research Consortium on Information Management (ERCIM). Zurich, Switzerland.
|
| |
11
|
Klavans, J. L. and Muresan S. 2000 (in press). "DEFINDER: Rule-Based Methods for the Extraction of Medical Terminology and their Assocciated Definitions from On-line Text". Proceedings of 2000 American Medical Informatics Association (AMIA) Annual Symposium, Los Angeles, California.
|
| |
12
|
|
| |
13
|
MacGregor, R. 1990. The Evolving Technology of Classification-Based Knowledge Representation Systems. In John Sowa (ed.), Principles of Semantic Networks: Explorations in the Representation of Knowledge. Morgan Kaufmann.
|
| |
14
|
Muslea, I. and S. Minton and C. A. Knoblock. 1998. Wrapper Induction for Semistructured Web-based Information Sources. Proceedings of the Conference on Automated Learning and Discovery. Pittsburgh, PA.
|
| |
15
|
Okumura, A. and E. H. Hovy. 1994. Ontology Concept Association using a Bilingual Dictionary. Proceedings of the 1st AMTA Conference. Columbia, MD.
|
| |
16
|
Rigau, G. and E. Agirre. 1995. Disambiguating Bilingual Nominal Entries against WordNet. Proceedings of the 7th ESSLI Symposium. Barcelona, Spain.
|
| |
17
|
Swartout, W. R., R. Patil, K. Knight, and T. Russ. 1996. Toward Distributed Use of Large-Scale Ontologies. Proceedings of the 10th Knowledge Acquisition for Knowledge-Based Systems Workshop. Banff, Canada.
|
|