ACM Home Page
Please provide us with feedback. Feedback
Simplifying data access: the energy data collection (EDC) project
Full text PdfPdf (190 KB)
Source dg.o; Vol. 128 archive
Proceedings of the 2000 annual national conference on Digital government research table of contents
Pages: 1 - 11  
Year of Publication: 2000
Authors
José Luis Ambite  University of Southern California, Marina del Rey, CA
Yigal Arens  University of Southern California, Marina del Rey, CA
Luis Gravano  Columbia University, New York, NY
Vasileios Hatzivassiloglou  Columbia University, New York, NY
Eduard Hovy  University of Southern California, Marina del Rey, CA
Judith Klavans  Columbia University, New York, NY
Andrew Philpot  University of Southern California, Marina del Rey, CA
Usha Ramachandran  University of Southern California, Marina del Rey, CA
Jay Sandhaus  Columbia University, New York, NY
Anurag Singla  Columbia University, New York, NY
Brian Whitman  Columbia University, New York, NY
Publisher
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 15,   Citation Count: 0
Additional Information:

abstract   references   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

The massive amount of statistical and text data available from government agencies has created a set of daunting challenges to both research and analysis communities. These problems include heterogeneity, size, distribution, and control of terminology. At the Digital Government Research Center we are investigating solutions to these key problems. In this paper we focus on (1) ontological mappings for terminology standardization, (2) data integration across data bases with high speed query processing, and (3) interfaces for query input and presentation of results. This collaboration between researchers from Columbia University and the Information Sciences Institute of the University of Southern California employs technology developed at both locations, in particular the SENSUS ontology, the SIMS multi-database access planner, the LKB automated dictionary and terminology analysis system, and others. The pilot application targets gasoline data from the Bureau of Labor Statistics, the Energy Information Administration of the Department of Energy, the Census Bureau, and other government agencies.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Arens, Y., C. A., Knoblock and C.-N. Hsu. 1996. Query Processing in the SIMS Information Mediator. In A. Tate (ed), Advanced Planning Technology. Menlo Park: AAAI Press.
 
3
 
4
Bateman, J. A., Kasper, R. T., Moore, J. D., and Whitney, R. A. 1989. A General Organization of Knowledge for Natural Language Processing: The Penman Upper Model. Unpublished research report, USC/Information Sciences Institute, Marina del Rey, CA.
 
5
Fellbaum, C. 1998. (ed.) WordNet: An On-Line Lexical Database and Some of its Applications. Cambridge: MIT Press.
6
 
7
Hovy, E. H. 1998. Combining and Standardizing Large-Scale, Practical Ontologies for Machine Translation and Other Uses. Proceedings of the 1st International Conference on Language Resources and Evaluation (LREC). Granada, Spain.
 
8
Hovy, E. H., A. Philpot, J.-L. Ambite, and U. Ramachandran. 2000. Automating the Placement of Database Concepts into a Large Ontology. In preparation.
 
9
 
10
Klavans, J. L., C. Jacquemin and E. Tzoukermann. 1997. "A Natural language approach to multi-word term conflation". Proceedings of the DELOS conference from the European Research Consortium on Information Management (ERCIM). Zurich, Switzerland.
 
11
Klavans, J. L. and Muresan S. 2000 (in press). "DEFINDER: Rule-Based Methods for the Extraction of Medical Terminology and their Assocciated Definitions from On-line Text". Proceedings of 2000 American Medical Informatics Association (AMIA) Annual Symposium, Los Angeles, California.
 
12
 
13
MacGregor, R. 1990. The Evolving Technology of Classification-Based Knowledge Representation Systems. In John Sowa (ed.), Principles of Semantic Networks: Explorations in the Representation of Knowledge. Morgan Kaufmann.
 
14
Muslea, I. and S. Minton and C. A. Knoblock. 1998. Wrapper Induction for Semistructured Web-based Information Sources. Proceedings of the Conference on Automated Learning and Discovery. Pittsburgh, PA.
 
15
Okumura, A. and E. H. Hovy. 1994. Ontology Concept Association using a Bilingual Dictionary. Proceedings of the 1st AMTA Conference. Columbia, MD.
 
16
Rigau, G. and E. Agirre. 1995. Disambiguating Bilingual Nominal Entries against WordNet. Proceedings of the 7th ESSLI Symposium. Barcelona, Spain.
 
17
Swartout, W. R., R. Patil, K. Knight, and T. Russ. 1996. Toward Distributed Use of Large-Scale Ontologies. Proceedings of the 10th Knowledge Acquisition for Knowledge-Based Systems Workshop. Banff, Canada.
Collaborative Colleagues:
José Luis Ambite: colleagues
Yigal Arens: colleagues
Luis Gravano: colleagues
Vasileios Hatzivassiloglou: colleagues
Eduard Hovy: colleagues
Judith Klavans: colleagues
Andrew Philpot: colleagues
Usha Ramachandran: colleagues
Jay Sandhaus: colleagues
Anurag Singla: colleagues
Brian Whitman: colleagues