ACM Home Page
Please provide us with feedback. Feedback
An architecture for biological information extraction and representation
Full text PdfPdf (356 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2004 ACM symposium on Applied computing table of contents
Nicosia, Cyprus
SESSION: Bioinformatics (BIO) table of contents
Pages: 103 - 110  
Year of Publication: 2004
ISBN:1-58113-812-1
Authors
Aditya Vailaya  Agilent Laboratories, Palo Alto, CA
Peter Bluvas  Agilent Laboratories, Palo Alto, CA
Robert Kincaid  Agilent Laboratories, Palo Alto, CA
Allan Kuchinsky  Agilent Laboratories, Palo Alto, CA
Michael Creech  Agilent Laboratories, Palo Alto, CA
Annette Adler  Agilent Laboratories, Palo Alto, CA
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 52,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/967900.967924
What is a DOI?

ABSTRACT

Technological advances in biomedical research are generating a plethora of heterogeneous data at a high rate. There is a critical need for extraction, integration and management tools for information discovery and synthesis from these heterogeneous data. In this paper, we present a general architecture, called ALFA, for information extraction and representation from diverse biological data. The ALFA architecture consists of: (i) a networked, hierarchical object model for representing information from heterogeneous data sources in a standardized, structured format; and (ii) a suite of integrated, interactive software tools for information extraction and representation from diverse biological data sources. As part of our research efforts to explore this space, we have currently prototyped the ALFA object model and a set of interactive software tools for searching, filtering, and extracting information from scientific text. In particular, we describe BioFerret, a meta-search tool for searching and filtering relevant information from the web, and ALFA Text Viewer, an interactive tool for user-guided extraction, disambiguation, and representation of information from scientific text. We further demonstrate the potential of our tools in integrating the extracted information with experimental data and diagrammatic biological models via the common underlying ALFA representation.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Clare, A., and King, R.D. How Well Do We Understand the Clusters Found in Microarray Data? InSilico Biology, 2 (4):p511--522, 2002.
 
2
 
3
Friedman, C., Kra, P., Yu, H., Krauthammer, M., Rzhetsky, A. GENIES: A Natural-Language Processing System for the Extraction of Molecular Pathways from Journal Articles. Bioinformatics: 1(1), 1--9, 2001.
 
4
Fukuda, K., Tsunoda, T., Tamura, A., Takagi, A. Toward Information Extraction: Identifying Protein Names from Biological Papers. PSB1998, p705--716, 1998.
 
5
Gene Ontology#8482; (GO) Consortium, http://www.geneontology.org/.
 
6
Humphreys, K., Demetriou, G., Gaizauskas, R. Two Applications of Information Extraction to Biological Science Journal Articles: Enzyme Interactions and Protein Structures. PSB2000, Hawaii, 2000.
 
7
Iliopoulos, I., Enright, A.J., Ouzounis, C. A. TEXTQUEST: Document Clustering of Medline Abstracts for Concept Discovery in Molecular Biology. PSB2001, Hawaii, 2001.
 
8
 
9
Kincaid, R., Kleusing, D., Vailaya, A. BNS: An LDAP-based Biomolecule Naming Service. OiBC2002, Washington DC, http://openbns.sourceforge.net/.
 
10
Krauthammer, M., Rzhetsky, A., Morozov, P., Friedman, C. Using BLAST for Identifying Gene and Protein Names in Journal Articles. Gene 259, p245--252, 2000.
 
11
Ng, S.-K., and Wong, M. Toward Routine Automatic Pathway Discovery from On-line Scientific Text Abstracts. Genome Informatics, 10:104--112, 1999.
 
12
O'Day, V.L., Adler, A., Kuchinsky, A., Bouch, A. When Worlds Collide: Molecular Biology as Interdisciplinary Collaboration. ECSCW 2001, p399--418, 2001.
 
13
Palakal, M., Mukhopadhyay, S., Mostafa, J., Raje, R. N'Cho, M., Mishra, S. An Intelligent Biological Information Management System. Bioinformatics:18(10), p1283--8, Oct 2002.
 
14
Palakal, M., Stephens, M., Mukhpodhyay, S., Raje, R., and Rhodes, S. Tagging and Disambiguation of Multiple Object Classes from Biological Text. IEEE CSB 2002, Stanford, CA, 2002.
 
15
Park, J.C., Kim, H.S., Kim, J.J. Bidirectional Incremental Parsing for Automatic Pathway Identification with Combinatory Categorical Grammar. PSB2001, Hawaii, 2001.
 
16
Porter, M. F. An Algorithm for Suffix Stripping. Program, 14(3):130--137, 1980.
 
17
Rindflesch, T.C., Tanabe, L., Weinstein, J.N., Hunter, L. EDGAR: Extraction of and Drugs, Genes, and Relations from the Biomedical Literature. PSB2000, Hawaii, 2000.
 
18
Sekimizu, T., Park, H.S., Tsujii, J. Identifying the Interaction between Genes and Gene Products based on Frequently seen Verbs in Medline Abstracts. Genome Informatics, 9:62--71, 1998.
 
19
Stephens, M., Palakal, M., Mukhopadhyay, S., Raje, R., Mostafa, J. Detecting Gene Relations from Medline Abstracts. PSB2001, Hawaii, 2001.
 
20
Wahl, M., Howes, T., Kille, S. Lightweight Directory Access Protocol (v3). ITF RFC2551, December 1997.
 
21
Wong, L. A Protein Interaction Extraction System. PSB2001, Hawaii, 2001.
 
22
Yakushiji, A., Tateisi, Y., Miyao, Y., Tsujii, J. Event Extraction from Biomedical Papers using a Full Parser. PSB2001, Hawaii, 2001.

Collaborative Colleagues:
Aditya Vailaya: colleagues
Peter Bluvas: colleagues
Robert Kincaid: colleagues
Allan Kuchinsky: colleagues
Michael Creech: colleagues
Annette Adler: colleagues