| FlagelLink: a decision support system for distributed flagellar data using data warehouse |
| Full text |
Pdf
(343 KB)
|
| Source
|
Symposium on Applied Computing
archive
Proceedings of the 2008 ACM symposium on Applied computing
table of contents
Fortaleza, Ceara, Brazil
SESSION: Bioinformatics
table of contents
Pages: 1268-1272
Year of Publication: 2008
ISBN:978-1-59593-753-7
|
|
Authors
|
|
Fabiana F. Araújo
|
Universidade Federal do Ceará, Fortaleza, CE Brasil
|
|
Ângela M. A. Pinheiro
|
Universidade Estadual do Ceará, Fortaleza, CE Brasil
|
|
Kaio M. Farias
|
Universidade Estadual do Ceará, Fortaleza, CE Brasil
|
|
Bernadette F. Lóscio
|
Universidade Estadual do Ceará, Fortaleza, CE Brasil
|
|
Diana M. Oliveira
|
Universidade Estadual do Ceará, Fortaleza, CE Brasil
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 47, Citation Count: 0
|
|
|
ABSTRACT
Combining different types of data from multiple databases (DBs) is a key feature in bioinformatics, particularly due to the problem that each of these DB resources usually contains different subsets of biological knowledge and only answers questions in its domain, nether helping with questions that span domain boundaries nor considering them. As bioinformatics DBs grow in size and as biological questions grow in scope, better solutions will inevitably consist in preserving the autonomy and diversity of DBs and developing new systems to offer an integrated and transparent access to existing distributed data sources (DS). In this paper, we present a decision support system (DSS), called FlagelLink, to provide access to a set of distributed information about a particular domain (the flagellum, a cellular organelle responsible for motility). It employs useful bioinformatics tools (such as BLAST, MUSCLE, HMMER, etc) in an exclusive data warehouse (DW) through terminology and ontology resources (semantic-driven) to maintain an actual DSS for a specific knowledge domain. FlagelLink (available at http://flagellink.nugen.uece.br/flagellink) has a unified, ondemand integration approach that merges the identified ontological knowledge (which means a defined number of test cases and scenarios of genes and proteins all involved in flagellar activities) with traditional and ontology-based information integration techniques.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Altschul, S. F. et al. (1997). Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res., 25:3389--3402.
|
| |
2
|
Araujo, F. F. et al. (2006). FlagelLink - An integrated organellar database dealing with refined pattern recognition of specific motifs/domains targeted to the eukaryotic flagellum. In: 14th Annual Meeting of the International Society for Computational Biology - ISMB 2006.
|
| |
3
|
Apweiler, R. et al. (2004). UniProt: The Universal Protein Knowledgebase. Nucleic Acids Res. 32, 115--119.
|
| |
4
|
Bairoch A. and Apweiler, R. (2000). The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res., 28: 45--48.
|
| |
5
|
Barros, A. L. B. P. et al. (2006). PhyloTree - An Integrated and Automatic Tool to Generate Phylogenetic Trees. Proceedings 14th International Conference Intelligent Systems for Molecular Biology - ISMB, Fortaleza, Brazil, August 2006.
|
| |
6
|
Benson, D. A. et al. (2005). GenBank. Nucleic Acids Res., 33: D34--D38.
|
| |
7
|
Davidson, S. B. et al. (1997). BioKleisli: A Digital Library for Bio-medical Researchers. International Journal of Digital Libraries 1: 36--53.
|
| |
8
|
Eddy, S. R. (1998). Profile hidden markov models. Bioinformatics, 14, 755--763.
|
| |
9
|
Edgar, R. C. (2004). MUSCLE: a Multiple Sequence Alignment Method with Reduced Time and Space Complexity. BMC Bioinformatics, 5:113 doi:10.1186/1471-2105-5-113.
|
| |
10
|
Galperin, M. (2007). The molecular biology database collection: 2007 update. Nucleic Acids Research 35: D3--D4
|
| |
11
|
Geer, R. C. and Sayers, E. W. (2003). Entrez: Making use of its power. Briefings in Bioinformatics, 4(2), 1779--184.
|
| |
12
|
Gough, J. et al. (2001). Assignment of homology to genome sequences using a library of hidden markov models that represent all proteins of known structure. J. Mol. Biol., 313, 903--919.
|
| |
13
|
Jakoniene, V. and Lambrix, P. (2005). Ontology-based integration for bioinformatics. Proceedings of the 31st VLDB Conference, Workshop on Ontologies-based techniques for databases and information systems, ODBIS 2005, pp.55--58, Trondheim, Norway.
|
| |
14
|
Lacroix, Z. and Critchlow, T. (2003) Bioinformatics: Managing Scientific Data. Arizona State University, USA. 441 p. ISBN 155860829X
|
| |
15
|
|
| |
16
|
Lopez, L. (2001). SRS - Sequence Retrieval System. Presentation. http://www.pdg.cnb.uam.es/cursos/BioInfo2001/pages/-SRS/. Universidad Autonoma de Madrid, 2001.
|
| |
17
|
Needleman, S. B. and Wunsch, C. D. (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol., 48, 443--453.
|
| |
18
|
O'Brien, E. A., et al. (2003). GOBASE- a database of mitochondrial and chloroplast information. Nucleic Acids Res, 31, 176--178.
|
| |
19
|
Rahm, E. and Bernstein, P. A. (2001). On Matching Schemas Automatically, Techn. Report 1/2001, Dept. of Comp. Science, Univ. of Leipzig. http://dol.uni-leipzig.de/pub/2001-5/en
|
| |
20
|
Smith, T. F. and Waterman, M. S. (1981). Identification of common molecular subsequences. J. Mol. Biol., 147: 195--197.
|
| |
21
|
Shah, S. P. et al. (2005). Atlas - a data warehouse for integrative bioinformatics; BMC Bioinformatics, 6:34 doi:10.1186/1471-2105-6-34
|
| |
22
|
Stein, L. (2003). Integrating Biological Databases. Nature Reviews Genetics 4:337--345.
|
| |
23
|
Stevens R., et al. (2000). TAMBIS: transparent access to multiple bioinformatics information source. Bioinformatics 16: 184--185.
|
| |
24
|
Stoesser, G. et al. (2001) The EMBL Nucleotide Sequence Database. Nucleic Acids Res., 29:, 17--21.
|
| |
25
|
|
| |
26
|
Thompson, J. D.; et al. (1994). Clustal W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gag penalties and weight matrix choice. Nucleic Acids Res., 22: 4673--4680.
|
|