ACM Home Page
Please provide us with feedback. Feedback
A brief look at some machine learning problems in genomics
Full text PdfPdf (744 KB)
Source Annual Workshop on Computational Learning Theory archive
Proceedings of the tenth annual conference on Computational learning theory table of contents
Nashville, Tennessee, United States
Pages: 109 - 113  
Year of Publication: 1997
ISBN:0-89791-891-6
Author
David Haussler  Computer Science Department, University of California, Santa Cruz, CA
Sponsors
AT&T Labs :
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
Vanderbilt University : Vanderbilt University
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 32,   Citation Count: 0
Additional Information:

references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/267460.267486
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
AGM+90
S.F. Altshul, W. Gish, W. Miller, Myers E. W., and Lipman D. J. Basic local alignment search tool. J. Mol. Biol., 215, 1990.
 
AHO93
K. Asai, S. Hayamizu, and K. Onizuka. HMM with protein structure grammar. In Proceedings r~'the Hawaii International Conference on System Sciences, pages 783-791, Los Alamitos, CA, 1993. IEEE Computer Society Press.
 
B.96
Hazes B. The (qxw)3 domain: A flexible lectin scaffold. Protein Science, 5(8): 1490-1501, August 1996.
 
Bat96
A. Bateman. Fibronectin type iii domains ill yeast detected by a hidden markov model. Current Biology, 6( 12): 1544-1547, December 1996.
 
Bat97
A. Bateman. The structure of a domain common to archaebacteria and the homocystinuria disease protein. Trends in Biochemical Sciences, 22( 1 ): 12-13, January i 997.
 
BB94
Philipp Bucher and Amos Bairoch. A generalized prolile syntax for biomolecular sequence motifs and its function in automatic sequence interpretation. In ISMB-94, pages 53- 6 I, Menlo Park, CA, 1994. AAAI/MIT Press.
 
BCHM94
R Baldi, Y. Chauvin, T Hunkapillar, and M. McClure. Hidden Markov models of biological primary sequence information. PNAS, 91:1059-1063, 1994.
 
BE95
Timothy L. Bailey and Charles Elkan. The value of prior knowledge in discovering motifs with MEME. In ISMB-95, pages 21-29, Menlo Park, CA, July 1995. AAAI/MIT Press.
 
BEC96
A. Bateman, S. Eddy, and C. Chothia. Members of the immunoglobulin superfamily in bacteria. Protein Science, 5:1939-194 I, 1996.
 
BG96
M. Burset and R. Guigo. Evaluation of gene structure prediction programs. Genomics, 34(3):353-367, 1996. Data set and evaluation results can be found at http://www.imim.es/GeneldentificationfEvaluation/Index.html.
 
BHK+93
 
BK97
C. Burge and S. Karlin. Prediction of complete gene structure in genomic DNA. JMB, 1997. In press.
 
BLE91
J.U. Bowie, R. L0thy, and D. Eisenherg. A method to identify protein sequences that fold into a known three-dimensional structure. Science, 253:164-170, 1991.
 
BM93
M. Borodovsky and J. Mclninch. Genmark: Parallel gene recognition for both DNA strands. Computers and Chemistr3,, 17(2):123-133, 1993.
 
BS90
G.J. Barton and M. J. Sternherg. Flexible protein sequence patterns: A sensitive method to detect weak structural similarities. JMB, 212(2):389-402, 1990.
 
Cho92
C. Chothia. One thousand families for the molecular biologist. Nature, 357:543-544, 1992.
 
CS92
L.R. Cardon and G. D. Stormo. Expectation maximization algorithm for identifying proteinbinding sites with variable lengths from unaligned DNA fragments. JMB. 223:159-170, 1992.
 
DMHM97
J. Dalgaard, M Moser, R. Hughey, and S. Mian. Statistical modeling, phylogenetic analysis and structure prediction of a protein splicing domain common to inteins and hedgehog proteins. manuscript, 1997.
 
DS94
S. Dong and D. B. Searls. Gene structure prediction by linguistic methods. Genomics, 162:705-708, 1994.
 
ED94
S.R. Eddy and R. Durbin. RNA sequence analysis using covariance models. NAR, 22:2079- 2088, 1994.
 
Edd95
Sean Eddy. Multiple alignment using hidden Markov models. In Christopher Railings et al., editors, ISMB-95, pages 114-120, Menlo Park, CA, July 1995. AAAI/M1T Press.
 
Edd96
S.R. Eddy. Hidden markov models. Curr. Opin. Struct. Biol., 1996.
 
EMD95
S.R. Eddy, G. Mitchison, and R. Durbin. Maximum discrimination hidden Markov models of sequence consensus. J. Comput. Biol., 2:9-23, 1995.
 
Fic96a
J.W. Fickett. Finding genes by computer - the state of the art. Trends in Genetics, 12(8):316- 320, 1996.
 
Fic96b
J.W. Fickett. The gene identification problem - an overview for developers. Computers and Chemistry, 20(1): 103-118, 1996.
 
Gel95
M.S. Gelfand. Prediction of function in dna sequence analysis. Jour. Comp. Biol., 2(1):87- 115, 1995.
 
GLE90
M. Gribskov, R. L/ithy, and D. Eisenberg. Profile analysis. Methods in Enzymology, 183:146- 159, 1990.
 
GMP96
M.S. Gelfand, A. A. Mironov, and P. A. Pevzner. Gene recognition via spliced sequence alignment. PNAS, 93(17):9061-9066, 1996.
 
HH91
Steven Henikoff and Jorja G. Henikoff. Automated assembly of protein blocks for database searching. NAR, 19(23):6565-6572, 1991.
 
HH96
Jorja G. Henikoff and Steven Henikoff. Using substitution probabilities to improve positionspecific scoring matrices. CABIOS, 12(2): 135- 143, 1996.
 
HK95
 
HWB90
Steven Henikoff, James C. Wallace, and Joseph P. Brown. Finding protein similarities with nucleofide sequence databases. Methods Enzymol., 183:111-132, 1990.
 
KBM+94
A. Krogh, M. Brown, I. S. Mian, K. Sj61ander, and D. Haussler. Hidden Markov models in computational biology: Applications to protein modeling. JMB, 235:150 !-1531, February 1994.
 
KHRE96
 
KMH94
A. Krogh, I. S. Mian, and D. Haussler. A Hidden Markov Model that finds genes in E. coli DNA. NAR, 22:4768-4778, 1994.
 
LAB +93
C.E. Lawrence, S. Altschul, M. Boguski, J. Liu, A. Neuwald, and J. Wootton. Detecting subtle sequence signals: A Gibbs sampling strategy for multiple alignment. Science, 262:208-214, 1993.
 
LME91
R. Liithy, A. D. McLachlan, and D. Eisenberg. Secondary structure-based profiles: Use of structure-conserving scoring table in searching protein sequence databases for structural similarities. Proteins: Structure, Function, and Genetics, 10:229-239, 1991.
 
LR90
C.E. Lawrence and A. A. Reilly. An expectation maximization (EM) algorithm for the identification and characterization of common sites in unaligned biopolymer sequences. Proteins, 7:41-51, 1990.
 
NJD96
Goldman N., Thorne J.L., and Jones D.T Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses. JMB, 263(2):196-208, October 25 1996.
 
Pea90
W.R. Pearson. Rapid and sensitive sequence comparison with FASTP and FASTA. Methods Enzymol., 183:63-98, 1990.
 
Pen96
Elizabeth Pennisi. Teams tackle protein prediction. Science, 273:426-428, 26 July 1996.
 
Rab89
Lawrence R. Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. of the IEEE, 77(2):257-286, February 1989.
 
SAL94
V. Soiovyev, Salamov A., and C. Lawrence. Predicting internal exons by oligonucleotide composition and discriminant analysis of splicable open reading frames. Nucl. Acids Res., 22:5156-5163, 1994.
 
SBH+94
Yasubumi Sakakibara, Michael Brown, Richard Hughey, I. Saira Mian, Kimmen SjiSlander, Rebecca C. Underwood, and David Haussler. Stochastic context-free grammars for tRNA modeling. NAR, 22:5112-5120, 1994.
 
SKB+96
K. Sj61ander, K. Karplus, M. P. Brown, R. Hughey, A. Krogh, I. S. Mian, and D. Haussler. Dirichlet mixtures: A method for improving detection of weak but significant protein sequence homology. CABIOS, 12(4):327-345, 1996.
 
SS95
E. Snyder and G. Stormo. Indentification of protein coding regions in genomic dna. JMB, 248, 1995.
 
SWS93
C.M. Stultz, J. V. White, and T. F. Smith. Structural analysis based on state-space modeling. Protein Science, 2:305-315, 1993.
 
TAK94
Roman L. Tatusov, Stephen E Altschul, and Eugen V. Koonin. Detection of conserved segments in proteins: Iterative scanning of sequence databases with alignment blocks. PNAS, 91:12091-12095, December 1994.
 
Tau96
Gary Taubes. Software matchmakers help make sense of sequences. Science, 273:588-590, 2 August 1996.
 
THG94
Julie D. Thompson, Desmond G. Higgins, and Toby J. Gibson. Improved sensitivity of profile searches through the use of sequence weights and gap excision. CABIOS, 10(1 ): 19-29, 1994.
 
WMS+96
Jason T L. Wang, Thomas G. Mart, Dennis Shasha, Bruce Shapiro, Gung-Wei Chirn, and T. Y. Lee. Complementary classification approaches for protein sequences. Protein Engr., to appear 1996.
 
WP86
M.S. Waterman and M. D. Periwitz. Line geometries for sequence comparisons. Bull. Math. Biol., 46:567-577, 1986.
 
XESU94
Y. Xu, J. R. Einstein, M. Shah, and E. C. Uberbacher. An improved system for exon recognition and gene modeling in human dna sequences. In ISMB-94, Menlo Park, CA, 1994. AAAI/MIT Press.