|
ABSTRACT
The cell defense mechanism of RNA interference has applications in gene function analysis and promising potentials in human disease therapy. To effectively silence a target gene, it is desirable to select appropriate initiator siRNA molecules having satisfactory silencing capabilities. Computational prediction for silencing efficacy of siRNAs can assist this screening process before using them in biological experiments. String kernel functions, which operate directly on the string objects representing siRNAs and target mRNAs, have been applied to support vector regression for the prediction and improved accuracy over numerical kernels in multidimensional vector spaces constructed from descriptors of siRNA design rules. To fully utilize information provided by string and numerical data, we propose to unify the two in a kernel feature space by devising a multiple kernel regression framework where a linear combination of the kernels is used. We formulate the multiple kernel learning into a quadratically constrained quadratic programming (QCQP) problem, which although yields global optimal solution, is computationally demanding and requires a commercial solver package. We further propose three heuristics based on the principle of kernel-target alignment and predictive accuracy. Empirical results demonstrate that multiple kernel regression can improve accuracy, decrease model complexity by reducing the number of support vectors, and speed up computational performance dramatically. In addition, multiple kernel regression evaluates the importance of constituent kernels, which for the siRNA efficacy prediction problem, compares the relative significance of the design rules. Finally, we give insights into the multiple kernel regression mechanism and point out possible extensions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
G.J. Hannon, "RNA Interference," Nature, vol. 418, pp. 244-251, July 2002.
|
| |
2
|
E. Check, "Hopes Rise for RNA Therapy as Mouse Study Hits Target," Nature, vol. 432, p. 136, 2004.
|
| |
3
|
T.R. Brummelkamp, R. Bernards, and R. Agami, "A System for Stable Expression of Short Interfering RNAs in Mammalian Cells," Science, vol. 296, pp. 550-553, 2002.
|
| |
4
|
Y. Pei and T. Tuschl, "On the Art of Identifying Effective and Specific siRNA," Nature Methods, vol. 3, no. 9, pp. 670-676, 2006.
|
| |
5
|
M. Amarzguioui and H. Prydz, "An Algorithm for Selection of Functional siRNA Sequences," Biochemical and Biophysical Research Comm, vol. 316, pp. 1050-1058, 2004.
|
| |
6
|
A. Reynolds, D. Leake, Q. Boese, S. Scaringe, W.S. Marshall, and A. Khovorova, "Rational siRNA Design for RNA Interference," Nature Biotechnology, vol. 22, pp. 326-330, 2004.
|
| |
7
|
K. Ui-Tei, Y. Naito, F. Takahashi, T. Haraguchi, H. Ohki-Hamazaki, A. Juni, R. Ueda, and K. Saigo, "Guidelines for the Selection of Highly Effective siRNA Sequences for Mammalian and Chick RNA Interference," Nucleic Acids Research, vol. 32, pp. 936-948, 2004.
|
| |
8
|
B. Jagla, N. Aulner, P. Kelly, D. Song, A. Volchuk, A. Zatorski, D. Shum, T. Mayer, D.D. Angelis, O. Ouerfelli, U. Rutishauser, and J. Rothman, "Sequence Characteristics of Functional siRNAs," RNA, vol. 11, pp. 864-872, June 2005.
|
| |
9
|
D. Huesken, J. Lange, C. Mickanin, J. Weiler, F. Asselbergs, J. Warner, B. Meloon, S. Engel, A. Rosenberg, D. Cohen, M. Labow, M. Reinhardt, F. Natt, and J. Hall, "Design of a Genome-Wide siRNA Library Using an Artificial Neural Network," Nature Biotechnology, vol. 23, no. 8, pp. 995-1001, 2005.
|
| |
10
|
G. Ge, G. Wong, and B. Luo, "Prediction of siRNA Knockdown Efficacy Using Artificial Neural Network Models," Biochemical and Biophysical Research Comm., vol. 336, pp. 723-728, 2005.
|
| |
11
|
P. Sætrom and O. Snøve, Jr., "A Comparison of siRNA Efficacy Predictors," Biochemical and Biophysical Research Comm., vol. 321, pp. 247-253, 2004.
|
| |
12
|
S. Qiu, T. Lane, and L. Buturovic, "A Randomized String Kernel and Its Applications to RNA Interference," Proc. 22nd AAAI Conf. Artificial Intelligence, pp. 627-632, July 2007.
|
| |
13
|
R. Teramoto, M. Aoki, T. Kimura, and M. Kanaoka, "Prediction of siRNA Functionality Using Generalized String Kernel and Support Vector Machine," FEBS Letters, vol. 579, pp. 2878-2882, 2005.
|
| |
14
|
P. Jia, T. Shi, Y. Cai, and Y. Li, "Demonstration of Two Novel Methods for Predicting Functional siRNA Efficiency," BMC Bioinformatics, vol. 7, p. 271, 2006.
|
| |
15
|
J.-P. Vert, N. Foveau, C. Lajaunie, and Y. Vandenbrouck, "An Accurate and Interpretable Model for siRNA Efficacy Prediction," MBC Bioinformatics, vol. 7, p. 520, 2006.
|
| |
16
|
V.N. Vapnik, Statistical Learning Theory. John Wiley and Sons, 1998.
|
| |
17
|
S. Qiu and T. Lane, "The RNA String Kernel for siRNA Efficacy Prediction," Proc. Seventh IEEE Int'l Conf. Bioinformatics and Bioeng. (BIBE2007), pp. 307-314, Oct. 2007.
|
| |
18
|
S. Qiu, C. Adema, and T. Lane, "A Computational Study of Off-Target Effects of RNA Interference," Nucleic Acids Research, vol. 33, pp. 1834-1847, 2005.
|
| |
19
|
|
| |
20
|
|
| |
21
|
N. Cristianini, J. Shawe-Taylor, A. Elissee, and J. Kandola, "On Kernel-Target Alignment," Advances in Neural Information Processing Systems, T. Dietterich, S. Becker, and Z. Ghahramani, eds., vol. 14, MIT Press, 2002.
|
| |
22
|
A. Smola and B. Schölkopf, "A Tutorial on Support Vector Regression," Technical Report NC2-TR-1998-030, NeuroCOLT2, 1998.
|
| |
23
|
|
| |
24
|
S. Qiu and T. Lane, "Multiple Kernel Learning for Support Vector Regression," Technical Report TR-CS-2005-42, Computer Science Dept., The Univ. of New Mexico, 2005.
|
 |
25
|
|
| |
26
|
S.M. Elbashir, J. Martinez, A. Patkaniowska, W. Lendeckel, and T. Tuschl, "Functional Anatomy of siRNA for Mediating Efficient RNAi in Drosophila Melanogaster Embryo Lysate," The EMBO J., vol. 20, no. 23, pp. 6877-6888, 2001.
|
| |
27
|
S. Saxena, Z.O. Jonsson, and A. Dutta, "Small RNAs with Imperfect Match to Endogenous mRNA Repress Translation," J. Biological Chemistry, vol. 278, no. 45, pp. 44312-44319, 2003.
|
| |
28
|
A. Jackson, S. Bartz, J. Schelter, S. Kobayashi, J. Burchard, M. Mao, B. Li, G. Cavet, and P.S. Linsley, "Expression Profiling Reveals Off-Target Gene Regulation by RNAi," Nature Biotechnology, vol. 21, no. 6, pp. 635-637, 2003.
|
| |
29
|
M. Amarzguioui and H. Prydz, "An Algorithm for Selection of Functional siRNA Sequences," Biochemical and Biophysical Research Comm., vol. 316, pp. 1050-1058, 2004.
|
| |
30
|
A. Reynolds, D. Leake, Q. Boese, S. Scaringe, W.S. Marshall, and A. Khovorova, "Rational siRNA Design for RNA Interference," Nature Biotechnology, vol. 22, pp. 326-330, 2004.
|
| |
31
|
C.-C. Chang and C.-J. Lin, LIBSVM: A Library for Support Vector Machines, http://www.csie.ntu.edu.tw/~cjlin/libsvm, 2001.
|
| |
32
|
|
| |
33
|
S. Qiu and T. Lane, "Parallel Computation of RBF Kernels for Support Vector Classifiers," Proc. Fifth SIAM Int'l Conf. Data Mining (SDM05), pp. 334-345, Apr. 2005.
|
INDEX TERMS
Primary Classification:
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.6
Learning
Additional Classification:
G.
Mathematics of Computing
G.1
NUMERICAL ANALYSIS
G.1.6
Optimization
Subjects:
Quadratic programming methods
G.2
DISCRETE MATHEMATICS
G.2.1
Combinatorics
Subjects:
Combinatorial algorithms
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.8
Problem Solving, Control Methods, and Search
Subjects:
Heuristic methods
I.6
SIMULATION AND MODELING
I.6.4
Model Validation and Analysis
J.
Computer Applications
J.3
LIFE AND MEDICAL SCIENCES
Subjects:
Biology and genetics
General Terms:
Algorithms,
Design,
Performance,
Theory
Keywords:
Multiple kernel learning,
multiple kernel heuristics,
support vector regression,
QCQP optimization,
RNA interference,
siRNA efficacy.
|