|
ABSTRACT
Microarray data integration is an important yet challenging problem. Usually, direct integration of microarrays after normalization is ineffective because of the diverse types of experiment specific variations. To address this issue, two novel integration approaches were proposed in recent microarray studies. The first study[16] presented a cancer classification technique which identifies gene pairs whose expression orders are consistent within class and different across classes. The other study[18] presented a promising gene expression analysis technique which utilizes pairwise correlations of gene expressions across different microarray datasets. Interestingly, we observe that both of the independently developed techniques rely on inter-gene information and noise filtering strategy to achieve satisfactory performance in microarray integration. Motivated by this observation, we propose in this paper a formal data model for microarray integration using inter-gene information and effective filtering, which generalizes the previous two frameworks. We also show how the proposed model can handle a broader range of problems than the previous frameworks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Ashburner, C. Ball, J. Blake, D. Botstein, H. Butler, J. Cherry, A. Davis, K. Dolinski, S. Dwight, J. Eppig, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature Genetics, 25(1):25--9, 2000.
|
| |
2
|
A. Bernard and A. Hartemink. Informative structure priors: joint learning of dynamic regulatory networks from multiple types of data. PSB, 10:459--70, 2005.
|
| |
3
|
Karsten M. Borgwardt , Cheng Soon Ong , Stefan Schönauer , S. V. N. Vishwanathan , Alex J. Smola , Hans-Peter Kriegel, Protein function prediction via graph kernels, Bioinformatics, v.21 n.1, p.47-56, January 2005
[doi> 10.1093/bioinformatics/bti1007]
|
| |
4
|
J. DeRisi, L. Penland, P. Brown, M. Bittner, P. Meltzer, M. Ray, Y. Chen, Y. Su, and J. Trent. Use of a cDNA microarray to analyse gene expression patterns in human cancer. Nature Genetics, 14(4):457--60, 1996.
|
| |
5
|
M. Eisen, P. Spellman, P. Brown, and D. Botstein. Cluster analysis and display of genome-wide expression patterns. PNAS, 95(25):14863--8, 1998.
|
| |
6
|
D. Geman et al. Classifying gene expression profiles from pairwise mRNA comparisons. Stat. App. in Genetics and Mol. Biol., 3:19, 2004.
|
| |
7
|
L. Heyer, S. Kruglyak, and S. Yooseph. Exploring expression data: identification and analysis of coexpressed genes. Genome Res., 9(11):1106, 1999.
|
| |
8
|
R. Jansen, D. Greenbaum, and M. Gerstein. Relating whole-genome expression data with protein-protein interactions. Genome Res., 12(1):37, 2002.
|
| |
9
|
G. Lanckriet, M. Deng, N. Cristianini, M. Jordan, and W. Noble. Kernel-based data fusion and its application to protein function prediction in yeast. PSB, 9:300--11, 2004.
|
| |
10
|
|
| |
11
|
M. Rubin, R. Dunn, M. Strawderman, and K. Pienta. Tissue microarray sampling strategy for prostate cancer biomarker analysis. Am. J. Surg. Pathol., 26(3):312--19, 2002.
|
| |
12
|
J. Tenenbaum, V. de Silva, and J. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319--23, 2000.
|
| |
13
|
R. Tibshirani, T. Hastie, B. Narasimhan, and G. Chu. Diagnosis of multiple cancer types by shrunken centroids of gene expression. PNAS, 99(10):6567--72, 2002.
|
| |
14
|
S. Tornow. Functional modules by relating protein interaction networks and gene expression. Nucleic Acids Res., 31(21):6283--9, 2003.
|
| |
15
|
G. Tseng and W. Wong. Tight clustering: a resampling-based approach for identifying stable and tight patterns in data. Biometrics, 61(1):10--6, 2005.
|
| |
16
|
|
| |
17
|
Z. Zhang, R. Bast Jr, Y. Yu, J. Li, L. Sokoll, A. Rai, J. Rosenzweig, B. Cameron, Y. Wang, X. Meng, et al. Three biomarkers identified from serum proteomic analysis for the detection of early stage ovarian cancer. Cancer Res., 64:5882--90, 2004.
|
| |
18
|
X. Zhou, M. Kao, H. Huang, A. Wong, J. Nunez-Iglesias, M. Primig, O. Aparicio, C. Finch, T. Morgan, and W. Wong. Functional annotation and network reconstruction through cross-platform integration of microarray data. Nature Biotechnology, 23(2):238--43, 2005.
|
| |
19
|
X. Zhou, M. Kao, and W. Wong. Transitive functional annotation by shortest-path analysis of gene expression data. PNAS, 99(20):12783--8, 2002.
|
| |
20
|
W. Zhu, X. Wang, Y. Ma, M. Rao, J. Glimm, and J. Kovach. Detection of cancer-specific markers amid massive mass spectral data. PNAS, 100(25):14666--71, 2003.
|
|