| Latent space domain transfer between high dimensional overlapping distributions |
| Full text |
Pdf
(1.53 MB)
|
Source
|
International World Wide Web Conference
archive
Proceedings of the 18th international conference on World wide web
table of contents
Madrid, Spain
SESSION: Data mining/session: statistical methods
table of contents
Pages 91-100
Year of Publication: 2009
ISBN:978-1-60558-487-4
|
|
Authors
|
|
Sihong Xie
|
Sun Yat-Sen University, Guangzhou, China
|
|
Wei Fan
|
IBM T.J. Watson Research Center, New York, USA
|
|
Jing Peng
|
Montclair State University, Montclair, USA
|
|
Olivier Verscheure
|
IBM T.J. Watson Research Center, New York, USA
|
|
Jiangtao Ren
|
Sun Yat-Sen University, Guangzhou, China
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 22, Downloads (12 Months): 111, Citation Count: 0
|
|
|
ABSTRACT
Transferring knowledge from one domain to another is challenging due to a number of reasons. Since both conditional and marginal distribution of the training data and test data are non-identical, model trained in one domain, when directly applied to a different domain, is usually low in accuracy. For many applications with large feature sets, such as text document, sequence data, medical data, image data of different resolutions, etc. two domains usually do not contain exactly the same features, thus introducing large numbers of "missing values" when considered over the union of features from both domains. In other words, its marginal distributions are at most overlapping. In the same time, these problems are usually high dimensional, such as, several thousands of features. Thus, the combination of high dimensionality and missing values make the relationship in conditional probabilities between two domains hard to measure and model. To address these challenges, we propose a framework that first brings the marginal distributions of two domains closer by "filling up" those missing values of disjoint features. Afterwards, it looks for those comparable sub-structures in the "latent-space" as mapped from the expanded feature vector, where both marginal and conditional distribution are similar. With these sub-structures in latent space, the proposed approach then find common concepts that are transferable across domains with high probability. During prediction, unlabeled instances are treated as "queries", the mostly related labeled instances from out-domain are retrieved, and the classification is made by weighted voting using retrieved out-domain examples. We formally show that importing feature values across domains and latent semantic index can jointly make the distributions of two related domains easier to measure than in original feature space, the nearest neighbor method employed to retrieve related out domain examples is bounded in error when predicting in-domain examples. Software and datasets are available for download.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
| |
3
|
Chih-Chung Chang and Chih-Jen Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.
|
| |
4
|
T. Cover and P. Hart. Nearest neighbor pattern classification. Information Theory, IEEE Transactions on, 13(1):21--27, 1967.
|
 |
5
|
Wenyuan Dai , Qiang Yang , Gui-Rong Xue , Yong Yu, Boosting for transfer learning, Proceedings of the 24th international conference on Machine learning, p.193-200, June 20-24, 2007, Corvalis, Oregon
[doi> 10.1145/1273496.1273521]
|
 |
6
|
|
| |
7
|
|
| |
8
|
Wei Fan and Ian Davidson. On sample selection bias and its efficient correction via model averaging and unlabeled examples. In SDM, 2007.
|
 |
9
|
Jing Gao , Wei Fan , Jing Jiang , Jiawei Han, Knowledge transfer via multiple model local structure mapping, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
[doi> 10.1145/1401890.1401928]
|
 |
10
|
Xiao Ling , Wenyuan Dai , Gui-Rong Xue , Qiang Yang , Yong Yu, Spectral domain-transfer learning, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
[doi> 10.1145/1401890.1401951]
|
 |
11
|
Xiao Ling , Gui-Rong Xue , Wenyuan Dai , Yun Jiang , Qiang Yang , Yong Yu, Can chinese web pages be classified with english data source?, Proceeding of the 17th international conference on World Wide Web, April 21-25, 2008, Beijing, China
[doi> 10.1145/1367497.1367628]
|
| |
12
|
Jiangtao Ren, Xiaoxiao Shi, Wei Fan, and Philip S. Yu. Type-independent correction of sample selection bias via structural discovery and re-balancing. In SDM, pages 565--576, 2008.
|
| |
13
|
|
| |
14
|
A. J. Smola and B. Schoelkopf. A tutorial on support tor regression, 1998
|
| |
15
|
|
| |
16
|
|
 |
17
|
|
 |
18
|
Keisuke Yamazaki , Motoaki Kawanabe , Sumio Watanabe , Masashi Sugiyama , Klaus-Robert Müller, Asymptotic Bayesian generalization error when training and test distributions are different, Proceedings of the 24th international conference on Machine learning, p.1079-1086, June 20-24, 2007, Corvalis, Oregon
[doi> 10.1145/1273496.1273632]
|
|