| Customer targeting models using actively-selected web content |
| Full text |
Pdf
(337 KB)
|
Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
table of contents
Las Vegas, Nevada, USA
SESSION: Industrial papers
table of contents
Pages 946-953
Year of Publication: 2008
ISBN:978-1-60558-193-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 19, Downloads (12 Months): 189, Citation Count: 0
|
|
|
ABSTRACT
We consider the problem of predicting the likelihood that a company will purchase a new product from a seller. The statistical models we have developed at IBM for this purpose rely on historical transaction data coupled with structured firmographic information like the company revenue, number of employees and so on. In this paper, we extend this methodology to include additional text-based features based on analysis of the content on each company's website. Empirical results demonstrate that incorporating such web content can significantly improve customer targeting. Furthermore, we present methods to actively select only the web content that is likely to improve our models, while reducing the costs of acquisition and processing.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
D. A. Cohn, Z. Ghahramani, and M. I. Jordan. Active learning with statistical models. Journal of Artificial Intelligence Research, 4:129--145, 1996.
|
| |
4
|
V. Federov. Theory of optimal experiments. Academic Press, 1972.
|
| |
5
|
|
| |
6
|
E. Frank and R. R. Bouckaert. Naive bayes for text classification with unbalanced classes. In Proc 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, pages 503--510, 2006.
|
| |
7
|
|
| |
8
|
J. Keifer. Optimal experimental designs. Journal of the Royal Statistical Society, 21B:272--304, 1959.
|
| |
9
|
R. Lawrence , C. Perlich , S. Rosset , J. Arroyo , M. Callahan , J. M. Collins , A. Ershov , S. Feinzig , I. Khabibrakhmanov , S. Mahatma , M. Niemaszyk , S. M. Weiss, Analytics-driven solutions for customer targeting and sales-force allocation, IBM Systems Journal, v.46 n.4, p.797-816, October 2007
|
| |
10
|
A. McCallum and K. Nigam. A comparison of event models for naive Bayes text classification. In Papers from the AAAI-98 Workshop on Text Categorization, pages 41--48, Madison, WI, July 1998.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
J. Rennie, L. Shih, J. Teevan, and D. Karger. Tackling the poor assumptions of naive bayes text classifiers. In Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003), pages 616--623, 2003.
|
| |
15
|
S. Rosset and R. D. Lawrence. Data enhanced predictive modeling for sales targeting. In Proceedings of the 2005 SIAM International Conference on Data Mining (SDM-05), 2005.
|
| |
16
|
M. Saar-Tsechansky and F. J. Provost. Active learning for class probability estimation and ranking. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-2001), pages 911--920, 2001.
|
| |
17
|
R. E. Schapire, Y. Freund, P. Bartlett, and W. S. Lee. Boosting the margin: A new explanation for the effectiveness of voting methods. The Annals of Statistics, 26(5):1651--1686, 1998.
|
| |
18
|
|
| |
19
|
|
 |
20
|
|
|