| Spam decisions on gray e-mail using personalized ontologies |
| Full text |
Pdf
(490 KB)
|
Source
|
Symposium on Applied Computing
archive
Proceedings of the 2009 ACM symposium on Applied Computing
table of contents
Honolulu, Hawaii
SESSION: The semantic web and applications track
table of contents
Pages 1262-1266
Year of Publication: 2009
ISBN:978-1-60558-166-8
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 23, Downloads (12 Months): 100, Citation Count: 0
|
|
|
ABSTRACT
E-mail is one of the most common communication methods among people on the Internet. However, the increase of e-mail misuse/abuse has resulted in an increasing volume of spam e-mail over recent years. As spammers always try to find a way to evade existing spam filters, new filters need to be developed to catch spam. A statistical learning filter is at the core of many commercial anti-spam filters. It can either be trained globally for all users, or personally for each user. Generally, globally-trained filters outperform personally-trained filters for both small and large collections of users under a real environment. However, globally-trained filters sometimes ignore personal data. Globally-trained filters cannot retain personal preferences and contexts as to whether a feature should be treated as an indicator of legitimate e-mail or spam. Gray e-mail is a message that could reasonably be considered either legitimate or spam. In this paper, a personalized ontology spam filter was implemented to make decisions for gray e-mail. In the future, by considering both global and personal ontology-based filters, we can show a significant improvement in overall performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Burns, E. The deadly duo: Spam and viruses. Jun. 2006. http://www.clickz.com/stats/sectors/e-mail/print.php/3614491.
|
| |
2
|
Commtouch. http://www.commtouch.com.
|
| |
3
|
An Introduction to RDF and the Jena RDF API. http://jena.sourceforge.net/tutorial/RDF_API/index.html.
|
| |
4
|
Liu, R. Dynamic Category Profiling for Text Filtering and Classification. In Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD '06), 2006, 255--264.
|
| |
5
|
Medlock, B. An adaptive approach to spam filtering on a new corpus. In Proceedings of the 3rd Conference on E-mail and Anti-Spam (CEAS '06), 2006.
|
| |
6
|
Quinlan, J. Bagging, Boosting, and C4.5. In proceedings of AAAI/IAAI, Vol. 1, 1996, 725--730.
|
| |
7
|
Sahami, M., Dumais, S., Heckerman, D., and Horvitz, E. A Bayesian Approach to Filtering Junk E-Mail. In Proceedings of the AAAI Workshop on Learning for Text Categorization, 1998, 55--62.
|
| |
8
|
Segal, R. Combining global and personal anti-spam. In Proceedings of 4th Conference on E-mail and Anti-Spam, (CEAS '07), 2007.
|
| |
9
|
Kazem Taghva , Julie Borsack , Jeffrey Coombs , Allen Condit , Steve Lumos , Tom Nartker, Ontology-based Classification of Email, Proceedings of the International Conference on Information Technology: Computers and Communications, p.194, April 28-30, 2003
|
| |
10
|
Weka: the Waikato Environment for Knowledge Analysis. http://www.cs.waikato.ac.nz/~ml/publications/1995/Garner9 5-WEKA.pdf.
|
| |
11
|
Yih, W., McCann, R., and Kolcz, A. Improving Spam Filtering by Detecting Gray Mail. In Proceedings of the 4th Conference on E-mail and Anti-Spam (CEAS '07), 2007.
|
| |
12
|
Youn, S., and McLeod, D. A Comparative Study for E-mail Classification," In Proceedings of International Joint Conferences on Computer, Information, System Sciences, and Engineering (CISSE '06), 2006, 387--391.
|
| |
13
|
Youn, S. and McLeod, D. Spam E-mail Classification using an Adaptive Ontology, Journal of Software (JSW) 2, 3, (2007), 43--55.
|
|