Distributed layer-3 e-mail classification for spam control
This paper proposes a distributed layer-3 e-mail classification for spam control. E-mail packets are inferred in transit and tagged with an intra-packet spam score to indicate whether the packet forms a legitimate or spam e-mail. During e-mail packet reassembly, tags for an e-mail are aggregated to...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Book Section |
Published: |
IEEE Explore
2007
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/17110/ http://dx.doi.org/10.1109/CCECE.2006.277810 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.17110 |
---|---|
record_format |
eprints |
spelling |
my.utm.171102017-02-05T03:12:26Z http://eprints.utm.my/id/eprint/17110/ Distributed layer-3 e-mail classification for spam control Marsono, Muhammad N. El-Kharashi, M. Watheq Gebali, Fayez Ganti, Sudhakar TK Electrical engineering. Electronics Nuclear engineering This paper proposes a distributed layer-3 e-mail classification for spam control. E-mail packets are inferred in transit and tagged with an intra-packet spam score to indicate whether the packet forms a legitimate or spam e-mail. During e-mail packet reassembly, tags for an e-mail are aggregated to give an inter-packet spam score. The naive Bayes inference technique is used to evaluate the performance of the proposed approach compared to the full e-mail classification approach. Our simulation results show that the proposed approach exhibits a comparable spam precision (and confidence) to the full e-mail classification approach. Spam recall increases from 63% to 85% depending to the maximum transmission unit size, approaching the 87% of the full e-mail classification. For 67% spam-to-legitimate ratio, we obtain reduction of end servers's workload by 42% to 57% (across all maximum transmission unit sizes tested) of the total e-mail traffic. Thus, the proposed approach can complement existing anti-spam systems by pre-processing e-mail packets on upstream nodes. Layer-3 e-mail processing requires reduced processing complexity as compared to layer-7 processing and is viable for high throughput hardware-based implementations. IEEE Explore 2007-01-15 Book Section PeerReviewed Marsono, Muhammad N. and El-Kharashi, M. Watheq and Gebali, Fayez and Ganti, Sudhakar (2007) Distributed layer-3 e-mail classification for spam control. In: Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on. IEEE Explore, Ottawa, Ont., pp. 742-745. ISBN 1-4244-0038-4 http://dx.doi.org/10.1109/CCECE.2006.277810 10.1109/CCECE.2006.277810 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
TK Electrical engineering. Electronics Nuclear engineering |
spellingShingle |
TK Electrical engineering. Electronics Nuclear engineering Marsono, Muhammad N. El-Kharashi, M. Watheq Gebali, Fayez Ganti, Sudhakar Distributed layer-3 e-mail classification for spam control |
description |
This paper proposes a distributed layer-3 e-mail classification for spam control. E-mail packets are inferred in transit and tagged with an intra-packet spam score to indicate whether the packet forms a legitimate or spam e-mail. During e-mail packet reassembly, tags for an e-mail are aggregated to give an inter-packet spam score. The naive Bayes inference technique is used to evaluate the performance of the proposed approach compared to the full e-mail classification approach. Our simulation results show that the proposed approach exhibits a comparable spam precision (and confidence) to the full e-mail classification approach. Spam recall increases from 63% to 85% depending to the maximum transmission unit size, approaching the 87% of the full e-mail classification. For 67% spam-to-legitimate ratio, we obtain reduction of end servers's workload by 42% to 57% (across all maximum transmission unit sizes tested) of the total e-mail traffic. Thus, the proposed approach can complement existing anti-spam systems by pre-processing e-mail packets on upstream nodes. Layer-3 e-mail processing requires reduced processing complexity as compared to layer-7 processing and is viable for high throughput hardware-based implementations.
|
format |
Book Section |
author |
Marsono, Muhammad N. El-Kharashi, M. Watheq Gebali, Fayez Ganti, Sudhakar |
author_facet |
Marsono, Muhammad N. El-Kharashi, M. Watheq Gebali, Fayez Ganti, Sudhakar |
author_sort |
Marsono, Muhammad N. |
title |
Distributed layer-3 e-mail classification for spam control |
title_short |
Distributed layer-3 e-mail classification for spam control |
title_full |
Distributed layer-3 e-mail classification for spam control |
title_fullStr |
Distributed layer-3 e-mail classification for spam control |
title_full_unstemmed |
Distributed layer-3 e-mail classification for spam control |
title_sort |
distributed layer-3 e-mail classification for spam control |
publisher |
IEEE Explore |
publishDate |
2007 |
url |
http://eprints.utm.my/id/eprint/17110/ http://dx.doi.org/10.1109/CCECE.2006.277810 |
_version_ |
1643646730657333248 |