A Modified Minimum Risk Bayes and It’s Application in Spam Filtering

  • Zhenfang Zhu
  • Peipei Wang
  • Zhiping Jia
  • Hairong Xiao
  • Guangyuan Zhang
  • Hao Liang
Conference paper
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 269)


To settle the problem of the flood spam, a spam filtering algorithm based on AdaBoost algorithm and minimum Risk Bayes algorithm is created by the combination of the latter two after in-depth analysis and research of them. Experiments have been run to apply it to spam filtering, the result of which shows that this algorithm can better the performance of spam filtering system by improving the accuracy of mail filtering.


Mail filtering Minimum risk bayes AdaBoost 


  1. 1.
    Wang L, Lin Y-P, Peng Y et al (2004) An algorithm of filtering junk mails based on cognition-learning and minimum risk naive bayes. Acta Simulata Systematica Sinica 32:69–85Google Scholar
  2. 2.
    Bedi P, Vashisth P (2011) Interest based recommendations with argumentation. J Artif Intell 4:119–142CrossRefGoogle Scholar
  3. 3.
    Muda Z, Yassin W, Sulaiman MN, Udzir NI (2011) A K-Means and naive Bayes learning approach for better intrusion detection. Inf Technol J 10:648–655CrossRefGoogle Scholar
  4. 4.
    Yoav F, Schapire RE (1999) A short introduction to boosting. J Japanese Soc Artif Intell 5:771–778Google Scholar
  5. 5.
    Chang HL, Yue TW (2011) Entropy-directed AdaBoost algorithm with NBBP features for face detection. Inf Technol J, 10:1518−1526Google Scholar
  6. 6.
    Lee LH, Wan CH, Yong TF, Kok HM (2010) A review of nearest neighbor-support vector machines hybrid classification models. J Appl Sci 10:1841–1858CrossRefGoogle Scholar
  7. 7.
    Li XY, Ye F (2006) Method of spam filtering based on multi-bayes algorithms. Comput Eng Appl 9:114–116Google Scholar
  8. 8.
    Zhen L, Liang T, Ming-Tian Z (2008) Research on spam classifier based on features of spammer’s behaviours. Inf Technol J 7:165–169CrossRefGoogle Scholar
  9. 9.
    Jing-ping J, Fei-zhou Z, Yan-mei C (2009) Adaboost object tracking algorithm. Pattern Recogn Artif Intell 3:477–478Google Scholar
  10. 10.
    Jiang Y, Ding XQ (2008) AdaBoost algorithm using multi-step correction. J Tsinghua Univ Sci Technol 10:1610–1611Google Scholar
  11. 11.
    Hammoud D, Maamri R, Sahnoun Z (2011) Machine learning in an agent: a generic model and an intelligent agent based on inductive decision learning. J Artif Intell 4:29–44CrossRefGoogle Scholar
  12. 12.
    Stambouli TB, Keche M, Ouamri A (2010) Iterative feature selection for classification. J Appl Sci 10:1015–1018CrossRefGoogle Scholar
  13. 13.
    Wang T, Qiu GY, He JH, (2008) New bayes e-mail filtering model based on risk minimization. Appl Res Comput 4:1147−1149Google Scholar
  14. 14.
    Su S, Hongfei L, Ye Z (2009) Character-based language modeling approach for spam filtering. J Chin Inf Process 2:41–47Google Scholar
  15. 15.
    Sebastini F (2002) Machine learning in automated text categorization. JACM 1:1–47Google Scholar
  16. 16.
    Li L (2006) Data complexity in machine learning and novel classification algorithms. Ph.D. Thesis, California Institute of Technology, CaliforniaGoogle Scholar
  17. 17.
    Wang B, Pan WF (2005) A survey of content-based anti-spam email filtering. J Chin Inf Process 5:1–6Google Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2014

Authors and Affiliations

  • Zhenfang Zhu
    • 1
    • 2
  • Peipei Wang
    • 3
  • Zhiping Jia
    • 1
  • Hairong Xiao
    • 2
  • Guangyuan Zhang
    • 2
  • Hao Liang
    • 2
  1. 1.School of Computer Science and TechnologyShandong UniversityJinanChina
  2. 2.School of Information Science and Electric EngineeringShandong Jiaotong UniversityJinanChina
  3. 3.School of Accountancy Shandong management UniversityJinanChina

Personalised recommendations