Performance Analysis of NSL-KDD Dataset Using Classification Algorithms with Different Feature Selection Algorithms and Supervised Filter Discretization

  • Shailesh Singh PanwarEmail author
  • Y. P. Raiwani
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 989)


Naive Bayes and Bayes Net are critical classification methods in data mining classification and have build up being important software tools for the classification, the description and generalization of information. All classification algorithms are open sources, which are implemented on Java (C4.5 algorithms) for WEKA software tool. In this paper, we exhibit the strategy for increasing the performance of Naive Bayes and Bayes Net algorithms with supervised filter discretization after we applied feature selection techniques. We have used the supervised filter discretization on two classification algorithms and compared the result with and without discretization. The outcomes acquired from the experiment showed significant improvement over the existing classification algorithms.


Naive Bayes Bayes net WEKA NSL-KDD dataset Preprocessing Discretization Feature selection algorithms 


  1. 1.
    Kantardzic, M.: Data Mining: Concepts, Models, Methods, and Algorithms. Wiley, ISBN: 0471228524 (2003)Google Scholar
  2. 2.
    Mitra, S., Acharya, T.: Data Mining Multimedia, Soft Computing, and Bioinformatics. Wiley (2003)Google Scholar
  3. 3.
    Shun, J., Malki, H.A.: Network intrusion detection system using neural networks. Fourth international conference on natural computation, Jinan, pp. 242–246 (2008)Google Scholar
  4. 4.
    Xue, B., Zhang, M., Browne W.N., Yao, X.: A survey on evolutionary computation approaches to feature selection. In: IEEE transactions on evolutionary computation, vol. 20, no. 4, pp. 606–626, Aug. 2016 (2016)CrossRefGoogle Scholar
  5. 5.
    Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. In: IEEE transactions on systems, man, and cybernetics, part C (Applications and reviews), vol. 40, no. 2, pp. 121–144, March 2010 (2010)CrossRefGoogle Scholar
  6. 6.
    Taheri, S., Mammadov, M.: Learning the naive bayes classifier with optimization models. Int. J. Appl. Math. Comput. Sci. 23(4), 787–795 (2013)MathSciNetCrossRefGoogle Scholar
  7. 7.
    Panwar, S.S., Raiwani, Y.P.: Data reduction techniques to analyze NSL-KDD dataset. Int. J. Comput. Eng. Technol. 5(10), 21–31 (2014)Google Scholar
  8. 8.
    Raiwani, Y.P., Panwar, S.S.: Research challenges and performance of clustering techniques to analyze NSL-KDD dataset. Int. J. Emerg. Trends Technol. Comput. Sci. (IJETTCS) 3(6), 172–177 (2014)Google Scholar
  9. 9.
  10. 10.
  11. 11.
    Gama, J., Pinto, C.: Discretization from data streams: applications to histograms and data mining. In: Proceedings of the 2006 ACM Symposium on Applied Computing, SAC, New York, NY, USA, pp. 662–667 (2006)Google Scholar
  12. 12.
    Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous valued attributes for classification learning. In: Thirteenth International Joint Conference on Artificial Intelligence, vol. 2, pp. 1022–1027. Morgan Kaufmann Publishers (1993)Google Scholar
  13. 13.
    Kononenko, I.: On biases in estimating multivalve attributes. In: 14th International Joint Conference on Artificial Intelligence, pp. 1034–1040 (1995)Google Scholar
  14. 14.
    WEKA User Manual: [Available Online]
  15. 15.
    NSL-KDD dataset: [Available Online]
  16. 16.
    Raiwani, Y.P., Panwar, S.S.: Data reduction and neural networking algorithms to improve intrusion detection system with NSL-KDD dataset. Int. J. Emerg. Trends Technol. Comput. Sci. (IJETTCS) 4(1), 219–225 (2015)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  1. 1.H.N.B. Garhwal UniversitySrinagar, GarhwalIndia

Personalised recommendations