Skip to main content

A Performance Study of Hidden Markov Model and Random Forest in Internet Traffic Classification

  • Conference paper
  • First Online:
Information Science and Applications (ICISA) 2016

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 376))

Abstract

Currently, the emphasis on internet traffic classification is gaining momentum and a broad range of procedures has been developed for a variety of network tools. Machine learning methods are commonly utilized for the recognition and classification of internet traffic. This investigation trains its sights on two frequently employed techniques in this area. These are the Hidden Markov Model and the Random Forest procedure. Furthermore, the study offers a detailed explanation and analysis for the methodologies for each and forwards the advantages and setbacks attributed through comparing the performance in different significant factors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Callado, A., Kamienski, C., Fernandes, S., Sadok, D.: A Survey on Internet Traffic Identification and Classification (2009)

    Google Scholar 

  2. IANA. Internet Assigned Numbers Authority. http://www.iana.org/assignments/port-numbers

  3. Bernaille, L., Teixeira, R., Salamatian, K.: Early application identification. In: Proceedings of the 2006 ACM CoNEXT Conference, p. 6 (2006)

    Google Scholar 

  4. Haffner, P., Sen, S., Spatscheck, O., Wang, D.: ACAS: automated construction of application signatures. In: Proceedings of the 2005 ACM SIGCOMM Workshop on Mining Network Data, pp. 197–202 (2005)

    Google Scholar 

  5. Marques Nt, H.T., Rocha, L.C., Guerra, P.H., Almeida, J.M., Meira Jr, W., Almeida, V.A.: Characterizing broadband user behavior. In: Proceedings of the 2004 ACM Workshop on Next-Generation Residential Broadband Challenges, pp. 11–18 (2004)

    Google Scholar 

  6. Shi, Z.: Principles of machine learning. International Academic Publishers (1992)

    Google Scholar 

  7. Munther, A., Razif, R., Nizam, S., Sabri, N., Anbar, M.: Active Build-Model Random Forest Method for Network Traffic Classification. International Journal of Engineering & Technology (0975-4024) 6 (2014)

    Google Scholar 

  8. Mitchell, T.M.: Machine learning. 1997, vol. 45. McGraw Hill, Burr Ridge (1997)

    MATH  Google Scholar 

  9. Wang, Y., Li, Q.: Review on the Studies and Advances of Machine Learning Approaches. TELKOMNIKA Indonesian Journal of Electrical Engineering 12, 1487–1494 (2014)

    Google Scholar 

  10. Maia, J.E.B.: Internet traffic classification using a Hidden Markov model. In: 2010 10th International Conference on Hybrid Intelligent Systems (HIS), pp. 37–42 (2010)

    Google Scholar 

  11. Dainotti, A., De Donato, W., Pescape, A., Rossi, P.S.: Classification of network traffic via packet-level hidden Markov models. In: Global Telecommunications Conference, IEEE GLOBECOM 2008. IEEE, pp. 1–5 (2008)

    Google Scholar 

  12. Wright, C., Monrose, F., Masson, G.M.: HMM profiles for network traffic classification. In: Proceedings of the 2004 ACM Workshop on Visualization and Data Mining for Computer Security, pp. 9–15 (2004)

    Google Scholar 

  13. Wright, C.V., Monrose, F., Masson, G.M.: Towards better protocol identification using profile HMMs. JHU Technical Report JHU-SPAR051201, p. 14 (2005)

    Google Scholar 

  14. Banfield, R.E., Hall, L.O., Bowyer, K.W., Kegelmeyer, W.P.: A comparison of decision tree ensemble creation techniques. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 173–180 (2007)

    Article  Google Scholar 

  15. Breiman, L.: Bagging predictors. Machine learning 24, 123–140 (1996)

    MATH  Google Scholar 

  16. Freund, Y., Schapire, R.E.: A desicion-theoretic generalization of on-line learning and an application to boosting. In: Computational Learning Theory, pp. 23–37 (1995)

    Google Scholar 

  17. Breiman, L.: Random forests. Machine learning 45, 5–32 (2001)

    Article  MATH  Google Scholar 

  18. Hou, Y.: P2P Network Traffic Identification Based on Random Forest Algorithm. Journal of Networks 9, 2456–2461 (2014)

    Google Scholar 

  19. Munther, A., Alalousi, A., Nizam, S., Othman, R.R., Anbar, M.: Network traffic classification––a comparative study of two common decision tree methods: C4. 5 and Random forest. In: 2014 2nd International Conference on Electronic Design (ICED), pp. 210–214 (2014)

    Google Scholar 

  20. Mohd, A.B., bin Mohd Nor, S.: Towards a flow-based internet traffic classification for bandwidth optimization. International Journal of Computer Science and Security (IJCSS) 3, 146–153 (2009)

    Google Scholar 

  21. Johnson, M.T.: Capacity and complexity of HMM duration modeling techniques. IEEE Signal Processing Letters 12, 407–410 (2005)

    Article  Google Scholar 

  22. Jun, L., Shunyi, Z., Yanqing, L., Zailong, Z.: Internet traffic classification using machine learning. In: Second International Conference on Communications and Networking in China, CHINACOM 2007, pp. 239–243 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alhamza Munther .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media Singapore

About this paper

Cite this paper

Munther, A., Othman, R.R., Alsaadi, A.S., Anbar, M. (2016). A Performance Study of Hidden Markov Model and Random Forest in Internet Traffic Classification. In: Kim, K., Joukov, N. (eds) Information Science and Applications (ICISA) 2016. Lecture Notes in Electrical Engineering, vol 376. Springer, Singapore. https://doi.org/10.1007/978-981-10-0557-2_32

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-0557-2_32

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-0556-5

  • Online ISBN: 978-981-10-0557-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics