Skip to main content

A Classification Model for Drug Addicts Based on Improved Random Forests Algorithm

  • Conference paper
  • First Online:
Artificial Intelligence and Security (ICAIS 2019)

Abstract

To accomplish the rapid screening of drug addicts and to meet the requirements of modern police work, this research employs a data-mining technology that utilizes real samples of drug addicts as well as non-drug addicts. The aim is to construct a classification model based on pulse wave data. After the pre-processing of pulse wave data, the original random forest classification model is initially established with high accuracy, but with a relatively low recall rate and F1 score. To resolve this issue, an improved classification model is henceforth proposed. The improved model mainly involves three improvement strategies: firstly, perform cross-validation by dividing multiple training sets and test sets to obtain generalization errors; secondly, balance the sample distribution using down-sampling techniques; and finally, select model parameters based on multi-criteria analysis. According to the evaluation results of accuracy, precision, recall rates, and F1 scores, the performance of the improved random forest classification model has demonstrated its superiority and robustness via experiments using different datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Li, S., Zhang, F.: Point-contact type FBG dynamic pressure sensor and its application in the measurement of pulse information. Optoelectron. Laser 27(10), 1017–1022 (2016)

    MathSciNet  Google Scholar 

  2. Xiaorui, S., Aike, Q.: Evaluation of cardiovascular health based on pulse wave detection technology. J. Med. Biomech. 30(5), 468–473 (2015)

    Google Scholar 

  3. Chen, M., Cai, K.: A modern spectrum analysis method for pulse abnormalities of drug abusers. J. Chongqing Univ. (Nat. Sci. Ed.) 24(4), 98–102 (2001)

    Google Scholar 

  4. Breiman, L.: Random forests. Mach. Learn. 54(1), 5–32 (2001)

    Article  MATH  Google Scholar 

  5. Ming, L.: The solution of multicollinearity: a new standard to rule out variables. Stat. Decis. Mak. 5, 82–83 (2013)

    Google Scholar 

  6. Zhang, Y., Gong, H.: Approximate linear Bayesian estimation of skewness coefficient. Stat. Decis. Mak. 78–81 (2017)

    Google Scholar 

  7. Sun, H., McIntosh, S.: Analyzing cross-domain transportation big data of New York City with semi-supervised and active learning. Comput. Mater. Continua 57(1), 1–9 (2018)

    Article  Google Scholar 

  8. Chen, Y., Zhou, J., Du, J.: Credit evaluation method based on transaction data. Comput. Appl. Softw. 35(5), 168–171 (2018)

    Google Scholar 

  9. Feifei, S., Zhuo, C., Xiaolei, X.: Application of an improved random forest based classifier in crime prediction domain. J. Intell. 33(10), 148–152 (2014)

    Google Scholar 

  10. Hao, H., Xianhui, W.: Maximum F1-score criterion based discriminative feature compensation training algorithm for automatic mispronunciation detection. Acta Electronica Sin. 43(7), 1294–1299 (2015)

    Google Scholar 

  11. Verikas, A., Gelzinis, A., Bacauskiene, M.: Mining data with random forests: a survey and results of new tests. Pattern Recognit. 22(2), 330–349 (2011)

    Article  Google Scholar 

  12. Zhihua, Z.: Machine Learning. Tsinghua University Press, Beijing (2016)

    Google Scholar 

  13. Yin, H., Hu, Y.: Unbalanced feature selection algorithm based on random forest. J. Sun Yat-Sen Univ. (Nat. Sci. Ed.) 53(5), 59–65 (2014)

    MATH  Google Scholar 

  14. Shishi, D.: Analysis of random forest theory. Integr. Technol. 2(1), 1–7 (2013)

    Google Scholar 

  15. Wu, X., Zhang, C., Zhang, R., et al.: A distributed intrusion detection model via nondestructive partitioning and balanced allocation for big data. CMC Comput. Mater. Continua 56(1), 61–72 (2018)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Tianyue Chen or Haiyan Gu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, T., Gu, H. (2019). A Classification Model for Drug Addicts Based on Improved Random Forests Algorithm. In: Sun, X., Pan, Z., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2019. Lecture Notes in Computer Science(), vol 11632. Springer, Cham. https://doi.org/10.1007/978-3-030-24274-9_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-24274-9_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-24273-2

  • Online ISBN: 978-3-030-24274-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics