Skip to main content
Log in

Simulation of English part-of-speech classification based on artificial intelligence and additive logistic regression

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

English part-of-speech classification technology is a technology that can process text data, can effectively solve the problem of messy data in text information categories, make data structured and organized, and facilitate people to obtain effective information implicit in the text. This article transforms the original polynomial distribution into a generalized linear model and uses logistic regression algorithm for specific implementation. Moreover, the model proposed in this paper inherits the good explanatory characteristics of the decision tree, and it locally uses logistic regression to fit the data, which greatly improves the function space that logistic regression can fit. In addition, due to changes in the decision theory of logistic regression leaf nodes, the corresponding tree branch theory also needs to be changed accordingly. Finally, this paper designs experiments to study the performance of the model constructed in this paper. The research results show that the model constructed in this paper has high accuracy in the extraction and classification of English part-of-speech features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data availability

Data will be made available on request.

References

  • Delle Luche C, Floccia C, Granjon L et al (2017) Infants’ first words are not phonetically specified: own name recognition in British English-learning 5-month-olds. Infancy 22(3):362–388

    Article  Google Scholar 

  • Gupta A, Yadav D (2021) A novel approach to perform context-based automatic spoken document retrieval of political speeches based on wavelet tree indexing. Multimedia Tools Appl 80(14):22209–22229

    Article  Google Scholar 

  • Hooshmand RA, Soltani S (2011) Fuzzy optimal phase balancing of radial and meshed distribution networks using BF-PSO algorithm. IEEE Trans Power Syst 27(1):47–57

    Article  Google Scholar 

  • Li P, Ye Y (2016) Chinese spam filtering based on back-propagation neural networks. Softw Eng 4(2):9–12

    Google Scholar 

  • Matikolaie FS, Kheddache Y, Tadj C (2022) Automated newborn cry diagnostic system using machine learning approach. Biomed Signal Process Control 73:103434

    Article  Google Scholar 

  • Mohsen F, Hadhoud MM, Moustafa K et al (2012) A new image segmentation method based on particle swarm optimization. Int Arab J Inf Technol 9(5):487–493

    Google Scholar 

  • Névéol A, Dalianis H, Velupillai S et al (2018) Clinical natural language processing in languages other than English: opportunities and challenges. J Biomed Sem 9(1):1–13

    Article  Google Scholar 

  • Rajesh S, Prathima S, Reddy LSS (2010) Unusual pattern detection in DNA database using KMP algorithm. Int J Compurt Appl 1(22):1–7

    Google Scholar 

  • Stärk K, Kidd E, Frost RL (2022) Word segmentation cues in German child-directed speech: a corpus analysis. Lang Speech 65(1):3–27

    Article  Google Scholar 

  • Ten Oever S, Kaushik K, Martin AE (2022) Inferring the nature of linguistic computations in the brain. PLoS Comput Biol 18(7):e1010269

    Article  Google Scholar 

  • Wang J, Xue X, Weng W (2015) Source code summarization technology based on syntactic analysis. J Comput Appl 35(7):1999

    Google Scholar 

  • Weizhong XSLNL, Xiaobai LIU (2010) Design of integration framework for multi-language applications on logging platform in Java. Acta Petrolei Sinica 31(5):810

    Google Scholar 

  • Wu Y, Peng X, Ruan K, Hu Z (2017) Improved image segmentation method based on morphological reconstruction. Multimedia Tools Appl 76(19):19781–19793

    Article  Google Scholar 

  • Yu F (2015) Malicious url detection algorithm based on bm pattern matching. Int J Secur Appl 9(9):33–44

    Google Scholar 

  • Zhao Y, Li H, Yin S et al (2018) A new Chinese word segmentation method based on maximum matching. J Inf Hiding Multim Signal Process 9(6):1528–1535

    Google Scholar 

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongchun Jia.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interests.

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jia, H. Simulation of English part-of-speech classification based on artificial intelligence and additive logistic regression. Soft Comput (2023). https://doi.org/10.1007/s00500-023-08490-5

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s00500-023-08490-5

Keywords

Navigation