Abstract
The solution of the text classification and time-varying problems are two basic practical problems frequently encountered in the fields of science and engineering, and most of the text classification and dynamic problems solving are realized by recurrent neural networks (RNN), therefore, the improvement on the convergence and robustness of the RNN models becomes increasingly important. Based on this fact, two novel activation functions (NAF) are proposed to improve the performance of each RNN formula for text classification, dynamic problems solving and dynamic matrix inversion in this work. Firstly, the first NAF (\(\textrm{NAF}_1\)) is applied to the two-layer simple RNN model, long short-term memory RNN model and gated recurrent unit RNN model for text classification. Comparing with the above three RNN models activated by reported activation functions (rectified linear unit (ReLU) function, leak relu (LReLU), exponential linear unit (ELU), scaled ELU], the \(\textrm{NAF}_1\)-activated RNN models achieve higher accuracy in text classification. In addition, based on the second NAF (\(\textrm{NAF}_2\)), an improved fixed-time convergent recurrent neural network (IFTCRNN) model for time-varying problems solving is constructed. The \(\textrm{NAF}_2\)-based IFTCRNN model achieves fixed-time convergence and strong robustness to noises in time-varying Sylvester matrix equation solving, dynamic matrix inversion and robot manipulator trajectory tracking.
Similar content being viewed by others
References
Sable CL, Hatzivassiloglou V (2000) Text-based approaches for non-topical image categorization. Int J Digit Libr 3(3):261–275
Sanchez-Pi N, Mart L, Garcia ACB (2014) Text classification techniques in oil industry applications. Proceedings of international joint conference SOCO13-CISIS13-ICEUTE13. Springer, pp 211–220
Schapire RE, Singer Y (2000) Boostexter: a boosting-based system for text categorization. Mach Learn 39(2):135–168
Chen M, Jin X, Shen D (2011) Short text classification improved by learning multi-granularity topics. In: Proceedings of the 22nd international joint conference on artificial intelligence. Citeseer, pp 1776–1781
Onan A, Korukoglu S, Bulut H (2016) Ensemble of keyword extraction methods and classifiers in text classification. Expert Syst Appl 57:232–237
Tang B, He H, Baggenstoss P et al (2016) A Bayesian classification approach using class-specific features for text categorization. IEEE Trans Knowl Data Eng 28:1602–1606
Tam S, Setiono R (2002) A comparative study of centroidbased, neighbourhood-based and statistical approaches for effective document categorization. Proceedings of the 16th international conference on pattern recognition (ICPR’02) 4(4):235–238
Prasad R, Kulkarni U, Prasad JR (2009) Machine learning in evolving connectionist text summarizer. In: 3rd International conference on anti-counterfeiting, security, and identification in communication, Hong Kong, pp 539–543
Joachims T (1998) Text Categorization with Support Vector Machines: Learning with many relevant features. Lect Notes Comput Sci 1398:137–142
Zhang T, Oles FJ (2001) Text categorization based on regularized linear classification methods. Inf Retr 4(1):5–31
Wang P, Xu B, Xu J et al (2016) Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification. Neurocomputing 174:806–814
Kim Y (2014) Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882
Lai S, Xu L, Liu K, Zhao J (2015) Recurrent convolutional neural networks for text classification. Proc AAAI 333:2267–2273
Chen H, Sun M, Tu C, Lin Y, Liu Z (2016) Neural sentiment classification with user and product attention. In: Proceedings of the empirical methods in natural language processing,y EMNLP, pp 1650–1659
Zablith F, Osman IH (2019) Review modus, text classification and sentiment prediction of unstructured reviews using a hybrid combination of machine learning and evaluation models. Appl Math Model 71:569–583
Wu M, Yin B, Vosoughi A, Studer C, Cavallaro JR, Dick C (2013) Approximate matrix inversion for high-throughput data detection in the large-scale MIMO uplink. Proc IEEE Int Symp Circuits Syst 54:2155–2158
Zhang Z, Deng X, Kong L, Li S (2020) A circadian rhythms learning network for resisting cognitive periodic noises of time-varying dynamic system and applications to robots. IEEE Trans Cognit Dev Syst 12(3):575–587
Zhang Z, Yang S, Chen S, Luo Y, Yang H, Liu Y (2020) A vector-based constrained obstacle avoidance scheme for wheeled mobile redundant robot manipulator. IEEE Trans Cognit Dev Syst 13:465–474
Li Z, Yuan W, Zhao S, Yu Z, Kang Y, Chen CLP (2019) Brain-actuated control of dual-arm robot manipulation with relative motion. IEEE Trans Cognit Dev Syst 11(1):51–62
Li J, Li Z, Li X, Feng Y, Hu Y, Xu B (2021) Skill learning strategy based on dynamic motion primitives for human–robot cooperative manipulation. IEEE Trans Cognit Dev Syst 13(1):105–117
Jin J, Zhu J, Gong J, Chen W (2022) Novel activation functions-based ZNN models for fixed-time solving dynamic Sylvester equation. Neural Comput Appl 34:14297–14315
Jin J, Zhu J, Zhao L, Chen L (2022) A fixed-time convergent and noise tolerant zeroing neural network for online solution of time-varying matrix inversion. Appl Soft Comput 130:109691
Zhang Y, Ge SS (2005) Design and analysis of a general recurrent neural network model for time-varying matrix inversion. IEEE Trans Neural Networks 16:1477–1490
Zhang Y, Ma W, Cai B (2008) From Zhang neural network to Newton iteration for matrix inversion. IEEE Trans Circuits Syst I Regul Pap 56:1405–1415
Zhang Y, Li Z, Li K (2011) Complex-valued Zhang neural network for online complex-valued time-varying matrix inversion. Appl Math Comput Simul Model Pract Theory 217:10066–10073
Yang Y, Zhang Y (2013) Superior robustness of power-sum activation functions in Zhang neural networks for time-varying quadratic programs perturbed with large implementation errors. Neural Comput Appl 22:175–185
Zhang Y, Jin L, Ke Z (2012) Superior performance of using hyperbolic sine activation functions in ZNN illustrated via time-varying matrix square roots finding. Comput Inf Syst 9:1603–1625
Li S, Chen S, Liu B (2013) Accelerating a recurrent neural network to finite-time convergence for solving time-varying Sylvester equation by using a sign-bi-power activation function. Neural Process Lett 37:189–205
Li S, Li Y (2014) Nonlinearly activated neural network for solving time-varying complex sylvester equation. IEEE Trans Cybern 44(8):1397–1407
Wei Q, Dobigeon N, Tourneret JY (2015) Fast fusion of multi-band images based on solving a Sylvester equation. IEEE Trans Image Process 24(11):4109–4121
Yu S, He Z, Qi T, Wang X (2021) The equivalence canonical form of five quaternion matrices with applications to imaging and Sylvester-type equations. J Comput Appl Math 393:113494
Qi Y, Jin L, Li H, Li Y, Liu M (2020) Discrete computational neural dynamics models for solving time-dependent Sylvester equations with applications to robotics and MIMO systems. IEEE Trans Industr Inf 16(10):6231–6241
Xiao L, Cao Y, Dai J, Jia L, Tan H (2021) Finite-time and predefined-time convergence design for zeroing neural network: theorem, method, and verification. IEEE Trans Ind Inf 17(7):4724–4732
Xiao L, Zhang Y, Zuo Q, Dai J, Li J, Tang W (2020) A noise-tolerant zeroing neural network for time-dependent complex matrix inversion under various kinds of noises. IEEE Trans Ind Inf 16(6):3757–3766
Xiao L, Dai J, Lu R, Li S, Li J, Wang S (2020) Design and Comprehensive Analysis of a Noise-Tolerant ZNN Model With Limited-Time Convergence for Time-Dependent Nonlinear Minimization. IEEE Trans Neural Networks Learn Syst 31(12):5339–5348
Jia L, Xiao L, Dai J, Qi Z, Zhang Z, Zhang Y (2021) Design and application of an adaptive fuzzy control strategy to zeroing neural network for solving time-variant QP problem. IEEE Trans Fuzzy Syst 29(6):1544–1555
Xiao L, Tao J, Dai J, Wang Y, Jia L, Yongjun H (2021) A Parameter-changing and complex-valued zeroing neural-network for finding solution of time-varying complex linear matrix equations in finite time. IEEE Trans Ind Inf 17(10):6634–6643
Yan X, Liu M, Jin L, Li S, Hu B, Zhang X, Huang Z (2019) New zeroing neural network models for solving nonstationary Sylvester equation with verifications on mobile manipulators. IEEE Trans Ind Inf 15(9):5011–5022
Zhang Z, Zheng L, Weng J, Mao Y, Lu W, Xiao L (2018) A new varying-parameter recurrent neural-network for online solution of time-varying Sylvester equation. IEEE Trans Cybern 48(11):3135–3148
Zhang Z et al (2018) A varying-parameter convergent-differential neural network for solving joint-angular-drift problems of redundant robot manipulators. IEEE/ASME Trans Mechatron 23(2):679–689
Zhang Z, Zheng L, Qiu T, Deng F (2020) Varying-Parameter Convergent-Differential Neural Solution to Time-Varying Overdetermined System of Linear Equations. IEEE Trans Autom Control 65(2):874–881
Xiao L, Zhang Y (2014) From different Zhang functions to various ZNN models accelerated to finite-time convergence for time-varying linear matrix equation. Neural Process Lett 39:309–326
Zhang Z, Chen T, Wang M, Zheng L (2020) An exponential-type anti-noise varying-gain network for solving disturbed time-varying inversion systems. IEEE Trans Neural Networks Learn Syst 31(9):3414–3427
Jin J, Gong J (2020) An interference-tolerant fast convergence zeroing neural network for dynamic matrix inversion and its application to mobile manipulator path tracking. Alex Eng J 60:659–669
Jin J, Gong J (2021) A noise-tolerant fast convergence ZNN for dynamic matrix inversion. Int J Comput Math 8:1–19
Chen C, Li L, Peng H, Yang Y, Zhao H (2020) A new fixed-time stability theorem and its application to the fixed-time synchronization of neural networks. Neural Netw 123:412–419
Zhao L, Jin J, Gong J (2021) Robust zeroing neural network for fixed-time kinematic control of wheeled mobile robot in noise-polluted environment. Math Comput Simul 185:289–307
Jin J, Chen W, Zhao L, Chen L, Tang Z (2022) A nonlinear zeroing neural network and its applications on time-varying linear matrix equations solving, electronic circuit currents computing and robotic manipulator trajectory tracking. Comput Appl Math 41: 319
Jin J, Chen W, Qiu L, Zhu J, Liu H (2023) A noise tolerant parameter-variable zeroing neural network and its applications. Math Comput Simul 207:482–498
Jin J, Chen W, Chen C (2022) A predefined fixed-time convergence ZNN and its applications to time-varying quadratic programming solving and dual-arm manipulator cooperative trajectory tracking. IEEE Trans Ind Inf. https://doi.org/10.1109/TII.2022.3220873
Jin J, Qiu L (2022) A robust fast convergence zeroing neural network and its applications to dynamic sylvester equation solving and robot trajectory tracking. J Franklin Inst 359:3183–3209
Acknowledgements
This work is supported by the National Natural Science Foundation of China (Grant No. 62273141), Natural Science Foundation of Hunan Province (Grant No. 2020JJ4315), Scientific Research Fund of Hunan Provincial Education Department (Grant No. 20B216).
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, W., Jin, J., Gerontitis, D. et al. Improved Recurrent Neural Networks for Text Classification and Dynamic Sylvester Equation Solving. Neural Process Lett 55, 8755–8784 (2023). https://doi.org/10.1007/s11063-023-11176-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-023-11176-6