Attention-based BiGRU-CNN for Chinese question classification

Liu, Jin; Yang, Yihe; Lv, Shiqi; Wang, Jin; Chen, Hui

doi:10.1007/s12652-019-01344-9

Jin Liu¹,
Yihe Yang¹,
Shiqi Lv¹,
Jin Wang² &
…
Hui Chen³

2530 Accesses
62 Citations
Explore all metrics

Abstract

Chinese question classification is one of the essential tasks in nature language processing (NLP) for Chinese language due to its distinctive characteristics. Methods presented in the literature are usually based on rules or traditional machine learning methods, which require manually created rules or features. Thus, the accuracy of the classification is constrained by inherent limitations of these methods. As deep learning-based methods have been proved to be able to mine deep information of text, to alleviate the problem, this article proposes a novel deep neural network model, Attention-Based BiGRU-CNN network (ABBC); and applies it to Chinese question classification task. The model combines the characteristics and advantages of convolutional neural network, attention mechanism and recurrent neural network. Our model can not only extract the features of Chinese questions effectively, but also learn the context information of words to solve the problem that the Text-CNN model can lose position feature. By comparing out model to four other classic models, the experimental results show that our model achieves the best performance in the Chinese question classification task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

"Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach"

Article Open access 05 March 2024

Impact of word embedding models on text analytics in deep learning environment: a review

Article 22 February 2023

How to Fine-Tune BERT for Text Classification?

Notes

References

Barigou F (2018) Impact of instance selection on kNN-based text categorization. J Inf Process Syst 14(2):418–434
Google Scholar
Bengio Y, Ducharme R, Vincent P, Jauvin C (2003) A neural probabilistic language model. J Mach Learn Res 3(Feb):1137–1155
MATH Google Scholar
Chen Z, Hu K (2018) Radical enhanced Chinese word embedding. In: Chinese computational linguistics and natural language processing based on naturally annotated big data, pp 3–11
Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN Encoder–Decoder for statistical machine translation. In: Proceedings of the 2014 conference on empirical methods in natural language processing, pp 1724–1734
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555
Collobert R, Weston J (2008). A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th international conference on machine learning, July, pp 160–167, ACM
Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12(Aug):2493–2537
MATH Google Scholar
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
MATH Google Scholar
Dachapally PR, Ramanam S (2018) In-depth question classification using convolutional neural networks. arXiv preprint arXiv:1804.00968
Hinton GE (1986) Learning distributed representations of concepts. In: Proceedings of the eighth annual conference of the cognitive science society, vol 1, p 12, August
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article Google Scholar
Jiang M, Liang Y, Feng X, Fan X, Pei Z, Xue Y, Guan R (2018) Text classification based on deep belief network and softmax regression. Neural Comput Appl 29(1):61–70
Article Google Scholar
Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: 52nd Annual meeting of the association for computational linguistics. Association for Computational Linguistics, June
Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing, pp 1746–1751
Kocik K (2004) Question classification using maximum entropy models. The University of Sydney, Sydney
Google Scholar
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Article Google Scholar
Le-Hong P, Phan XH, Nguyen TD (2015) Using dependency analysis to improve question classification. In: Knowledge and systems engineering. Advances in intelligent systems and computing, vol 326, pp 653–665
Li R, Tao X, Lei T, Hu Y (2005) Using maximum entropy model for Chinese text categorization. J Comput Res Dev 42(1):578–587
Google Scholar
Li C, Chai YM, Nan XF, Gao ML (2016) Research on problem classification method based on deep learning. Comput Sci 12:021
Google Scholar
Liu J, Zhou M, Lin L, Kim HJ, Wang J (2017) Rank web documents based on multi-domain ontology. J Ambient Intell Humanized Comput. https://doi.org/10.1007/s12652-017-0566-5
Liu J, Ren H, Wu M, Wang J, Kim HJ (2018) Multiple relations extraction among multiple entities in unstructured text. Soft Comput 22(13):4295–4305
Article Google Scholar
Liu W, Chen X, Jeon B, Chen L, Chen B (2019) Influence maximization on signed networks under independent cascade model. Appl Intell 49(3):912–928
Article Google Scholar
Maron ME, Kuhns JL (1960) On relevance, probabilistic indexing and information retrieval. J ACM (JACM) 7(3):216–244
Article Google Scholar
Mikolov T, Sutskever I, Chen K, Corrado G, Dean J (2013a) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th international conference on neural information processing systems, vol 2, pp 3111–3119
Mikolov T, Chen K, Corrado G, Dean J (2013b) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781
Rozental A, Fleischer D (2018) Amobee at SemEval-2018 task 1: GRU neural network with a CNN attention mechanism for sentiment classification. arXiv preprint arXiv:1804.04380
Ruder S, Ghaffari P, Breslin JG (2016) A hierarchical model of reviews for aspect-based sentiment analysis. arXiv preprint arXiv:1609.02745
Sathasivam S, Abdullah WATW (2008) Logic learning in Hopfield networks. arXiv preprint arXiv:0804.4075
Shang L, Lu Z, Li H (2015) Neural responding machine for short-text conversation. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, vol 1, pp 1577–1586
Singh J, Singh G, Singh R (2017) Optimization of sentiment analysis using machine learning classifiers. Hum Centric Comput Inf Sci 7(1):32
Article Google Scholar
Socher R, Perelygin A, Wu J, Chuang J, Manning CD, Ng A, Potts C (2013) Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 conference on empirical methods in natural language processing, pp 1631–1642
Su TR, Lee HY (2017) Learning Chinese word representations from glyphs of characters. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp 264–273
Sun JG, Cai DF, De-Xin LV, Dong YJ (2007) Hownet based Chinese question automatic classification. J Chin Inf Process 21(1):90–95
Google Scholar
Tian WD, Gao YY, Zu YL (2010) Question classification based on self-learning rules and modified Bayes. Jisuanji Yingyong Yanjiu 27(8):2869–2871
Google Scholar
Wang D, Nyberg E (2015) A long short-term memory model for answer sentence selection in question answering. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, vol 2: short papers, pp 707–712)
Wang J, Zhang Z, Li B, Lee S, Sherratt RS (2014) An enhanced fall detection system for elderly person monitoring using consumer home networks. IEEE Trans Consum Electron 60(1):23–29
Article Google Scholar
Wang J, Cao Y, Li B, Kim HJ, Lee S (2017) Particle swarm optimization based clustering algorithm with mobile sink for wsns. Future Gener Comput Syst 76:452–457
Article Google Scholar
Wang G, Li, C, Wang W, Zhang Y, Shen D, Zhang X et al (2018) Joint embedding of words and labels for text classification. arXiv preprint arXiv:1805.04174
Wu YZ, Zhao J, Duan XY, Xu B (2005) Research on question answering & evaluation: a survey. J Chin Inf Process 3:1–13
Google Scholar
Yang S, Gao C, Qin F, Dai X, Chen J (2012) A feature model integrating basic and bag-of-words binding features. J Chin Inf Process 26(5):46–52
Google Scholar
Yang Z, Yang D, Dyer C, He X, Smola A, Hovy E (2016) Hierarchical attention networks for document classification. In: Proceedings of the 2016 conference of the north american chapter of the association for computational linguistics: human language technologies, pp 1480–1489
Yang M, Zhao W, Ye J, Lei Z, Zhao Z, Zhang S (2018). Investigating capsule networks with dynamic routing for text classification. In: Proceedings of the 2018 conference on empirical methods in natural language processing, pp 3110–3119
Yin C, Xi J, Sun R, Wang J (2018) Location privacy protection based on differential privacy strategy for big data in industrial internet-of-things. IEEE Trans Ind Inf 14(8):3628–3636
Article Google Scholar
Yu B, Xu Q, Zhang P (2018) Question classification based on MAC-LSTM. In: 2018 IEEE third international conference on data science in cyberspace (DSC), June, pp 69–75, IEEE
Zeng D, Dai Y, Li F, Sherratt RS, Wang J (2018) Adversarial learning for distant supervised relation extraction. Comput Mater Continua 55(1):121–136
Google Scholar
Zhang D, Lee WS (2003) Question classification using support vector machines. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval, vol 29, no 6, pp 26–32

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (61872231, 61772454, 61701297, 61811530332, 61811540410).

Author information

Authors and Affiliations

College of Information Engineering, Shanghai Maritime University, Shanghai, China
Jin Liu, Yihe Yang & Shiqi Lv
School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha, China
Jin Wang
College of Education, Shanghai Normal University, Shanghai, China
Hui Chen

Authors

Jin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yihe Yang
View author publications
You can also search for this author in PubMed Google Scholar
Shiqi Lv
View author publications
You can also search for this author in PubMed Google Scholar
Jin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hui Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hui Chen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, J., Yang, Y., Lv, S. et al. Attention-based BiGRU-CNN for Chinese question classification. J Ambient Intell Human Comput (2019). https://doi.org/10.1007/s12652-019-01344-9

Download citation

Received: 15 October 2018
Accepted: 05 June 2019
Published: 13 June 2019
DOI: https://doi.org/10.1007/s12652-019-01344-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Attention-based BiGRU-CNN for Chinese question classification

Abstract

Access this article

Similar content being viewed by others

"Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach"

Impact of word embedding models on text analytics in deep learning environment: a review

How to Fine-Tune BERT for Text Classification?

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Attention-based BiGRU-CNN for Chinese question classification

Abstract

Access this article

Similar content being viewed by others

"Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach"

Impact of word embedding models on text analytics in deep learning environment: a review

How to Fine-Tune BERT for Text Classification?

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation