One-Document Training for Vietnamese Sentiment Analysis

Nguyen-Nhat, Dang-Khoa; Duong, Huu-Thanh

doi:10.1007/978-3-030-34980-6_21

Dang-Khoa Nguyen-Nhat¹⁰ &
Huu-Thanh Duong¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11917))

Included in the following conference series:

International Conference on Computational Data and Social Networks

977 Accesses
6 Citations

Abstract

The traditional studies which have based on machine learning are usually supervised learning for sentiment analysis problem, this is costly time and money to build pre-labeled dataset, not domain adaptation and hard to handle unseen data. In this paper, we have approached semi-supervised learning for Vietnamese sentiment analysis, training data is only one document. Many preprocessing techniques have been performed to clean and normalize data, complemented semantic lexicons such as negation handling, intensification handling, also augmented training data from one-document training. In experiments, we have performed various aspects and obtained competitive results which may motivate next propositions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5(4), 1093–1113 (2014)
Article Google Scholar
Soleymani, M., Garcia, D., Jou, B., Schuller, B., Chang, S.-F., Pantic, M.: A survey of multimodal sentiment analysis. Image Vis. Comput. 65, 3–14 (2017)
Article Google Scholar
Fernández-Gavilanes, M., Àlvarez-López, T., Juncal-Martínez, J., Costa-Montenegro, E., González-Castaño, F.J.: GTI: an unsupervised approach for sentiment analysis in Twitter. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado, pp. 533–538 (2015)
Google Scholar
Fernández-Gavilanes, M., Juncal-Martínez, J., García-Méndez, S., Costa-Montenegro, E., González-Castaño, F.J.: Creating emoji lexica from unsupervised sentiment analysis of their descriptions. Expert Syst. Appl. 103, 74–91 (2018)
Article Google Scholar
AL-Sharuee, M.T., Liu, F., Pratama, M.: Sentiment analysis: an automatic contextual analysis and ensemble clustering approach and comparison. Data Knowl. Eng. 115, 194–213 (2018)
Article Google Scholar
Kobayashi, S.: Contextual augmentation: data augmentation by words with paradigmatic relations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), New Orleans, Louisiana, pp. 452–457 (2018)
Google Scholar
Symeonidis, S., Effrosynidis, D., Arampatzis, A.: A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis. Expert Syst. Appl. 110, 298–310 (2018)
Article Google Scholar
Kim, K.: An improved semi-supervised dimensionality reduction using feature weighting: application to sentiment analysis. Expert Syst. Appl. 109, 49–65 (2018)
Article Google Scholar
Yoo, S., Song, J., Jeong, O.: Social media contents based sentiment analysis and prediction system. Expert Syst. Appl. 105, 102–111 (2018)
Article Google Scholar
Hussein, D.M.E.-D.M.: A survey on sentiment analysis challenges. J. King Saud Univ.- Eng. Sci. 30(4), 330–338 (2018)
Google Scholar
Nguyen-Thi, B.-T., Duong, H.-T.: A Vietnamese sentiment analysis system based on multiple classifiers with enhancing lexicon features. In: Duong, T.Q., Vo, N.-S., Nguyen, L.K., Vien, Q.-T., Nguyen, V.-D. (eds.) INISCOM 2019. LNICST, vol. 293, pp. 240–249. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30149-1_20
Chapter Google Scholar
Wei, J., Zou, K.: EDA: easy data augmentation techniques for boosting performance on text classification. In: ICLR 2019–7th International Conference on Learning Representations (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, Ho Chi Minh City Open University, 97 Vo Van Tan, Ward 6, District 3, Ho Chi Minh City, Vietnam
Dang-Khoa Nguyen-Nhat & Huu-Thanh Duong

Authors

Dang-Khoa Nguyen-Nhat
View author publications
You can also search for this author in PubMed Google Scholar
Huu-Thanh Duong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huu-Thanh Duong .

Editor information

Editors and Affiliations

University of Calabria, Rende, Italy
Andrea Tagarelli
University of Illinois at Urbana-Champaign, Urbana, IL, USA
Hanghang Tong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nguyen-Nhat, DK., Duong, HT. (2019). One-Document Training for Vietnamese Sentiment Analysis. In: Tagarelli, A., Tong, H. (eds) Computational Data and Social Networks. CSoNet 2019. Lecture Notes in Computer Science(), vol 11917. Springer, Cham. https://doi.org/10.1007/978-3-030-34980-6_21

Download citation

DOI: https://doi.org/10.1007/978-3-030-34980-6_21
Published: 11 November 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34979-0
Online ISBN: 978-3-030-34980-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics