Abstract
The traditional studies which have based on machine learning are usually supervised learning for sentiment analysis problem, this is costly time and money to build pre-labeled dataset, not domain adaptation and hard to handle unseen data. In this paper, we have approached semi-supervised learning for Vietnamese sentiment analysis, training data is only one document. Many preprocessing techniques have been performed to clean and normalize data, complemented semantic lexicons such as negation handling, intensification handling, also augmented training data from one-document training. In experiments, we have performed various aspects and obtained competitive results which may motivate next propositions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5(4), 1093–1113 (2014)
Soleymani, M., Garcia, D., Jou, B., Schuller, B., Chang, S.-F., Pantic, M.: A survey of multimodal sentiment analysis. Image Vis. Comput. 65, 3–14 (2017)
Fernández-Gavilanes, M., Àlvarez-López, T., Juncal-MartÃnez, J., Costa-Montenegro, E., González-Castaño, F.J.: GTI: an unsupervised approach for sentiment analysis in Twitter. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado, pp. 533–538 (2015)
Fernández-Gavilanes, M., Juncal-MartÃnez, J., GarcÃa-Méndez, S., Costa-Montenegro, E., González-Castaño, F.J.: Creating emoji lexica from unsupervised sentiment analysis of their descriptions. Expert Syst. Appl. 103, 74–91 (2018)
AL-Sharuee, M.T., Liu, F., Pratama, M.: Sentiment analysis: an automatic contextual analysis and ensemble clustering approach and comparison. Data Knowl. Eng. 115, 194–213 (2018)
Kobayashi, S.: Contextual augmentation: data augmentation by words with paradigmatic relations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), New Orleans, Louisiana, pp. 452–457 (2018)
Symeonidis, S., Effrosynidis, D., Arampatzis, A.: A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis. Expert Syst. Appl. 110, 298–310 (2018)
Kim, K.: An improved semi-supervised dimensionality reduction using feature weighting: application to sentiment analysis. Expert Syst. Appl. 109, 49–65 (2018)
Yoo, S., Song, J., Jeong, O.: Social media contents based sentiment analysis and prediction system. Expert Syst. Appl. 105, 102–111 (2018)
Hussein, D.M.E.-D.M.: A survey on sentiment analysis challenges. J. King Saud Univ.- Eng. Sci. 30(4), 330–338 (2018)
Nguyen-Thi, B.-T., Duong, H.-T.: A Vietnamese sentiment analysis system based on multiple classifiers with enhancing lexicon features. In: Duong, T.Q., Vo, N.-S., Nguyen, L.K., Vien, Q.-T., Nguyen, V.-D. (eds.) INISCOM 2019. LNICST, vol. 293, pp. 240–249. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30149-1_20
Wei, J., Zou, K.: EDA: easy data augmentation techniques for boosting performance on text classification. In: ICLR 2019–7th International Conference on Learning Representations (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Nguyen-Nhat, DK., Duong, HT. (2019). One-Document Training for Vietnamese Sentiment Analysis. In: Tagarelli, A., Tong, H. (eds) Computational Data and Social Networks. CSoNet 2019. Lecture Notes in Computer Science(), vol 11917. Springer, Cham. https://doi.org/10.1007/978-3-030-34980-6_21
Download citation
DOI: https://doi.org/10.1007/978-3-030-34980-6_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34979-0
Online ISBN: 978-3-030-34980-6
eBook Packages: Computer ScienceComputer Science (R0)