Abstract
Context: Opinions given by hotel clients in tourism social networks, the ones which can be a great source of knowledge extraction in the Big Data context, including the sustainable aspects of the hotels clients opinions. Objective: Evaluate performance and quality of deep learning neural networks, especially the Target-Connection LSTM (TC-LSTM) and Attention-based LSTM (AT-LSTM) algorithms, aiming to mine and classify the opinions posted on the TripAdvisor and Booking social networks, by considering sustainability aspects. Method: A controlled experiment to compare the efficiency and efficacy of the classifiers was carried out. Results: The AT-LSTM algorithm presented the best results, especially in terms of accuracy, precision, f-measure, average training time and average classification time. The first with 74,58%, the second with 95,54%, the third with 85,37%, then fourth with 7,3 s and the last one with 1,12 s. Conclusion: The AT-LSTM algorithm was expressly more effective than TC-LSTM, making it an option to be considered for mining opinions based on specific aspects of tourism and peculiar market niches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hartmann, N., et al.: Portuguese word embeddings: evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of International Conference on Learning Representations Workshop (ICLR-2013) (2013)
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empiricial Methods in Natural Language Processing (EMNLP-2014), vol. 12, pp. 1532–1543 (2014)
Greff, K., et al.: LSTM: a search space odyssey. IEEE Trans. Neural Netw. Learn. Syst. 28(10), 2222–2232 (2017)
Mikolov, T., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. arXiv preprint arXiv:1503.00075 (2015)
Tang, D., et al.: Effective LSTMs for target-dependent sentiment classification. arXiv preprint arXiv:1512.01100 (2015)
Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016)
Ling, W., Dyer, C., Black, A., Trancoso, I.: Two/too simple adaptations of word2vec for syntax problems. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics (2015)
Severino, A.J.: Metodologia do trabalho científico. Cortez editora (2017)
Lima, T., Junior, M.C., Augusta, M., Nunes, S.N.: Mining on line general opinions about sustainability of hotels: a systematic literature mapping. In: International Conference on Computational Science and Its Applications. Springer, Cham (2018)
Wang, Y., Huang, M., Zhao, L.: Attention-based LSTM for aspect-level sentiment classification. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (2016)
Young, T., et al.: Recent trends in deep learning based natural language processing. arXiv preprint arXiv:1708.02709 (2017) 18
Wohlin, C., et al.: Experimentation in Software Engineering. Springer Science & Business Media, Berlin/New York (2012)
Juristo, N., Moreno, A.M.: Basics of Software Engineering Experimentation. Springer Science & Business Media, New York (2013)
Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. OSDI, vol. 16 (2016)
Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)
Tsytsarau, M., Palpanas, T.: Survey on mining subjective data on the web. Data Min. Knowl. Disc. 24(3), 478–514 (2012)
Srisuan, J., Hanskunatai, A.: The ensemble of Naïve Bayes classifiers for hotel searching. In: 2014 International Computer Science and Engineering Conference (ICSEC). IEEE (2014)
Songpan, W.: The analysis and prediction of customer review rating using opinion mining. In: 2017 IEEE 15th International Conference on Software Engineering Research, Management and Applications (SERA). IEEE (2017)
Chaabani, Y., Toujani, R., Akaichi, J.: Sentiment analysis method for tracking touristics reviews in social media network. In: International Conference on Intelligent Interactive Multimedia Systems and Services. Springer, Cham (2017)
Hung, C.: Word of mouth quality classification based on contextual sentiment lexicons. Inf. Process. Manag. 53(4), 751–763 (2017)
Hu, Y.-H., Chen, Y.-L., Chou, H.-L.: Opinion mining from online hotel reviews–a text summarization approach. Inf. Process. Manag. 53(2), 436–449 (2017)
James, G., et al.: An Introduction to Statistical Learning, vol. 112. Springer, New York (2013)
Patro, V.M., Ranjan Patra, M.: Augmenting weighted average with confusion matrix to enhance classification accuracy. Transactions on Machine Learning and Artificial Intelligence 2(4), 77–91 (2014)
Makhtar, M., Neagu, D.C., Ridley, M.J.: Binary classification models comparison: on the similarity of datasets and confusion matrix for predictive toxicology applications. In: International Conference on Information Technology in Bio-and Medical Informatics. Springer, Berlin/Heidelberg (2011)
Caelen, O.: A Bayesian interpretation of the confusion matrix. Ann. Math. Artif. Intell. 81(3–4), 429–450 (2017)
Dwivedi, A.K.: Performance evaluation of different machine learning techniques for prediction of heart disease. Neural Comput. Appl. 1–9, 29:685–693 (2016)
de Oliveira, N., Robert A., Junior, M.C.: Experimental analysis of stemming on jurisprudential documents retrieval. Information 9(2), 28 (2018)
Santos, B.S., Junior, M.C., de Souza, J.G.: A initial experimental evaluation of the neuromessenger: a collaborative tool to improve the empathy of text interactions. In: Latifi, S. (ed.) Information Technology-New Generations, pp. 411–419. Springer, Cham (2018)
Basili, V.R., Weiss, D.M.: A methodology for collecting valid software engineering data. IEEE Trans. Softw. Eng. 6, 728–738 (1984)
do Turismo, M.: Índice de Competividade do Turismo Nacional. Available in: https://goo.gl/c3agrm. Access in: 01 out (2018)
de Matos, J.K.E., Costa, M.A.N.: Sustentabilidade nos meios de hospedagem no Brasil: a norma NBR 15401: 2006. Congresso de Arquitetura, Turismo e Sustentabilidade, vol. 1 (2012)
John Lu, Z.Q.: The elements of statistical learning: data mining, inference, and prediction. J. R. Stat. Soc. A. Stat. Soc. 173(3), 693–694 (2010)
Travassos, G.H., Gurov, D., Amaral, E.A.G.G.: Introdução àǎ Engenharia de Software Experimental. UFRJ, Rio de Janeiro (Brazil) (2002)
Kluyver, T., et al.: Jupyter Notebooks-a publishing format for reproducible computational workflows. ELPUB (2016)
Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. OSDI, vol. 16 (2016)
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Shapiro, S.S., Wilk, M.B.: An analysis of variance test for normality (complete samples). Biometrika 52(3/4), 591–611 (1965)
Field, A.: Descobrindo a Estatística Usando o SPSS-2. Bookman Editora, São Paulo (Brazil) (2009)
Levene, H.: Robust tests for equality of variances. In: Olkin, I., et al. (eds.) Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling, pp. 279–292. Stanford University Press, Stanford (1961)
SPSS IBM.: IBM SPSS Statistics for Windows, Version 20.0. IBM Corp, New York (2011)
Litchfield, J.T. Jr., Wilcoxon, F.: A simplified method of evaluating dose-effect experiments. J. Pharmacol. Exp. Ther. 96(2), 99–113 (1949)
del Pino Medina Brito, M., dos Santos Júnior, A.: La gestión sostenible de los establecimientos de hospedaje de la provincia Sur de Brasil. I Foro Internacional de Turismo Maspalomas Costa Canaria. Universidad de Las Palmas de Gran Canaria (2013)
Hartmann, N., et al.: Portuguese word embeddings: evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
de Oliveira Lima, T., Júnior, M.C., de J. Prado, K.H., dos S. Júnior, A. (2019). A Big Data Experiment to Assess the Effectiveness of Deep Learning Neural Networks in the Mining of Sustainable Aspects of the Hotels Clients Opinions. In: Latifi, S. (eds) 16th International Conference on Information Technology-New Generations (ITNG 2019). Advances in Intelligent Systems and Computing, vol 800. Springer, Cham. https://doi.org/10.1007/978-3-030-14070-0_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-14070-0_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14069-4
Online ISBN: 978-3-030-14070-0
eBook Packages: EngineeringEngineering (R0)