Skip to main content

A Big Data Experiment to Assess the Effectiveness of Deep Learning Neural Networks in the Mining of Sustainable Aspects of the Hotels Clients Opinions

  • Conference paper
  • First Online:
16th International Conference on Information Technology-New Generations (ITNG 2019)

Abstract

Context: Opinions given by hotel clients in tourism social networks, the ones which can be a great source of knowledge extraction in the Big Data context, including the sustainable aspects of the hotels clients opinions. Objective: Evaluate performance and quality of deep learning neural networks, especially the Target-Connection LSTM (TC-LSTM) and Attention-based LSTM (AT-LSTM) algorithms, aiming to mine and classify the opinions posted on the TripAdvisor and Booking social networks, by considering sustainability aspects. Method: A controlled experiment to compare the efficiency and efficacy of the classifiers was carried out. Results: The AT-LSTM algorithm presented the best results, especially in terms of accuracy, precision, f-measure, average training time and average classification time. The first with 74,58%, the second with 95,54%, the third with 85,37%, then fourth with 7,3 s and the last one with 1,12 s. Conclusion: The AT-LSTM algorithm was expressly more effective than TC-LSTM, making it an option to be considered for mining opinions based on specific aspects of tourism and peculiar market niches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hartmann, N., et al.: Portuguese word embeddings: evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017)

    Google Scholar 

  2. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of International Conference on Learning Representations Workshop (ICLR-2013) (2013)

    Google Scholar 

  3. Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empiricial Methods in Natural Language Processing (EMNLP-2014), vol. 12, pp. 1532–1543 (2014)

    Google Scholar 

  4. Greff, K., et al.: LSTM: a search space odyssey. IEEE Trans. Neural Netw. Learn. Syst. 28(10), 2222–2232 (2017)

    Article  MathSciNet  Google Scholar 

  5. Mikolov, T., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

    Google Scholar 

  6. Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. arXiv preprint arXiv:1503.00075 (2015)

    Google Scholar 

  7. Tang, D., et al.: Effective LSTMs for target-dependent sentiment classification. arXiv preprint arXiv:1512.01100 (2015)

    Google Scholar 

  8. Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016)

    Google Scholar 

  9. Ling, W., Dyer, C., Black, A., Trancoso, I.: Two/too simple adaptations of word2vec for syntax problems. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics (2015)

    Google Scholar 

  10. Severino, A.J.: Metodologia do trabalho científico. Cortez editora (2017)

    Google Scholar 

  11. Lima, T., Junior, M.C., Augusta, M., Nunes, S.N.: Mining on line general opinions about sustainability of hotels: a systematic literature mapping. In: International Conference on Computational Science and Its Applications. Springer, Cham (2018)

    Google Scholar 

  12. Wang, Y., Huang, M., Zhao, L.: Attention-based LSTM for aspect-level sentiment classification. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (2016)

    Google Scholar 

  13. Young, T., et al.: Recent trends in deep learning based natural language processing. arXiv preprint arXiv:1708.02709 (2017) 18

    Google Scholar 

  14. Wohlin, C., et al.: Experimentation in Software Engineering. Springer Science & Business Media, Berlin/New York (2012)

    Google Scholar 

  15. Juristo, N., Moreno, A.M.: Basics of Software Engineering Experimentation. Springer Science & Business Media, New York (2013)

    MATH  Google Scholar 

  16. Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. OSDI, vol. 16 (2016)

    Google Scholar 

  17. Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)

    Article  Google Scholar 

  18. Tsytsarau, M., Palpanas, T.: Survey on mining subjective data on the web. Data Min. Knowl. Disc. 24(3), 478–514 (2012)

    Article  Google Scholar 

  19. Srisuan, J., Hanskunatai, A.: The ensemble of Naïve Bayes classifiers for hotel searching. In: 2014 International Computer Science and Engineering Conference (ICSEC). IEEE (2014)

    Google Scholar 

  20. Songpan, W.: The analysis and prediction of customer review rating using opinion mining. In: 2017 IEEE 15th International Conference on Software Engineering Research, Management and Applications (SERA). IEEE (2017)

    Google Scholar 

  21. Chaabani, Y., Toujani, R., Akaichi, J.: Sentiment analysis method for tracking touristics reviews in social media network. In: International Conference on Intelligent Interactive Multimedia Systems and Services. Springer, Cham (2017)

    Google Scholar 

  22. Hung, C.: Word of mouth quality classification based on contextual sentiment lexicons. Inf. Process. Manag. 53(4), 751–763 (2017)

    Article  Google Scholar 

  23. Hu, Y.-H., Chen, Y.-L., Chou, H.-L.: Opinion mining from online hotel reviews–a text summarization approach. Inf. Process. Manag. 53(2), 436–449 (2017)

    Article  Google Scholar 

  24. James, G., et al.: An Introduction to Statistical Learning, vol. 112. Springer, New York (2013)

    Book  Google Scholar 

  25. Patro, V.M., Ranjan Patra, M.: Augmenting weighted average with confusion matrix to enhance classification accuracy. Transactions on Machine Learning and Artificial Intelligence 2(4), 77–91 (2014)

    Article  Google Scholar 

  26. Makhtar, M., Neagu, D.C., Ridley, M.J.: Binary classification models comparison: on the similarity of datasets and confusion matrix for predictive toxicology applications. In: International Conference on Information Technology in Bio-and Medical Informatics. Springer, Berlin/Heidelberg (2011)

    Google Scholar 

  27. Caelen, O.: A Bayesian interpretation of the confusion matrix. Ann. Math. Artif. Intell. 81(3–4), 429–450 (2017)

    Article  MathSciNet  Google Scholar 

  28. Dwivedi, A.K.: Performance evaluation of different machine learning techniques for prediction of heart disease. Neural Comput. Appl. 1–9, 29:685–693 (2016)

    Google Scholar 

  29. de Oliveira, N., Robert A., Junior, M.C.: Experimental analysis of stemming on jurisprudential documents retrieval. Information 9(2), 28 (2018)

    Article  Google Scholar 

  30. Santos, B.S., Junior, M.C., de Souza, J.G.: A initial experimental evaluation of the neuromessenger: a collaborative tool to improve the empathy of text interactions. In: Latifi, S. (ed.) Information Technology-New Generations, pp. 411–419. Springer, Cham (2018)

    Chapter  Google Scholar 

  31. Basili, V.R., Weiss, D.M.: A methodology for collecting valid software engineering data. IEEE Trans. Softw. Eng. 6, 728–738 (1984)

    Article  Google Scholar 

  32. do Turismo, M.: Índice de Competividade do Turismo Nacional. Available in: https://goo.gl/c3agrm. Access in: 01 out (2018)

  33. de Matos, J.K.E., Costa, M.A.N.: Sustentabilidade nos meios de hospedagem no Brasil: a norma NBR 15401: 2006. Congresso de Arquitetura, Turismo e Sustentabilidade, vol. 1 (2012)

    Google Scholar 

  34. John Lu, Z.Q.: The elements of statistical learning: data mining, inference, and prediction. J. R. Stat. Soc. A. Stat. Soc. 173(3), 693–694 (2010)

    Article  Google Scholar 

  35. Travassos, G.H., Gurov, D., Amaral, E.A.G.G.: Introdução àǎ Engenharia de Software Experimental. UFRJ, Rio de Janeiro (Brazil) (2002)

    Google Scholar 

  36. Kluyver, T., et al.: Jupyter Notebooks-a publishing format for reproducible computational workflows. ELPUB (2016)

    Google Scholar 

  37. Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. OSDI, vol. 16 (2016)

    Google Scholar 

  38. Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  39. Shapiro, S.S., Wilk, M.B.: An analysis of variance test for normality (complete samples). Biometrika 52(3/4), 591–611 (1965)

    Article  MathSciNet  Google Scholar 

  40. Field, A.: Descobrindo a Estatística Usando o SPSS-2. Bookman Editora, São Paulo (Brazil) (2009)

    Google Scholar 

  41. Levene, H.: Robust tests for equality of variances. In: Olkin, I., et al. (eds.) Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling, pp. 279–292. Stanford University Press, Stanford (1961)

    Google Scholar 

  42. SPSS IBM.: IBM SPSS Statistics for Windows, Version 20.0. IBM Corp, New York (2011)

    Google Scholar 

  43. Litchfield, J.T. Jr., Wilcoxon, F.: A simplified method of evaluating dose-effect experiments. J. Pharmacol. Exp. Ther. 96(2), 99–113 (1949)

    Google Scholar 

  44. del Pino Medina Brito, M., dos Santos Júnior, A.: La gestión sostenible de los establecimientos de hospedaje de la provincia Sur de Brasil. I Foro Internacional de Turismo Maspalomas Costa Canaria. Universidad de Las Palmas de Gran Canaria (2013)

    Google Scholar 

  45. Hartmann, N., et al.: Portuguese word embeddings: evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

de Oliveira Lima, T., Júnior, M.C., de J. Prado, K.H., dos S. Júnior, A. (2019). A Big Data Experiment to Assess the Effectiveness of Deep Learning Neural Networks in the Mining of Sustainable Aspects of the Hotels Clients Opinions. In: Latifi, S. (eds) 16th International Conference on Information Technology-New Generations (ITNG 2019). Advances in Intelligent Systems and Computing, vol 800. Springer, Cham. https://doi.org/10.1007/978-3-030-14070-0_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-14070-0_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-14069-4

  • Online ISBN: 978-3-030-14070-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics