Skip to main content

Tourism Web Filtering and Analysis Using Naïve Bay with Boundary Values and Text Mining

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 924))

Abstract

This paper proposes a technique to filter tourism websites and datamining to analyze keywords to improve searching on tourism websites as Search Engine Optimization (SEO). This work focuses on tourist sites or attractions, hotel or accommodation, and restaurant in the tourist provinces as a core keywords. Content in websites is retrieved from Google with queries of 11 Thai famous tourism provinces. From all retrieved results, filtering using Naïve Bayes algorithm with Boundary Values is performed to detect only relevant content as 6,171 filtered websites (66.55%) from 9,273 retrieved websites. From keyword analysis method, we compared three methods including (1) keywords from Apriori algorithm, (2) keywords from frequent terms within websites, and (3) keywords by frequency of terms from the ontology. The experiment results are conclusive that keywords from frequent terms within websites performed best. The keywords are usable to customize the websites to improve search ranking.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Cao, T., Nguyen, Q.: Semantic approach to travel information search and Itinerary recommendation. Int. J. Web Inf. Syst. 8(3), 256–277 (2012)

    Article  Google Scholar 

  2. The Electronic Transactions Development Agency (Public Organization).: Thailand Internet User Profile 2017. Retrieved 27 March 2018 from https://www.aripfan.com/thailand-internet-user-profile-2017 (2017) (in Thai)

  3. Lu, P., Cong, X.: The research on webpage ranking algorithm based on topic-expert documents. Adv. Intell. Syst. Comput. 361, 195–204 (2015)

    Google Scholar 

  4. Deka, R.: Increasing website visibility using search engine optimization. Int. J. Eng. Technol. Sci. Res. 1(1), 18–22 (2014)

    Google Scholar 

  5. Vignesh, J., Deepa, V.: Search engine optimization to increase website visibility. Int. J. Sci. Res. 3(2), 425–430 (2014)

    Google Scholar 

  6. Jain, A., Sharma, S.: An efficient keyword driven test automation framework for web application. Int. J. Eng. Sci. Adv. Technol. 2(3), 600–604 (2012)

    Google Scholar 

  7. Ozbal, G., Pighin, D., Strapparava, C.: Automation and evaluation of the keyword method for second language learning. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Baltimore, USA, pp. 352–357 (2014)

    Google Scholar 

  8. Theparat, C.: Tourism to continue growth spurt in 2017 (2017). Retrieved 27 Mar 2018 from https://www.bangkokpost.com/business/tourism-and-transport/1199925/tourism-to-continue-growth-spurt-in-2017

  9. Panawong, N., Namahoot, C.S., Brueckner, M.: Classification of tourism web with modified Naïve Bayes algorithm. Adv. Mater. Res. 931–932, 1360–1364 (2014)

    Article  Google Scholar 

  10. Namahoot, C.S., Lobo, D., Kabbua, S.: Enhancement of a text clustering technique for the classification of Thai tourism websites. In: Proceedings of Computer Science and Engineering Conference (ICSEC), IEEE, Khon Kaen, Thailand, pp. 203–208 (2014)

    Google Scholar 

  11. Tosqui-Lucks, P., Silva, B.C.D.D.: Structuring an ontology of the basic vocabulary of tourism. Int. J. Inf. Educ. Technol. 2(4), 221–334 (2012)

    Google Scholar 

  12. Khan, A., Baharudin, B., Lee, L.H., Khan, K.: A review of machine learning algorithms for text-documents classification. J. Adv. Inf. Technol. 1(1), 4–20 (2010)

    Google Scholar 

  13. Namahoot, C.S., Brueckner, M., Panawong, N.: Context-aware tourism recommender system using temporal ontology and naïve bayes. Adv. Intell. Syst. Comput. 361, 183–194 (2015)

    Google Scholar 

  14. Hashimi, H., Hafez, A., Mathkour, H.: Selection criteria for text mining approaches. Comput. Hum. Behav. 51, 729–733 (2015)

    Article  Google Scholar 

  15. Noh, H., Jo, Y., Lee, S.: Keyword selection and processing strategy for applying text mining to patent analysis. Expert Syst. Appl. 42, 4348–4360 (2015)

    Article  Google Scholar 

  16. Berezina, K., Bilgihan, A., Cobanoglu, C., Okumus, F.: Understanding satisfied and dissatisfied hotel customers: text mining of online hotel reviews. J. Hospitality Market. Manag. 25(1), 1–24 (2015)

    Google Scholar 

  17. Wasilewska, A.: APRIORI algorithm (2016). Retrieved 2 Apr 2018 from http://www.cs.sunysb.edu/~cse634/lecture_notes/07apriori.pdf

  18. Lemnitzer, L., Monachesi, P.: Extraction and evaluation of keywords from learning objects—a multilingual approach. In: Proceedings of the Language Resources and Evaluation Conference (LREC 2008), Morocco, pp. 1–8 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Naruepon Panawong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Panawong, N., Sittisaman, A. (2019). Tourism Web Filtering and Analysis Using Naïve Bay with Boundary Values and Text Mining. In: Bhatia, S., Tiwari, S., Mishra, K., Trivedi, M. (eds) Advances in Computer Communication and Computational Sciences. Advances in Intelligent Systems and Computing, vol 924. Springer, Singapore. https://doi.org/10.1007/978-981-13-6861-5_46

Download citation

Publish with us

Policies and ethics