Abstract
This paper proposes a technique to filter tourism websites and datamining to analyze keywords to improve searching on tourism websites as Search Engine Optimization (SEO). This work focuses on tourist sites or attractions, hotel or accommodation, and restaurant in the tourist provinces as a core keywords. Content in websites is retrieved from Google with queries of 11 Thai famous tourism provinces. From all retrieved results, filtering using Naïve Bayes algorithm with Boundary Values is performed to detect only relevant content as 6,171 filtered websites (66.55%) from 9,273 retrieved websites. From keyword analysis method, we compared three methods including (1) keywords from Apriori algorithm, (2) keywords from frequent terms within websites, and (3) keywords by frequency of terms from the ontology. The experiment results are conclusive that keywords from frequent terms within websites performed best. The keywords are usable to customize the websites to improve search ranking.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Cao, T., Nguyen, Q.: Semantic approach to travel information search and Itinerary recommendation. Int. J. Web Inf. Syst. 8(3), 256–277 (2012)
The Electronic Transactions Development Agency (Public Organization).: Thailand Internet User Profile 2017. Retrieved 27 March 2018 from https://www.aripfan.com/thailand-internet-user-profile-2017 (2017) (in Thai)
Lu, P., Cong, X.: The research on webpage ranking algorithm based on topic-expert documents. Adv. Intell. Syst. Comput. 361, 195–204 (2015)
Deka, R.: Increasing website visibility using search engine optimization. Int. J. Eng. Technol. Sci. Res. 1(1), 18–22 (2014)
Vignesh, J., Deepa, V.: Search engine optimization to increase website visibility. Int. J. Sci. Res. 3(2), 425–430 (2014)
Jain, A., Sharma, S.: An efficient keyword driven test automation framework for web application. Int. J. Eng. Sci. Adv. Technol. 2(3), 600–604 (2012)
Ozbal, G., Pighin, D., Strapparava, C.: Automation and evaluation of the keyword method for second language learning. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Baltimore, USA, pp. 352–357 (2014)
Theparat, C.: Tourism to continue growth spurt in 2017 (2017). Retrieved 27 Mar 2018 from https://www.bangkokpost.com/business/tourism-and-transport/1199925/tourism-to-continue-growth-spurt-in-2017
Panawong, N., Namahoot, C.S., Brueckner, M.: Classification of tourism web with modified Naïve Bayes algorithm. Adv. Mater. Res. 931–932, 1360–1364 (2014)
Namahoot, C.S., Lobo, D., Kabbua, S.: Enhancement of a text clustering technique for the classification of Thai tourism websites. In: Proceedings of Computer Science and Engineering Conference (ICSEC), IEEE, Khon Kaen, Thailand, pp. 203–208 (2014)
Tosqui-Lucks, P., Silva, B.C.D.D.: Structuring an ontology of the basic vocabulary of tourism. Int. J. Inf. Educ. Technol. 2(4), 221–334 (2012)
Khan, A., Baharudin, B., Lee, L.H., Khan, K.: A review of machine learning algorithms for text-documents classification. J. Adv. Inf. Technol. 1(1), 4–20 (2010)
Namahoot, C.S., Brueckner, M., Panawong, N.: Context-aware tourism recommender system using temporal ontology and naïve bayes. Adv. Intell. Syst. Comput. 361, 183–194 (2015)
Hashimi, H., Hafez, A., Mathkour, H.: Selection criteria for text mining approaches. Comput. Hum. Behav. 51, 729–733 (2015)
Noh, H., Jo, Y., Lee, S.: Keyword selection and processing strategy for applying text mining to patent analysis. Expert Syst. Appl. 42, 4348–4360 (2015)
Berezina, K., Bilgihan, A., Cobanoglu, C., Okumus, F.: Understanding satisfied and dissatisfied hotel customers: text mining of online hotel reviews. J. Hospitality Market. Manag. 25(1), 1–24 (2015)
Wasilewska, A.: APRIORI algorithm (2016). Retrieved 2 Apr 2018 from http://www.cs.sunysb.edu/~cse634/lecture_notes/07apriori.pdf
Lemnitzer, L., Monachesi, P.: Extraction and evaluation of keywords from learning objects—a multilingual approach. In: Proceedings of the Language Resources and Evaluation Conference (LREC 2008), Morocco, pp. 1–8 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Panawong, N., Sittisaman, A. (2019). Tourism Web Filtering and Analysis Using Naïve Bay with Boundary Values and Text Mining. In: Bhatia, S., Tiwari, S., Mishra, K., Trivedi, M. (eds) Advances in Computer Communication and Computational Sciences. Advances in Intelligent Systems and Computing, vol 924. Springer, Singapore. https://doi.org/10.1007/978-981-13-6861-5_46
Download citation
DOI: https://doi.org/10.1007/978-981-13-6861-5_46
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6860-8
Online ISBN: 978-981-13-6861-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)