Mining Tourist Preferences with Twice-Learning
Data mining techniques have been recognized as powerful tools for predictive modeling tourist decision-making process. However, two practical yet important problems have not been resolved by the data miners in empirical tourism research. Firstly, comprehensibility-the role of the data mining should not only generate accurate predictions, but also provide insights why certain prediction is made. But most widely used data mining methods that can generalize well are black-box in nature and can provide little information on the tourist decision-making facts. Secondly, the lack of training samples-it is usually rather difficult to collect enough training samples through surveying the tourist on site, especially for surveying the tourist’s decision-making facts. Many data mining methods may not achieve satisfactory performance if learned on small data set. In this paper, we show that these two problems can be addressed simultaneously using a twice-learning framework on the travel preference data. The results indicate that by addressing these two problems properly, we can predict tourist preferences accurately as well as extracting meaningful insights which would be useful for tourism marketing.
KeywordsTarget Concept Gain Ratio Data Mining Method Data Mining Approach Neural Network Ensemble
Unable to display preview. Download preview PDF.
- 8.Pai, P.F., Hong, W.C., Chang, P.T., Chen, C.T.: The application of support vector machines to forecast tourist arrivals in Barbados: An empirical study. International Journal of Management 23, 375–385 (2006)Google Scholar
- 10.Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann, San Mateo (1993)Google Scholar
- 13.Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)Google Scholar
- 16.World Tourism Organization. Chinese outbound tourism, Madrid (2003)Google Scholar