Analysis of the prediction capability of web search data based on the HE-TDC method ‒ prediction of the volume of daily tourism visitors
Web search query data are obtained to reflect social spots and serve as novel economic indicators. When faced with high-dimensional query data, selecting keywords that have plausible predictive ability and can reduce dimensionality is critical. This paper presents a new integrative method that combines Hurst Exponent (HE) and Time Difference Correlation (TDC) analysis to select keywords with powerful predictive ability. The method is called the HE-TDC screening method and requires keywords with predictive ability to satisfy two characteristics, namely, high correlation and fluctuation memorability similar to the predicting target series. An empirical study is employed to predict the volume of tourism visitors in the Jiuzhai Valley scenic area. The study shows that keywords selected using HE-TDC method produce a model with better robustness and predictive ability.
KeywordsTourism visitor volume prediction web-search data HE-TDC method Jiuzhai Valley time series Hurst exponent
Unable to display preview. Download preview PDF.
- Brynjolfsson, E., Geva, T. & Reichman, S. (2015). Crowd-squared: amplifying the predictive power of search trend data. MIS Quarterly (Forthcoming). Available at http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2513559. Cited January 18, 2016.Google Scholar
- CNNIC. (2014). Statistical Report on the Development of China Internet Network in the Thirty-Fifth Time. China Internet Network Information Center. Available at http://www.cac.gov.cn/cnnic35fzzktjbg.htm. Cited March 1st, 2015.Google Scholar
- Lazer, D., Kennedy, R., King, G. & Vespignani, A. (2014). Big data. The parable of Google flu: traps in big data analysis. Science (NY), 343(6176): 1203.Google Scholar
- Peng, G. & Wang, J.Y. (2014). Detecting syphilis amount in China based on Baidu query data. In: International Conference on Soft Computing in Information Communication Technology (SCICT 2014), Atlantis Press.Google Scholar
- Wang, J.Y., Peng, G. & Dai, W. (2014). Prediction of online trade growth using search-ANFIS: transactions on Taobao as examples. In: 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), July 6-11, 2014, Beijing, China.Google Scholar
- Wu, L. & Brynjolfsson, E. (2014). The future of prediction: how Google searches foreshadow housing prices and sales. Economics of Digitization, University of Chicago Press.Google Scholar