, Volume 87, Issue 3, pp 425–450 | Cite as

Research trends analysis by comparing data mining and customer relationship management through bibliometric methodology

  • Hsu-Hao TsaiEmail author


There are few comprehensive studies and categorization schemes to discuss the characteristics for both data mining and customer relationship management (CRM) although they have already become more important recently. Using a bibliometric approach, this paper analyzes data mining and CRM research trends from 1989 to 2009 by locating headings “data mining” and “customer relationship management” or “CRM” in topics in the SSCI database. The bibliometric analytical technique was used to examine these two topics in SSCI journals from 1989 to 2009, we found 1181 articles with data mining and 1145 articles with CRM. This paper implemented and classified data mining and CRM articles using the following eight categories—publication year, citation, country/territory, document type, institute name, language, source title and subject area—for different distribution status in order to explore the differences and how data mining and CRM technologies have developed in this period and to analyze data mining and CRM technology tendencies under the above result. Also, the paper performs the K–S test to check whether the analysis follows Lotka’s law. The research findings can be extended to investigate author productivity by analyzing variables such as chronological and academic age, number and frequency of previous publications, access to research grants, job status, etc. In such a way characteristics of high, medium and low publishing activity of authors can be identified. Besides, these findings will also help to judge scientific research trends and understand the scale of development of research in data mining and CRM through comparing the increases of the article author. Based on the above information, governments and enterprises may infer collective tendencies and demands for scientific researcher in data mining and CRM to formulate appropriate training strategies and policies in the future. This analysis provides a roadmap for future research, abstracts technology trends and facilitates knowledge accumulations so that data mining and CRM researchers can save some time since core knowledge will be concentrated in core categories. This implies that the phenomenon “success breeds success” is more common in higher quality publications.


Data mining Customer relationship management Research trend analysis Bibliometric methodology 


  1. Adam, D. (2002). The counting house. Nature, 415, 726–729.Google Scholar
  2. Ahmed, S. R. (2004). Effectiveness of neural network types for prediction of business failure. Information Technology: Coding and Computing, 2, 455–459.Google Scholar
  3. Berry, M. J. A., & Linoff, G. S. (2004). Data mining techniques second edition—for marketing, sales, and customer relationship management. New York: Wiley.Google Scholar
  4. Berson, A., Smith, S., & Thearling, K. (2000). Building data mining applications for CRM. New York: McGraw-Hill.Google Scholar
  5. Bortiz, J. E., & Kennedy, D. B. (1995). Effectiveness of neural network types for prediction of business failure. Expert Systems with Applications, 9, 503–512.CrossRefGoogle Scholar
  6. Brachman, R. J., Khabaza, T., Kloesgen, W., Piatetsky-Shapiro, G., & Simoudis, E. (1996). Mining business databases. Communication of the ACM, 39(11), 42–48.CrossRefGoogle Scholar
  7. Broadus, R. N. (1987). Toward a definition of bibliometrics. Scientometrics, 12(5–6), 373–379.CrossRefGoogle Scholar
  8. Chen, M. S., Han, J., & Yu, P. S. (1996). Data mining: an overview from a database perspective. IEEE Transactions on Knowledge and Data Engineering, 8(6), 866–883.CrossRefGoogle Scholar
  9. Coille, R. C. (1977). Lotka’s frequency distribution of scientific productivity. Journal of American Society for Information Science, 28, 366–370.CrossRefGoogle Scholar
  10. Eustace, R., & Merrington, G. (1995). A probabilistic neural network approach to jet engine fault diagnosis. In Proceedings of the 8th international conference on industrial and engineering applications of artificial intelligence and expert systems (pp. 67–76) Melbourne, Australia.Google Scholar
  11. Fayyad, U. M., Piatetsky-Shapiro, G., & Smyth, P. (1996a). From data mining to knowledge discovery: an overview. In: Advances in Knowledge Discovery and Data Mining (pp. 1–34). California: American Association for Artificial Intelligence.Google Scholar
  12. Fayyad, U. M., Piatetsky-Shapiro, G., & Smyth, P. (1996b). The KDD process for extracting useful knowledge from volumes of data. Communications of the ACM, 39(11), 27–34.CrossRefGoogle Scholar
  13. Fletcher, D., & Goss, E. (1993). Forecasting with neural networks: An application using bankruptcy data. Information and Management, 24(3), 159–167.CrossRefGoogle Scholar
  14. Gupta, D. K. (1987). Lotka’s law and productivity of entomological research in Nigeria for the period 1900–1973. Scientometrics, 12, 33–46.CrossRefGoogle Scholar
  15. He, Z., Xu, X., Huang, J. Z., & Deng, S. (2004). Mining class outliers: Concepts, algorithms and applications in CRM. Expert Systems with Applications, 27, 681–697.CrossRefGoogle Scholar
  16. Kincaid, J. W. (2003). Customer relationship management: Getting it right. New Jersey: Prentice Hall.Google Scholar
  17. Langley, P., & Simon, H. A. (1995). Applications of machine learning and rule induction. Communication of the ACM, 38(11), 54–64.CrossRefGoogle Scholar
  18. Lau, H. C. W., Wong, C. W. Y., Hui, I. K., & Pun, K. F. (2003). Design and implementation of an integrated knowledge system. Knowledge-Based Systems, 16, 69–76.CrossRefGoogle Scholar
  19. Lejeune, M. A. P. M. (2001). Measuring the impact of data mining on churn management. Internet Research: Electronic Networking Applications and Policy, 11, 375–387.CrossRefGoogle Scholar
  20. Ling, R., & Yen, D. C. (2001). Customer relationship management: An analysis framework and implementation strategies. Journal of Computer Information Systems, 41, 82–97.Google Scholar
  21. Lotka, A. J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 16(12), 317–324.Google Scholar
  22. Moed, H. F., & Van Leeuwen, Th. N. (1995). Improving the accuracy of the institute for scientific information’s journal impact factors. Journal of the American Society for Information Science, 46, 461–467.CrossRefGoogle Scholar
  23. Ngai, E. W. T. (2005). Customer relationship management research (1992–2002): An academic literature review and classification. Marketing Intelligence & Planning, 23, 582–605.CrossRefGoogle Scholar
  24. Nicholls, P. T. (1989). Bibliometric modeling processes and empirical validity of Lotka’s law. Journal of American Society for Information Science, 40(6), 379–385.CrossRefGoogle Scholar
  25. Pao, M. L. (1985). Lotka’s law, a testing procedure. Information Processing and Management, 21, 305–320.CrossRefGoogle Scholar
  26. Pao, M. L. (1989). Concept of information retrieve. Colorado: Libraries Unlimited.Google Scholar
  27. Parvatiyar, A., & Sheth, J. N. (2001). Customer relationship management: Emerging practice, process, and discipline. Journal of Economic & Social Research, 3, 1–34.Google Scholar
  28. Potter, W. G. (1981). Lotka’s law revisited. Library Trends, 30(1), 21–39.Google Scholar
  29. Potter, W. G. (1988). ‘Of making many books there is no end’: Bibliometrics and libraries. The Journal of Academic Librarianship, 14, 238a–238c.Google Scholar
  30. Pritchard, A. (1969). Statistical bibliography or bibliometrics. Journal of Documentation, 25(4), 348–349.MathSciNetGoogle Scholar
  31. Rao, I. K. R. (1980). The distribution of scientific productivity and social change. Journal of American Society for Information Science, 31, 111–122.CrossRefGoogle Scholar
  32. Salchenberger, L. M., Cinar, E. M., & Lash, N. A. (1992). Neural networks: A new tool for predicting thrift failures. Decision Sciences, 23, 899–916.CrossRefGoogle Scholar
  33. Su, C. T., Hsu, H. H., & Tsai, C. H. (2002). Knowledge mining from trained neural networks. Journal of Computer Information Systems, 42, 61–70.Google Scholar
  34. Swift, R. S. (2001). Accelerating customer relationships: Using CRM and relationship technologies. New Jersey: Prentice Hall.Google Scholar
  35. Tam, K. Y., & Kiang, M. Y. (1992). Managerial applications of neural networks: The case of bank failure predictions. Management Science, 38, 926–947.zbMATHCrossRefGoogle Scholar
  36. Teo, T. S. H., Devadoss, P., & Pan, S. L. (2006). Towards a holistic perspective of customer relationship management implementation: A case study of the housing and development board, Singapore. Decision Support Systems, 42, 1613–1627.Google Scholar
  37. Tsai, H. H., & Chi, Y. P. (2011). Trend analysis of supply chain management by bibliometric methodology. International Journal of Digital Content Technology and its Applications, 5(1), 285–295.Google Scholar
  38. Tsai, H. H., & Chiang, J. K. (2011). E-commerce research trend forecasting: A study of bibliometric methodology. International Journal of Digital Content Technology and its Applications, 5(1), 101–111.Google Scholar
  39. Tsai, H. H., & Yang, J. M. (2010). Analysis of knowledge management trend by bibliometric approach. In Proceeding(s) of the WASET on knowledge management (Vol. 62, pp. 174–178). Google Scholar
  40. Turban, E., Aronson, J. E., Liang, T. P., & Sharda, R. (2007). Decision support and business intelligence systems (8th ed.). Taiwan: Pearson Education.Google Scholar
  41. Van Raan, A. F. J. (1996). Advanced bibliometric methods as quantitative core of peer review based evaluation and foresight exercises. Scientometrics, 36, 397–420.CrossRefGoogle Scholar
  42. Van Raan, A. F. J. (2000). The Pandora’s box of citation analysis: Measuring scientific excellence, the last evil? In B. Cronin & H. B. Atkins (Eds.), The web of knowledge. A festschrift in honor of Eugene Garfield. Chap. 15 (pp. 301–319). New Jersey: ASIS Monograph Series.Google Scholar
  43. Van Raan, A. F. J., & Van Leeuwen, Th. N. (2002). Assessment of the scientific basis of interdisciplinary, applied research. Application of bibliometric methods in nutrition and food research. Research Policy, 31, 611–632.CrossRefGoogle Scholar
  44. Vlachy, J. (1978). Frequency distribution of scientific performance: A bibliography of Lotka’s law and related phenomena. Scientometrics, 1, 109–130.Google Scholar
  45. Weingart, P. (2003), Evaluation of research performance: the danger of numbers. In: Bibliometric analysis in science and research. Applications, benefits and limitations. Second conference of the Central Library (pp. 7–19) Forschungszentrum Jülich.Google Scholar
  46. Weingart, P. (2004). Impact of bibliometrics upon the science system: inadvertent consequences? In H. F. Moed, W. Glanzel, & U. Schmoch (Eds.), Handbook on quantitative science and technology research. The Netherlands: Kluwer Academic Publishers.Google Scholar
  47. Zhang, G., Hu, M. Y., Patuwo, B. E., & Indro, D. C. (1999). Artificial neural networks in bankruptcy prediction: General framework and cross validation analysis. European Journal of Operational Research, 116, 16–32.zbMATHCrossRefGoogle Scholar

Copyright information

© Akadémiai Kiadó, Budapest, Hungary 2011

Authors and Affiliations

  1. 1.Department of Management Information SystemNational Chengchi UniversityWenshan District, Taipei CityTaiwan, R.O.C.

Personalised recommendations