Abstract
With the growing recognition of the importance of knowledge creation, knowledge maps are being regarded as a critical tool for successful knowledge management. However, the various methods of developing knowledge maps mostly depend on unsystematic processes and the judgment of domain experts with a wide range of untapped information. Thus, this research aims to propose a new approach to generate knowledge maps by mining document databases that have hardly been examined, thereby enabling an automatic development process and the extraction of significant implications from the maps. To this end, the accepted research proposal database of the Korea Research Foundation (KRF), which includes a huge knowledge repository of research, is investigated for inducing a keyword-based knowledge map. During the developmental process, text mining plays an important role in extracting meaningful information from documents, and network analysis is applied to visualize the relations between research categories and measure the value of network indices. Five types of knowledge maps (core R&D map, R&D trend map, R&D concentration map, R&D relation map, and R&D cluster map) are developed to explore the main research themes, monitor research trends, discover relations between R&D areas, regions, and universities, and derive clusters of research categories. The results can be used to establish a policy to support promising R&D areas and devise a long-term research plan.
Similar content being viewed by others
References
Aaronson, S. (1975). The footnotes of science. Mosaic, 6, 22–27.
Abramo, G., D’Angelo, C. A., & Caprasecca, A. (2009). Allocative efficiency in public research funding: Can bibliometrics help? Research Policy, 38, 206–215.
Abramo, G., D’Angelo, C. A., & Pugini, F. (2008). The measurement of Italian universities’ research productivity by a non parametric-bibliometric methodology. Scientometrics, 76(2), 225–244.
Aksnes, D. W., & Taxt, R. E. (2004). Peers reviews and bibliometric indicators: A comparative study at Norwegian University. Research Evaluation, 13(1), 33–41.
Borner, K., Hardy, E., Herr, B., Hooloway, T., & Paley, W. B. (2007). Taxonomy visualization in support of the semi-automatic validation and optimization of organizational schemas. Journal of Informetrics, 1, 214–225.
Boyack, K. W., & Klavans, R. (2008). Measuring science–technology interaction using rare inventor–author names. Journal of Informetrics, 2, 173–182.
Browne, G., Curley, S., & Benson, P. (1997). Evoking information in probability assessment: Knowledge maps and reasoning-based directed questions. Management Science, 43(1), 1–14.
Calero-Medina, C., & Noyons, E. C. M. (2008). Combining mapping and citation network analysis for a better understanding of the scientific development: The case of the absorptive capacity field. Journal of Informetrics, 2, 272–279.
Chen, C. (1999). Information visualization and virtual environments. Berlin: Springer.
Chen, C., & Paul, R. J. (2001). Visualizing a knowledge domain’s intellectual structure. Computer, 34(3), 65–71.
Clifton, C. (2004). TopCat: Data mining for topic identification in a text corpus. IEEE Transactions on Knowledge and Data Engineering, 16(8), 949–964.
Cohen, W. W., & Hirsh, H. (1998). Joints that generalize: Text classification using WHIRL. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, New York.
Dhillon, I. S., & Modha, D. S. (2001). Concept decompositions for large sparse text data using clustering. Machine Learning, 42(1), 143–175.
Dixon, M. (1997). An overview of document mining technology. Unpublished paper.
Doyle, L. B. (1961). Semantic roadmaps for literature searchers. Journal of the Association for Computing Machinery, 8, 553–578.
Garfield, E. (1963). Citation indexes in sociological and historical research. American Documentation, 14, 289–291.
Goldstone, R. L., & Kersten, A. (2003). Concepts and categories. In A. F. Healy & R. W. Proctor (Eds.), Comprehensive handbook of psychology (pp. 591–621). New York: Wiley.
Griffith, B. C., Small, H., Stonehill, J. A., & Dey, S. (1974). The structure of scientific literature, II: Toward a macro and microstructure for science. Science Studies, 4, 339–365.
Gruber, T. (1995). Toward principles for the design of ontologies used for knowledge sharing. International Journal of Human-Computer Studies, 43(5–6), 907–928.
Hammer, H., Garcia-Molina, J., Cho, R., Aranha, A., & Crespo, V. (1997). Extracting semistructured information from the Web. In Proceedings of the Workshop on Management of Semistructured Data (PODS/SIGMOD’97), Tucson, AZ.
Hayashi, T. (2003). Bibliometric analysis on additionality of Japanese R&D programmes. Scientometrics, 56(3), 301–316.
Herl, H. E., O’Neil, H. F. J., Chung, G. K. W. K., & Schacter, J. (1999). Reliability and validity of a computer-based knowledge mapping system to measure content understanding. Computer in Human Behavior, 15(3), 315–333.
Hsu, N. N., & Chang, C. C. (1999). Finite-state transducers for semi-structured text mining. In Proceedings of International Joint Conference on Artificial Intelligence (IJCAI) workshop on text mining, Stockholm, Sweden.
Jiang, C., Coenen, F., Sanderson, R., & Zito, M. (2010). Text classification using graph mining-based feature extraction. Knowledge-Based Systems, 23, 302–308.
Klavans, R., & Boyack, K. W. (2007). Is there a convergent structure to science? In D. Torres-salinas & H. F. Moded (Eds.), Proceedings of the 11th International Conference of the International Society for Scientometrics and Informetrics (pp. 437–448). Madrid: CSIC.
Kohonen, T. (1995). Self-organizing maps. Berlin: Springer.
Leydesdorff, L., Cozzens, S., & Van Den Besselaar, P. (1994). Tracking areas of strategic importance using scientometric journal mappings. Research Policy, 23(2), 217–229.
Leydesdorff, L., & Rafols, I. (2009). A global map of science based on the ISI subject categories. Journal of the American Society for Information Science and Technology, 60, 348–362.
Li, Y., Chung, S. M., & Holt, J. D. (2008). Text document clustering based on frequent word meaning sequences. Data & Knowledge Engineering, 64, 381–404.
Li, N., & Wu, D. D. (2010). Using text mining and sentiment analysis for online forums hotspot detection and forecast. Decision Support Systems, 48, 354–368.
Lourenco, A., Carreira, R., Carneiro, S., Maia, P., Glezopena, D., Fdez-Riverola, F., et al. (2009). @Note: A workbench for biomedical text mining. Journal of Biomedical Informatics, 42, 710–720.
Maule, R. W. (1997). Cognitive maps, AI agents and personalized virtual environment in internet learning experience. Internet Research, 8(4), 347–358.
Moed, H. F., De Bruin, R. E., Nederhof, A. J., & Tijssen, R. J. W. (1991). International scientific co-operation and awareness within the European community: Problems and perspectives. Scientometrics, 21, 291–311.
Moon, Y. H. (2005). Next-generation information analysis. Daejeon, Korea: KISTI.
Murray, F. (2002). Innovation as co-evolution of scientific and technological networks: Exploring tissue engineering. Research Policy, 31, 1389–1403.
Narin, F. (1976). Evaluative bibliometrics: The use of publication and citation analysis in the evaluation of scientific activity. Washington, DC: National Science Foundation.
Narin, F., & Hamilton, K. S. (1996). Bibliometric performance measures. Scientometrics, 36(3), 293–310.
National Institute of Science and Technology Policy (NISTEP). (2007). Science map 2004: Study on hot research areas (1999–2004) by bibliometric method. Tokyo, Japan: NISTEP.
Novak, J. D. (1998). Learning, creating, and using knowledge: Concept maps as facilitative tools in schools and cooperations. Mahwah, NJ: Lawrence Erlbaum Associates.
Noyons, E. C. M. (1999). Bibliometic mapping as a science policy and research management tool. Thesis, Leiden University. Leiden: DSWO Press.
Noyons, E. C. M., Luwel, M., & Moed, H. F. (1998). Assessment of Flemish R&D in the field of information technology - A bibliometric evaluation based on publication and patent data, combined with OECD research input statistics. Research Policy, 27, 287–302.
Perry, C. A., & Rice, R. E. (1998). Scholarly communication in developmental dyslexia: Influence of network structure on change in a hybrid problem area. Journal of the American Society for Information Science, 49(2), 151–168.
Price, D. J. D. (1965). Networks of scientific papers. Science, 149, 510–515.
Pritchard, A. (1969). Statistical bibliography or bibliometrics. Journal of Documentation, 25(4), 348–349.
Santo, M. M., Coelho, G. M., Santos, D., & Filho, L. (2006). Text mining as a valuable tool in foresight exercises: A study on nanotechnology. Technological Forecasting & Social Change, 73, 1013–1027.
Sebastiani, F. (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34(1), 1–47.
Small, H. G. (1977). A co-citation model of a scientific specialty: A longitudinal study of collagen research. Social Studies of Science, 7, 139–166.
STEPI. (2003). Development and application of knowledge maps for analysis of planning new technology. Seoul, Korea: STEPI.
Tseng, Y., Lin, C., & Lin, Y. (2007). Text mining techniques for patent analysis. Information Processing and Management, 43, 1216–1247.
Vail, E. F. (1999). Mapping organizational knowledge. Knowledge Management Review, 8, 10–15.
Van Raan, A. F. J. (1996). Advanced bibliometric methods as quantitative core of peer review based evaluation and foresight exercises. Scientometrics, 36, 397–420.
Van Raan, A. F. J. (2005). Fatal attraction: Conceptual and methodological problems in the ranking of universities by bibliometric methods. Scientometrics, 62(1), 133–143.
Van Raan, A. F. J., & Van Leeuwen, Th. N. (2002). Assessment of the scientific basis of interdisciplinary, applied research application of bibliometric methods in nutrition and food research. Research Policy, 31, 611–632.
White, H. D. (2001). Author-centered bibliometircs through CAMEOs: Characterization automatically made and cited online. Scientometrics, 51, 607–637.
White, H. D., Buzydlowski, J., & Lin, X. (2000), Co-cited author maps as interfaces to digital libraries: Designing pathfinder networks in the humanities. In Proceedings of the IEEE International conference on Information Visualization, London, UK.
White, H. D., Lin, X., & Mccain, K. W. (1998). Two modes of automated domain analysis: Multidimensional scaling vs. Kohones feature mapping of information science authors. In Proceedings of the Fifth International ISKO Conference (pp. 57–61). Wurzberg, Germany: Ergon Verlag.
Yang, Y. (1994). Expert network: Effective and efficient learning from human decisions in text categorization and retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’94), Dublin, Ireland.
Yang, H., & Lee, C. (2004). A text mining approach on automatic generation of web directories and hierarchies. Expert Systems with Applications, 27, 645–663.
Yoon, B. U. (2008). Structuring technological information for technology roadmapping. In Proceeding of the 7th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Databases (AIKED ‘08), Cambridge, UK.
Yoshiyuki, T., Shiho, M., Yuya, K., & Katsumori, M. (2009). Nanobiotechnology as an emerging research domain from nanotechnology: A bibliometric approach. Scientometrics, 80(1), 25–40.
Acknowledgments
This research was supported by the Basic Science Research Program through the National Research Foundation (NRF) and funded by the Ministry of Education, Science, and Technology (Grant No. 2009-0073285).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yoon, B., Lee, S. & Lee, G. Development and application of a keyword-based knowledge map for effective R&D planning. Scientometrics 85, 803–820 (2010). https://doi.org/10.1007/s11192-010-0294-5
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-010-0294-5