Abstract
All research or development activities produce many kinds of outcome such as article, patent, research report, human resources information, application method for some equipment, experimental data and so on. The NTIS (National Science & Technology Information Service) in Korea offers a unified search service using national R&D outcomes data to researchers. But this function does not meet the academic requirements of users who want to use the relevance of papers, patents, research reports, etc. It is needs to display related documents together when a user stays in a page which offers detail metadata about one outcome, this helps users to diminish effort to search their interesting information. In this paper, we propose the method for similar document retrieval among heterogeneous kinds of R&D outcomes. A combination of user query and search factor extracted from the search engine are used to search some similar documents, and the boosting technology using the author field and subject code (S&T standard code) field is applied to document ranking process. We show usefulness of proposed method in this paper as developing the intelligent system of NTIS or many metadata search services.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Saracoglu, R., Tutuncu, K., Allahverdi, N.: A fuzzy clustering approach for finding similar documents using a novel similarity measure. Expert Systems with Applications 33(3), 600–605 (2007)
Chen, C.-M., Liu, D.-R.: Tree indexing for efficient search of similar documents. In: Computer Software and Applications Conference, pp. 210–211. IEEE Comput. Soc. (2000)
Fox, T.W.: Document vector compression and its application in document clustering. In: Canadian Conference on Computer Engineering, pp. 2029–2032. IEEE (2005)
FAST ESP User Manual
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media Dordrecht
About this paper
Cite this paper
Han, H., Choi, K., Kim, J., Choi, H. (2012). Similar Document Retrieval among the Different Kind of National R&D Outcomes. In: Park, J., Leung, V., Wang, CL., Shon, T. (eds) Future Information Technology, Application, and Service. Lecture Notes in Electrical Engineering, vol 179. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-5064-7_8
Download citation
DOI: https://doi.org/10.1007/978-94-007-5064-7_8
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-5063-0
Online ISBN: 978-94-007-5064-7
eBook Packages: EngineeringEngineering (R0)