Abstract
Research project selection is an essential task for government and private agencies. When a huge number of research proposals are received, it is common to group them along with their similarities in research discipline areas. Then the grouped proposals are assigned to the appropriate experts for peer review. Existing approaches are not efficient to classify the document of project proposal correctly. Text-mining methods are used to solve the problem of classifying text documents automatically. In this paper, research proposals are classified based on the discipline areas and proposals in each discipline are grouped using the text-mining technique. Dice’s coefficient, Damerau–Levenshtein distance, Tversky index, Cosine similarity and Jaro–Winkler distance are used to find the similarity between the documents. The classification is used to predict the target class for each document proposal in the dataset accurately. In this work, Proposed Ensemble classifiers are used that contain various classifiers such as Kernel Support Vector Machine (KSVM), Self-Organizing Map (SOM), K-Nearest Neighbor (KNN) and Naïve Bayes. These are the four classification methods used for proposed work and get an accuracy of results.
Similar content being viewed by others
References
Arratia-Martinez NM, Caballero-Fernandez R, Litvinchev I, Lopez-Irarragorri F (2017) Research and development project portfolio selection under uncertainty. Journal of Ambient Intelligence and Humanized Computing, Springer, pp. 1–10
Darena F, Zizka J (2015) Interdependence of text mining quality and the input data preprocessing, artificial intelligence perspectives and applications, Springer, pp. 141–150
Estep J (2015) Development of a technology transfer score to inform the selection of a research proposal. IEEE 2015 Portland International Conference on Management of Engineering and Technology (PICMET), pp. 1754–1768
Hirankitti M, Mai TX (2012) A meta-logical approach for reasoning with an OWL 2 ontology. J Ambient Intell Humaniz Comput, Springer, 3(4):293–303
Horecki K, Mazurkiewicz J (2015) Natural language processing methods used for automatic prediction mechanism of related phenomenon. ICAISC 2015, Springer, pp 13–24
Jeno S, Arthi T, Lakshmipathi B (2014) Ontology based mining techniques for systematic allocation of project proposals to external reviewers. Int J Innovative Res Sci Eng Technol 3(3):1825–1830
Kumar M, Vig R (2012) Term-frequency inverse-document frequency definition semantic (TIDS) based focused web crawler. Global Trends in Information Systems and Software Applications, Springer, pp 31–36
Ma J, Xu W, Sun Y-H, Turban E, Wang S, Liu O (2012) An ontology-based text-mining method to cluster proposals for research project selection. IEEE Trans Syst Man Cybern Part A Syst Hum 42(3):784–790
Madhavan K, Chen X (2014) DIA2: web-based cyber infrastructure for visual analysis of funding portfolios. IEEE Trans Vis Comput Gr 20(2):1823–1832
Mardiana T, Adji T, Hidayah I (2015) The comparition of distance-based similarity measure to detection of plagiarism in Indonesian text: intelligence in the era of big data, Springer, pp 155–164
Materia VC, Pascucci S, Kolympiris C (2015), “Understanding the selection processes of public research projects in agriculture: The role of scientific merit”, Food Policy, Elsevier, pp. 87–99
Morita T, Fukuta N, Izumi N, Yamaguch T (2006) Doddle-owl: a domain ontology construction tool with owl, Asian Semantic Web Conference, Springer pp 537–551
Patil SS, Uddin SA (2015) Research paper selection based on an ontology and text mining technique using clustering. J Comput Eng 17(1):65–71
Raja K, Saravanakumar R (2014) Evaluating Data Reliability: An Evidential Answer with Application to a Web-Enabled Data Warehouse”. International Journal of Innovations in Scientific Engineering Research 1(5):359–365
Rathore DS, Jain RC, Ujjainiya B (2013) A text mining method for research project selection using KNN. IEEE 2013 International Conference on Green Computing, Communication and Conservation of Energy (ICGCE), pp. 900–904
Sergio Jimenez C, Becerra A, Gelbukh F Gonzalez (2009) Generalized Mongue-Elkan Method for Approximate Text String Comparison”. Computational Linguistics and Intelligent Text Processing, Springer, pp 559–570
Silva T, Guo Z, Ma J, Jiang H, Chen H (2013) A social network-empowered research analytics framework for project selection. Decision Support Systems, Elsevier, pp 957–968
Singh J, Kumar M (2011) A meta search approach to find similarity between web pages using different similarity measures. Advances in computing, communication and control, Springer, pp. 150–160
Sultani W, Zhang D (2017) Unsupervised action proposal ranking through proposal recombination. Computer vision and image understanding, Elsevier, pp 1–26
Tian Q, Ma J, Liang J, Kowk R, Liu O, Zhang Q (2005) An organizational decision support system for effective R&D project selection, Decision. Support System, Elsevier Volume. 39(3):403–413
Wang Y, Xu W, Jiang H (2015) Using text mining and clustering to group research proposals for research project selection. IEEE 2015 48th Hawaii International Conference on System Sciences, pp. 1256–1263
Xu W, Xu Y, Ma J (2013) An ontology based frequent item set method to support research proposal grouping for research project selection, IEEE 2013 46th Hawaii International Conference on System Sciences, pp. 1174–1182
Yun H, Xu X, Ii Z (2016) A LDA model based text-mining method to recommend reviewer for proposal of research project selection. IEEE 2016 13th International Conference on Service Systems and Service Management (ICSSSM), pp. 1–5
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Saravanan, R.A., Rajesh Babu, M. Enhanced text mining approach based on ontology for clustering research project selection. J Ambient Intell Human Comput (2017). https://doi.org/10.1007/s12652-017-0637-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12652-017-0637-7