Skip to main content
Log in

Enhanced text mining approach based on ontology for clustering research project selection

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

Research project selection is an essential task for government and private agencies. When a huge number of research proposals are received, it is common to group them along with their similarities in research discipline areas. Then the grouped proposals are assigned to the appropriate experts for peer review. Existing approaches are not efficient to classify the document of project proposal correctly. Text-mining methods are used to solve the problem of classifying text documents automatically. In this paper, research proposals are classified based on the discipline areas and proposals in each discipline are grouped using the text-mining technique. Dice’s coefficient, Damerau–Levenshtein distance, Tversky index, Cosine similarity and Jaro–Winkler distance are used to find the similarity between the documents. The classification is used to predict the target class for each document proposal in the dataset accurately. In this work, Proposed Ensemble classifiers are used that contain various classifiers such as Kernel Support Vector Machine (KSVM), Self-Organizing Map (SOM), K-Nearest Neighbor (KNN) and Naïve Bayes. These are the four classification methods used for proposed work and get an accuracy of results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Arratia-Martinez NM, Caballero-Fernandez R, Litvinchev I, Lopez-Irarragorri F (2017) Research and development project portfolio selection under uncertainty. Journal of Ambient Intelligence and Humanized Computing, Springer, pp. 1–10

  • Darena F, Zizka J (2015) Interdependence of text mining quality and the input data preprocessing, artificial intelligence perspectives and applications, Springer, pp. 141–150

  • Estep J (2015) Development of a technology transfer score to inform the selection of a research proposal. IEEE 2015 Portland International Conference on Management of Engineering and Technology (PICMET), pp. 1754–1768

  • Hirankitti M, Mai TX (2012) A meta-logical approach for reasoning with an OWL 2 ontology. J Ambient Intell Humaniz Comput, Springer, 3(4):293–303

    Article  Google Scholar 

  • Horecki K, Mazurkiewicz J (2015) Natural language processing methods used for automatic prediction mechanism of related phenomenon. ICAISC 2015, Springer, pp 13–24

  • Jeno S, Arthi T, Lakshmipathi B (2014) Ontology based mining techniques for systematic allocation of project proposals to external reviewers. Int J Innovative Res Sci Eng Technol 3(3):1825–1830

    Google Scholar 

  • Kumar M, Vig R (2012) Term-frequency inverse-document frequency definition semantic (TIDS) based focused web crawler. Global Trends in Information Systems and Software Applications, Springer, pp 31–36

  • Ma J, Xu W, Sun Y-H, Turban E, Wang S, Liu O (2012) An ontology-based text-mining method to cluster proposals for research project selection. IEEE Trans Syst Man Cybern Part A Syst Hum 42(3):784–790

  • Madhavan K, Chen X (2014) DIA2: web-based cyber infrastructure for visual analysis of funding portfolios. IEEE Trans Vis Comput Gr 20(2):1823–1832

    Article  Google Scholar 

  • Mardiana T, Adji T, Hidayah I (2015) The comparition of distance-based similarity measure to detection of plagiarism in Indonesian text: intelligence in the era of big data, Springer, pp 155–164

  • Materia VC, Pascucci S, Kolympiris C (2015), “Understanding the selection processes of public research projects in agriculture: The role of scientific merit”, Food Policy, Elsevier, pp. 87–99

  • Morita T, Fukuta N, Izumi N, Yamaguch T (2006) Doddle-owl: a domain ontology construction tool with owl, Asian Semantic Web Conference, Springer pp 537–551

  • Patil SS, Uddin SA (2015) Research paper selection based on an ontology and text mining technique using clustering. J Comput Eng 17(1):65–71

    Google Scholar 

  • Raja K, Saravanakumar R (2014) Evaluating Data Reliability: An Evidential Answer with Application to a Web-Enabled Data Warehouse”. International Journal of Innovations in Scientific Engineering Research 1(5):359–365

    Google Scholar 

  • Rathore DS, Jain RC, Ujjainiya B (2013) A text mining method for research project selection using KNN. IEEE 2013 International Conference on Green Computing, Communication and Conservation of Energy (ICGCE), pp. 900–904

  • Sergio Jimenez C, Becerra A, Gelbukh F Gonzalez (2009) Generalized Mongue-Elkan Method for Approximate Text String Comparison”. Computational Linguistics and Intelligent Text Processing, Springer, pp 559–570

    Google Scholar 

  • Silva T, Guo Z, Ma J, Jiang H, Chen H (2013) A social network-empowered research analytics framework for project selection. Decision Support Systems, Elsevier, pp 957–968

    Google Scholar 

  • Singh J, Kumar M (2011) A meta search approach to find similarity between web pages using different similarity measures. Advances in computing, communication and control, Springer, pp. 150–160

  • Sultani W, Zhang D (2017) Unsupervised action proposal ranking through proposal recombination. Computer vision and image understanding, Elsevier, pp 1–26

  • Tian Q, Ma J, Liang J, Kowk R, Liu O, Zhang Q (2005) An organizational decision support system for effective R&D project selection, Decision. Support System, Elsevier Volume. 39(3):403–413

  • Wang Y, Xu W, Jiang H (2015) Using text mining and clustering to group research proposals for research project selection. IEEE 2015 48th Hawaii International Conference on System Sciences, pp. 1256–1263

  • Xu W, Xu Y, Ma J (2013) An ontology based frequent item set method to support research proposal grouping for research project selection, IEEE 2013 46th Hawaii International Conference on System Sciences, pp. 1174–1182

  • Yun H, Xu X, Ii Z (2016) A LDA model based text-mining method to recommend reviewer for proposal of research project selection. IEEE 2016 13th International Conference on Service Systems and Service Management (ICSSSM), pp. 1–5

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Annamalai Saravanan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Saravanan, R.A., Rajesh Babu, M. Enhanced text mining approach based on ontology for clustering research project selection. J Ambient Intell Human Comput (2017). https://doi.org/10.1007/s12652-017-0637-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12652-017-0637-7

Keywords

Navigation