Enhanced text mining approach based on ontology for clustering research project selection

Saravanan, R. Annamalai; Rajesh Babu, M.

doi:10.1007/s12652-017-0637-7

Enhanced text mining approach based on ontology for clustering research project selection

Original Research
Published: 04 December 2017

(2017)
Cite this article

Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

338 Accesses
8 Citations
Explore all metrics

Abstract

Research project selection is an essential task for government and private agencies. When a huge number of research proposals are received, it is common to group them along with their similarities in research discipline areas. Then the grouped proposals are assigned to the appropriate experts for peer review. Existing approaches are not efficient to classify the document of project proposal correctly. Text-mining methods are used to solve the problem of classifying text documents automatically. In this paper, research proposals are classified based on the discipline areas and proposals in each discipline are grouped using the text-mining technique. Dice’s coefficient, Damerau–Levenshtein distance, Tversky index, Cosine similarity and Jaro–Winkler distance are used to find the similarity between the documents. The classification is used to predict the target class for each document proposal in the dataset accurately. In this work, Proposed Ensemble classifiers are used that contain various classifiers such as Kernel Support Vector Machine (KSVM), Self-Organizing Map (SOM), K-Nearest Neighbor (KNN) and Naïve Bayes. These are the four classification methods used for proposed work and get an accuracy of results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Research paper classification systems based on TF-IDF and LDA schemes

Article Open access 26 August 2019

Domain Ontology Graph Approach Using Markov Clustering Algorithm for Text Classification

The study of subject-classification based on journal coupling and expert subject-classification system

Article 15 February 2016

References

Arratia-Martinez NM, Caballero-Fernandez R, Litvinchev I, Lopez-Irarragorri F (2017) Research and development project portfolio selection under uncertainty. Journal of Ambient Intelligence and Humanized Computing, Springer, pp. 1–10
Darena F, Zizka J (2015) Interdependence of text mining quality and the input data preprocessing, artificial intelligence perspectives and applications, Springer, pp. 141–150
Estep J (2015) Development of a technology transfer score to inform the selection of a research proposal. IEEE 2015 Portland International Conference on Management of Engineering and Technology (PICMET), pp. 1754–1768
Hirankitti M, Mai TX (2012) A meta-logical approach for reasoning with an OWL 2 ontology. J Ambient Intell Humaniz Comput, Springer, 3(4):293–303
Article Google Scholar
Horecki K, Mazurkiewicz J (2015) Natural language processing methods used for automatic prediction mechanism of related phenomenon. ICAISC 2015, Springer, pp 13–24
Jeno S, Arthi T, Lakshmipathi B (2014) Ontology based mining techniques for systematic allocation of project proposals to external reviewers. Int J Innovative Res Sci Eng Technol 3(3):1825–1830
Google Scholar
Kumar M, Vig R (2012) Term-frequency inverse-document frequency definition semantic (TIDS) based focused web crawler. Global Trends in Information Systems and Software Applications, Springer, pp 31–36
Ma J, Xu W, Sun Y-H, Turban E, Wang S, Liu O (2012) An ontology-based text-mining method to cluster proposals for research project selection. IEEE Trans Syst Man Cybern Part A Syst Hum 42(3):784–790
Madhavan K, Chen X (2014) DIA2: web-based cyber infrastructure for visual analysis of funding portfolios. IEEE Trans Vis Comput Gr 20(2):1823–1832
Article Google Scholar
Mardiana T, Adji T, Hidayah I (2015) The comparition of distance-based similarity measure to detection of plagiarism in Indonesian text: intelligence in the era of big data, Springer, pp 155–164
Materia VC, Pascucci S, Kolympiris C (2015), “Understanding the selection processes of public research projects in agriculture: The role of scientific merit”, Food Policy, Elsevier, pp. 87–99
Morita T, Fukuta N, Izumi N, Yamaguch T (2006) Doddle-owl: a domain ontology construction tool with owl, Asian Semantic Web Conference, Springer pp 537–551
Patil SS, Uddin SA (2015) Research paper selection based on an ontology and text mining technique using clustering. J Comput Eng 17(1):65–71
Google Scholar
Raja K, Saravanakumar R (2014) Evaluating Data Reliability: An Evidential Answer with Application to a Web-Enabled Data Warehouse”. International Journal of Innovations in Scientific Engineering Research 1(5):359–365
Google Scholar
Rathore DS, Jain RC, Ujjainiya B (2013) A text mining method for research project selection using KNN. IEEE 2013 International Conference on Green Computing, Communication and Conservation of Energy (ICGCE), pp. 900–904
Sergio Jimenez C, Becerra A, Gelbukh F Gonzalez (2009) Generalized Mongue-Elkan Method for Approximate Text String Comparison”. Computational Linguistics and Intelligent Text Processing, Springer, pp 559–570
Google Scholar
Silva T, Guo Z, Ma J, Jiang H, Chen H (2013) A social network-empowered research analytics framework for project selection. Decision Support Systems, Elsevier, pp 957–968
Google Scholar
Singh J, Kumar M (2011) A meta search approach to find similarity between web pages using different similarity measures. Advances in computing, communication and control, Springer, pp. 150–160
Sultani W, Zhang D (2017) Unsupervised action proposal ranking through proposal recombination. Computer vision and image understanding, Elsevier, pp 1–26
Tian Q, Ma J, Liang J, Kowk R, Liu O, Zhang Q (2005) An organizational decision support system for effective R&D project selection, Decision. Support System, Elsevier Volume. 39(3):403–413
Wang Y, Xu W, Jiang H (2015) Using text mining and clustering to group research proposals for research project selection. IEEE 2015 48th Hawaii International Conference on System Sciences, pp. 1256–1263
Xu W, Xu Y, Ma J (2013) An ontology based frequent item set method to support research proposal grouping for research project selection, IEEE 2013 46th Hawaii International Conference on System Sciences, pp. 1174–1182
Yun H, Xu X, Ii Z (2016) A LDA model based text-mining method to recommend reviewer for proposal of research project selection. IEEE 2016 13th International Conference on Service Systems and Service Management (ICSSSM), pp. 1–5

Download references

Author information

Authors and Affiliations

Research and Development Center, Bharathiar University, Coimbatore, 641046, Tamil Nadu, India
R. Annamalai Saravanan
Department of Computer Science and Engineering, Karpagam College of Engineering (Autonomous), Coimbatore, 641032, Tamil Nadu, India
M. Rajesh Babu

Authors

R. Annamalai Saravanan
View author publications
You can also search for this author in PubMed Google Scholar
M. Rajesh Babu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Annamalai Saravanan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Saravanan, R.A., Rajesh Babu, M. Enhanced text mining approach based on ontology for clustering research project selection. J Ambient Intell Human Comput (2017). https://doi.org/10.1007/s12652-017-0637-7

Download citation

Received: 22 June 2017
Accepted: 23 November 2017
Published: 04 December 2017
DOI: https://doi.org/10.1007/s12652-017-0637-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhanced text mining approach based on ontology for clustering research project selection

Abstract

Access this article

Similar content being viewed by others

Research paper classification systems based on TF-IDF and LDA schemes

Domain Ontology Graph Approach Using Markov Clustering Algorithm for Text Classification

The study of subject-classification based on journal coupling and expert subject-classification system

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Enhanced text mining approach based on ontology for clustering research project selection

Abstract

Access this article

Similar content being viewed by others

Research paper classification systems based on TF-IDF and LDA schemes

Domain Ontology Graph Approach Using Markov Clustering Algorithm for Text Classification

The study of subject-classification based on journal coupling and expert subject-classification system

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation