Multicriteria decision making taxonomy of code recommendation system challenges: a fuzzy-AHP analysis

Akbar, Muhammad Azeem; Khan, Arif Ali; Huang, Zhiqiu

doi:10.1007/s10799-021-00355-3

Multicriteria decision making taxonomy of code recommendation system challenges: a fuzzy-AHP analysis

Open access
Published: 14 February 2022

Volume 24, pages 115–131, (2023)
Cite this article

Download PDF

You have full access to this open access article

Information Technology and Management Aims and scope Submit manuscript

Multicriteria decision making taxonomy of code recommendation system challenges: a fuzzy-AHP analysis

Download PDF

3825 Accesses
Explore all metrics

Abstract

The recommendation systems plays an important role in today’s life as it assist in reliable selection of common utilities. The code recommendation system is being used by the code databases (GitHub, source frog etc.) aiming to recommend the more appropriate code to the users. There are several factors that could negatively impact the performance of code recommendation systems (CRS). This study aims to empirically explore the challenges that could have critical impact on the performance of the CRS. Using systematic literature review and questionnaire survey approaches, 19 challenges were identified. Secondly, the investigated challenges were further prioritized using fuzzy-AHP analysis. The identification of challenges, their categorization and the fuzzy-AHP analysis provides the prioritization-based taxonomy of explored challenges. The study findings will assist the real-world industry experts and to academic researchers to improve and develop the new techniques for the improvement of CRS.

MCMARS: Hybrid Multi-criteria Decision-Making Algorithm for Recommender Systems of Mobile Applications

Recommendation systems-based software requirements elicitation process—a systematic literature review

Article Open access 02 February 2024

Automated compliance checking in the context of Industry 4.0: from a systematic review to an empirical fuzzy multi-criteria approach

Article 12 February 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the modern world, the recommendation system has become an indispensable part, especially fore-commerce, medical, and social media systems. The recommendation systems provide suggestions and recommendation based on the user interests. Recommendation system usually use data sources to develop the system components and train for making appropriate decisions. In digital business world the recommender system could predict whether a particular user would prefer an item or not based on the user's profile. Recommender systems are beneficial to both service providers and users [20]. They minimize transaction expenses of finding and selecting items in an online shopping environment [20].

In this paper we have considered the following formal definitions of recommendation system: Ricci et al. [44], defines that “A recommender system or a recommendation system is a subclass of information filtering system that seeks to predict the rating, suitability or preference a user would give to an item.” Hossain et al.[20] stated that “A Recommender System refers to a system that is capable of predicting the future preference of a set of items for a user and recommends the top items.” Moreover, Sridevi et al. [50] underlined that “A recommendation engine filters the data using different algorithms and recommends the most relevant items to users.”

The current era is considered the revolution period in the field of artificial intelligence (AI). “To train the AI models, the selection of efficient source code is a critical phase [7]. Most of the practitioners use the source code which is available on public sources, e.g., GitHub, source frog, etc. and the recommender system plays an important role in the selection of appropriated source code [7]. A code recommender system refers to a system that uses the code sources (e.g., Github, source frog) and recommends the most suitable source code to the developers and researchers. While selection the code, the affectability, and reliability of the code are very important [34]. There are critical concerns related to code recommender system i.e. code analysis concerning the code quality of and code implementation capability [34]. Mark et al. [18] underlined the significance of the code analysis before its preprocessing for feature extraction and implementation. Moreover, Gregorio et al. [45] highlighted the importance of analyzing the complexity, code size, and the available resources for code implementations. Yamashita and Moonen [55] emphasized the compatibility of the development environment capacity and for the selected source code. They further mention that to get successful results; the code should be executed in a compatible development environment.”

The importance of CRS in recent era motivated to conduct a comprehensive empirical study to explore the key challenges which might hinder the performance of CRS. The objective of the study consists of two main fold: (1) to explore the CRS challenges from the literature and verify them with industry practices; (2) to prioritize the investigated challenges concerning to their significance for CRS using fuzzy-AHP. The results and analysis of this study will provide the prioritization-based taxonomy of the investigated challenges. We believe that the deep analysis of CRS challenges will assists the industry experts and researcher to entertain the most priority challenges and develop the new techniques for the improvement in code recommendation systems. Following are the research questions develop to achieve the key objectives of this study:

[RQ1] What challenges of code recommendation system are discussed in the existing literature?
[RQ2] What is the real-world significance of the code recommendation system challenges?
[RQ3] What would be the prioritization-based taxonomy of investigated challenging?

2 Related work

The recommendation is helpful to “decide if there is no prior experience or knowledge about a particular matter. This is the technological revolutionized era, and the peoples believe in the auto recommendation system [38]. Due to the higher acceptance level, the business industry motivated to automate its businesses with a strong and reliable recommendation system [24]. We found several studies conducted to improve the performance of recommendation systems, e.g., [16, 39].”

Like the other areas of life, “the recommender system also has significant importance in the selection of source code for the training of artificial intelligence systems. Code recommender system refers to a system that uses the source code sources (e.g., Github, source frog) and recommends the most suitable code to the developers and researchers [41]. Currently, the source codes related to the machine learning filed received much attention from the software industry and academic researcher’s community. With the intersection of research areas, i.e., “software engineering,” “programming languages,” “machine learning” and “natural language processing,” the various communities have been composed into the areas of “big code” or “code naturalness” with numerous significant outcomes [44]. Mostly, in the field of machine learning, the researcher needs large corpora of code to train the artificial intelligence model and to learn the probabilistically causes concerning coding practices at a large scale. The primary aim is to train and implement the trained model as a useful tool in the required area. However, besides the importance of source code, there is little research has been conducted to address the problem of code recommendation systems. Mens and Lozano [40] highlighted that the selection of reliable source code from the available sources is an important activity to get the fruitful results. Janjic et al. [22] also emphasized the importance of a reliable code recommendation system.”

Considering the state of the art literature and to the best of our knowledge, little empirical research has been conducted to address and highlight the concerns of CRS systems. Though, we tried to fill this gap by conducting a comprehensive empirical study aiming to identify the key challenges that could hinder the performance of CRS. The systematic literature review (SLR) approach has been adopted to explore the existing literature studies and investigate the factors that could be critical challenges for CRS. The survey questionnaire method has been further used to evaluate the SLR results and encapsulate the perceptions of the field experts. The finally summarise list of the challenging factors is used to develop the prioritization taxonomy using the fuzzy AHP technique. Fuzzy AHP is widely used approach for multi criteria problems and has been used in different software engineering research projects [37, 43, 48, 49, 54]. For example, Khan and Shameem [29] used the fuzzy-AHP analysis to rank the success factors of software process improvement paradigm. Similarly, Shameem et al. [47] taxonomies the factors that could influence the agile processes in geographically distributed environment. Moreover, Akbar et al. [3] prioritize the DevOps challenging factors using fuzzy AHP. Based on the above discussion, we could justify the application of fuzzy AHP method for this research study.

3 Research methodology

To address the study objectives, three different steps were adopted. In first step, the systematic literature review was conducted to explore the challenges of CRS, reported by the researchers. In second step, the finding of literature review were verified by conducting the questionnaire survey study with industry experts. In third step, the fuzzy-AHP was used to prioritize the identified list of challenges considering their criticality for CRS systems. All the adopted research methodology steps are presented in Fig. 1 and described in below section.

3.1 Systematic literature review (SLR)

The SLR is the most significant approach of identifying and interpreting the available research evidence, in formal manner, based on the developed research questions and protocols. In this study, we have performed the SLR study using the guidelines of [31]. The SLR steps are discussed in in below sections.

3.1.1 Review process planning

3.1.1.1 Research questions

To identify the challenges related to CRS the following research question was developed:

[RQ1] What challenges of code recommendation system are reported in the literature?

3.1.1.2 Database selection

The selection of appropriated data sources are critical for the collection of most potential literature relevant to the study proposed research the questions [12]. We have considered the following seven databases for data extraction considering recommendations provided by Chen et al. [12, 25, 42]:“IEEE Xplore (http://ieeexplore.ieee.org)”, “ACM Digital Library (http://dl.acm.org)”, “Springer Link (link.springer.com)”, “Wiley Inter Science (www.wiley.com)”, “Science Direct (http://www.sciencedirect.com)”, “Google Scholar (scholar.google.com)”, “IET-digital libraries (www.theiet.org)”.

3.1.1.3 Search strings

A search string is a combination of text, symbols, keywords and their alternatives used to extract the data from digital repositories [11, 15, 19, 23, 30, 42]. The Boolean “OR” and “AND” the selected keywords and their alternative were concatenated:

(“barriers” OR “obstacles” OR “hurdles” OR “difficulties” OR “impediments” OR “hindrance” OR “challenges” OR “limitations”) AND (“code recommendation systems” OR “code recommender systems” OR “code filtering systems”).

3.1.1.4 Inclusion and exclusion criteria

The inclusion criteria are the characteristics that must be included in study, while exclusion criteria are the characteristics to disqualify certain material from inclusion in the study. The same approach has been used in other studies of software engineering domain e.g. [42, 57] and [35].

Inclusion criteria:

(1)
Studies published in conference proceedings, workshop, journal, and book chapters.
(2)
Selected literature should be in English.
(3)
Articles whose findings directly related with the objective of this study.
(4)
Most recent article will be considered if two or more studies are of similar nature or from same research project.

Exclusion criteria:

(1)
Articles out of the CRS scope.
(2)
Studies that did not provide the detail discussion of CRS.
(3)
Studies that have not focused on CRS challenging factors.

3.1.1.5 Study quality assessment (QA)

To exemplify the degree of conformity of primary selected studies QA is performed. The checklist questions and Likert scale used for QA are given in Table 1. The objective of QA is to check the suitability and appropriability of the selected literature concerning to address the research questions of this paper.

Table 1 Selected studies quality assessment criteria

Multicriteria decision making taxonomy of code recommendation system challenges: a fuzzy-AHP analysis

Abstract

Similar content being viewed by others

MCMARS: Hybrid Multi-criteria Decision-Making Algorithm for Recommender Systems of Mobile Applications

Recommendation systems-based software requirements elicitation process—a systematic literature review

Automated compliance checking in the context of Industry 4.0: from a systematic review to an empirical fuzzy multi-criteria approach

1 Introduction

2 Related work

3 Research methodology

3.1 Systematic literature review (SLR)

3.1.1 Review process planning

3.1.1.1 Research questions

3.1.1.2 Database selection

3.1.1.3 Search strings

3.1.1.4 Inclusion and exclusion criteria

3.1.1.5 Study quality assessment (QA)

3.1.2 Conducting the review

3.1.3 Reporting the review

3.2 Empirical study

3.2.1 Development of survey instrument

3.2.2 Pilot assessment of survey instrument

3.2.3 Data sources

3.2.4 Survey data analysis

3.3 Fuzzy set theory and AHP

3.3.1 Fuzzy set

Definition

3.3.2 Fuzzy AHP

4 The results and analysis

4.1 Investigations of SLR study

4.2 Findings of empirical study

4.3 Fuzzy-AHP analysis

5 Study implications and limitations

6 Summary of research findings

7 Conclusion and future direction

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation