On the similarity measures of N-cubic Pythagorean fuzzy sets using the overlapping ratio

The similarity measures are essential concepts to discuss the closeness between sets. Fuzzy similarity measures and intuitionistic fuzzy similarity measures dealt with the incomplete and inconsistent data more efficiently. With time in decision-making theory, a complex frame of the environment that occurs cannot be specified entirely by these sets. A generalization like the Pythagorean fuzzy set can handle such a situation more efficiently. The applicability of this set attracted the researchers to generalize it into N-Pythagorean, interval-valued N-Pythagorean, and N-cubic Pythagorean sets. For this purpose, first, we define the overlapping ratios of N-interval valued Pythagorean and N-Pythagorean fuzzy sets. In addition, we define similarity measures in these sets. We applied this proposed measure for comparison analysis of plagiarism software.


Introduction
The similarity measure is the tool used for many decisionmaking processes, unclear reasons, data mining, artificial intelligence, measuring instruments, machine learning, and the reasoning of approximation. The primary purpose of similarity measure is that measures the similarity or like portion between two objects. i.e., it utilizes the degree of a similar part and is expressed between 0 and 1(real number) [1][2][3]. Further, similarity measure was defined for interval-valued data, as crisp value is no more effective in decision-making problems. Similarity interval was introduced to evaluate the similarity of interval-valued fuzzy sets, [4][5][6][7]. The similarity was being used as an application to many areas of imprecise nature, including diagnosis of many diseases, ranking purposes, and engineering tools [8][9][10][11][12]. For proceeded more, similarity measure for intuitionistic fuzzy set initialized according to the concept that it demonstrates the membership also non-membership value along with their daily used applications [13][14][15][16]. A Pythagorean fuzzy set determined similarity measure with their valuable existence in various fuzzy environments was in a Pythagorean fuzzy set. The similarity measures of the Pythagorean fuzzy set are also defined in the sense of consideration of similarity among Pythagorean fuzzy sets [17][18][19]. As a fuzzy set and interval-valued fuzzy set are considered as the composing sets of cubic sets, similarity measure was explained in terms of cubic sets [20][21][22]. Similarity measure under cubic Pythagorean fuzzy sets was initialized to illustrate the similarity between two defined sets with the similarity of their composing sets [23]. The effect of priority degrees on the overall result is thoroughly investigated. Furthermore, in the [24] Pythagorean fuzzy set environment, a decision-making strategy is proposed based on these operators [25]. This paper aims to expand on the concepts of PmFSs and offer some additional operations and findings on them. The idea of the Pythagorean m-polar fuzzy relation to selecting the most suitable life partner was used [26]. A bipolar fuzzy soft set (BFSS) is a powerful mathematical tool for dealing with uncertainty and unreliability in real-world situations such as logistics and supply chain management [27]. Some basic properties of the proposed similarity measure are also discussed, such as the SM of any two PFS-sets equals unity if the two PFS-sets coincide [28]. We studied focused on a problem on a linear Diophantine fuzzy graph, where each arc length is assigned a linear Diophantine fuzzy number rather than a real number in this paper. The linear Diophantine fuzzy number can indicate the linear Diophantine fuzzy graph's arc expenses' uncertainty [29]. Single-valued neutrosophic Einstein interactive weighted averaging and geometric operators are proposed using SVNSs, and smooth approximation with interactive Einstein operations [30].
Further, the development made by different researchers in the area of cubic sets and their applications can be seen in [31][32][33][34][35][36][37][38][39][40][41][42][43][44][45][46][47]. The idea of similarity measure was examined as the powerful mechanism used for many years in various fields using these given sets. For similarity evaluation, many other methods included similarity measures of Euclidean, Manhattan, Chebyshev, Minkowski, cosine, Pearson, Mahalanobis, SED Jaccard, Levenshtein, Dice, Jensen Shannon, Canberra, Hamming, Spearman, Chi-square, and so on. These tools are used in many areas of fuzzy environments to analyze similarities among different sets. In science, similarity measure expresses how the samples of data are closely related, and the concept of dissimilarity is how the data models are diverse or disparate. The similarity measure of the overlapping ratio indicated a set's similarity and dissimilarity to find a given set's overall similarity. The fact is that we discussed in a fuzzy environment that similarity measure has already been under observation. These similarities only find similarities between the sets of positive impacts. It needs to express the negative similarity of sets.
We define the overlapping ratio of the N-Pythagorean fuzzy set and N-interval-valued Pythagorean fuzzy set to initialize the negative similarity. Then we define the overlapping ratio for N-cubic Pythagorean fuzzy sets. Also, we describe the similarity of N-Pythagorean fuzzy sets and N-interval-valued Pythagorean fuzzy sets. After that, we initialize the similarity measure of the N-cubic Pythagorean fuzzy set. Also, demonstrate the comparison of two popular plagiarism checker software using similarity measure of overlapping ratio under the N-cubic Pythagorean fuzzy sets. In this study, we show that our method produces intuitively appealing results even when the similarity is analyzed using a graph visualization method. In our technique (the new measure), we compare two N-cubic Pythagorean fuzzy sets, i.e., with a measure that uses two functions to represent N-cubic Pythagorean fuzzy sets for comparison. While evaluating the results of our new measure's comparison, we draw a graph of the new similarity measure between two Ncubic Pythagorean fuzzy sets, where the membership value of N-interval valued Pythagorean fuzzy sets compares to the membership function of second N-cubic Pythagorean fuzzy sets, and non-membership value corresponds to nonmembership value of this set.

Background
We will start through the review of similarity measures. First, we revise the definition of similarity measure in such a way:

Definition [1]
A similarity measure is a real-valued mapping S (E, F) → [0, 1] that describes the similar area between objects (E,F). Basically, the area that is similar between objects lies between the interval [0, 1] such that 0 specify that objects are totally disjoint and 1 specify that objects behave alike. The basic properties of similarity measure are as under below for the sets E, F and G.

Definition [5]
The overlapping ratio (OV) of an interval A within a pair of intervals {A, B} utilizes the cardinality of the alike of the given intervals pair divided by the size of an interval as: such that, |A ∩ B | be the cardinality of the common portion of intervals pair and |A| be the cardinality of A.

Remarks
In an interval A having a cardinality 0, i.e., |A| 0, OV (A, B) 0. The overlapping ratio for the intervals pair will satisfy one of the cases given below:

Definition[5]
The similarity measure using overlapping ratio SOR for the intervals pair, A and B, is the t-norm of inverse overlapping ratios, defined as below:

Similarity measures of n-cubic pythagorean fuzzy set
The purpose of initiating this concept of similarity measure is the evaluation of the similarities of two N-cubic Pythagorean fuzzy set. To evaluate overall similarity, we consider reciprocal similarity along with similarity measure.

Overlapping ratio for N-Pythagorean fuzzy set
Definition: The overlapping ratio of the item set A having a set {A, B} is the cardinality of the union of the pair divided by the cardinality of given N-Pythagorean fuzzy set i.e., A. the.
where | A ∪ B | be the cardinality of the union of A and B. also, | A | be the cardinality of set A.
Remark For any N-Pythagorean fuzzy set with the cardinality of zero i.e., | A | 0, OV (A, B) is the set to 0. Then, the overlapping ratio for N-Pythagorean fuzzy set defined by Eq. (a) satisfies the given one of the following cases: Example

Overlapping ratio for N-interval valued Pythagorean fuzzy set
Definition: The overlapping ratio of the item set C having a sets {C, D} is the cardinality of the union of the pair divided by the cardinality of given N-interval valued Pythagorean fuzzy set i.e., C. the where | C ∪ D | be the cardinality of the union between C and D. also, | C | be the cardinality of set C. Eq (b) implies that OV (C, D) where | X ∪Y | be the cardinality of the union between X and Y . also, | X | be the cardinality of set X. Eq (c) implies that OV (X, Y )

Similarity measure of N-Pythagorean fuzzy set
Definition: The similarity measure using overlapping ratio S OV for the pair of N-Pythagorean fuzzy set is the t-norm of their inverse overlapping ratios, as This graph demonstrates that grey portion is similar that defines the similarity measure clearly:

Similarity measure of N-interval valued Pythagorean fuzzy set
Definition: The similarity measure using overlapping ratio S OV for the pair of N-interval-valued Pythagorean fuzzy set (C, D) be the t-norm of their inverse overlapping ratios defined as; where T N is a t-norm of NCPFN's.
The Fig. 2 clearly explains the like portion of two Ninterval valued Pythagorean fuzzy sets illustrated as:

Similarity measure for N-Cubic Pythagorean fuzzy set
Definition: The similarity measure using overlapping ratio S * OV for the pair of N-cubic Pythagorean fuzzy set (X, Y ) is the t-norm of their inverse overlapping ratios defined as: where T * N (t-norm of N-cubic Pythagorean fuzzy sets) is a t-norm of NCPFN's. Then,T * N satisfied following conditions: • Boundedness.
So T * N is the t norm.

Properties of the similarity measure using overlapping ratio
Following are some characteristics of the similarity measure using overlapping ratio. (Symmetry)S * OV (X , Y )follows the property of symmetry. That is, The t-norm T * N is symmetric. Therefore,S * OV (X, Y )is also symmetric.

Application
In previous knowledge, college students at California State University, Northridge, delivered some information about plagiarism using software for plagiarism detection. Advance, look at, 1/2 of the scholars in two training have been randomly decided on and using the teachers that their time papers might use for plagiarism the software. Students have no more extended information about the software program that might be used. The workers thought that scholars that have been warned about the usage of the software could plagiarize less than learners were not now. However, the caution had not affected. In a recent study, college students wrote to which that chain of two papers was initialized. Their know-how in detecting plagiarism programs changed oppositely related to the rates of plagiarism on the advanced paper. However, the recent article determined no relation between information and plagiarism. Instead, individuals have attempted to attract repeatedly from the same assets of plagiarized fabric throughout documents.

Comparison between two plagiarism software (X and Y)
Software(X) was made for classroom use and is planned for assessing students' work. The University people group has free admittance to two diverse electronic creativity checking tools outfitted to assist authors with keeping away from plagiarism. The following is a short outline of individuals and how interesting to handle them. Software(X) and Software(Y) expect entries to be created by the initial developer. They have no idea for used by other people.
Software(X) Software(X) has been developed explicitly for classroom use and planned for checking on students' jobs. Like the administration, it can be coordinated into Canvas courses and permits the trainer to view reports of students' ask. It looks at students' tasks of about 60 billion pages, about 600 million student papers that have already been added to the database of this software: and more than 100 million articles from professionals. Submitted articles are added to a database of material from institutions worldwide.

Software(Y)
The software is supposed to accommodate educational writers in warding off plagiarism and copyright neglect while making ready gadgets for e-books and is NOT supposed for lecture room use. This software is considered for editors to examine articles submitted before the booklet. Papers judgment using this software are NOT introduced to or recorded on another database. In this section, the similarity measure of N-cubic Pythagorean fuzzy sets for the overlapping ratio is used to compare two popular software, i.e., Software(X) and Software(Y).
Let Now, our aim is to classify the software C in one of the sets of software's A j (j 1,2). For this purpose, the proposed method has been evaluated given in Table 1 from C to A j (j 1, 2). According to the law the maximum value of the similarity measure between two NCPFN, s.
After the numerical discussion given in Table 1, we conclude the following results.
• We analyzed that the features of software C are 10 % like A1(X software) in a negative sense as well as the sources of software C is 30 % like A2(Y software) in negative nature. • In negative sense, we may achieve that the similarity measure of A2(Y software) is much than A1(X software). • By using the idea of maximum degree, the degree of A1 is much greater than A2 which means that C software relate to X software. so, A2 is preferable than A1.

Comparison analysis
In a comparative analysis, we establish three different sets (A1, A2, C) and use predefined measures of similarity, dice, and jaccard for comparison to identify the similarity between A1 and C, as well as A2 and C, which declare the novelty of our proposed approach of similarity measure of overlapping ratio.
The following Table 2 shows the similarities: In this research, we present a new similarity measure that computes the overall similarity of a pair of N-cubic Pythagorean fuzzy sets by considering their common similarity. We employed the over-lapping ratio of the specified sets inside the pair to capture the asymmetric likeness. We've also shown that the new measure meets all of the fundamental characteristics of a similarity measure.
Finally, we used synthetic datasets to compare the behavior of the proposed measure to the well-defined measures of similarity, Jaccard, and Dice similarity measures. The findings reveal that the proposed similarity measure is more sensitive to changes in set width and invariant and linear.

Conclusion
In this article, we defined a new concept of similarity measure of N-cubic Pythagorean fuzzy set based on overlapping ratio by overlapping two N-cubic Pythagorean fuzzy sets after representing the overlapping ratio N-interval valued Pythagorean fuzzy sets and N-Pythagorean fuzzy sets. We describe the similarity measure of N-cubic Pythagorean fuzzy set, N-interval valued Pythagorean fuzzy set, and N-Pythagorean fuzzy sets. Further, we apply this similarity measure for finding the comparison between two software related to plagiarism detectors, i.e., Software(X) and Software(Y). Finally, we reveal the efficacy of the new suggested similarity measure. This similarity measure will be used in many decision-making processes to analyze similarities negatively.
Future studies could include applications of the suggested overlapping ratio similarity metric in multi-criteria decision making (MCDM) and research analysis. A similarity measure for probabilistic N-cubic Pythagorean fuzzy sets were developed to characterize stochastic and non-stochastic uncertainty in a single framework. Future studies could focus on creating an independent module that uses an overlapping ratio similarity metric for existing expert or intelligent systems, software benefits, and online browsers.
Author contributions MAAS provided the final proofread, project administration, supervision and funding, MG provided the writing of the initial draft, a compilation of the results, data analysis.
Funding This project was funded by the Deanship of Scientific Research (DSR) at King Abdulaziz University, Jeddah, under grant no. G: 025-130-1442. The authors, therefore, acknowledge the DSR for technical and financial support.

Availability of data and material
No data were used to support this study.

Conflict of interest
The authors declare that there are no conflicts of interest regarding the publication of this article.
Ethics approval and consent to participate All authors have participated in the completion of this paper and they are well aware about the submission of this article.

Consent for publication All authors are agreeing of this article's publication.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecomm ons.org/licenses/by/4.0/.