A new goodness of fit test in the presence of uncertain parameters

The Weibull distribution has been widely used in the areas of quality and reliability. The Anderson–Darling test has been popularly used either the data in hand follow the Weibull distribution or not. The existing Anderson–Darling test under classical statistics is applied when all the observations in quality and reliability work are determined, précised, and exact. In the areas of reliability and quality, the data may indeterminate, in-interval and fuzzy. In this case, the existing Anderson–Darling test cannot be applied for testing the assumption of the Weibull distribution. In this paper, we present the Anderson–Darling test under neutrosophic statistics. We present the methodology to fit the neutrosophic Weibull distribution on the data. We discuss the testing procedure with the help of reliability data. We present the comparisons of the proposed test with the existing Anderson–Darling the goodness of fit test under classical statistics. From the comparison, it is concluded that the proposed test is more informative than the existing Anderson–Darling test under an indeterminate environment. In addition, the proposed test gives information about the measure of indeterminacy.


Introduction
The derivation of statistical methods is based on the assumption that a random variable or the data follow some specific distribution. According to Romeu [1] "when we assume that our data follow a specific distribution, we take a serious risk. If our assumption is wrong, then the results obtained may invalid". For example, before testing a hypothesis, the suitable test statistic is chosen according to the nature of the data in hand. The tests based on normal distribution are chosen when the assumption of the normality is met; otherwise, the non-parametric tests are applied for testing the hypothesis. Two approaches have been widely used to checking the assumption of any distribution. An approach in which the assumption of the data is checked using the graphical properties is called the empirical procedure. Another approach which provides the more formal, a quantifiable, and reliable result is called the goodness of fit test. The goodness of fit tests is based on the cumulative distribution function (cdf) or the probability density function (pdf) of the underlying distribution. Arshad et al. [2] applied the Anderson [3] and Razali and Wah [4] presented a study of the performance evaluation of this test. Jäntschi and Bolboacȃ [5] worked on the computational probabilities of Anderson-Darling test. Formenti et al. [6] applied the Anderson-Darling test in risk assessment. Islam [7] worked on the ranking of skewed distribution using this test. Jäntschi [8] and Jäntschi [9] worked on detecting outliers for continuous distributions. More information can be read et al., in Rahman [10], Anderson [11], Li et al. [12] and Wijekularathna et al. [13].
The statistical test and models have been widely used for the testing of energy generating devices. The choice of the better statistical model will lead to the best estimation and forecasting of the lifetime of energy produced items. Zhang and Lee [14] worked on the health monitoring of batteries. He et al. [15] and Nuhic et al. [16] used the Bayesian approach and data-driven approach for the batteries' data, respectively. Hu et al. [17] worked on the capacity estimation of the batteries. Ng et al. [18] applied the Bayes model for the life prediction of the batteries. Barré et al. [19] presented a statistical study for batteries used in vehicles. Chiodo et al. [20] worked on an accelerated test using batteries' data. Chiodo et al. [20] presented statistical analysis of lithium-ion battery recycling processes. Mathis et al. [21] presented statistical work on the consumption due to the energy heating system. Pramanik et al. [21] provided a review on energy equipment. For more applications of statistical models, the reader may refer to Shim et al. [22], Andre et al. [23], Xing et al. [24], and Harris et al. [25].
In the traditional tests under classical statistics, it is assumed that all observations are crisp in the population or the sample. But, the data obtained from the complex system may not be determined, exact, and certain. To test this type of data, the statistical tests based on the fuzzy approach are applied. Arnold [26] discussed the fuzzy test and power function of the test. Przemysław Grzegorzewski [27] and Jamkhaneh and Ghara [28] discussed the application of the statistical test for vague data. Montenegro et al. [29] presented a fuzzy-based test for two populations. Taheri and Behboodian [30] used the Bayesian approach to develop a test under fuzzy logic. Wu [31] presented a test for more than two populations using fuzzy logic. Przemyslaw et al. [32] and Noughabi and Akbari [33] presented the testing of hypothesis procedure for fuzzy logic. Momeni et al. [34] presented Kolomogorov-Smirnov for testing the normality of the fuzzy data. More applications of fuzzy-based tests can be seen in Van Cutsem and Gath [29,35], Mohanty and AnnanNaidu [36], Moradnezhadi [37], Moewes et al. [38] and Choi et al. [39].
A generalization of fuzzy logic is called the neutrosophic logic was introduced by Smarandache [40]. The neutrosophic provides information about the measure of indeterminacy, the measure of truthiness, and measure of falseness. Smarandache and Khalid [41] proved the efficiency of the neutrosophic logic over the fuzzy logic and interval-based analysis. The applications of neutrosophic logic can be seen in Hanafy et al. [42], Broumi and Smarandache [43], Guo and Sengur [44], Guo [55]. Smarandache [56] introduced the neutrosophic statistics as the extension of classical statistics. The neutrosophic statistics can be applied when the data have indeterminacy. Chen et al. [57] and Chen et al. [58] presented the idea of analyzing the neutrosophic numbers. Aslam [59] and Aslam [60] proposed a statistical test to test normality using the neutrosophic statistics. For more applications of neutrosophic statistics, the reader may refer to Aslam and Albassam [61] and Aslam [62].
Our literature search shows that there is no work on the Anderson-Darling test in the presence of indeterminacy. The existing Anderson-Darling test cannot be applied when the data are given in neutrosophic numbers. In this paper, we will present the Anderson-Darling test under neutrosophic statistics. We will present the methodology to fit the neutrosophic Weibull distribution on the batteries' data. We will discuss the testing procedure with the help of batteries' reliability data. From the comparison, it is concluded that the proposed test is more informative than the existing Anderson-Darling test. We expect that the proposed test will help the energy experts in the selection of appropriate statistical distribution for better estimation of energy produced devices.

Preliminaries
Let I N ∈ [I L , I U ] be an indeterminacy interval. Suppose that X N X L + X U I N ; I N ∈ [I L , I U ] denotes the lifetime follows the neutrosophic Weibull distribution with neutrosophic scale parameter θ N θ L + θ U I N ; I N ∈ [I L , I U ] and neutrosophic shape parameter β N β L + β U I N ; The neutrosophic cumulative distribution function (ncdf) is defined as Note here that the neutrosophic Weibull distribution is given in Eq. (1) is the generalization of the Weibull distribution under classical statistics. The neutrosophic Weibull distribution reduces to the Weibull distribution under classical statistics if I L 0.

Fitting of neutrosophic Weibull distribution
Now, we discuss the methodology to test the assumption either the given data having neutrosophic numbers follow the neutrosophic Weibull distribution or not. To develop the proposed test, it is assumed that the neutrosophic shape parameter and neutrosophic scale parameter of the neutrosophic Weibull distribution are unknown and estimated from the given neutrosophic data. The proposed test will be applied to test the null hypothesis that the neutrosophic data are fitted to the neutrosophic Weibull distribution versus the alternative hypothesis that the neutrosophic data do not follow the neutrosophic Weibull distribution. The goodness of fit test statistic, when the neutrosophic data do, not follows the neutrosophic statistics is given by where n N ∈ [n L , n U ] be a neutrosophic random sample and Z N (i) [x N (i)/θ N ] β N . According to Romeu [1], the Anderson-Darling test can be applied for small and large samples.
Based on the proposed test, if OSL N < 0.05, the null hypothesis that the failure time having neutrosophic numbers follow the neutrosophic Weibull distribution is rejected and this error committed is less than 0.05. Based on this study, it is concluded that the given neutrosophic data do not fit the neutrosophic Weibull distribution if the calculated values of the statistic OSL N is less than 0.05, otherwise, the neutrosophic data follow the neutrosophic Weibull distribution. The operational process of the proposed test is shown in Fig. 1.

Application of the proposed test
To discuss the application of the proposed test, we consider a life test experiment where 23 batteries are put on the test. A tested battery is labeled as a failed item if at least one of its parts fails to meet the given specification limits. According to Khoolenjani and Shahsanaie [65] "tested the battery  Fig. 2 The procedure of the proposed test for batteries' data may be considered as failed, or-strictly speaking-as nonconforming, when at least one value of its parameters falls beyond specification limits. In practice, however, we do not have the possibility to measure all parameters and are not able to define precisely the moment of a failure". The industrial engineer is interested to test either the given data follow the Weibull distribution or not. It is clear that the lifetime of batteries is given in indeterminacy interval rather than the exact number. For this data, the use Anderson-Darling goodness of fit test under classical statistics is not suitable. Therefore, the alternative of the existing test is the proposed Anderson-Darling goodness of fit test under neutrosophic statistics. The necessary computation to perform the proposed test is shown in Table 1. The values of the statistic AD N ∈ [AD L ,AD U ] for the real data are shown as AD N i 1−2i [23,23] ln(1 − exp(−Z N (i)))− Z n N −i+1 − [23,23]  We note that OSL N < 0.05 for the batteries' data. Therefore, it is concluded that the lifetime of batteries does not follow the neutrosophic Weibull distribution. The operational process of the proposed test for the batteries' data are shown in Fig. 2.

Comparative study
Now we compare the efficiency of the proposed Anderson--Darling goodness of fit test under neutrosophic statistics with the Anderson-Darling goodness of fit test under classical statistics in the measure of indeterminacy. For the fair comparison, we will fix the same values of n N ∈ [n L , n U ] as used in the real example.

Concluding remarks
In this paper, we presented the Anderson-Darling test under neutrosophic statistics. We presented the methodology to fit the neutrosophic Weibull distribution on the data. We discussed the testing procedure with the help of reliability data. We applied the proposed test for batteries' failure data and found that the data do not follow the neutrosophic Weibull distribution. From the application and comparative studies, it can be concluded that the proposed test is more informative, flexible, and effective to be applied in an uncertainty environment as compared to the existing Anderson-Darling test. The proposed test provides information about the measure of indeterminacy for testing of the null hypothesis. The proposed test has the limitation that it can be applied to test either the data follow the non-normal distribution such as neutrosophic Weibull distribution. The proposed test can be modified for other distribution accordingly. The proposed test cannot apply for testing the hypothesis for neutrosophic normal distribution. The efficiency of the proposed test in the power of the test can be studied as future research. The proposed test can be applied for big data as future research. In addition, the proposed test can be extended for other nonnormal distributions as future research.