Introduction

The derivation of statistical methods is based on the assumption that a random variable or the data follow some specific distribution. According to Romeu [1] “when we assume that our data follow a specific distribution, we take a serious risk. If our assumption is wrong, then the results obtained may invalid”. For example, before testing a hypothesis, the suitable test statistic is chosen according to the nature of the data in hand. The tests based on normal distribution are chosen when the assumption of the normality is met; otherwise, the non-parametric tests are applied for testing the hypothesis. Two approaches have been widely used to checking the assumption of any distribution. An approach in which the assumption of the data is checked using the graphical properties is called the empirical procedure. Another approach which provides the more formal, a quantifiable, and reliable result is called the goodness of fit test. The goodness of fit tests is based on the cumulative distribution function (cdf) or the probability density function (pdf) of the underlying distribution. Arshad et al. [2] applied the Anderson–Darling test for testing the assumption of generalized Pareto distribution. Marsaglia and Marsaglia [3] and Razali and Wah [4] presented a study of the performance evaluation of this test. Jäntschi and Bolboacă [5] worked on the computational probabilities of Anderson–Darling test. Formenti et al. [6] applied the Anderson–Darling test in risk assessment. Islam [7] worked on the ranking of skewed distribution using this test. Jäntschi [8] and Jäntschi [9] worked on detecting outliers for continuous distributions. More information can be read et al., in Rahman [10], Anderson [11], Li et al. [12] and Wijekularathna et al. [13].

The statistical test and models have been widely used for the testing of energy generating devices. The choice of the better statistical model will lead to the best estimation and forecasting of the lifetime of energy produced items. Zhang and Lee [14] worked on the health monitoring of batteries. He et al. [15] and Nuhic et al. [16] used the Bayesian approach and data-driven approach for the batteries’ data, respectively. Hu et al. [17] worked on the capacity estimation of the batteries. Ng et al. [18] applied the Bayes model for the life prediction of the batteries. Barré et al. [19] presented a statistical study for batteries used in vehicles. Chiodo et al. [20] worked on an accelerated test using batteries’ data. Chiodo et al. [20] presented statistical analysis of lithium-ion battery recycling processes. Mathis et al. [21] presented statistical work on the consumption due to the energy heating system. Pramanik et al. [21] provided a review on energy equipment. For more applications of statistical models, the reader may refer to Shim et al. [22], Andre et al. [23], Xing et al. [24], and Harris et al. [25].

In the traditional tests under classical statistics, it is assumed that all observations are crisp in the population or the sample. But, the data obtained from the complex system may not be determined, exact, and certain. To test this type of data, the statistical tests based on the fuzzy approach are applied. Arnold [26] discussed the fuzzy test and power function of the test. Przemysław Grzegorzewski [27] and Jamkhaneh and Ghara [28] discussed the application of the statistical test for vague data. Montenegro et al. [29] presented a fuzzy-based test for two populations. Taheri and Behboodian [30] used the Bayesian approach to develop a test under fuzzy logic. Wu [31] presented a test for more than two populations using fuzzy logic. Przemyslaw et al. [32] and Noughabi and Akbari [33] presented the testing of hypothesis procedure for fuzzy logic. Momeni et al. [34] presented Kolomogorov-Smirnov for testing the normality of the fuzzy data. More applications of fuzzy-based tests can be seen in Van Cutsem and Gath [29, 35], Mohanty and AnnanNaidu [36], Moradnezhadi [37], Moewes et al. [38] and Choi et al. [39].

A generalization of fuzzy logic is called the neutrosophic logic was introduced by Smarandache [40]. The neutrosophic provides information about the measure of indeterminacy, the measure of truthiness, and measure of falseness. Smarandache and Khalid [41] proved the efficiency of the neutrosophic logic over the fuzzy logic and interval-based analysis. The applications of neutrosophic logic can be seen in Hanafy et al. [42], Broumi and Smarandache [43], Guo and Sengur [44], Guo and Sengur [45], Guo et al. [46], Patro and Smarandache [47], Broumi et al. [48], Peng and Dai [49], Abdel-Baset et al. [50], Abdel-Baset et al. [51], Abdel-Baset et al. [52], Nabeeh et al. [53], Pratihar et al. [54] and Pratihar et al. [55]. Smarandache [56] introduced the neutrosophic statistics as the extension of classical statistics. The neutrosophic statistics can be applied when the data have indeterminacy. Chen et al. [57] and Chen et al. [58] presented the idea of analyzing the neutrosophic numbers. Aslam [59] and Aslam [60] proposed a statistical test to test normality using the neutrosophic statistics. For more applications of neutrosophic statistics, the reader may refer to Aslam and Albassam [61] and Aslam [62].

Our literature search shows that there is no work on the Anderson–Darling test in the presence of indeterminacy. The existing Anderson–Darling test cannot be applied when the data are given in neutrosophic numbers. In this paper, we will present the Anderson–Darling test under neutrosophic statistics. We will present the methodology to fit the neutrosophic Weibull distribution on the batteries’ data. We will discuss the testing procedure with the help of batteries’ reliability data. From the comparison, it is concluded that the proposed test is more informative than the existing Anderson–Darling test. We expect that the proposed test will help the energy experts in the selection of appropriate statistical distribution for better estimation of energy produced devices.

Preliminaries

Let \(I_{{\text{N}}} \in \left[ {I_{{\text{L}}} , I_{{\text{U}}} } \right]\) be an indeterminacy interval. Suppose that \(X_{{\text{N}}} = X_{{\text{L}}} + X_{{\text{U}}} I_{{\text{N}}}\); \( I_{\rm N} \in \left[ {I_{\rm L} ,I_{\rm U} } \right] \) denotes the lifetime follows the neutrosophic Weibull distribution with neutrosophic scale parameter \(\theta_{N} = \theta_{L} + \theta_{U} I_{N}\); \(I_{{\text{N}}} \in \left[ {I_{{\text{L}}} , I_{{\text{U}}} } \right]\) and neutrosophic shape parameter \(\beta_{{\text{N}}} = \beta_{{\text{L}}} + \beta_{{\text{U}}} I_{{\text{N}}}\); \(I_{{\text{N}}} \in \left[ {I_{{\text{L}}} , I_{{\text{U}}} } \right]\). The neutrosophic cumulative distribution function (ncdf) is defined as

$$ F_{{\text{N}}} \left( {x_{{\text{N}}} } \right) = 1 - \exp \left\{ { - \left( {\frac{{x_{{\text{N}}} }}{{\theta_{{\text{N}}} }}} \right)^{{\beta_{{\text{N}}} }} } \right\};\;\theta_{{\text{N}}} \in \left[ {\theta_{{\text{L}}} , \theta_{{\text{U}}} } \right],\; \beta_{{\text{N}}} \in \left[ {\beta_{{\text{L}}} , \beta_{{\text{U}}} } \right],\;I_{{\text{N}}} \in \left[ {I_{{\text{L}}} , I_{{\text{U}}} } \right] $$
(1)

Note here that the neutrosophic Weibull distribution is given in Eq. (1) is the generalization of the Weibull distribution under classical statistics. The neutrosophic Weibull distribution reduces to the Weibull distribution under classical statistics if \(I_{{\text{L}}} = 0\).

Fitting of neutrosophic Weibull distribution

Now, we discuss the methodology to test the assumption either the given data having neutrosophic numbers follow the neutrosophic Weibull distribution or not. To develop the proposed test, it is assumed that the neutrosophic shape parameter and neutrosophic scale parameter of the neutrosophic Weibull distribution are unknown and estimated from the given neutrosophic data. The proposed test will be applied to test the null hypothesis that the neutrosophic data are fitted to the neutrosophic Weibull distribution versus the alternative hypothesis that the neutrosophic data do not follow the neutrosophic Weibull distribution. The goodness of fit test statistic, when the neutrosophic data do, not follows the neutrosophic statistics is given by

$$ \begin{aligned} {\text{AD}}_{{\text{N}}} & = \left[ {\mathop \sum \limits_{i} \frac{1 - 2i}{{n_{{\text{N}}} }}\left\{ {\ln \left( {1 - \exp \left( { - Z_{{\text{N}}} \left( i \right)} \right)} \right) - Z_{{n_{{\text{N}}} - i + 1}} } \right\} - n_{{\text{N}}} } \right];\\ &\quad n_{{\text{N}}} \in \left[ {n_{{\text{L}}} ,n_{{\text{U}}} } \right],\;{\text{AD}}_{{\text{N}}} \in \left[ {{\text{AD}}_{{\text{L}}} {\text{,AD}}_{{\text{U}}} } \right] \end{aligned} $$
(2)

where \(n_{{\text{N}}} \in \left[ {n_{{\text{L}}} ,n_{{\text{U}}} } \right]\) be a neutrosophic random sample and \(Z_{{\text{N}}} \left( i \right) = \left[ {x_{{\text{N}}} \left( i \right)/\theta_{{\text{N}}} } \right]^{{\beta_{{\text{N}}} }}\). According to Romeu [1], the Anderson–Darling test can be applied for small and large samples.

The neutrosophic form of statistic \({\text{AD}}_{{\text{N}}} \in \left[ {{\text{AD}}_{{\text{L}}} {\text{,AD}}_{{\text{U}}} } \right]\) can be expressed as follows

$$ {\text{AD}}_{{\text{N}}} {\text{ } = \text{ AD}}_{{\text{L}}} + {\text{AD}}_{{\text{U}}} I_{{\text{N}}} ;\;I_{{{\text{ND}}}} \in \left[ {I_{{{\text{LD}}}} ,I_{{{\text{UD}}}} } \right] $$
(3)

Note here that the proposed neutrosophic statistic \({\text{AD}}_{{\text{N}}} \in \left[ {{\text{AD}}_{{\text{L}}} {\text{,AD}}_{{\text{U}}} } \right]\) has two parts. The first part \({\text{AD}}_{{\text{L}}}\) is determined part and \({\text{AD}}_{{\text{U}}} I_{{{\text{ND}}}}\) is an indeterminate part of the test and \(I_{{{\text{ND}}}} \in \left[ {I_{{{\text{LD}}}} ,I_{{{\text{UD}}}} } \right]\) shows the indeterminate interval. The proposed statistic is the generalization of the goodness of fit test statistic under classical statistic discussed by Romeu and Grethlein [63], Amsc & CMPS [64] and Romeu [1]. The proposed test statistic reduces to the existing test statistic when \(I_{{\text{L}}}\) = 0. The modified statistic \({\text{AD}}_{{\text{N}}}^{*} \in \left[ {{\text{AD}}_{{\text{L}}}^{*} {\text{,AD}}_{{\text{U}}}^{*} } \right]\) is given by

$$ {\text{AD}}_{{\text{N}}}^{*} = \left( {1 + 0.2/\sqrt {n_{{\text{N}}} } } \right){\text{AD}}_{{\text{N}}} {;}\;\;n_{{\text{N}}} \in \left[ {n_{{\text{L}}} ,n_{{\text{U}}} } \right],\;{\text{AD}}_{{\text{N}}} \in \left[ {{\text{AD}}_{{\text{L}}},\,\, {\text{AD}}_{{\text{U}}} } \right] $$
(4)

The neutrosophic form of statistic \({\text{AD}}_{{\text{N}}}^{*} \in \left[ {{\text{AD}}_{{\text{L}}}^{*} {\text{,AD}}_{{\text{U}}}^{*} } \right]\) can be expressed as follows

$$ {\text{AD}}_{{\text{N}}}^{*} \;{ = }\;{\text{AD}}_{{\text{L}}}^{*} {\text{ } + \text{ AD}}_{{\text{U}}}^{*} I_{{\text{N}}} ;\;\;I_{{{\text{ND}}^{*} }} \in \left[ {I_{{{\text{LD}}^{*} }},\,\,I_{{{\text{UD}}^{*} }} } \right] $$
(5)

The statistic \({\text{AD}}_{{\text{N}}}^{*} \in \left[ {{\text{AD}}_{{\text{L}}}^{*} {\text{,AD}}_{{\text{U}}}^{*} } \right]\) reduces to statistic under classical statistics when no Neutrosophy is recorded. Note here that the first part of the neutrosophic form presents the statistic under classical statistics and \(D_{{\text{U}}}^{*} I_{{\text{N}}} ;\;I_{{{\text{ND}}^{*} }} \in \left[ {I_{{{\text{LD}}^{*} }} ,I_{{{\text{UD}}^{*} }} } \right]\) shows the indeterminate part, where \(I_{{{\text{ND}}^{*} }} \in \left[ {I_{{{\text{LD}}^{*} }} ,I_{{{\text{UD}}^{*} }} } \right]\) denotes the indeterminate interval associated with the statistic \( D_{{\text{N}}}^{*} \in \left[ {{\text{AD}}_{{\text{L}}}^{*} {,}\;{\text{AD}}_{{\text{U}}}^{*} } \right]\).

The p value of the test will be calculated using neutrosophic observed significance level (NOSL) which is given by

$$ \begin{aligned} {\text{OSL}}_{{\text{N}}} & = \frac{1}{{\left\{ {1 + \exp \left[ { - \;0.1 + 1.24\ln \left( {{\text{AD}}_{{\text{N}}}^{*} } \right) + 4.48\left( {{\text{AD}}_{{\text{N}}}^{*} } \right)} \right]} \right\}}};\\ &\quad {\text{OSL}}_{{\text{N}}} \in \left[ {{\text{OSL}}_{{\text{L}}} {\text{,OSL}}_{{\text{U}}} } \right] \end{aligned} $$
(6)

Based on the proposed test, if \({\text{OSL}}_{{\text{N}}} < 0.05\), the null hypothesis that the failure time having neutrosophic numbers follow the neutrosophic Weibull distribution is rejected and this error committed is less than 0.05. Based on this study, it is concluded that the given neutrosophic data do not fit the neutrosophic Weibull distribution if the calculated values of the statistic \({\text{OSL}}_{{\text{N}}}\) is less than 0.05, otherwise, the neutrosophic data follow the neutrosophic Weibull distribution. The operational process of the proposed test is shown in Fig. 1.

Fig. 1
figure 1

The procedure of the proposed test

Application of the proposed test

To discuss the application of the proposed test, we consider a life test experiment where 23 batteries are put on the test. A tested battery is labeled as a failed item if at least one of its parts fails to meet the given specification limits. According to Khoolenjani and Shahsanaie [65] “tested the battery may be considered as failed, or—strictly speaking—as nonconforming, when at least one value of its parameters falls beyond specification limits. In practice, however, we do not have the possibility to measure all parameters and are not able to define precisely the moment of a failure”. The lifetime in 100 h of 23 batteries are given as follows

[2.9, 3.99], [5.24,7.2], [6.56,9.02], [7.14,9.82], [11.6,15.96], [12.14,16.69], [12.65,17.4], [13.24,18.21], [13.67,18.79], [13.88,19.09], [15.64,21.51], [17.05,23.45], [17.4,23.93], [17.8,24.48], [19.01,26.14], [19.34,26.59], [23.13,31.81], [23.34,32.09], [26.07,35.84], [30.29,41.65], [43.97,60.46], [48.09,66.13], [73.48,98.04].

The industrial engineer is interested to test either the given data follow the Weibull distribution or not. It is clear that the lifetime of batteries is given in indeterminacy interval rather than the exact number. For this data, the use Anderson–Darling goodness of fit test under classical statistics is not suitable. Therefore, the alternative of the existing test is the proposed Anderson–Darling goodness of fit test under neutrosophic statistics. The necessary computation to perform the proposed test is shown in Table 1. The values of the statistic \({\text{AD}}_{{\text{N}}} \in \left[ {{\text{AD}}_{{\text{L}}} {\text{,AD}}_{{\text{U}}} } \right]\) for the real data are shown as

Table 1 The necessary computations for the proposed test

\({\text{AD}}_{{\text{N}}} = \Big[ \mathop \sum \limits_{i} \frac{1 - 2i}{{\left[ {23,23} \right]}}\Big\{ \ln \left( {1 - \exp \left( { - Z_{{\text{N}}} \left( i \right)} \right)} \right) - Z_{{n_{{\text{N}}} - i + 1}} \Big\} - \left[ {23,23} \right] \Big];\;\;{\text{ AD}}_{{\text{N}}} \in \left[ {67.54, 67.47} \right].\)

The calculation for modified statistic \({\text{AD}}_{{\text{N}}}^{*} \in \left[ {{\text{AD}}_{{\text{L}}}^{*} {\text{,AD}}_{{\text{U}}}^{*} } \right]\) is shown as follows

\({\text{AD}}_{{\text{N}}}^{*} = \left( {1 + 0.2/\sqrt {\left[ {23,23} \right]} } \right)\left[ {67.54, 67.47} \right];\, {\text{AD}}_{{\text{N}}}^{*} \in [ 70.35,70.28 ]\); .

The approximate values \({\text{OSL}}_{{\text{N}}} \in \left[ {{\text{OSL}}_{{\text{L}}} {\text{,OSL}}_{{\text{U}}} } \right]\) is shown as follows

$$ \begin{aligned} {\text{OSL}}_{{\text{N}}} & = \frac{1}{{\left\{ {1 + \exp \left[ { - \;0.1 + 1.24\ln \left( {\left[ {70.35,70.28} \right]} \right) + 4.48\left( {\left[ {70.35,70.28} \right]} \right)} \right]} \right\}}};\\ &\quad {\text{OSL}}_{{\text{N}}} \in \left[ {0,0} \right] \end{aligned} $$

We note that \({\text{OSL}}_{{\text{N}}} < 0.05\) for the batteries’ data. Therefore, it is concluded that the lifetime of batteries does not follow the neutrosophic Weibull distribution. The operational process of the proposed test for the batteries’ data are shown in Fig. 2.

Fig. 2
figure 2

The procedure of the proposed test for batteries’ data

Comparative study

Now we compare the efficiency of the proposed Anderson–Darling goodness of fit test under neutrosophic statistics with the Anderson–Darling goodness of fit test under classical statistics in the measure of indeterminacy. For the fair comparison, we will fix the same values of \(n_{{\text{N}}} \in \left[ {n_{{\text{L}}} ,n_{{\text{U}}} } \right]\) as used in the real example. The values of the proposed statistic \({\text{AD}}_{{\text{N}}} \in \left[ {67.54, 67.47} \right]\) in neutrosophic form can be written as \({\text{AD}}_{{\text{N}}} = 67.54 - 67.47I_{{\text{N}}} ;\;I_{{\text{N}}} \in \left[ {0,0.001} \right]\). The values modified statistic \({\text{AD}}_{{\text{N}}}^{*} \in \left[ {70.35,70.28} \right]\) can be written as \({\text{AD}}_{{\text{N}}}^{*} = 70.35 - 70.28I_{{\text{N}}} ;\;I_{{\text{N}}} \in \left[ {0,0.0009} \right]\). Note here that the values 67.54 and 70.35 represent the values of Anderson–Darling the goodness of fit test under classical statistics when \(I_{{\text{N}}} = 0\). For the real data when \(\alpha\) = 0.05, the chance of accepting the null hypothesis is 0.95, the chance of rejecting the null hypothesis is 0.05 and measures of indeterminacy are 0.001 and 0.0009. From this study, it can be noted that the proposed Anderson–Darling, the goodness of fit test under neutrosophic statistics provides information about the measure of indeterminacy while the existing Anderson–Darling, the goodness of fit test, does not provide any information about the measure of indeterminacy. In addition, the existing test provides the exact values of the statistics that are not required in uncertainty. Therefore, the proposed test Anderson–Darling, the goodness of fit test, under neutrosophic statistics is quite effective to be applied under uncertainty.

Concluding remarks

In this paper, we presented the Anderson–Darling test under neutrosophic statistics. We presented the methodology to fit the neutrosophic Weibull distribution on the data. We discussed the testing procedure with the help of reliability data. We applied the proposed test for batteries’ failure data and found that the data do not follow the neutrosophic Weibull distribution. From the application and comparative studies, it can be concluded that the proposed test is more informative, flexible, and effective to be applied in an uncertainty environment as compared to the existing Anderson–Darling test. The proposed test provides information about the measure of indeterminacy for testing of the null hypothesis. The proposed test has the limitation that it can be applied to test either the data follow the non-normal distribution such as neutrosophic Weibull distribution. The proposed test can be modified for other distribution accordingly. The proposed test cannot apply for testing the hypothesis for neutrosophic normal distribution. The efficiency of the proposed test in the power of the test can be studied as future research. The proposed test can be applied for big data as future research. In addition, the proposed test can be extended for other non-normal distributions as future research.