1 Introduction

Since the end of 2019, the coronavirus disease (COVID-19) pandemic has raged worldwide. The pandemic has seriously hindered social and economic development in most countries (Shorfuzzaman 2021). It is undeniable that enterprises have been plagued by many problems such as rising costs, declining revenue, and low operating efficiency. Minimizing the negative impact of the pandemic on enterprises has become a research hotspot (Alraja et al. 2022; Zamani et al. 2022). Hence, investigating the issues of performance prediction and optimization for enterprises in this mutant environment is of highly consequential. Specifically, an effective model is required to predict the performance in advance, and the corresponding performance optimization solution is generated simultaneously.

However, the issues of performance prediction and optimization are more complex in the context of the COVID-19 pandemic. Scholars have studied different techniques, such as statistical methods and artificial intelligence (AI) approaches. In the era of big data, the latter has rapidly developed in recent years. It is well known that the algorithms based on machine learning theory simulate human learning behavior, obtain new knowledge, and reorganize the existing knowledge structure to continuously improve performance (Huang et al. 2021a, b). These AI algorithms not only deal with large-scale data with nonlinear relationships but also obtain more accurate results. However, it is worth noting that reliable data guarantee the performance of AI algorithms. In the context of the COVID-19 pandemic, the business data of enterprises have fluctuated significantly (Lahmiri and Bekiros 2021; Xu et al. 2022), thereby aggravating performance prediction errors. The concept of efficiency from an input–output perspective makes it possible to solve this problem (Jiang et al. 2019). Efficiency is a more stable measure of an enterprise’s capabilities. Focusing on the role of multidimensional efficiency, this research proposes a more effective approach by combining the theory of machine learning and operations research. Meanwhile, it is necessary to optimize the performance of low-performance enterprises. A scientific optimization solution would enhance the resistance of enterprises to the COVID-19 pandemic. Therefore, two critical capabilities are involved in the proposed model: performance prediction and performance optimization.

Notably, the healthcare sector has played an irreplaceable role in the fight against COVID-19 (Hossain et al. 2020), including healthcare equipment and services enterprises, as well as pharmaceutical, biotechnology, and life sciences enterprises. For example, with the support of healthcare technology enterprises, China has effectively improved its response to the pandemic by using “health codes.” Owing to the efforts of biotechnology and pharmaceutical enterprises, various COVID-19 vaccines have been developed to reduce the risk of human mortality. Therefore, the proposed model is applied to the healthcare industry, representing not only an empirical study but also an avenue to help healthcare enterprises enhance their resistance to the COVID-19 pandemic.

In summary, this research investigates a valuable issue at the intersection of AI algorithms, operations research, and management science, and the main contributions of this research are threefold. First, the theory of machine learning and operations research are combined to develop an intelligent DEA-SVM model with interdisciplinary attributes, which typifies a robust theoretical innovation in combinatorial optimization. Second, the proposed approach conforms to the COVID-19 pandemic environment, and the theoretical innovation thus meets the needs of the times. Third, the proposed approach is successfully applied to the healthcare industry. Through a large volume of real data, the effectiveness and reliability of the approach are verified, and this has a significant application value.

The remainder of this paper is organized as follows: Sect. 2 presents a review of the relevant literature, Sect. 3 proposes the intelligent DEA-SVM model, Sect. 4 explains the data from the healthcare industry, Sect. 5 presents the numerical experiments and analyses, and the conclusions are presented in Sect. 6.

2 Literature review

In the era of big data, enterprise performance management relies heavily on information-driven decision-making, and machine learning is thus a promising source of new techniques (Mashrur et al. 2020; Tsai and Hung 2021). To enhance the prediction accuracy of credit performance, Wang and Ku (2021) propose a parallel artificial neural network (PANNs) ensemble model that creates several independent artificial neural networks (ANNs), each of which deals with the financial performance of the firms for each year, and the final output of the PANNs is aggregated via ensemble learning. Tong and Tong (2022) established a financial early warning system to predict financial operations using a decision tree algorithm. These studies predicted enterprise performance based on different AI techniques. The primary purpose of performance prediction is to screen out enterprises with financial or operational problems. Thus, this issue is a classification problem involving both high- and low-performance enterprises. Therefore, a classification algorithm based on machine learning theory, such as the SVM algorithm, is an effective tool (Huang et al. 2021a, b). To forecast the performance of listed companies, Yan et al. (2020) developed a new framework for a financial early warning system by combining several techniques such as SVM. Meanwhile, Li and Liu (2019) presented a novel method based on the gray kernel AR-SVM model, and the experimental results showed that their method was effective in the analysis of market preference data. Similarly, focusing on the impact of corporate debt financing capabilities and debt financing costs, Liu (2021) established an effective financial crisis early warning model by employing SVM. Considering the excellent classification capability of the SVM algorithm in previous studies, it was also introduced as a classifier in this study.

However, most studies ignore the impact of significant changes in an enterprise’s external environment on prediction results. Under the COVID-19 pandemic, there were huge fluctuations in financial and operational data. Thus, this irregular change is a great challenge for traditional prediction techniques. Therefore, a more reliable approach, which can convert original business data into more stable data, is required in the context of the COVID-19 pandemic. Moreover, to enhance resilience to the great changes in the external environment, it is essential to investigate scientific optimization solutions for low-performance enterprises. Consequently, the traditional SVM algorithm needs to be improved. Efficiency is a relatively stable measure of an enterprise’s capabilities, such as marketing, research and development (R&D), and operations capabilities (Li et al. 2010). Based on linear programming, DEA is a quantitative analysis approach for evaluating the relative effectiveness of comparable units of the same type (Huang et al. 2019; Soltanifar and Sharafi 2022). With DEA-based models, enterprise organizational capability was studied by Lin et al. (2021), sustainable development capability was measured by Jin et al. (2022), and technological innovation capability was evaluated by Gu et al. (2018). In terms of performance optimization, DEA-based approaches can also generate the optimization solutions for input/output resources. Concerning the financial field, Wu et al. (2022) proposed a novel DEA model, and they found that some samples had insufficient variables that needed to be increased, while others had redundant resources that required to be reduced. The corresponding optimization solutions are discussed further. Zhao and Ge (2020) applied DEA to academic entrepreneurship issues, and Wu et al. (2018) designed a DEA-based performance evaluation system for construction enterprises. From an input–output perspective, their approaches successfully generated solutions for performance optimization. Previous studies have demonstrated that enterprise performance can be optimized by increasing (reducing) insufficient (redundant) resources.

3 Proposed intelligent model

In this section, an intelligent DEA-SVM model is developed for two purposes. One purpose is to accurately predict enterprise performance using fluctuation data during the COVID-19 pandemic, while the other is to generate scientific performance optimization solutions for low-performance enterprises from the perspective of input/output resources. According to the SVM (Du et al. 2021) and DEA (Raayatpanah et al. 2021) theories, the proposed model is developed as follows, and the corresponding schematic is depicted in Fig. 1.

Fig. 1
figure 1

Schematic of the intelligent DEA-SVM model

In the context of the COVID-19 pandemic, it is supposed that there are n healthcare enterprises and n decision-making units (DMUs), where DMUj (j = 1,2,…,n). From an input–output perspective, each enterprise has m input resources, xij (i = 1,2,…,m), and s output resources, yrj (r = 1,2,…,s). Thus, the relative efficiencies of these n DMUs are generated by Eq. 1. Here, ε is the non-Archimedean infinitesimal. The \({s}_{i}^{-}\) and \({s}_{r}^{+}\) indicate the redundancy/shortage of the input/output variables, respectively. This model is based on the assumption of constant returns to scale, and the efficiency value is thus represented as technical efficiency (TE).

$$\mathrm{min}\theta ={\theta }_{0}-\varepsilon \left(\sum_{r=1}^{s}{s}_{r}^{+}+\sum_{i=1}^{m}{s}_{i}^{-}\right)$$
(1)
$$ {\text{s}}.{\text{t}}.\left\{ {\begin{array}{*{20}l} {\sum\limits_{j = 1}^{n} {\lambda_{j} } x_{ij} + s_{i}^{ - } = \theta_{0} x_{i0} } \hfill \\ {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} - s_{r}^{ + } = y_{r0} } \hfill \\ {\lambda_{j} ,s_{i}^{ - } ,s_{r}^{ + } \ge 0} \hfill \\ {i = 1,2, \ldots ,m} \hfill \\ {r = 1,2, \ldots ,s} \hfill \\ {j = 1,2, \ldots ,n} \hfill \\ \end{array} } \right. $$

Under the assumption of variable returns to scale, Eq. 1 is transformed into Eq. 2, and the corresponding efficiency value is pure technical efficiency (PTE). Based on the values of TE and PTE, scale efficiency (SE) is further calculated as SE = TE/PTE.

$$\mathrm{min}\theta ={\theta }_{0}-\varepsilon \left(\sum_{r=1}^{s}{s}_{r}^{+}+\sum_{i=1}^{m}{s}_{i}^{-}\right)$$
(2)
$$ {\text{s}}.{\text{t}}.\left\{ {\begin{array}{*{20}l} {\sum\limits_{j = 1}^{n} {\lambda_{j} } x_{ij} + s_{i}^{ - } = \theta_{0} x_{i0} } \hfill \\ {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} - s_{r}^{ + } = y_{r0} } \hfill \\ {\sum\limits_{j = 1}^{n} {\lambda_{j} } = 1} \hfill \\ {\lambda_{j} ,s_{i}^{ - } ,s_{r}^{ + } \ge 0} \hfill \\ {i = 1,2, \ldots ,m} \hfill \\ {r = 1,2, \ldots ,s} \hfill \\ {j = 1,2, \ldots ,n} \hfill \\ \end{array} } \right. $$

Consequently, for a non-efficient DMU, the target values of the input/output resources are generated based on \({s}_{i}^{-}\) and \({s}_{r}^{+}\). The proportion of resources that should be reduced is then defined (see Eq. 3). Here, pro is the proportion, DO is the original data, and DT is the target data.

$$pro={D}_{O}-{D}_{T}/{D}_{O}$$
(3)

The efficiency values obtained are used as the inputs for the performance prediction. It is well known that an SVM constructs a hyperplane or set of hyperplanes that can be used for classification. In this research, enterprises are divided into high- and low-performance categories after the COVID-19 outbreak. Hence, the first task of the SVM algorithm is to determine the hyperplane that maximizes the margin (w) between the two classes. Here, \(w*x+b=0\) denotes the hyperplane. Thus, the problem is to determine w and b by solving the following objective function using quadratic programming (see Eq. 4).

$$ \begin{aligned} & \mathop {\min }\limits_{w,b} \frac{1}{2}||w^{2}|| \\ & {\text{s}}.{\text{t}}.\,y^{\left( i \right)} \left( {w^{T} x^{\left( i \right)} + b} \right) \ge 1 \\ & i = 1,2, \ldots ,n \\ \end{aligned} $$
(4)

An ideal SVM analysis should produce a hyperplane that completely separates the vectors (cases) into two nonoverlapping classes. However, perfect separation may not be possible, or it may result in a model with so many cases that it does not classify correctly. Therefore, the slack parameter ξ is introduced to allow some instances to fall into the margin, but they are penalized. This means that parameter C is also introduced to penalize the misclassified instances and those within the margin (see Eq. 5).

$$ \begin{aligned} & \mathop {\min }\limits_{w,b,\xi \ge 0} \frac{1}{2}W^{T} W + C\mathop \sum \limits_{i} \xi_{i} \\ & {\text{s}}.{\text{t}}.\;y_{i} \left( {W^{T} x_{i} + b} \right) \ge 1 - \xi_{i} \\ & i = 1,2, \ldots ,n \\ & \xi_{i} \ge 0 \\ \end{aligned} $$
(5)

Using the Lagrange operator, this problem is transformed into Eqs. 6 and 7:

$$L\left(w,b,\xi ,\alpha ,\lambda \right)=\frac{1}{2}{W}^{T}W+C\sum_{i}{\xi }_{i}+\sum_{i}{\alpha }_{i}\left(1-{\xi }_{i}-{y}_{i}\left({W}^{T}{x}_{i}+b\right)\right)-\sum_{i}{\lambda }_{i}{\xi }_{i}$$
(6)
$$L\left(\xi ,\alpha ,\lambda \right)={\sum }_{i}{\alpha }_{i}-\frac{1}{2}{\sum }_{i,j}{\alpha }_{i}{\alpha }_{j}{y}_{i}{y}_{j}{X}_{i}^{T}{X}_{j}$$
(7)

Furthermore, the dual form of this problem (Eq. 8) and the Karush–Kuhn–Tucker conditions (Eq. 9) to be satisfied are given.

$$ \mathop {\max }\limits_{\alpha } W\left( \alpha \right) = \mathop \sum \limits_{i = 1}^{n} \alpha_{i} - \frac{1}{2}\mathop \sum \limits_{i,j = 1}^{n} y^{\left( i \right)} y^{\left( j \right)} \alpha_{i} \alpha_{j} (x^{\left( i \right)} ,x^{\left( j \right)}) $$
(8)
$$ \begin{aligned} & {\text{s}}.{\text{t}}.\;0 \le \alpha_{i} \le C \\ & i = 1,2, \ldots ,n \\ & \mathop \sum \limits_{i = 1}^{n} \alpha_{i} y^{\left( i \right)} = 0 \\ \end{aligned} $$
$$ \left\{ {\begin{array}{*{20}l} {\begin{array}{*{20}l} {\alpha_{i} = 0} \hfill & {y^{\left( i \right)} } \hfill \\ \end{array} \left( {w^{T} x^{\left( i \right)} + b} \right) \ge 1} \hfill \\ {\begin{array}{*{20}c} {\alpha_{i} = C} & {y^{\left( i \right)} \left( {w^{T} x^{\left( i \right)} + b} \right)} \\ \end{array} \le 1} \hfill \\ {\begin{array}{*{20}l} {0 < \alpha_{i} < C} \hfill & {y^{\left( i \right)} \left( {w^{T} x^{\left( i \right)} + b} \right) = 1} \hfill \\ \end{array} } \hfill \\ \end{array} } \right. $$
(9)

Moreover, there are situations whereby a nonlinear region can separate groups more efficiently. The kernel function is then employed to map the data into a different space, where a hyperplane (linear) cannot be used to perform the separation. This problem is then transformed into Eq. 10.

$$ \begin{aligned} & \mathop {\max }\limits_{\alpha \ge 0} \mathop \sum \limits_{i} \alpha_{i} - \frac{1}{2}\mathop \sum \limits_{i,j} \alpha_{i} \alpha_{j} y_{i} y_{j} K\left( {X_{i} ,X_{j} } \right) \\ & {\text{s}}.{\text{t}}.\mathop \sum \limits_{i} \alpha_{i} y_{i} = 0 \\ & \alpha_{i} \le C \\ & i = 1,2, \ldots ,n \\ \end{aligned} $$
(10)

The data are mapped into a new space and the inner product of the new vectors is obtained. The image of the inner product of the data is the inner product of the data images. The two kernel functions are shown below.

$$ K\left( {X_{i} ,X_{j} } \right) = X_{i}^{T} X_{j} \quad \left( {{\text{Linear}}\,{\text{Kernal}}} \right) $$
$$ K\left( {X_{i} ,X_{j} } \right) = {\text{exp}}\left( { - \frac{||{X_{i} - X_{j}^{2}|| }}{{2\sigma^{2} }}} \right)\quad \left( {{\text{RBF}}\,{\text{Kernal}}} \right) $$

4 Experimental data

In this study, the data of the healthcare industry were rigorously collected for numerical experiments, and the variables and datasets in line with the proposed DEA-SVM model were selected scientifically.

4.1 Data acquisition

First, the data of healthcare enterprises were collected through the Wharton Research Data Services (WRDS) database, and the Global Industry Classification Standard (GICS) was employed to screen out healthcare enterprises among various types of enterprises. Healthcare equipment and services enterprises and pharmaceutical, biotechnology, and life sciences enterprises were both considered. Second, to effectively validate the capability of the DEA-SVM model under the COVID-19 pandemic, the enterprises’ business data before the outbreak of the COVID-19 (2018), and the enterprises’ performance data after the outbreak of the COVID-19 (2020) were collected.

4.2 Variable explanation

Based on the analysis of relevant variables in previous studies, 10 vital variables were selected for this study.

  1. 1.

    Total revenue (million). This variable represents the gross income received from all the divisions of the company. High total revenue is usually desirable for companies. It is the primary output of the firms from the input–output perspective.

  2. 2.

    Cost of goods sold (COGS, million). This variable represents all the costs directly allocated by the company to production, such as material, labor, and overhead.

  3. 3.

    Capital expenditure (million). This variable represents the cash outflow or the funds used for additions to the company's property, plant, and equipment (PPE).

  4. 4.

    Research and development expense (R&D expense, million). This variable represents all the costs related to the development of new products or services, and it is indispensable for analyzing healthcare enterprises.

  5. 5.

    Employees (thousand). This variable represents the number of company workers.

  6. 6.

    Selling, general, and administrative expense (SGA expense, million). This variable represents all the commercial expenses incurred in the regular course of business pertaining to the securing of operating income.

  7. 7.

    Working capital (million). This variable represents the difference between total current assets minus total current liabilities.

  8. 8.

    Total current asset (million). This variable represents the cash and other assets that are expected to be realized in cash or used in revenue generation.

  9. 9.

    Total property, plant, and equipment (Total PPE, million). This variable represents the cost, less accumulated depreciation, of tangible fixed properties used in the production of revenue.

  10. 10.

    Performance category. In the WRDS database, quality ranking is an item of appraisal of firm performance. Based on this item, the performance category variable was defined as two categories. Numbers 1 and 0 represent high and low performances, respectively.

Importantly, four efficiency variables were generated based on these variables. The four efficiency variables comprehensively reflect a firm’s capability from different perspectives.

  1. 1.

    Operation Efficiency. Inputs are cost of goods sold and capital expenditures; output is total revenue.

  2. 2.

    Research & development Efficiency. Inputs are R&D expense and employees; output is total revenue.

  3. 3.

    Marketing Efficiency. Inputs are SGA expense and working capital; output is total revenue.

  4. 4.

    Finance Efficiency. Inputs are total current assets and total PPE; output is total revenue.

4.3 Dataset explanation

To improve the reliability and effectiveness of the numerical experiments, an equal number of high- and low-performance enterprises were included in the data samples. In addition, each enterprise has business data for 2018 and performance data for 2020. After removing invalid data, the final sample comprised 120 healthcare enterprises. Among them, there were 60 enterprises each in the high- and low-performance categories. These 120 enterprises were randomly divided into two data groups: training and testing. There were 90 enterprises in the training data group, with 45 high- and 45 low-performance enterprises. For the testing data group, 15 high- and 15 low-performance enterprises were involved.

The descriptive statistical analyses for the business data variables are given in Table 1, and these data are normalized before the numerical experiments using Eq. 11. Here, Vnew is the normalized data, V is the original data, Vmin is the minimum value of the corresponding variable, and Vmax is the maximum value of the corresponding variable.

Table 1 Descriptive statistical analysis for business data variables
$${V}_{new}=\frac{V-{V}_{min}}{{V}_{max}-{V}_{min}}$$
(11)

5 Numerical experiments

Numerical experiments are conducted for two purposes: one is to examine whether the proposed model can achieve satisfactory prediction results in the context of the COVID-19 pandemic, and the other is to examine whether the proposed model can provide scientific performance optimization solutions for enterprises in advance.

5.1 Performance prediction and analysis

In this section, the proposed intelligent DEA-SVM model is compared with the traditional SVM algorithm. Both approaches utilized the experimental data of 120 healthcare enterprises: 90 training samples and 30 testing samples.

Because the business data fluctuated significantly in the context of the COVID-19 pandemic, the DEA-SVM model intelligently transformed the original business data into efficiency data, which improved the stability of the inputs. Consequently, the DEA-SVM model took the efficiency data of four variables (operations, R&D, marketing, and finance efficiencies) in 2018 as inputs and the performance category data in 2020 as the output. However, the traditional SVM algorithm predicted the performance by utilizing the original business data. Therefore, the inputs were the eight business variables (cost of goods sold, capital expenditure, R&D expense, employees, SGA expense, working capital, total current asset, and total PPE) corresponding to the four efficiency variables, and the output was also the performance category.

To ensure the reliability of the results, experiments were conducted with different cost values (c = 0.5, c = 1, c = 5, c = 10) and kernel types (linear and radial basis function (RBF)), which are two key parameters in the SVM algorithm. The SVM type was C-SVC. The prediction accuracy results are shown in Figs. 2 and 3.

Fig. 2
figure 2

Comparison of prediction results (kernel type of SVM is linear)

Fig. 3
figure 3

Comparison of prediction results (kernel type of SVM is RBF)

From Figs. 2 and 3, it is clear that the intelligent DEA-SVM model outperforms the traditional SVM algorithm under different cost values and kernel types; however, there is a slight drop when c = 0.5, and the kernel type is RBF. Second, the results of the linear kernel type are generally better than those of the RBF kernel type. In particular, when the kernel type is linear and c = 1.00, the prediction accuracy of the DEA-SVM model reaches 90%, which is 20% higher than that of the traditional SVM algorithm.

Owing to the pronounced impact of the COVID-19 pandemic on healthcare enterprises, business data fluctuated irregularly. There would be considerable errors if the original business data were directly used for the performance prediction. The efficiency variable generated from the input–output perspective can steadily reflect an enterprise’s capabilities. Therefore, the proposed intelligent DEA-SVM model achieved a better prediction capability by combining the theory of AI algorithms and operations research.

5.2 Performance optimization and analysis

For enterprises with low-performance, it is necessary to optimize their performance in advance to combat the negative impact of the COVID-19 pandemic. Thus, the purpose of this section is to verify how to conceptualize optimization solutions for enterprises with low performance using the DEA-SVM model.

By carefully observing the efficiency data of 120 sample enterprises, it is found that the average efficiency value in the high-performance category is significantly higher than that in the low-performance category, and this result is consistent for three efficiency types: technical, pure technical, and scale. In particular, for the technical efficiency used in the prediction experiment, the average value of the high-performance category is 18% higher than that of the low-performance category.

As a result, the proposed DEA-SVM model further computes the improvement proportion of eight business variables to realize performance optimization, that is, the proportion by which each variable should be reduced. Figure 4 shows the average improvement proportion among the 120 enterprises.

Fig. 4
figure 4

Performance optimization for all enterprises (average value)

From Fig. 4, it is demonstrated that the proposed DEA-SVM model can generate the proportion of improvement for each business variable. To enhance the healthcare industry’s resistance to the COVID-19 pandemic, working capital and R&D expenses, which are the most redundant variables, should be reduced by 94.51% and 90.22%, respectively, while capital expenditure only need to be reduced by 47.01%. The redundancy of employees and total PPE are medium, and they should be reduced by 73.57% and 82.60%, respectively.

However, these results are the average values for 120 sample enterprises, whereas the optimization solutions for each enterprise are different. Some enterprises need to reduce the number of input variables, while others do not. In fact, the DEA-SVM model can simultaneously generate the performance optimization solution for each enterprise, and Fig. 5 shows the optimization solutions for two randomly selected enterprises. Abbott Laboratories is a healthcare equipment and services enterprise in the USA, and Thermo Fisher Scientific is a pharmaceutical, biotechnology, and life sciences enterprise also in the USA. Evidently, the results for the two enterprises are completely different. Thus, each enterprise should optimize its performance according to its unique situation.

Fig. 5
figure 5

Performance optimization for two case-study enterprises

6 Conclusion

In the context of the COVID-19 pandemic, this research presented an intelligent DEA-SVM model to predict and optimize the performance of healthcare enterprises. The WRDS database was used to collect business (performance) data before (after) the COVID-19 outbreak. Through a numerical experiment on performance prediction, four efficiency variables that reflect an enterprise’s capabilities are generated: operation, R&D, marketing, and finance efficiencies. The prediction accuracy of the DEA-SVM model was then compared with that of the traditional SVM algorithm. By combining various parameters, it was comprehensively verified that the DEA-SVM model had an excellent prediction capability. The prediction accuracy was optimal when the kernel type was linear and the cost parameter was 1.00. In terms of the numerical experiment for performance optimization, the optimization solutions for low-performance enterprises were obtained. Working capital and R&D expense were the most redundant resources for all sample enterprises. For each specific enterprise, the corresponding optimization solution was simultaneously generated. These experiments not only proved the theoretical innovation of the DEA-SVM model but also reflected its application value. In the future, efficiency-related parameters could be improved through additional experiments. It is also important to explore the feasibility of applying this model to other fields.