Performance prediction and optimization for healthcare enterprises in the context of the COVID-19 pandemic: an intelligent DEA-SVM model

Huang, He; Zhong, Liwei; Shen, Ting; Wang, Huixin

doi:10.1007/s10878-022-00911-9

Performance prediction and optimization for healthcare enterprises in the context of the COVID-19 pandemic: an intelligent DEA-SVM model

Published: 12 October 2022

Volume 44, pages 3778–3791, (2022)
Cite this article

Download PDF

Journal of Combinatorial Optimization Aims and scope Submit manuscript

Performance prediction and optimization for healthcare enterprises in the context of the COVID-19 pandemic: an intelligent DEA-SVM model

Download PDF

He Huang¹,
Liwei Zhong ORCID: orcid.org/0000-0002-7945-5248²,
Ting Shen² &
…
Huixin Wang²

1735 Accesses
Explore all metrics

Abstract

The coronavirus disease (COVID-19) pandemic has caused significant changes in the external environment of enterprises, resulting in tremendous negative impacts. Accordingly, the irregular fluctuation of business data poses a critical challenge to traditional approaches. Therefore, to combat the effects of the COVID-19 pandemic, an effective model is required to proactively predict an enterprise’s performance and simultaneously generate scientific performance optimization solutions. Consequently, at the intersection of artificial intelligence algorithms, operations research, and management science, an intelligent DEA-SVM model, which has a theoretical contribution, is developed in this study. The capabilities of this model are verified through sufficient numerical experiments. On the one hand, this model outperforms traditional algorithms in prediction accuracy. On the other hand, effective performance optimization solutions for low-performance enterprises are obtained from the input–output perspective. Moreover, the application value of this model is reflected in its successful implementation in the healthcare industry. Thus, it is a user-friendly tool for realizing the stable operation of enterprises in the context of the COVID-19 pandemic.

Artificial Intelligence and Business Value: a Literature Review

Article Open access 25 August 2021

Predictive big data analytics for supply chain demand forecasting: methods, applications, and research opportunities

Article Open access 25 July 2020

Big Data Analytics: Applications, Prospects and Challenges

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Since the end of 2019, the coronavirus disease (COVID-19) pandemic has raged worldwide. The pandemic has seriously hindered social and economic development in most countries (Shorfuzzaman 2021). It is undeniable that enterprises have been plagued by many problems such as rising costs, declining revenue, and low operating efficiency. Minimizing the negative impact of the pandemic on enterprises has become a research hotspot (Alraja et al. 2022; Zamani et al. 2022). Hence, investigating the issues of performance prediction and optimization for enterprises in this mutant environment is of highly consequential. Specifically, an effective model is required to predict the performance in advance, and the corresponding performance optimization solution is generated simultaneously.

However, the issues of performance prediction and optimization are more complex in the context of the COVID-19 pandemic. Scholars have studied different techniques, such as statistical methods and artificial intelligence (AI) approaches. In the era of big data, the latter has rapidly developed in recent years. It is well known that the algorithms based on machine learning theory simulate human learning behavior, obtain new knowledge, and reorganize the existing knowledge structure to continuously improve performance (Huang et al. 2021a, b). These AI algorithms not only deal with large-scale data with nonlinear relationships but also obtain more accurate results. However, it is worth noting that reliable data guarantee the performance of AI algorithms. In the context of the COVID-19 pandemic, the business data of enterprises have fluctuated significantly (Lahmiri and Bekiros 2021; Xu et al. 2022), thereby aggravating performance prediction errors. The concept of efficiency from an input–output perspective makes it possible to solve this problem (Jiang et al. 2019). Efficiency is a more stable measure of an enterprise’s capabilities. Focusing on the role of multidimensional efficiency, this research proposes a more effective approach by combining the theory of machine learning and operations research. Meanwhile, it is necessary to optimize the performance of low-performance enterprises. A scientific optimization solution would enhance the resistance of enterprises to the COVID-19 pandemic. Therefore, two critical capabilities are involved in the proposed model: performance prediction and performance optimization.

Notably, the healthcare sector has played an irreplaceable role in the fight against COVID-19 (Hossain et al. 2020), including healthcare equipment and services enterprises, as well as pharmaceutical, biotechnology, and life sciences enterprises. For example, with the support of healthcare technology enterprises, China has effectively improved its response to the pandemic by using “health codes.” Owing to the efforts of biotechnology and pharmaceutical enterprises, various COVID-19 vaccines have been developed to reduce the risk of human mortality. Therefore, the proposed model is applied to the healthcare industry, representing not only an empirical study but also an avenue to help healthcare enterprises enhance their resistance to the COVID-19 pandemic.

In summary, this research investigates a valuable issue at the intersection of AI algorithms, operations research, and management science, and the main contributions of this research are threefold. First, the theory of machine learning and operations research are combined to develop an intelligent DEA-SVM model with interdisciplinary attributes, which typifies a robust theoretical innovation in combinatorial optimization. Second, the proposed approach conforms to the COVID-19 pandemic environment, and the theoretical innovation thus meets the needs of the times. Third, the proposed approach is successfully applied to the healthcare industry. Through a large volume of real data, the effectiveness and reliability of the approach are verified, and this has a significant application value.

The remainder of this paper is organized as follows: Sect. 2 presents a review of the relevant literature, Sect. 3 proposes the intelligent DEA-SVM model, Sect. 4 explains the data from the healthcare industry, Sect. 5 presents the numerical experiments and analyses, and the conclusions are presented in Sect. 6.

2 Literature review

In the era of big data, enterprise performance management relies heavily on information-driven decision-making, and machine learning is thus a promising source of new techniques (Mashrur et al. 2020; Tsai and Hung 2021). To enhance the prediction accuracy of credit performance, Wang and Ku (2021) propose a parallel artificial neural network (PANNs) ensemble model that creates several independent artificial neural networks (ANNs), each of which deals with the financial performance of the firms for each year, and the final output of the PANNs is aggregated via ensemble learning. Tong and Tong (2022) established a financial early warning system to predict financial operations using a decision tree algorithm. These studies predicted enterprise performance based on different AI techniques. The primary purpose of performance prediction is to screen out enterprises with financial or operational problems. Thus, this issue is a classification problem involving both high- and low-performance enterprises. Therefore, a classification algorithm based on machine learning theory, such as the SVM algorithm, is an effective tool (Huang et al. 2021a, b). To forecast the performance of listed companies, Yan et al. (2020) developed a new framework for a financial early warning system by combining several techniques such as SVM. Meanwhile, Li and Liu (2019) presented a novel method based on the gray kernel AR-SVM model, and the experimental results showed that their method was effective in the analysis of market preference data. Similarly, focusing on the impact of corporate debt financing capabilities and debt financing costs, Liu (2021) established an effective financial crisis early warning model by employing SVM. Considering the excellent classification capability of the SVM algorithm in previous studies, it was also introduced as a classifier in this study.

However, most studies ignore the impact of significant changes in an enterprise’s external environment on prediction results. Under the COVID-19 pandemic, there were huge fluctuations in financial and operational data. Thus, this irregular change is a great challenge for traditional prediction techniques. Therefore, a more reliable approach, which can convert original business data into more stable data, is required in the context of the COVID-19 pandemic. Moreover, to enhance resilience to the great changes in the external environment, it is essential to investigate scientific optimization solutions for low-performance enterprises. Consequently, the traditional SVM algorithm needs to be improved. Efficiency is a relatively stable measure of an enterprise’s capabilities, such as marketing, research and development (R&D), and operations capabilities (Li et al. 2010). Based on linear programming, DEA is a quantitative analysis approach for evaluating the relative effectiveness of comparable units of the same type (Huang et al. 2019; Soltanifar and Sharafi 2022). With DEA-based models, enterprise organizational capability was studied by Lin et al. (2021), sustainable development capability was measured by Jin et al. (2022), and technological innovation capability was evaluated by Gu et al. (2018). In terms of performance optimization, DEA-based approaches can also generate the optimization solutions for input/output resources. Concerning the financial field, Wu et al. (2022) proposed a novel DEA model, and they found that some samples had insufficient variables that needed to be increased, while others had redundant resources that required to be reduced. The corresponding optimization solutions are discussed further. Zhao and Ge (2020) applied DEA to academic entrepreneurship issues, and Wu et al. (2018) designed a DEA-based performance evaluation system for construction enterprises. From an input–output perspective, their approaches successfully generated solutions for performance optimization. Previous studies have demonstrated that enterprise performance can be optimized by increasing (reducing) insufficient (redundant) resources.

3 Proposed intelligent model

In this section, an intelligent DEA-SVM model is developed for two purposes. One purpose is to accurately predict enterprise performance using fluctuation data during the COVID-19 pandemic, while the other is to generate scientific performance optimization solutions for low-performance enterprises from the perspective of input/output resources. According to the SVM (Du et al. 2021) and DEA (Raayatpanah et al. 2021) theories, the proposed model is developed as follows, and the corresponding schematic is depicted in Fig. 1.

In the context of the COVID-19 pandemic, it is supposed that there are n healthcare enterprises and n decision-making units (DMUs), where DMU_j (j = 1,2,…,n). From an input–output perspective, each enterprise has m input resources, x_ij (i = 1,2,…,m), and s output resources, y_rj (r = 1,2,…,s). Thus, the relative efficiencies of these n DMUs are generated by Eq. 1. Here, ε is the non-Archimedean infinitesimal. The ${s}_{i}^{-}$ and ${s}_{r}^{+}$ indicate the redundancy/shortage of the input/output variables, respectively. This model is based on the assumption of constant returns to scale, and the efficiency value is thus represented as technical efficiency (TE).

$$\mathrm{min}\theta ={\theta }_{0}-\varepsilon \left(\sum_{r=1}^{s}{s}_{r}^{+}+\sum_{i=1}^{m}{s}_{i}^{-}\right)$$

(1)

$$ {\text{s}}.{\text{t}}.\left\{ {\begin{array}{*{20}l} {\sum\limits_{j = 1}^{n} {\lambda_{j} } x_{ij} + s_{i}^{ - } = \theta_{0} x_{i0} } \hfill \\ {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} - s_{r}^{ + } = y_{r0} } \hfill \\ {\lambda_{j} ,s_{i}^{ - } ,s_{r}^{ + } \ge 0} \hfill \\ {i = 1,2, \ldots ,m} \hfill \\ {r = 1,2, \ldots ,s} \hfill \\ {j = 1,2, \ldots ,n} \hfill \\ \end{array} } \right. $$

Under the assumption of variable returns to scale, Eq. 1 is transformed into Eq. 2, and the corresponding efficiency value is pure technical efficiency (PTE). Based on the values of TE and PTE, scale efficiency (SE) is further calculated as SE = TE/PTE.

$$\mathrm{min}\theta ={\theta }_{0}-\varepsilon \left(\sum_{r=1}^{s}{s}_{r}^{+}+\sum_{i=1}^{m}{s}_{i}^{-}\right)$$

(2)

$$ {\text{s}}.{\text{t}}.\left\{ {\begin{array}{*{20}l} {\sum\limits_{j = 1}^{n} {\lambda_{j} } x_{ij} + s_{i}^{ - } = \theta_{0} x_{i0} } \hfill \\ {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} - s_{r}^{ + } = y_{r0} } \hfill \\ {\sum\limits_{j = 1}^{n} {\lambda_{j} } = 1} \hfill \\ {\lambda_{j} ,s_{i}^{ - } ,s_{r}^{ + } \ge 0} \hfill \\ {i = 1,2, \ldots ,m} \hfill \\ {r = 1,2, \ldots ,s} \hfill \\ {j = 1,2, \ldots ,n} \hfill \\ \end{array} } \right. $$

Consequently, for a non-efficient DMU, the target values of the input/output resources are generated based on ${s}_{i}^{-}$ and ${s}_{r}^{+}$. The proportion of resources that should be reduced is then defined (see Eq. 3). Here, pro is the proportion, D_O is the original data, and D_T is the target data.

$$pro={D}_{O}-{D}_{T}/{D}_{O}$$

(3)

The efficiency values obtained are used as the inputs for the performance prediction. It is well known that an SVM constructs a hyperplane or set of hyperplanes that can be used for classification. In this research, enterprises are divided into high- and low-performance categories after the COVID-19 outbreak. Hence, the first task of the SVM algorithm is to determine the hyperplane that maximizes the margin (w) between the two classes. Here, $w*x+b=0$ denotes the hyperplane. Thus, the problem is to determine w and b by solving the following objective function using quadratic programming (see Eq. 4).

$$ \begin{aligned} & \mathop {\min }\limits_{w,b} \frac{1}{2}||w^{2}|| \\ & {\text{s}}.{\text{t}}.\,y^{\left( i \right)} \left( {w^{T} x^{\left( i \right)} + b} \right) \ge 1 \\ & i = 1,2, \ldots ,n \\ \end{aligned} $$

(4)

An ideal SVM analysis should produce a hyperplane that completely separates the vectors (cases) into two nonoverlapping classes. However, perfect separation may not be possible, or it may result in a model with so many cases that it does not classify correctly. Therefore, the slack parameter ξ is introduced to allow some instances to fall into the margin, but they are penalized. This means that parameter C is also introduced to penalize the misclassified instances and those within the margin (see Eq. 5).

$$ \begin{aligned} & \mathop {\min }\limits_{w,b,\xi \ge 0} \frac{1}{2}W^{T} W + C\mathop \sum \limits_{i} \xi_{i} \\ & {\text{s}}.{\text{t}}.\;y_{i} \left( {W^{T} x_{i} + b} \right) \ge 1 - \xi_{i} \\ & i = 1,2, \ldots ,n \\ & \xi_{i} \ge 0 \\ \end{aligned} $$

(5)

Using the Lagrange operator, this problem is transformed into Eqs. 6 and 7:

$$L\left(w,b,\xi ,\alpha ,\lambda \right)=\frac{1}{2}{W}^{T}W+C\sum_{i}{\xi }_{i}+\sum_{i}{\alpha }_{i}\left(1-{\xi }_{i}-{y}_{i}\left({W}^{T}{x}_{i}+b\right)\right)-\sum_{i}{\lambda }_{i}{\xi }_{i}$$

(6)

$$L\left(\xi ,\alpha ,\lambda \right)={\sum }_{i}{\alpha }_{i}-\frac{1}{2}{\sum }_{i,j}{\alpha }_{i}{\alpha }_{j}{y}_{i}{y}_{j}{X}_{i}^{T}{X}_{j}$$

(7)

Furthermore, the dual form of this problem (Eq. 8) and the Karush–Kuhn–Tucker conditions (Eq. 9) to be satisfied are given.

$$ \mathop {\max }\limits_{\alpha } W\left( \alpha \right) = \mathop \sum \limits_{i = 1}^{n} \alpha_{i} - \frac{1}{2}\mathop \sum \limits_{i,j = 1}^{n} y^{\left( i \right)} y^{\left( j \right)} \alpha_{i} \alpha_{j} (x^{\left( i \right)} ,x^{\left( j \right)}) $$

(8)

$$ \begin{aligned} & {\text{s}}.{\text{t}}.\;0 \le \alpha_{i} \le C \\ & i = 1,2, \ldots ,n \\ & \mathop \sum \limits_{i = 1}^{n} \alpha_{i} y^{\left( i \right)} = 0 \\ \end{aligned} $$

$$ \left\{ {\begin{array}{*{20}l} {\begin{array}{*{20}l} {\alpha_{i} = 0} \hfill & {y^{\left( i \right)} } \hfill \\ \end{array} \left( {w^{T} x^{\left( i \right)} + b} \right) \ge 1} \hfill \\ {\begin{array}{*{20}c} {\alpha_{i} = C} & {y^{\left( i \right)} \left( {w^{T} x^{\left( i \right)} + b} \right)} \\ \end{array} \le 1} \hfill \\ {\begin{array}{*{20}l} {0 < \alpha_{i} < C} \hfill & {y^{\left( i \right)} \left( {w^{T} x^{\left( i \right)} + b} \right) = 1} \hfill \\ \end{array} } \hfill \\ \end{array} } \right. $$

(9)

Moreover, there are situations whereby a nonlinear region can separate groups more efficiently. The kernel function is then employed to map the data into a different space, where a hyperplane (linear) cannot be used to perform the separation. This problem is then transformed into Eq. 10.

$$ \begin{aligned} & \mathop {\max }\limits_{\alpha \ge 0} \mathop \sum \limits_{i} \alpha_{i} - \frac{1}{2}\mathop \sum \limits_{i,j} \alpha_{i} \alpha_{j} y_{i} y_{j} K\left( {X_{i} ,X_{j} } \right) \\ & {\text{s}}.{\text{t}}.\mathop \sum \limits_{i} \alpha_{i} y_{i} = 0 \\ & \alpha_{i} \le C \\ & i = 1,2, \ldots ,n \\ \end{aligned} $$

(10)

The data are mapped into a new space and the inner product of the new vectors is obtained. The image of the inner product of the data is the inner product of the data images. The two kernel functions are shown below.

$$ K\left( {X_{i} ,X_{j} } \right) = X_{i}^{T} X_{j} \quad \left( {{\text{Linear}}\,{\text{Kernal}}} \right) $$

$$ K\left( {X_{i} ,X_{j} } \right) = {\text{exp}}\left( { - \frac{||{X_{i} - X_{j}^{2}|| }}{{2\sigma^{2} }}} \right)\quad \left( {{\text{RBF}}\,{\text{Kernal}}} \right) $$

4 Experimental data

In this study, the data of the healthcare industry were rigorously collected for numerical experiments, and the variables and datasets in line with the proposed DEA-SVM model were selected scientifically.

4.1 Data acquisition

First, the data of healthcare enterprises were collected through the Wharton Research Data Services (WRDS) database, and the Global Industry Classification Standard (GICS) was employed to screen out healthcare enterprises among various types of enterprises. Healthcare equipment and services enterprises and pharmaceutical, biotechnology, and life sciences enterprises were both considered. Second, to effectively validate the capability of the DEA-SVM model under the COVID-19 pandemic, the enterprises’ business data before the outbreak of the COVID-19 (2018), and the enterprises’ performance data after the outbreak of the COVID-19 (2020) were collected.

4.2 Variable explanation

Based on the analysis of relevant variables in previous studies, 10 vital variables were selected for this study.

1.
Total revenue (million). This variable represents the gross income received from all the divisions of the company. High total revenue is usually desirable for companies. It is the primary output of the firms from the input–output perspective.
2.
Cost of goods sold (COGS, million). This variable represents all the costs directly allocated by the company to production, such as material, labor, and overhead.
3.
Capital expenditure (million). This variable represents the cash outflow or the funds used for additions to the company's property, plant, and equipment (PPE).
4.
Research and development expense (R&D expense, million). This variable represents all the costs related to the development of new products or services, and it is indispensable for analyzing healthcare enterprises.
5.
Employees (thousand). This variable represents the number of company workers.
6.
Selling, general, and administrative expense (SGA expense, million). This variable represents all the commercial expenses incurred in the regular course of business pertaining to the securing of operating income.
7.
Working capital (million). This variable represents the difference between total current assets minus total current liabilities.
8.
Total current asset (million). This variable represents the cash and other assets that are expected to be realized in cash or used in revenue generation.
9.
Total property, plant, and equipment (Total PPE, million). This variable represents the cost, less accumulated depreciation, of tangible fixed properties used in the production of revenue.
10.
Performance category. In the WRDS database, quality ranking is an item of appraisal of firm performance. Based on this item, the performance category variable was defined as two categories. Numbers 1 and 0 represent high and low performances, respectively.

Importantly, four efficiency variables were generated based on these variables. The four efficiency variables comprehensively reflect a firm’s capability from different perspectives.

1.
Operation Efficiency. Inputs are cost of goods sold and capital expenditures; output is total revenue.
2.
Research & development Efficiency. Inputs are R&D expense and employees; output is total revenue.
3.
Marketing Efficiency. Inputs are SGA expense and working capital; output is total revenue.
4.
Finance Efficiency. Inputs are total current assets and total PPE; output is total revenue.

4.3 Dataset explanation

To improve the reliability and effectiveness of the numerical experiments, an equal number of high- and low-performance enterprises were included in the data samples. In addition, each enterprise has business data for 2018 and performance data for 2020. After removing invalid data, the final sample comprised 120 healthcare enterprises. Among them, there were 60 enterprises each in the high- and low-performance categories. These 120 enterprises were randomly divided into two data groups: training and testing. There were 90 enterprises in the training data group, with 45 high- and 45 low-performance enterprises. For the testing data group, 15 high- and 15 low-performance enterprises were involved.

The descriptive statistical analyses for the business data variables are given in Table 1, and these data are normalized before the numerical experiments using Eq. 11. Here, V_new is the normalized data, V is the original data, V_min is the minimum value of the corresponding variable, and Vmax is the maximum value of the corresponding variable.

Table 1 Descriptive statistical analysis for business data variables

Full size table

$${V}_{new}=\frac{V-{V}_{min}}{{V}_{max}-{V}_{min}}$$

(11)

5 Numerical experiments

Numerical experiments are conducted for two purposes: one is to examine whether the proposed model can achieve satisfactory prediction results in the context of the COVID-19 pandemic, and the other is to examine whether the proposed model can provide scientific performance optimization solutions for enterprises in advance.

5.1 Performance prediction and analysis

In this section, the proposed intelligent DEA-SVM model is compared with the traditional SVM algorithm. Both approaches utilized the experimental data of 120 healthcare enterprises: 90 training samples and 30 testing samples.

Because the business data fluctuated significantly in the context of the COVID-19 pandemic, the DEA-SVM model intelligently transformed the original business data into efficiency data, which improved the stability of the inputs. Consequently, the DEA-SVM model took the efficiency data of four variables (operations, R&D, marketing, and finance efficiencies) in 2018 as inputs and the performance category data in 2020 as the output. However, the traditional SVM algorithm predicted the performance by utilizing the original business data. Therefore, the inputs were the eight business variables (cost of goods sold, capital expenditure, R&D expense, employees, SGA expense, working capital, total current asset, and total PPE) corresponding to the four efficiency variables, and the output was also the performance category.

To ensure the reliability of the results, experiments were conducted with different cost values (c = 0.5, c = 1, c = 5, c = 10) and kernel types (linear and radial basis function (RBF)), which are two key parameters in the SVM algorithm. The SVM type was C-SVC. The prediction accuracy results are shown in Figs. 2 and 3.

From Figs. 2 and 3, it is clear that the intelligent DEA-SVM model outperforms the traditional SVM algorithm under different cost values and kernel types; however, there is a slight drop when c = 0.5, and the kernel type is RBF. Second, the results of the linear kernel type are generally better than those of the RBF kernel type. In particular, when the kernel type is linear and c = 1.00, the prediction accuracy of the DEA-SVM model reaches 90%, which is 20% higher than that of the traditional SVM algorithm.

Owing to the pronounced impact of the COVID-19 pandemic on healthcare enterprises, business data fluctuated irregularly. There would be considerable errors if the original business data were directly used for the performance prediction. The efficiency variable generated from the input–output perspective can steadily reflect an enterprise’s capabilities. Therefore, the proposed intelligent DEA-SVM model achieved a better prediction capability by combining the theory of AI algorithms and operations research.

5.2 Performance optimization and analysis

For enterprises with low-performance, it is necessary to optimize their performance in advance to combat the negative impact of the COVID-19 pandemic. Thus, the purpose of this section is to verify how to conceptualize optimization solutions for enterprises with low performance using the DEA-SVM model.

By carefully observing the efficiency data of 120 sample enterprises, it is found that the average efficiency value in the high-performance category is significantly higher than that in the low-performance category, and this result is consistent for three efficiency types: technical, pure technical, and scale. In particular, for the technical efficiency used in the prediction experiment, the average value of the high-performance category is 18% higher than that of the low-performance category.

As a result, the proposed DEA-SVM model further computes the improvement proportion of eight business variables to realize performance optimization, that is, the proportion by which each variable should be reduced. Figure 4 shows the average improvement proportion among the 120 enterprises.

From Fig. 4, it is demonstrated that the proposed DEA-SVM model can generate the proportion of improvement for each business variable. To enhance the healthcare industry’s resistance to the COVID-19 pandemic, working capital and R&D expenses, which are the most redundant variables, should be reduced by 94.51% and 90.22%, respectively, while capital expenditure only need to be reduced by 47.01%. The redundancy of employees and total PPE are medium, and they should be reduced by 73.57% and 82.60%, respectively.

However, these results are the average values for 120 sample enterprises, whereas the optimization solutions for each enterprise are different. Some enterprises need to reduce the number of input variables, while others do not. In fact, the DEA-SVM model can simultaneously generate the performance optimization solution for each enterprise, and Fig. 5 shows the optimization solutions for two randomly selected enterprises. Abbott Laboratories is a healthcare equipment and services enterprise in the USA, and Thermo Fisher Scientific is a pharmaceutical, biotechnology, and life sciences enterprise also in the USA. Evidently, the results for the two enterprises are completely different. Thus, each enterprise should optimize its performance according to its unique situation.

6 Conclusion

In the context of the COVID-19 pandemic, this research presented an intelligent DEA-SVM model to predict and optimize the performance of healthcare enterprises. The WRDS database was used to collect business (performance) data before (after) the COVID-19 outbreak. Through a numerical experiment on performance prediction, four efficiency variables that reflect an enterprise’s capabilities are generated: operation, R&D, marketing, and finance efficiencies. The prediction accuracy of the DEA-SVM model was then compared with that of the traditional SVM algorithm. By combining various parameters, it was comprehensively verified that the DEA-SVM model had an excellent prediction capability. The prediction accuracy was optimal when the kernel type was linear and the cost parameter was 1.00. In terms of the numerical experiment for performance optimization, the optimization solutions for low-performance enterprises were obtained. Working capital and R&D expense were the most redundant resources for all sample enterprises. For each specific enterprise, the corresponding optimization solution was simultaneously generated. These experiments not only proved the theoretical innovation of the DEA-SVM model but also reflected its application value. In the future, efficiency-related parameters could be improved through additional experiments. It is also important to explore the feasibility of applying this model to other fields.

Data Availability

Enquiries about data availability should be directed to the authors.

References

Alraja MN, Imran R, Khashab BM, Shah M (2022) Technological innovation, sustainable green practices and SMEs sustainable performance in times of crisis (COVID-19 pandemic). Inf Syst Front. https://doi.org/10.1007/s10796-022-10250-z
Du Y, Yu H, Li ZJ (2021) Research of SVM ensembles in medical examination scheduling. J Comb Optim 42(4):1042–1052
Article MathSciNet Google Scholar
Gu W, Saaty TL, Wei LR (2018) Evaluating and optimizing technological innovation efficiency of industrial enterprises based on both data and judgments. Int J Inf Technol Decis Mak 17(1):9–43
Article Google Scholar
Hossain MS, Muhammad G, Guizani N (2020) Explainable AI and mass surveillance system-based healthcare framework to combat COVID-I9 like pandemics. IEEE Netw 34(4):126–132
Article Google Scholar
Huang H, Gao W, Ye CM (2021a) An intelligent data-driven model for disease diagnosis based on machine learning theory. J Comb Optim 42(4):884–895
Article MathSciNet MATH Google Scholar
Huang H, Li SL, Yu Y (2019) Evaluation of the allocation performance in a fashion retail chain using data envelopment analysis. J Text Inst 110(6):901–910
Article Google Scholar
Huang H, Shih PC, Zhu YL, Gao W (2021b) An integrated model for medical expense system optimization during diagnosis process based on artificial intelligence algorithm. J Comb Optim. https://doi.org/10.1007/s10878-021-00761-x
Jiang B, Lio WC, Li X (2019) An uncertain DEA model for scale efficiency evaluation. IEEE Trans Fuzzy Syst 27(8):1616–1624
Article Google Scholar
Jin FF, Zhang Y, Garg H, Liu JP, Chen J (2022) Evaluation of small and medium-sized enterprises’ sustainable development with hesitant fuzzy linguistic group decision-making method. Appl Intell 52(5):4940–4960
Article Google Scholar
Lahmiri S, Bekiros S (2021) The effect of COVID-19 on long memory in returns and volatility of cryptocurrency and stock markets. Chaos Solitons Fractals 151:111221
Article MathSciNet MATH Google Scholar
Li FT, Liu L (2019) Decision making based on grey model and support vector machine. Clust Comput- J Netw Softw Tools Appl 22(2):S4603–S4609
Google Scholar
Li SL, Shang J, Slaughter SA (2010) Why do software firms fail? Capabilities, competitive actions, and firm survival in the software industry from 1995 to 2007. Inf Syst Res 21(3):631–654
Article Google Scholar
Lin C, Ting SF, Lee L, Lin ST (2021) Firm capability assessment via the BSC and DEA. Ind Manag Data Syst 121(3):567–593
Article Google Scholar
Liu BY (2021) Influence of enterprise cloud computing database application and enterprise debt financing based on SVM algorithm: internal control quality analysis. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-021-03103-1
Mashrur A, Luo W, Zaidi NA, Robles-Kelly A (2020) Machine learning for financial risk management: a survey. IEEE Access 8:203203–203223
Article Google Scholar
Raayatpanah MA, Khodayifar S, Weise T, Pardalos P (2021) A novel approach to subgraph selection with multiple weights on arcs. J Comb Optim 44:242–268
Article MathSciNet MATH Google Scholar
Shorfuzzaman M (2021) IoT-enabled stacked ensemble of deep neural networks for the diagnosis of COVID-19 using chest CT scans. Computing. https://doi.org/10.1007/s00607-021-00971-5
Soltanifar M, Sharafi H (2022) A modified DEA cross efficiency method with negative data and its application in supplier selection. J Comb Optim 43(1):265–296
Article MathSciNet MATH Google Scholar
Tong LL, Tong GL (2022) A novel financial risk early warning strategy based on decision tree algorithm. Sci Program. https://doi.org/10.1155/2022/4648427
Tsai JK, Hung CH (2021) Improving AdaBoost classifier to predict enterprise performance after COVID-19. Mathematics 9(18):2215
Article Google Scholar
Wang MF, Ku HJ (2021) Utilizing historical data for corporate credit rating assessment. Expert Syst Appl 165:113925
Article Google Scholar
Wu PY, Ma J, Guo XY (2022) Efficiency evaluation and influencing factors analysis of fiscal and taxation policies: a method combining DEA-AHP and CD function. Ann Oper Res 309(1):325–345
Article MathSciNet Google Scholar
Wu WD, Ren CX, Wang YH, Liu T, Li L (2018) DEA-based performance evaluation system for construction enterprises based on BIM technology. J Comput Civ Eng 32(2):04017081
Article Google Scholar
Xu WJ, Fu ZN, Li H, Huang JL, Xu W, Luo YY (2022) A study of the impact of COVID-19 on the Chinese stock market based on a new textual multiple ARMA model. Stat Anal Data Min. https://doi.org/10.1002/sam.11582
Yan DW, Chi GT, Lai KK (2020) Financial distress prediction and feature selection in multiple periods by lassoing unconstrained distributed lag non-linear models. Mathematics 8(8):1275
Article Google Scholar
Zamani ED, Griva A, Conboy K (2022) Using business analytics for SME business model transformation under pandemic time pressure. Inf Syst Front. https://doi.org/10.1007/s10796-022-10255-8
Zhao JF, Ge ZF (2020) Analysis of influencing factors of academic entrepreneurship based on blockchain. Wirel Commun Mobile Comput 2020. https://doi.org/10.1155/2020/8825318

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China [72211530040; 71931008]; Shanghai Shenkang Hospital Development Center (SHDC22021204); Shanghai Municipal Health Commission (202140160); National Key R&D Program of China (2021YFF0900400).

Funding

Fundings are disclosed in Acknowledgements.

Author information

Authors and Affiliations

Business School, University of Shanghai for Science and Technology, Shanghai, China
He Huang
Shanghai Municipal Hospital of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, China
Liwei Zhong, Ting Shen & Huixin Wang

Authors

He Huang
View author publications
You can also search for this author in PubMed Google Scholar
Liwei Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Ting Shen
View author publications
You can also search for this author in PubMed Google Scholar
Huixin Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liwei Zhong.

Ethics declarations

Competing Interests

The authors have not disclosed any competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Huang, H., Zhong, L., Shen, T. et al. Performance prediction and optimization for healthcare enterprises in the context of the COVID-19 pandemic: an intelligent DEA-SVM model. J Comb Optim 44, 3778–3791 (2022). https://doi.org/10.1007/s10878-022-00911-9

Download citation

Accepted: 15 September 2022
Published: 12 October 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s10878-022-00911-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Performance prediction and optimization for healthcare enterprises in the context of the COVID-19 pandemic: an intelligent DEA-SVM model

Abstract

Similar content being viewed by others

Artificial Intelligence and Business Value: a Literature Review

Predictive big data analytics for supply chain demand forecasting: methods, applications, and research opportunities

Big Data Analytics: Applications, Prospects and Challenges

1 Introduction

2 Literature review

3 Proposed intelligent model