FormalPara Key Summary Points

Why carry out this study?

Myasthenia gravis (MG) is a rare but impactful autoimmune neuromuscular disorder. Its variable treatment response and a significant number of patients experiencing suboptimal therapy outcomes highlight the need for effective predictive tools

The study focuses on developing and validating a predictive nomogram for short-term (6 months) treatment efficacy in patients with acetylcholine receptor antibody-positive (AChR-Ab+) generalized myasthenia gravis (GMG), addressing the need for personalized treatment strategies

What did the study ask?/What was the hypothesis of the study?

The research aimed to construct a nomogram model that could accurately predict the short-term treatment response in AChR-Ab + GMG patients

What was learned from the study?

The developed nomogram model, incorporating factors like gender, body mass index (BMI), disease duration, hemoglobin levels, and white blood cell count, proved highly accurate in predicting suboptimal responses to 6-month standard treatment in patients with AChR-Ab+ GMG

This model signifies a step towards personalized treatment strategies in MG, emphasizing the importance of individual patient characteristics in predicting treatment efficacy

The study underscores the need for further validation of this nomogram in larger multicenter cohorts to confirm its effectiveness and applicability in diverse patient populations

Introduction

Myasthenia gravis (MG) is an autoimmune neuromuscular disorder characterized by fluctuating muscle weakness and fatigability [1, 2]. It occurs when antibodies attack and disrupt the communication between nerves and muscles at the neuromuscular junction. Despite its rarity with a prevalence ranging from 15 to 179 per million people [3], the impact of MG on patients’ quality of life and health care systems is substantial [4].

Treatment response in MG is variable and can range from complete symptom relief to minimal improvement or even disease progression [5]. Unfortunately, a significant proportion of patients have a suboptimal response to therapy. Studies report that up to 10–20% of patients do not achieve satisfactory control of their symptoms despite treatment [6], highlighting the need for better therapeutic strategies and prognostic tools.

The ability to predict which patients are more likely to have a poor response to treatment would have significant clinical implications. Early prediction of treatment response can guide personalized treatment plans, potentially enabling more aggressive or alternative therapies for patients identified as high-risk. This can improve the therapeutic success rate, mitigate disease progression, decrease hospitalization rates, and improve quality of life [7]. Furthermore, it can facilitate patient counseling regarding the prognosis and expected course of the disease, which is important in disease acceptance and treatment adherence [7].

Current prediction of treatment response primarily relies on clinical experience and certain indicators such as disease duration and severity [8,9,10]. However, these measures are not always reliable or precise. Therefore, there is a pressing need to develop reliable, data-driven prediction tools to guide therapeutic decision-making in MG. Zhao et al. developed a model for the 12-month outcome of AChR-Ab+ (acetylcholine receptor antibody-positive) GMG (generalized myasthenia gravis) patients, which could predict short-term prognosis effectively [11]. However, the treatment responses of a shorter term are also of great importance. Physicians and patients are more interested in whether a balance between efficacy and side effects can be achieved within a short period.

In the realm of clinical decision-making, particularly in neurology, the nomogram stands as a pivotal tool, bridging the gap between statistical predictions and practical patient care. A nomogram is a graphical representation tool that simplifies the statistical prediction model into a user-friendly interface, allowing for the visualization and calculation of the probability of a clinical event based on multiple predictors. In the context of AChR-Ab+ GMG, the complexity of the disease necessitates an approach that can encompass multiple variables to predict patient outcomes accurately. Here, the nomogram’s ability to integrate various clinical parameters into a single predictive model is invaluable. It aids neurologists in making informed, personalized treatment decisions, aligning with the evolving paradigm of precision medicine in neurology. The present study aims to develop and validate a predictive nomogram for poor shorter-term (6 months) outcomes in AChR-Ab+ GMG patients.

Methods

Study Design and Setting

This research is a single-center, retrospective, observational study that involved a consecutive case series design. It adhered to the ethical guidelines set forth in the Declaration of Helsinki and was approved by the ethical committee at The First Hospital of Shanxi Medical University (no. KYLL-2023-107). Due to the retrospective nature of the study, the requirement for written informed consent for the utilization of clinical data was waived.

Study Population

Between September 2020 and November 2022, a retrospective cohort study was conducted at The First Hospital of Shanxi Medical University to analyze patients diagnosed with GMG. The inclusion criteria encompassed those who tested positive for the acetylcholine receptor antibody (serum AChR-Ab was detected by enzyme linked immunosorbent assay, and ≥ 0.450 nmol/l was positive; the detection limit is 20 nmol/l), demonstrated adherence to their therapeutic interventions, maintained a minimum outpatient follow-up of 6 months post-diagnosis, and underwent all essential diagnostic procedures, including electromyography and blood biochemistry (detailed in Supplemental Table 1). Power analysis using G*Power (version 3.1.9.7) showed that the sample size was 44 when the effect size was 80%. To analyze the effect of AChR-Ab level on the prognosis of the disease, according to the level of AChR-Ab, the patients were divided into three groups: < 10 nmol/l, 10-20 nmol/l and > 20 nmol/l. An additional criterion was the absence of other infectious diseases at the time of initial diagnosis. Therapeutically, 49 patients initiated treatment with a regimen starting at 20 mg prednisone acetate, increased by 10 mg weekly until the desired dose, and then gradually reduced. Additionally, 64 patients were prescribed tacrolimus, targeting a blood concentration of 3 to 10 mmol/l by the end of the first week, with 23 of them concurrently treated with both prednisone acetate tablets and tacrolimus capsules. All patients received pyridostigmine 60 mg three times daily.

Outcome Measures

The primary outcome of the present study was the Myasthenia Gravis Foundation of America (MGFA) post-intervention status after 6-month treatment. Poor and good prognosis was defined as “unchanged or worse” and “improved or better,” respectively. This outcome measure leverages the MGFA post-intervention status, a metric frequently utilized in both clinical practice and research environments, reflecting its robustness and widely recognized utility in assessing treatment effectiveness in the context of MG [12, 13].

For a thorough evaluation of treatment efficacy predictors in AChR-Ab+ GMG patients, we analyzed a spectrum of demographic and clinical indicators. Demographic insights were derived from age of onset and gender, with lifestyle elements captured via smoking and alcohol consumption status. Vital parameters included body mass index (BMI), duration, hematological measures [red blood cell (RBC), hemoglobin (Hb), white blood cell (WBC), lymphocyte (LYM), albumin (Alb)], and arterial blood gas readings [power of hydrogen (PH), oxygen partial pressure (PO2), partial pressure of carbon dioxide (PCO2)]. Autoantibody statuses of AChR-Ab and Titin-Ab, along with the RNS, informed on the disease’s neurophysiological aspects. Clinical manifestations such as extraocular, palpebralis, bulbar, vegetative nerve, thymoma, and thymic hyperplasia, as well as MGFA scale for disease severity and hypertension as a comorbidity, were also integrated. These indicators aimed to provide a comprehensive evaluation of potential prognostic markers.

Statistical Analyses

Data analyses were performed using R statistical software (version 4.1.2). Categorical variables are reported as counts and percentages, whereas continuous variables are expressed as means ± standard deviations for normally distributed data and medians (interquartile ranges, IQR) for skewed data. The Shapiro-Wilk test was used to assess data for normality. To evaluate the differences in clinical efficacy among patients receiving various treatment methods at a 6-month follow-up, a chi-square (χ2) test was performed, with a significance threshold set at p < 0.050. Predictive factors were initially screened using univariate logistic regression with a cut-off of p < 0.050. Subsequently, significant predictors identified from the univariate analysis were included in a multivariate logistic regression model and to construct a nomogram predictive model. The model’s predictive performance was evaluated using receiver-operating characteristic (ROC) curve analysis, calibration curves, and decision curve analysis (DCA). Note that to estimate the performance of the model, we used a multiple bootstrap procedure (n = 500 bootstraps).

Results

Patient Characteristics:

Patient recruitment for the study is presented in the flow diagram (Fig. 1). Ninety of 248 patients were enrolled. The median age at disease onset was 55 (IQR: 40.500–64) years, and 46.700% were women. A total of 30 patients were ‘unchanged’ or ‘worse’ after 6-month standard treatment (Table 1).

Fig. 1
figure 1

Flow chart of constructing and validating a nomogram model for myasthenia gravis with poor short-term efficacy. MGFA-Pis MGFA postintervention status

Table 1 Clinical characteristics

Development of the Nomogram

ROC analysis was utilized to establish the cut-off value for all count data, which were subsequently transformed into categorical variables to facilitate further analysis. Logistic regression analysis showed that there was no statistical significance in the effect of different treatment schemes on the condition of patients at 6 months (P: prednisone acetate 0.257, tacrolimus 0.150, drug combination 0.163), indicating the potential influence of other factors on this outcome. The correlation between AChR-Ab level and disease severity has been controversial. Logistic regression analysis showed that there was no significant difference between AChR-Ab level and disease severity at 6 months (P < 10 nmol/l 0.401, 10–20 nmol/l 0.352, > 20 nmol/l 0.505). Univariate logistic regression analysis revealed that factors such as gender, BMI, duration of illness, Hb, WBC, and LYM met the significance criterion of p < 0.050, as detailed in Table 2. In several studies concerning prognostic factors of MG, the quantitative myasthenia gravis (QMG) score has been identified as a critical determinant of poor prognosis [11, 14, 15]. Consequently, despite not satisfying the initial P-value criterion, QMG was included in further analyses. Considering the study’s limited sample size, we emphasized the robustness of the model. This entailed a restriction on the number of risk factors evaluated, giving precedence to those with higher odds ratios (OR), especially when similar types of factors were competing. Detailed outcomes from the multivariate analysis are presented in Table 3. Building on these insights, a nomogram model was developed, as illustrated in Fig. 2A.

Table 2 Results of univariate logistic regression analysis
Table 3 Results of multivariate logistic regression analysis
Fig. 2
figure 2

A Nomogram prediction model obtained in this study. Hb hemoglobin, WBC white blood cell, BMI body mass index, QMG quantitative myasthenia gravis. B Calibration curve, which show high accuracy; x-axis represents the predictive potential for MG exacerbation, and y-axis represents the actual incidence of it; perfect prediction based on an idealized model is represented by the diagonal dotted line; the ideal performance of the nomogram is represented by the red solid line, and the closer the red solid line fits the dotted line, the better the prediction will be. C ROC analysis of the nomogram (QMG included), the nomogram (Non-QMG), and the bootstrap test model. D and E are decision analysis curves, and the use of the model in the blue range has high clinical benefit and a wide range of application

Model Evaluation

The calibration curve is a pivotal tool for evaluating the accuracy of a nomogram model. In our study, this curve demonstrated a negligible difference between the predicted outcomes and actual observations, indicating a well-calibrated model (Fig. 2B; p = 0.707). Furthermore, ROC analysis, a benchmark technique for assessing predictive accuracy, was employed. The AUC of our model was 0.792 (95% CI 0.686–0.899), suggesting substantial discriminative ability in identifying patients likely to exhibit poor efficacy following a 6-month standard treatment regimen (Fig. 2C).

To rigorously test the model’s validity, we conducted an analysis comparing its accuracy with and without the inclusion of QMG. Removal of QMG led to a decrease in accuracy (AUC: 0.749; 95% CI 0.634–0.863); however, DeLong’s test revealed no statistically significant difference between the two models (p = 0.079). Consequently, we opted for the model inclusive of QMG because of its superior accuracy.

We further assessed the clinical utility of our prognostic model using a DCA. The blue section of Fig. 2D delineates the model’s performance threshold area, underscoring its practical applicability.

In summary, the constructed nomogram exhibits high predictive accuracy for suboptimal response to a 6-month standard treatment in patients with AChR-Ab+ GMG. The validation of the predictive model is a crucial step in establishing its reliability and applicability in a clinical setting. For this purpose, we employed a bootstrapping technique with n = 500 iterations. Bootstrapping is a resampling method used extensively in statistics to estimate the distribution of a sample statistic by randomly selecting samples with replacement from the original dataset. This approach is particularly beneficial in assessing the stability and accuracy of a predictive model. In our study, the bootstrapping method was used to validate the nomogram developed for predicting the short-term prognosis of patients with AChR-Ab+ GMG. The choice of 500 bootstrap resamples was made to balance computational efficiency with the robustness of the validation process. This conclusion is bolstered by a bootstrap test with 500 repetitions on our dataset, which confirmed the consistency of the derived equations with those of the initial nomogram model (Fig. 2D, E). Such congruence not only reaffirms the robustness of our original model but also solidifies its reliability.

Discussion

In our study, we have constructed a predictive nomogram to estimate the short-term response to treatment in AChR-Ab+ GMG patients based on six significant factors, namely gender, BMI, QMG score, disease duration, Hb, and WBC count. This nomogram represents a step towards a more personalized, data-driven approach to the management of MG.

The finding that female gender is predictive of poorer response aligns with the recognized gender disparities in autoimmune diseases, including MG. Females have been shown to have a higher susceptibility and more severe disease course [16]. Hormonal and genetic factors have been proposed to underlie these disparities, and a deeper exploration of these aspects might refine the predictive power of gender in our model and contribute to a better understanding of sex-related differences in MG outcomes.

Prior literature has consistently indicated a linkage between elevated BMI and heightened clinical symptomatology in MG patients [16, 17]. However, there is a conspicuous dearth of research elucidating its implications on short-term prognostic outcomes. Within our investigational purview, an augmented BMI emerged as a potential harbinger of improved short-term outcomes. This correlation may stem from the perturbation in the disease’s natural trajectory, likely attributable to the initiation of a more rigorous therapeutic regimen for these particular patients during the disease’s early stages. For instance, within our dataset, 33.33% of patients with a BMI > 25.610 were administered a combined regimen of steroids and tacrolimus compared to 19.610% of those with a BMI < 25.61 who received the dual therapy. Nevertheless, it remains imperative to underscore the significance of adept weight management in the holistic care paradigm for MG.

Our model’s emphasis on high QMG scores and longer disease duration as indicators of poorer response is congruent with the understanding that these factors are generally linked to disease severity and chronicity. This stresses the need for early, aggressive intervention in patients presenting with severe symptoms and those with long-standing disease to improve treatment outcomes. This result is consistent with the results of some previous studies [14], but some studies have shown that patients with higher QMG scores at initial diagnosis have better prognosis [11, 18]. These differences may be explained by the longer observation periods of these studies, during which more drug doses may be administered, and by greater attention from physicians and patients with more severe symptoms. Second, changes in the definition of outcomes according to QMG scores in these studies may have influenced the results of the statistical analyses.

The inclusion of Hb and WBC levels in our model underscores the importance of considering the overall health status of MG patients. Suboptimal Hb levels coupled with elevated WBC counts potentially indicate systemic health anomalies—encompassing inflammatory responses, infectious processes, or persistent pathological conditions—that could influence the therapeutic responsiveness in MG. The amplification in WBC count, along with derivative markers like the neutrophil-to-lymphocyte ratio (NLR), predominantly indicates an underlying systemic inflammatory condition. Notably, NLR has emerged as a prognostic marker for a plethora of pathologies, as evidenced by several research publications [19,20,21]. In a retrospective investigation by Hsu et al., leukocytosis was identified as a robust predictor for mortality in MG [22]. Complementing this, numerous studies have elucidated the association of an elevated NLR with an adverse prognosis in MG [23,24,25]. Anemia’s role as a harbinger of adverse outcomes spans various diseases including but not limited to ischemic stroke and chronic obstructive pulmonary disease [26, 27]. Notably, a plethora of clinical studies have documented the occurrence of anemia in MG cohorts [28, 29]. Certain academic contributions suggest a heightened severity of MG in anemic individuals, although the underlying mechanisms remain an area of ongoing inquiry [22, 30]. Extrapolating from ischemic stroke analyses, some scholars propose that anemia might inflict localized skeletal muscle hypoxia by compromising tissue oxygenation, subsequently diminishing muscular prowess [31]. Such a hypothesis might elucidate the observation where a decrement in Hb levels correlates with exacerbated MG presentations.

The direction our study has taken towards developing predictive models for patient outcomes mirrors broader trends in MG research and healthcare. Machine learning and other advanced analytics tools are being employed to construct robust predictive models for disease outcomes, as illustrated by the study of Huahua Zhong et al. [15]. This move towards personalized medicine allows clinicians to tailor treatment strategies based on a patient’s individual disease characteristics and potential response to therapy.

Our study is an addition to the suite of tools being developed to predict disease outcomes in MG. However, it is important to note that further validation of our model in larger, multicenter cohorts is needed to ensure its broad applicability and robustness. The aim is to develop a model that can be used across various patient populations, thereby contributing significantly to improving the quality of life for individuals living with MG.

Historical literature has posited that factors such as dysphagia, hypoproteinemia, presence of anti-titin antibodies, and an advanced age of onset might be indicative of a worsening patient prognosis [13, 32, 33]. In the current investigation, variables including bulbar symptoms, thymoma, serum albumin levels, and the presence of anti-titin antibodies were precluded because their P-values exceeded 0.2 in the univariate logistic regression analysis. It is pertinent to note that the limited sample size necessitated stringent inclusion criteria. Consequently, the ‘age of onset’ variable, failing to meet these stipulated criteria, was not incorporated into the analysis.

In our endeavor to establish a nomogram for predicting short-term efficacy in AChR-Ab+ GMG patients, several limitations merit consideration. First, the study was conducted at a single institution and, with a cohort of only 90 patients, might reflect regional-specific biases and limitation of small sample size, thereby limiting its generalizability. Additionally, the retrospective design inherently introduces potential biases, such as selection bias. Furthermore, while our statistical models are conventional, they operate under certain assumptions, and these might not always mirror the nuances of complex clinical scenarios. Lastly, the model’s external validation across diverse settings remains imperative to affirm its universal applicability. In light of these considerations, we strongly advocate for future multicentric research with larger sample sizes to enhance the model’s robustness.

Conclusion

In conclusion, the predictive nomogram developed in our study provides an additional tool for the personalized management of MG. Incorporating key demographic, clinical, and laboratory parameters, our model reflects the multifaceted nature of MG prognosis. With further validation and potential refinement, this model could play a vital role in guiding therapeutic decisions, ultimately leading to improved patient outcomes.