FormalPara Key Summary Points

Why carry out this study?

Claims databases are preferred for research on multiple sclerosis (MS) because of the low prevalence rate and long and heterogeneous disease course of this condition; however, Japanese claims databases contain no information on disease severity or disability status.

To explore the possibility of utilizing principal component analysis (PCA) to estimate the disease severity of MS, we developed a score through PCA using information from a Japanese claims database. We also analyzed the demographic information and medical status of patients with MS based on the score.

What was learned from the study?

The score-associated changes in demographic features are consistent with those reported by previous research. Our results are also in line with those of previous studies that have reported that patients with higher disease severity experience lower costs for MS treatment but higher costs for other healthcare treatments as well as higher frequency of hospital visits and hospitalization.

Considering the consistency of these results with those of previous studies, the score developed by PCA may represent MS severity to some extent; PCA may be exploited as a method to estimate MS severity using claims databases.

Further research is necessary to validate our score with established measurements of disease severity or disability burden, such as the Expanded Disability Status Scale.

Introduction

Claims databases have been widely used for research on multiple sclerosis (MS) to assess the current healthcare conditions and evaluate the effects of interventions. The utilization of claims databases allows researchers to include a large number of patients throughout the country and observe them over a long period of time. This is especially preferred with respect to research on MS, as the prevalence rate of this condition is low and patients usually experience a long disease course. However, Japanese claims databases do not include measures to evaluate disease severity or disability status, such as the Expanded Disability Status Scale (EDSS) score [1]. Although a national registry dedicated to intractable diseases such as MS is compiled by the Ministry of Health, Labour, and Welfare, the registry only consists of information concerning medical expense subsidies from applicants with intractable diseases, but does not include all patients diagnosed with MS. Moreover, only certain people (such as staff members in national and local governments and researchers of research project on intractable diseases) are allowed access to the registry. Consequently, research investigating the nature and efficacy of MS treatments according to disease severity or disability status has been insufficiently performed in Japan. Although several studies on Japanese patients with MS using claims databases have been reported to date [2,3,4,5], these only provide information on the MS population as a whole; none contains analyses grouped based on disease severity or disability status. However, if it were possible to estimate these parameters from claims databases, we could take full advantage of the data, which offer several benefits including a large sample size and long-term follow-up periods.

Unsupervised machine learning, such as principal component analysis (PCA), is considered a possible method to estimate disease severity from data that do not contain any severity information. PCA is commonly used to reduce the number of dimensions by transforming a set of correlated variables into a set of uncorrelated variables, the principal component, so as to compress the datasets [6, 7]. This method was successfully used to develop a severity score of chronic obstructive pulmonary disease (COPD) by inputting a group of variables available in claims data that reflected aspects of COPD severity [8]. Therefore, we speculated that this method may enable us to estimate the disease severity of patients with MS as well.

Consequently, this study aimed to explore the possibility of utilizing PCA to estimate the disease severity of patients with MS using a Japanese claims database. Although claims databases do not contain details concerning disease severity, they include some closely related information, such as diagnoses associated with neurological dysfunction and prescriptions for symptomatic therapeutic drugs. Utilizing PCA, we developed a score to predict disease severity using this information. We also analyzed the characteristics and treatment status of patients with MS in the database based on this score in an attempt to assess the possibility of this approach.

Methods

Study Design

This was a claims-based study to develop a score to estimate MS disease severity using a PCA based on patients’ claims data. We analyzed treatment pattern, medical resource utilization, and healthcare costs based on the score and assessed the appropriateness of the score by confirming the consistency of the results with those reported by previous research.

Data Source and Settings

The data source was the claims database (April 2008–August 2018) provided by Medical Data Vision Co., Ltd. (MDV). This database contains clinical data on anonymized patients from acute care hospitals that have adopted the Diagnosis Procedure Combination fixed-payment reimbursement system (referred to as DPC hospitals hereafter) [9]. As of June 2018, the database consisted of data from approximately 20 million patients from 329 DPC hospitals (10% of all DPC hospitals).

The study period was between 2009 and 2018. The observation period of each patient corresponded to the period between the first and last record of any medical intervention for that patient. The first diagnosis of each disease was recorded as the “FromDate” in this database, regardless of the observation period. In this study, the earliest “FromDate” for MS was defined as the date of the first diagnosis of MS, which may lie outside the study period.

Patient Definition

We defined patients with MS using the following inclusion and exclusion criteria.

Inclusion Criteria

Patients who had at least one claim for an MS diagnosis, defined as G35 by the International Classification of Diseases, 10th revision (ICD-10) code [10], and met any of the following criteria:

  1. (1)

    At least one hospitalization claim for an MS diagnosis;

  2. (2)

    At least one outpatient claim for an MS diagnosis and at least one claim for disease-modifying therapies (DMTs) (defined by a generic name);

  3. (3)

    At least one outpatient claim for an MS diagnosis and first diagnosis received before the observation period;

  4. (4)

    At least three outpatient claims for an MS diagnosis.

Exclusion Criterion

At least one claim for a diagnosis of neuromyelitis optica spectrum disorder defined as G36 by the ICD-10 code.

Development of the Score by Principal Component Analysis

We defined MS disease severity as the PC1 score, the first principal component, derived by PCA of the data related to higher disease severity. In general, a PCA produces n principal components, with the first having the minimum possible distance from all the data and explaining the greatest variance. Therefore, among the factors that reflect disease severity, the first principal component was considered to be associated with a higher disease severity.

We selected factors to apply to the analysis as follows. Diagnosis and drug prescription codes related to the functional systems of the EDSS were selected based on the Japanese MS treatment guidelines [11] and the advice of a medical expert (author). Additional claims such as physical rehabilitation fees, which are supposed to be associated with disease severity, were also included through discussion with the medical expert based on clinical experience. For each patient, the presence of claims with diagnoses (defined by the ICD-10 code), prescription of drugs (defined by generic name), and medical procedures (defined by the procedure code) were observed each year. The conceptual scheme representing the development of the score is shown in Fig. 1, and the selected factors are listed in Supplementary Material Fig. S1.

Fig. 1
figure 1

Conceptual scheme illustrating the development of the disability score

We developed a score for each year from the first diagnosis (excluding the year of the first diagnosis) of each patient during the study period based on the claims data; that is, each year of each patient was considered a datum for the analysis.

We calculated the PC1 score for each patient in each year from their first diagnosis (excluding the year of the first diagnosis). We examined the yearly change in the PC1 score using a random-effects model, expressed as follows:

$$\begin{gathered} {\text{Score}}\left( {i, \, t} \right)\sim \beta_{0} + \beta_{1} \times \, t + \gamma \left( {i \, } \right) + \varepsilon \left( {i, \, t} \right) \hfill \\ \varepsilon \left( {i, \, t} \right)\sim {\text{Normal}}\left( {0, \, \sigma^{2} } \right), \hfill \\ \end{gathered}$$

where Score(i, t) represents the PC1 score for each patient (i) in the year from the first diagnosis (t), γ(i) indicates the random effect, β0 is the intercept, and ε(i, t) represents the random error. We calculated the distribution of the coefficient of years from the first diagnosis (β1). Patients included in this model had been observed for > 7 years following their first diagnosis, excluding the year of the first diagnosis.

Treatment Status and Healthcare Costs by the Score

Patient-years (excluding the year of the first diagnosis) were classified into quartile groups using the PC1 score. Treatment pattern, healthcare resource utilization, and healthcare costs were calculated for each group. Treatment pattern represents the frequency of treatments for MS (Supplementary Material Table S1) per patient per month (PPPM); healthcare resource utilization represents the frequency of clinical tests (Supplementary Material Table S2), hospitalizations, hospital visits, and relapses (PPPM); healthcare costs represent total costs and their breakdown for MS treatment, clinical tests, and others. Relapse was estimated based on the type of treatment (Supplementary Material Table S1), except for oral administration of prednisolone. The standard error was calculated for each value using the number of patient-years as the sample size.

We used SAS version 9.4 (SAS Institute Inc., Cary, NC, USA) and Microsoft Excel 2016 (Microsoft, Redmond, WA, USA) for the analyses.

Ethical Approval

This study was approved by the Clinical Research Promotion Network Japan (CR-IRB-0094). The database includes data collected for secondary use and was provided after anonymization; therefore, informed consent was not required according to ethical guidelines in Japan [12]. Permission to access and use the data for this study was obtained from Medical Data Vision Co., Ltd.

Results

Development of the Score

A total of 7067 patients with MS were extracted from the database according to the inclusion and exclusion criteria.

We obtained the eigenvector for each variable of the first principal component, as shown in Supplementary Material Fig. S1. The scree plot of each principal component and explained variance are described in Supplementary Material Fig. S2. The first principal component explained 3.2% of the total variance.

The patients’ demographic characteristics in each quartile based on the PC1 score (Q1–Q4, from lowest to highest, respectively) are shown in Fig. 2. There was a difference in the number of patient-years among quartile groups because patients were allocated according to the PC1 score. Regarding sex, 68.4 ± 0.8% (mean ± standard error) and 64.4 ± 0.7% of patients in the lowest and highest score groups were women, respectively. The average age was 43.3 ± 0.2 and 55.4 ± 0.2 years in the lowest and highest score groups, respectively. The years elapsed from the first diagnosis were approximately 6 for the lowest score group and 8.1 for the highest score group. The distribution of PC1 scores according to quartile groups is shown in Supplementary Material Fig. S3.

Fig. 2
figure 2

Patient demographics and treatment patterns according to the PC1 score. Disability status was predicted using a principal component analysis. Patient-years were classified into four levels by their PC1 score (Q1–Q4: from lowest to highest). Each value for drugs represents the frequency per patient per month, which was calculated as the total number of frequencies in each group divided by patient months in each group. The values are shown with their respective SE. Patient-years were applied as the sample size to calculate the SE. Colors are assigned for each demographic item (percentage of women, age, or duration from the first diagnosis of MS) or for all drugs as per the color scale bar. MS multiple sclerosis, PPPM per patient per month, PC1 first principal component, SE standard error

We examined the yearly change in the PC1 score from the first diagnosis based on the coefficient of years from the first diagnosis (β1) using a random-effects model; the distribution is shown in Fig. 3. The mean of β1 was 0.069 (95% confidence interval [CI] [0.062, 0.076]), meaning that the score increased with a statistical significance that was commensurate to the yearly increase from the first diagnosis for each patient.

Fig. 3
figure 3

Distribution of the coefficient of years from first diagnosis (β1). Mean (95% confidence interval) = 0.069 [0.0618, 0.0756]; standard deviation = 0.004

Treatment Status and Healthcare Costs Based on the Score

Figure 2 describes the treatment pattern based on the PC1 score by each quartile. DMTs were most frequently prescribed for those in the lowest score group. On the other hand, in the highest score group, prednisolone was most frequently prescribed at 0.316 ± 0.008.

Figure 4 indicates that the frequency of hospitalizations, relapses, and outpatient visits showed an increasing tendency with the rise in the PC1 score. The costs for MS treatment were higher in the lowest score group, while other healthcare costs were higher in the highest score group. Total healthcare costs were highest for the highest score group.

Fig. 4
figure 4

Healthcare resource utilization and healthcare costs according to the PC1 score. Principal component analysis was used to predict disability status. Patient-years were classified into four levels by their PC1 score (Q1–Q4: from lowest to highest). Each value represents the frequency per patient per month, which was calculated as the total number of frequencies divided by patient months in each group. The values are shown with their respective SE. Patient-years were applied as sample size to calculate the SE. Colors are assigned for each item group (clinical tests, hospitalization, relapse, hospital visit, or healthcare costs) as indicated in the color scale bar. MS multiple sclerosis, JPY Japanese yen, PPPM per patient per month, PC1 first principal component, SE standard error

Discussion

We estimated MS disease severity using PCA based on the information of Japanese claims data. We developed a score (PC1) based on the first principal component as an indicator of disease severity. When dividing patients into quartiles based on the score, patients with a higher score displayed a longer period since diagnosis, older age, and a higher male-to-female ratio.

We used only the first principal component to develop the score. The analysis is often repeated for the second, third, and more components to understand the data features. However, as we aimed to generate a score to predict the severity of the disease, we used only the first principal component, which was considered to mostly represent severity.

We found that the changes in demographic features based on the PC1 score are consistent with those outlines by previous reports [13, 14]. The observation that the male-to-female ratio was higher in the group with the greatest PC1 score is consistent with previous reports showing that male sex is a predictive factor of poor prognosis, and it is associated with a more progressive and severe outcome [15]. We also confirmed that the PC1 score of each patient positively correlated with the period since diagnosis. These results suggest that the PC1 score might represent the disease severity to some extent, as we hypothesized.

The higher healthcare costs (excluding MS treatment costs) and resource utilization (frequency of hospital visits and hospitalizations) observed in patients with the highest PC1 score are in line with the findings of previous reports. In fact, it has been shown that patients with a higher disability status as assessed through the EDSS sustained higher total healthcare costs [16, 17], healthcare costs excluding MS treatment [18], and costs for inpatient care [16]. Although there may be differences in available treatments and healthcare systems among countries, a similar trend in costs and resource utilization was observed with respect to disease severity in previous studies performed in other countries. However, these differences may directly influence the treatment received by patients with MS, which, in turn, could affect the research and clinical applicability of the score in regions outside of Japan. For example, the greater use of prednisolone in patients with the highest PC1 score observed in this study has instead hardly been observed in other countries. One possible reason for this is that there was no standardized treatment for high-severity secondary progressive MS in the observational period of our study, and some of these patients might have been treated with oral steroids despite the absence of clear evidence for their effectiveness. Another possible reason is that the cohort we analyzed may contain patients with atypical presentations of MS. Such presentations are diagnosed as MS but have features that are similar to those of other demyelinating diseases, such as neuromyelitis optica spectrum disorder and myelin oligodendrocyte glycoprotein antibody-associated disease, which leads to the prescription of oral corticosteroids even for patients diagnosed with MS. A recent multicenter retrospective study in Japan reported that 27% of patients with MS were treated with corticosteroids and/or immunosuppressants, and, among these, individuals with atypical MS were more frequently observed compared to those treated with DMTs [19]. It should be mentioned that we excluded prednisolone coded within 3 months of relapse treatment drug prescription; therefore, its frequent use cannot be explained by the follow-up treatment after pulse steroid therapy. Data obtained in the same country in the same period may be required to assess the validity of the score based on treatment status.

Our results showed an increase in the frequency of relapses with the rise in the PC1 score. A previous study has reported opposite findings, as a reduction in relapse rate was observed with the rise in EDSS levels [20]. One of the possible reasons for this discrepancy could be the definition of relapse we employed in this study. Since no information concerning relapse is present in the database, we defined relapse based on the treatments, including pulse steroid therapy and plasmapheresis. Another reason for this discrepancy was probably due to the temporary rise in disease severity observed after relapses. In fact, a higher PC1 score might be associated with greater severity due to higher disease activity after relapses, alongside accumulated disability. It should be noted that we excluded the data from the year of the first diagnosis when relapse rate is high, with the intent to reduce the effect of claims associated with symptoms that transiently appear because of relapse, and not because of accumulated disability. Regarding the group with low PC1 scores, a lower frequency of relapses and a relatively frequent use of DMT were observed, suggesting that the group might include individuals that better respond to DMT.

We reviewed previous studies that defined MS severity using information obtained from claims data. Although we found some relevant studies, their procedures are different from those that we used and cannot be directly applied to the Japanese claims databases because of the differences in code definitions and settings. Johnson et al. compared the relapse rate among different treatment patterns using claims data from MarketScan Commercial and Medicare Supplemental (USA) [21]. In their study, symptoms corresponding to the Kurtzke Functional System were selected based on the ICD, 9th revision (ICD-9), and the weighted sum of these measures served as an indicator of MS severity, which was used as a propensity score matching factor to evaluate relapse rates. Although the basic idea was similar to our procedure, the method to provide a summarized score of disease severity from the set of claims was different, and we found that many of the ICD-10 codes corresponding to those in the ICD-9 that were used in their study are not available or rarely seen in claims data in Japan (e.g., difficulty in walking). Thus, to appropriately estimate disease severity from claims databases, it is necessary to select claims suited for each database depending on each specific country.

There are several limitations to this study. The score was based on the first principal component derived by PCA, and the degree of uniqueness was confirmed by quartiles of the score. Based on the nature of the method used, the numerical value of the score is not a definitive indicator of the level of disease severity. Moreover, we defined relapse based on the records suggesting relapse treatment was being used. Therefore, treatments that we defined as relapse treatments performed for other purposes were also counted as relapses, and relapses not accompanied by treatment were not counted. In addition, we defined disease duration as the period from the first diagnosis of MS since there is no exact information on its precise onset in the database. In the MDV database, the data source is limited to DPC hospitals, and there are no data on diagnoses and treatments given in other facilities. Furthermore, since the information is based on the records of diagnoses and treatments, any lack of records or inaccuracy in recording may be reflected in the study results. Finally, although we showed a possibility to estimate the severity using PCA, we did not confirm its validity compared with actual clinical measures. Further research using medical records or institutional registries should be performed to compare our score to relevant clinical measures such as the EDSS, Multiple Sclerosis Functional Composite [22], or Barthel Index function scale [23], with the intent to improve and validate the developed score.

Conclusions

We developed a score to represent MS disease severity using information obtained from a Japanese claims database by the PCA approach. The group with higher scores showed a longer disease duration since diagnosis, older age, and a higher male-to-female ratio. Moreover, patients in this group sustained lower costs for MS treatment but higher costs for other healthcare treatments and experienced a higher frequency of hospital visits and hospitalizations. As these results are mostly consistent with previous research, we considered that the score we developed may represent disease severity to some extent. Consequently, PCA may be exploited as a method to estimate MS severity, thus allowing researchers to leverage claims databases despite the possible lack of information on disease severity. Therefore, we believe this study took the first step toward fully utilizing claims databases for research on MS. Further research is necessary to validate the score developed by this method with actual disease severity or disability scores such as the EDSS.