Population-based registers are widely used in epidemiological studies. We aimed to estimate the validity of multiple sclerosis (MS) diagnoses registered in the Swedish National Patient Register (NPR) by two sequential register-based case-definition algorithms. Prevalent MS patients aged 16–64 years were identified from the in- and specialised out-patient NPR in 2001–2013, using International Classification of Diseases code G35. These identified MS diagnoses were validated through two sequential register-based case-definition algorithms, as the ‘gold-standard’ reference, by linking individual-level data longitudinally to other nationwide registers. The primary algorithm first sought to corroborate the MS diagnoses with MS-specific information in other nationwide registers. The exploratory secondary algorithm identified individuals with MS-related information in other registers and those who were unable to be followed sufficiently. Through multi-register linkage, we estimated the number of confirmed and uncertain individuals with an MS diagnosis recorded in the NPR. A total of 19,781 individuals (mean age at first visit 45.2 years; 69.5% women) had at least one MS diagnosis recorded in the NPR during 2001–2013. Using the two case-definition algorithms, 92.5% (n = 18,291) of the MS diagnoses recorded in the NPR were confirmed, while 7.5% (n = 1490) remained uncertain. Our findings indicate that a very high percentage of patients coded with an MS diagnosis in the Swedish NPR actually have MS, and supports the use of the NPR as a viable source to identify individuals with an MS diagnosis for population-based research. This exploratory methods paper suggests an alternative novel method to verify individuals’ diagnoses in register-based settings.
The Swedish nationwide population-based registers are widely used in epidemiological studies, providing a less resource demanding data source than patient records to establish representative scientific results [1,2,3,4]. Nonetheless, administrative registers may be subject to diagnostic and coding errors . Individual-level linkage across the different registers can be used to corroborate the information contained within a single register. The National Patient Register (NPR), a principal data source for research [4, 5], contains records of healthcare visits in Sweden with diagnoses registered by the International Classification of Diseases (ICD) codes. The validity of the complete NPR is high , but knowledge of data quality is lacking for some specific diseases, such as multiple sclerosis (MS).
MS is a neurodegenerative disease that typically affects younger adults, causing increasing levels of cognitive and physical impairment as the disease progresses [6,7,8]. Sweden has an especially high prevalence ; with an estimated all-age nationwide MS prevalence of 189 per 100,000 . Despite that MS diagnoses are now being set earlier and more accurately, no single clinical feature or diagnostic test can positively diagnose MS [10,11,12]. This clinical context may reduce our certainty of the recorded MS diagnoses due to the risk of misdiagnosis between MS and other degenerative neurological diseases, a risk which has been suggested to still exist with the 2017 update of the McDonald criteria [10, 13]. Therefore, it has been questioned to what extent an MS diagnosis is incorrectly assigned and recorded in administrative records during the period investigating whether a patient has MS [10, 13,14,15,16,17,18].
Besides the NPR, several other Swedish nationwide registers contain MS-specific information (i.e., MS ICD codes or medications), which could be used to identify MS cases. Accordingly, we aimed to assess the validity of MS diagnoses recorded in the NPR with a novel method using information linked from other nationwide registers in two sequential register-based case-definition algorithms.
This validation study evaluates the NPR as a relevant source for MS studies by comparing the records of all individuals aged 16–64 with an MS code during 1 January 2001–31 December 2013 in the NPR against other register-based MS records, through two sequential register-based case-definition algorithms as the ‘gold-standard’ reference.
The unique personal identity number assigned to all residents in Sweden enabled individual-level linkages across six nationwide registers , administered by the following four authorities:
National Board of Health and Welfare: The National Patient Register (NPR) includes information about healthcare visits, including main and all side diagnoses coded according to the Swedish version of the ICD at time of visit . The NPR has had nationwide coverage since 1987 of in-patient admissions and of out-patient specialist visits since 2001 . The Swedish Prescribed Drug Register (SPDR) contains nationwide records of dispensed medication prescriptions with Anatomical Therapeutic Classification (ATC) codes since 1 July 2005 . The Cause of Death Register (CDR) contains nationwide information on the date, underlying and contributory causes of death recorded by ICD codes since 1961 .
Karolinska University Hospital: The Swedish MS Register (SMSReg) is a nationwide but voluntary MS-specific clinical register that is used for nationwide pharmacological surveillance and for enhancing quality of care [23, 24]. SMSReg contains comprehensive clinical information  for included patients diagnosed with MS to support clinical decision-making, including retrospective information predating the SMSReg’s establishment in 2001 . The accuracy and completeness of the clinical data has been recently estimated to be of value for future studies .
Social Insurance Agency: The Micro-Data for Analysis of the Social Insurance System (MiDAS) contains information on sickness absence (since 2005) and disability pension benefits (since 1994) (dates and diagnoses (by ICD codes)) .
Statistics Sweden: The Longitudinal Integration Database for Health Insurance and Labour Market Studies (LISA), comprises annual socio-demographic information on all people registered as living in Sweden [28, 29].
All individuals with a main or side diagnosis of MS (ICD-10: G35) recorded when having a healthcare visit in 2001–2013, aged 16–64 years, were identified in the NPR (n = 19,781). From the date of first MS diagnosis in 2001–2013 in the NPR, both retrospective and prospective information was obtained from the above mentioned six registers, in order to confirm the MS diagnosis. If it was not possible to confirm the MS diagnosis, the individual was followed until emigration, death, or end of data extraction of each register (varies between 2013 and 2016), whichever came first (Fig. 1).
We first described the MS patients in terms of frequencies and percentages by: sex (women/men); healthcare setting (in-/out-patient); whether MS was the main diagnosis for the first record with an MS diagnosis during 2001–2013 (yes/no); number of visits with an MS diagnosis recorded in the NPR ever (< 3/≥ 3); whether in the NPR prior to 2001 with an MS diagnosis (yes/no); whether died before 31 December 2013 (yes/no); and age at first record with MS in the NPR during 2001–2013.
The annual MS prevalence (total and sex-specific) were estimated from 2001 to 2013 of all individuals aged 16–64 at the time of MS diagnosis according to the NPR. The prevalence estimates were expressed per 100,000 of the total population in Sweden aged 16–64, identified in LISA for the respective year.
Two sets (primary and secondary) of register-based case-definition algorithms were constructed by the multidisciplinary research group for this study to validate MS diagnoses recorded in the NPR with the extensive register data available. The algorithms cross-checked the MS diagnoses at an individual-level with specific information on MS from all available nationwide register sources in Sweden, including prior to and after 2001–2013 (Fig. 1). These algorithms, applied in a sequential manner, were thought to be equivalent to a diagnostic test to confirm the registered diagnoses . Accordingly, the MS diagnoses recorded in the NPR were treated as ‘provisional’ MS until validated as ‘confirmed’ MS as per the algorithms.
Figure 2 displays the primary and secondary sets of case-definition algorithms, including the order of the steps identifying confirmed MS and the ‘gold-standard’ data source used . The primary algorithm identified MS-specific codes (ICD-10 G35, ICD-9 340, or specific ATC codes for MS medications) in other registers in six steps. In the first step of this algorithm, individuals with clinical MS information, entered by their treating neurologist, in the SMSReg were identified. The secondary case-definition was then applied to the remaining provisional MS diagnoses after completing the primary algorithm. This exploratory algorithm matched to other MS-related information (MS-like symptom ICD codes), and sought to identify individuals who were unable to be sufficiently followed in the data to confirm the diagnosis as ‘plausible’ MS. The individuals who remained at the end of the last step in each set of case-definition algorithms were classified as ‘uncertain’ MS (i.e., patients who received at least one MS diagnosis recorded in the NPR, but without other register-based information corroborating the diagnosis). The confirmed MS are those identified according to both case-definition algorithms, i.e., had other register-based MS information supporting their MS diagnosis in the NPR. We then plotted the proportions of confirmed, plausible, and uncertain MS by the year of first MS record in the NPR.
Finally, in order to estimate the influence of the uncertain MS cases in the NPR if included in study populations, we profiled the descriptive characteristics of those remaining as uncertain MS after each set of algorithms and those confirmed MS (including plausible MS diagnoses). Similarity of the characteristics of uncertain MS after the secondary algorithm and those of confirmed MS were tested using Chi square tests. The mean number of years between the first and second MS record in the NPR with 95% confidence intervals (CI) were calculated. All analyses were conducted in SAS v.9.4.
Compliance with ethical standards
This project was approved by the Regional Ethical Review Board of Stockholm in accordance with the Declaration of Helsinki.
Overall, 19,781 individuals (69.5% women) with MS diagnoses recorded in the NPR when aged 16–64 were identified in 2001–2013. In all, 81.7% (n = 16,158) were first identified from specialised out-patient visits and 89.7% (n = 17,738) had MS as the main diagnosis. They together had 308,761 visits with MS as a listed diagnosis during 2001–2013.
The nationwide prevalence of MS according to the NPR in the population aged 16–64 years steadily increased over the study period (Fig. 3). In 2013, the estimated prevalence was 258.3 (women 369.1; men 150.8) per 100,000. Women had a higher prevalence than men throughout the study.
Of the identified individuals, 92.5% (n = 18,291) had their diagnosis confirmed by the validation algorithms (Table 1). The primary algorithm confirmed the majority of diagnoses (17,922; 90.6%), with fewer corroborated by the secondary algorithm (369; 1.9%). We classified 1490 (7.5%) individuals as uncertain MS after cross-checking both case-definition algorithms. After the first two steps of the primary algorithm, MS diagnosis in the SMSReg and ≥ 3 visits due to MS recorded in the NPR, 89.4% (n = 17,692) of the provisional MS diagnoses were confirmed.
The profiles of the confirmed and uncertain MS are presented in Table 2. The mean number of years between the first and second visit (2001–2013) in the NPR for the uncertain MS were 0.85 (95% CI 0.70–1.00) after the primary case-definition algorithm and 1.01 (95% CI 0.78–1.24) after the secondary case-definition algorithm. Of the individuals with confirmed MS (n = 18,291), 16,527 (90.4%) had MS as the main diagnosis and 14,921 (81.6%) first presented to out-patient healthcare. Of the uncertain MS, most of them had only one visit due to an MS diagnosis recorded in the NPR (78.5%, n = 1169) (not presented in the table).
A peak in first time NPR-registered MS diagnoses was observed when out-patient healthcare was included in 2001 (Online Resource 1).
In this study we identified all individuals aged 16–64 with MS as a main or side diagnosis in the Swedish NPR during 2001–2013. We compared these provisional MS diagnoses against individual-level MS-specific information in other nationwide registers. With this novel method, we confirmed that 92.5% of the NPR-registered MS diagnoses could be validated after the two sequential register-based case-definition algorithms. Our findings support the NPR as a valuable source to identify MS cases in epidemiological studies.
The majority of provisional and confirmed MS diagnoses first presented in out-patient settings, with MS as the main diagnosis. A slightly higher percentage of the uncertain MS diagnoses were a side diagnosis. The spike in the number of registered MS diagnoses in 2001 corresponds with the inclusion of the specialised out-patient visits, and tapers off by 2004 to reasonably consistent annual numbers of first time identified diagnoses for the remaining study years. This trend suggests caution is required when identifying newly diagnosed MS cases from the NPR prior to 2004, i.e., the first years of including out-patient visits in the NPR.
The first two steps of the primary case-definition algorithm (MS diagnosis in the SMSReg and ≥ 3 visits with MS ever in the NPR) confirmed the majority of provisional MS diagnoses. The exploratory secondary case-definition algorithm identified a small number of plausible MS diagnoses, predominantly in the later years, with insufficient follow-up time to reasonably identify MS information in the registers. The individuals with uncertain MS after both case-definition algorithms could plausibly represent patients who were initially suspected of having MS, but the diagnosis was never confirmed, or simply due to administrative error. In total, 78.5% of the uncertain MS only had one visit ever due to MS.
A previous cross-referencing of the SMSReg to NPR by the National Board of Health and Welfare found that only 4% of MS entries in the NPR in 2008 were coded as ‘possible MS’ in the SMSReg , suggesting high validity of MS diagnoses recorded in the NPR . Accordingly, when selecting a study population, the voluntary SMSReg may underestimate the MS population, especially for particular geographical regions , and we, based on this study, add that the mandated NPR may slightly overestimate the MS population (< 10%). Further studies are needed to elucidate occurrence of this.
Our prevalence estimates of MS for 16–64 year olds were higher than an all-age estimate of 189 per 100,000 at 31 December 2008 with MS identified in either the SMSReg or NPR . The improved diagnostic criteria, better awareness, more efficacious disease modifying therapies (DMTs), and ultimately extended survival could be a potential explanation [6, 7]. The high prevalence of MS among working-aged individuals and increasing treatment options necessitates representative real-world results , which the NPR enables [1, 2].
Strengths of this study include the numerous nationwide registers with MS-specific information available to corroborate the MS diagnoses in the NPR. Usually, a ‘gold-standard’ of disease is derived from clinical data, albeit for a sample of the total population. Instead, we used a novel method of register-based algorithms linking individual-level register data, including MS-specific clinical information from the SMSReg , to determine the validity of all identified MS cases in Sweden in 2001–2013. This study included years both before and after identification in the NPR, a design consistent with the chronic and progressive nature of MS  and treatment guidelines for annual healthcare visits for individuals with MS or clinically isolated syndrome (CIS) .
One limitation was that the case-definition algorithms were limited to the different coverage dates of the contributing registers. For example, the SPDR, which was available for July 2005–December 2013 and only covered dispensed prescribed drugs, therefore, excluding both indented drugs administrated at clinics and DMTs that became available after 2013. Further, both MS and CIS patients may be prescribed interferon beta therapies , therefore, these ATC codes could not be used to confirm MS diagnoses.
In conclusion, the certainty of MS diagnoses for patients aged 16-64 in the NPR is very high according to this validation using register-based algorithms. These findings strengthen the notion that the Swedish NPR is a valuable source for MS populations in nationwide epidemiological studies.
Not applicable. The data cannot be made publically available due to privacy regulations. The data used in this study is administered by the Division of Insurance Medicine, Karolinska Institutet. According to the General Data Protection Regulation, the Swedish law SFS 2018:218, the Swedish Data Protection Act, the Swedish Ethical Review Act, and the Public Access to Information and Secrecy Act, these types of sensitive data can only be made available, after legal review, for researchers who meet the criteria for access to this type of sensitive and confidential data. Readers may contact Professor Kristina Alexanderson (firstname.lastname@example.org) regarding the data.
Sorensen HT, Sabroe S, Olsen J. A framework for evaluation of secondary data sources for epidemiological research. Int J Epidemiol. 1996;25(2):435–42.
Olsen J. Register-based research: some methodological considerations. Scand J Public Health. 2011;39(3):225–9. https://doi.org/10.1177/1403494811402719.
Wennman-Larsen A, Nilsson MI, Saboonchi F, Olsson M, Alexanderson K, Fornander T, et al. Can breast cancer register data on recommended adjuvant treatment be used as a proxy for actually given treatment? Eur J Oncol Nurs. 2016;22:1–7. https://doi.org/10.1016/j.ejon.2016.02.010.
Ludvigsson JF, Almqvist C, Bonamy A-KE, Ljung R, Michaëlsson K, Neovius M, et al. Registers of the Swedish total population and their use in medical research. Eur J Epidemiol. 2016;31(2):125–36.
Ludvigsson JF, Andersson E, Ekbom A, Feychting M, Kim JL, Reuterwall C, et al. External review and validation of the Swedish national inpatient register. BMC Public Health. 2011;11:450. https://doi.org/10.1186/1471-2458-11-450.
Giovannoni G, Butzkueven H, Dhib-Jalbut S, Hobart J, Kobelt G, Pepper G, et al. Brain health: time matters in multiple sclerosis. Mult Scler Relat Disord. 2016;9(Suppl 1):S5–48. https://doi.org/10.1016/j.msard.2016.07.003.
GMS Collaborators. Global, regional, and national burden of multiple sclerosis 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 2019. https://doi.org/10.1016/s1474-4422(18)30443-5.
Confavreux C, Vukusic S. The clinical course of multiple sclerosis. Handb Clin Neurol. 2014;122:343–69. https://doi.org/10.1016/B978-0-444-52001-2.00014-5.
Ahlgren C, Oden A, Lycke J. High nationwide prevalence of multiple sclerosis in Sweden. Mult Scler. 2011;17(8):901–8. https://doi.org/10.1177/1352458511403794.
Thompson AJ, Banwell BL, Barkhof F, Carroll WM, Coetzee T, Comi G, et al. Diagnosis of multiple sclerosis: 2017 revisions of the McDonald criteria. Lancet Neurol. 2018;17(2):162–73. https://doi.org/10.1016/S1474-4422(17)30470-2.
McNicholas N, Hutchinson M, McGuigan C, Chataway J. 2017 McDonald diagnostic criteria: a review of the evidence. Mult Scler Relat Disord. 2018;24:48–54. https://doi.org/10.1016/j.msard.2018.05.011.
Milo R, Miller A. Revised diagnostic criteria of multiple sclerosis. Autoimmun Rev. 2014;13(4–5):518–24. https://doi.org/10.1016/j.autrev.2014.01.012.
Solomon AJ, Naismith RT, Cross AH. Misdiagnosis of multiple sclerosis: impact of the 2017 McDonald criteria on clinical practice. Neurology. 2019;92(1):26–33. https://doi.org/10.1212/wnl.0000000000006583.
Brownlee WJ, Swanton JK, Altmann DR, Ciccarelli O, Miller DH. Earlier and more frequent diagnosis of multiple sclerosis using the McDonald criteria. J Neurol Neurosurg Psychiatry. 2015;86(5):584–5. https://doi.org/10.1136/jnnp-2014-308675.
Brownlee WJ. Misdiagnosis of multiple sclerosis: if you have a hammer, everything looks like a nail? Neurology. 2019;92(1)15–16. https://doi.org/10.1212/wnl.0000000000006584.
Culpepper WJ, Marrie RA, Langer-Gould A, Wallin MT, Campbell JD, Nelson LM, et al. Validation of an algorithm for identifying MS cases in administrative health claims datasets. Neurology. 2019;92(10):e1016–1028. https://doi.org/10.1212/wnl.0000000000007043.
Wallin MT, Culpepper WJ, Campbell JD, Nelson LM, Langer-Gould A, Marrie RA, et al. The prevalence of MS in the United States: a population-based estimate using health claims data. Neurology. 2019;92(10):e1029–1040. https://doi.org/10.1212/wnl.0000000000007035.
Widdifield J, Ivers NM, Young J, Green D, Jaakkimainen L, Butt DA, et al. Development and validation of an administrative data algorithm to estimate the disease burden and epidemiology of multiple sclerosis in Ontario, Canada. Mult Scler. 2015;21(8):1045–54. https://doi.org/10.1177/1352458514556303.
Ludvigsson JF, Otterblad-Olausson P, Pettersson BU, Ekbom A. The Swedish personal identity number: possibilities and pitfalls in healthcare and medical research. Eur J Epidemiol. 2009;24(11):659–67. https://doi.org/10.1007/s10654-009-9350-y.
The National Patient Register. National Board of Health and Welfare (Socialstyrelsen). 2018. http://www.socialstyrelsen.se/register/halsodataregister/patientregistret/inenglish. Accessed 30 April 2018.
Wettermark B, Hammar N, Fored CM, Leimanis A, Otterblad Olausson P, Bergman U, et al. The new Swedish Prescribed Drug Register—opportunities for pharmacoepidemiological research and experience from the first six months. Pharmacoepidemioland Drug Saf. 2007;16(7):726–735. https://doi.org/10.1002/pds.1294.
The Cause of Death Register (Dödsorsaksregistret (in Swedish)). National Board of Health and Welfare (Socialstyrelsen). 2018. http://www.socialstyrelsen.se/register/dodsorsaksregistret#. Accessed 30 April 2018.
Swedish Neuro Registry. Swedish Neuro Registries: Multiple Sclerosis. 2018. http://www.neuroreg.se/en.html/multiple-sclerosis. Accessed 30 April 2018.
Hillert J, Stawiarz L. The Swedish MS registry—clinical support tool and scientific resource. Acta Neurol Scand. 2015;132(199):11–9. https://doi.org/10.1111/ane.12425.
Alping P, Piehl F, Langer-Gould A, Frisell T. Validation of the Swedish Multiple Sclerosis Register: further improving a resource for pharmacoepidemiologic evaluations. Epidemiology. 2019;30(2):230–3. https://doi.org/10.1097/ede.0000000000000948.
Appendices to Open Comparisons of Healthcare Quality and Efficiency 2009: Indicator Descriptions, Coverage Comparison, and Health Consumption (Bilagor till Öppna jämförelser av hälso- och sjukvårdens kvalitet och effektivitet 2009: Indikatorbeskrivningar, Täckningsgradsjämförelser, and Vårdkonsumtion (in Swedish)). Stockholm: Swedish Association of Local Authorities and Regions and National Board of Health and Welfare (Sveriges Kommuner och Landsting och Socialstyrelsen); 2009.
MiDAS: Sjukpenning och rehabiliteringspenning. (MiDAS: sickness benefit and rehabilitation benefit.) (in Swedish). Social Insurance Agency (Försäkringskassan). 2011. www.forsakringskassan.se.
Longitudinal integration database for sick leave and labour market studies (LISA) 1990–2013. (Longitudinell integrationsdatabas för sjukförsäkrings- och arbetsmarknadsstudier (LISA) 1990–2013 (in Swedish)). Örebro: Statistics Sweden (Statistika centralbyrån); 2016.
Ludvigsson JF, Svedberg P, Olen O, Bruze G, Neovius M. The longitudinal integrated database for health insurance and labour market studies (LISA) and its use in medical research. Eur J Epidemiol. 2019;34(4):423–37. https://doi.org/10.1007/s10654-019-00511-8.
Benchimol EI, Manuel DG, To T, Griffiths AM, Rabeneck L, Guttmann A. Development and use of reporting guidelines for assessing the quality of validation studies of health administrative data. J Clin Epidemiol. 2011;64(8):821–9. https://doi.org/10.1016/j.jclinepi.2010.10.006.
Foulon S, Maura G, Dalichampt M, Alla F, Debouverie M, Moreau T, et al. Prevalence and mortality of patients with multiple sclerosis in France in 2012: a study based on French health insurance data. J Neurol. 2017;264(6):1185–92. https://doi.org/10.1007/s00415-017-8513-0.
National guidelines Care for multiple sclerosis and Parkinson’s disease: support for steering and management (Nationella riktlinjer Vård vid multipel skleros och Parkinsons sjukdom: Stöd för styrning och ledning (in Swedish)). Falun: National Board of Health and Welfare (Socialstyrelsen); 2016.
Open access funding provided by Karolinska Institutet.
This work was supported by a grant from the Swedish Research Council for Health, Working Life and Welfare (Dnr: 2007-1762) and by an unrestricted research grant from Biogen. For both, Kristina Alexanderson is the Principal Investigator of the grants. The design of the study, data collection, analyses, interpretations of data, and writing of manuscript were performed without involvement of the funding body.
Conflict of interest
The authors declare the following interests based on ICMJE recommendations with respect to the research, authorship, and/or publication of this article: CM, EF, and FY received salaries partly funded by an unrestricted research grant from Biogen. KA has received unrestricted research grants from Biogen. JH has received honoraria for serving on advisory boards or steering committees for Biogen, Merck, Sanofi-Genzyme and Novartis and speaker’s fees from Biogen, Novartis, Merck-Serono, Bayer-Schering, Teva and Sanofi-Genzyme. He has served as Principal Investigator for projects, or received unrestricted research support from, BiogenIdec, Merck-Serono, TEVA, Sanofi-Genzyme and Bayer-Schering and his research is also supported by the Swedish Research Council and the Swedish Brain Foundation.
Ethical approval for the project was granted by the Regional Ethical Review Board of Stockholm, Sweden (Dnr: 2007/762-31; 2009/23-32; 2009/1917-32; 2011/1710-32; 2012/863-32; and 2014/236-32), in accordance with the Declaration of Helsinki.
Due to the register-based nature of this project and given that all data were anonymised when delivered to us, we do not have information about the identity of those included in analyses. Accordingly, there was no informed consent from the patients involved, in line with the Ethical approval.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
About this article
Cite this article
Murley, C., Friberg, E., Hillert, J. et al. Validation of multiple sclerosis diagnoses in the Swedish National Patient Register. Eur J Epidemiol 34, 1161–1169 (2019). https://doi.org/10.1007/s10654-019-00558-7
- Validation studies
- Electronic health records
- Multiple sclerosis
- Sick leave
- Administrative data
- Swedish National Patient Register