FormalPara Key Summary Points

Why carry out this study?

Achieving a timely diagnosis of a specific ILD is important to inform prognosis and guide treatment decisions, as delays in treatment may negatively impact outcomes for patients.

Greater insight into disease behaviour may help to guide diagnosis and treatment decisions in patients with ILDs; however, there is currently a lack of evidence characterising ILD behaviour and progression during the early stages of disease.

The STARLINER study (NCT03261037) aimed to characterise disease behaviour in patients with suspected ILD during the peri-diagnostic period using daily home spirometry and accelerometry.

What was learned from the study?

During the peri-diagnostic period, differences in changes in FVC measured by home and site spirometry were observed between patients with IPF and patients with non-IPF ILD. A greater decline in steps per day was observed for IPF versus non-IPF ILD, whereas an increase in 6MWD was observed for patients with IPF versus a decline in 6MWD for patients with non-IPF ILD. No clear patterns of disease behaviour were observed for IPF versus non-IPF ILD for PROs.

Home spirometry was feasible for most patients with ILD; however, technical and analytical challenges in the home-based assessments prevented firm conclusions regarding disease behaviour and highlight the need for further optimisation of the technology and analysis methods before widespread implementation.

Digital Features

This article is published with digital features, including two videos, to facilitate understanding of the article. To view digital features for this article go to https://doi.org/10.6084/m9.figshare.14578833.

Introduction

Interstitial lung diseases (ILDs) are a heterogeneous group of respiratory diseases characterised by fibrosis and/or inflammation of the lung [1]. The aetiology of ILDs is diverse. The cause of some ILDs is known, e.g., hypersensitivity pneumonitis resulting from environmental exposures or ILDs secondary to connective tissue diseases (CTD) [1]. However, in other cases, the causes are unknown, e.g., idiopathic pulmonary fibrosis (IPF) and other idiopathic interstitial pneumonias [1, 2].

Timely diagnosis of a specific ILD is important to inform prognosis and guide treatment plans [1, 3]. The introduction of two antifibrotic treatments for IPF, pirfenidone and nintedanib, has increased the importance of early and accurate diagnosis for patients with fibrotic lung disease. Pirfenidone and nintedanib have been shown to modify the course of IPF by slowing IPF progression versus placebo, but cannot reverse existing fibrotic damage [4,5,6]. Therefore, early diagnosis and initiation of treatment are integral to slowing the irreversible decline in lung function experienced by patients with IPF. In other ILDs, early treatment of the underlying disease may also reverse disease, prevent or slow down disease progression, and relieve symptoms [7,8,9].

However, the diagnosis of an ILD is a complex process and many patients wait over 12 months for a diagnosis [10,11,12]. Reasons for delays are multifactorial and may include the presence of non-specific symptoms such as cough, dyspnoea and fatigue; a lack of evidence characterising ILD behaviour and progression during the early disease stages; and limited awareness about the prognosis and severity of ILDs among clinicians [13]. Many patients report one or more misdiagnoses prior to a final diagnosis [11] or geographical barriers to accessing specialised centres and multidisciplinary teams (MDTs) [14, 15].

It has been suggested that disease behaviour can be used to inform diagnosis, particularly in cases where reaching a diagnosis is difficult [16]; however, data on ILDs collected from around the time of diagnosis are scarce. Greater knowledge about ILD disease behaviour is needed to facilitate early and accurate diagnoses and guide treatment decisions. To develop a greater knowledge of ILD disease behaviour, it is important for frequent and detailed assessment of clinical measurements to be obtained as early as possible, particularly during the pre-diagnostic period. In clinical practice, the data available regarding disease behaviour may be limited during the pre-diagnostic period, and as clinic visits tend to occur in 3- or 6-month intervals, a more pragmatic approach may be required. Technologies such as home spirometry and accelerometry can allow these frequent measurements to be obtained as early as the pre-diagnostic period and may improve access to and quality of care for patients with ILD.

The STARLINER study aimed to assess disease behaviour in patients with suspected ILD during the peri-diagnostic period. The study used real-time home-based assessments, providing insights into the feasibility and utility of these technologies in patients with ILD.

Methods

Study Design and Participants

STARLINER was an international, multicentre study conducted at community and tertiary clinical centres in Canada, France, Ireland, Italy, The Netherlands and Russia.

Eligible patients were aged ≥ 50 years with suspected ILD, defined as radiological evidence of ILD in the presence of unexplained dyspnoea on exertion and/or cough. Patients were excluded if they had a history of clinically significant heart disease that could explain their symptoms, or a known history of CTD.

Patients were followed throughout the peri-diagnostic period, which comprised a pre-diagnostic period (from enrolment to diagnosis) and a post-diagnostic period (from diagnosis to treatment initiation) [17]. Patients left the study if they did not receive a diagnosis within 12 months of enrolment or if they received a non-ILD diagnosis. Patients who received a diagnosis of IPF or non-IPF ILD remained in the study until initiation of treatment for ILD or for a maximum of 6 months if treatment was not initiated. Therefore, the total duration of observation varied for each patient (≤ 18 months). Please see Fig. 1 in the previously published study design manuscript (https://link.springer.com/article/10.1007/s12325-018-0845-3#Fig1) for further details.

Patients were managed at the discretion of their treating physician. Concomitant therapies for comorbidities were continued according to regular local practice and were documented in the electronic case report form (eCRF).

The methods of this study have been previously described [17].

Compliance with Ethics Guidelines

All patients gave written informed consent. The study conformed to the International Conference on Harmonisation (ICH) E6 and E2A guidelines for Good Clinical Practice and Clinical Safety Data Management, the principles of the Declaration of Helsinki or the laws and regulations of the country where the research was conducted, whichever afforded the greater protection to the individual. The study was approved by the master ethics committee (Stichting BEBO, Postbus 1004, 9400 BA, Assen, The Netherlands) and by ethics committees at all other participating institutes. This trial is registered with ClinicalTrials.gov (NCT03261037).

Study Visits and Procedures

Disease behaviour was assessed using a multidimensional approach that included pulmonary function, physical functional capacity, symptoms and quality of life (QoL) collected by patients at home and during site visits.

Patients performed one home measurement of forced vital capacity (FVC) each day using a portable handheld turbine spirometer (Spirobank® Smart; Medical International Research, Rome, Italy). A second measurement was permitted if the technique of the first blow was not good, as detected by the spirometer and signalled to the patient. Patients’ steps per day were continuously measured using an accelerometer resembling a watch (Steel HR; Withings, Issy-les-Moulineaux, France). There was no minimum duration that the patients were required to wear the accelerometer for during the study.

Site visits at baseline, diagnosis and end of study were the only mandatory site visits, and as the study was conducted under real-life conditions, the timing of these site visits was not pre-determined. Otherwise, patients were managed and seen according to the standard of care at the discretion of the treating clinician, which could lead to additional site visits. Site measurements included: FVC, 6-min walk test (6MWT) and patient-reported outcomes (PROs), including King’s Brief Interstitial Lung Disease questionnaire (K-BILD), the EuroQoL 5-Dimensions 5-level index questionnaire (EQ-5D-5L), the Fatigue Assessment Scale (FAS), the modified Medical Research Council (mMRC) Dyspnoea Scale and three visual analogue scales (VAS) measuring cough, urge to cough and fatigue. Any further information collected as part of local standard of care was recorded in the eCRF. See Table S1 in the supplementary material for further details regarding mandatory outcome measures.

Digital Collaboration Platform

This study was facilitated by a novel digital collaboration platform, which has been described previously. Please see Fig. 2 in the previously published study design manuscript (https://link.springer.com/article/10.1007%2Fs12325-018-0845-3#Fig2) for further details.

Each patient received a handheld spirometer, an accelerometer, a tablet computer and user manuals for each device at enrolment. Patients received training on how to use the devices and had access to a digital avatar to guide them through the use of the spirometers.

Supplementary file2 Video 1. A guide on how to assemble the home spirometer (MP4 12111 KB)

Supplementary file3 Video 2. A guide on how to perform daily home spirometry measurements (MP4 39450 KB)

The tablet computers were linked to the digital collaboration platform to enable the treating physician to view their patient’s real-time data, and patients also had the option to view their own real-time data. Additionally, the digital collaboration platform had an optional functionality to allow investigators from community centres to share clinical data and high-resolution computed tomography (HRCT) images to obtain a second opinion from investigators from specialist tertiary ILD centres through virtual MDTs. Discussions between community and tertiary centres took place at any time during the study, and all diagnoses were at the discretion of the investigator.

The study sponsor did not have access to the digital collaboration platform and patient data collected by the home devices were anonymised before being transferred to the study database.

Outcomes

Information regarding diagnoses made during the study was collected, including, but not limited to, the percentage of patients diagnosed using a local MDT or via the virtual MDT function of the digital collaboration platform.

The primary endpoint was time-adjusted semi-annual change in FVC (ml) during the peri-diagnostic period measured using daily home spirometry in patients with IPF. The outcome was also analysed in patients with non-IPF ILD as a secondary endpoint.

Other secondary endpoints evaluated in patients with IPF and non-IPF ILD over the peri-diagnostic period were: time-adjusted semi-annual change in FVC (ml) measured using site spirometry, the correlation between home and site FVC at baseline and time-adjusted semi-annual changes in physical functional capacity measured by daily home accelerometry and site 6MWT. Changes in PRO scores over the peri-diagnostic period were assessed, including EQ-5D-5L health state VAS and index scores, K-BILD total score and subscores, FAS, fatigue VAS, cough VAS, urge-to-cough VAS and mMRC Dyspnoea Scale scores. Compliance with daily home measurements was also assessed.

There was no investigational medicinal product, and therefore there were no safety objectives; however, the incidences of respiratory-related and all-cause hospitalisations, respiratory-related and all-cause mortality, and acute exacerbations were reported.

Statistical Analyses

The planned sample size of 180 patients assumed that approximately 40% of patients who enrolled in the study would be diagnosed with IPF, 50% would be diagnosed with non-IPF ILD and 10% would be diagnosed with a non-ILD condition or would not have received a diagnosis within 12 months of enrolment.

The time-adjusted semi-annual FVC change measured using home spirometry was estimated for individual patients with at least three home spirometry measurements by applying a linear regression model. The estimated FVC change for each patient was then used to calculate the group mean and median FVC change.

An algorithm within the spirometers classified blows as acceptable or unacceptable manoeuvres (see Table S2 in the supplementary material). The primary analysis included only ‘good blows’ and an additional analysis included all ‘acceptable blows’.

For secondary outcomes measuring FVC, 6-min walk distance (6MWD) and steps per day, the same methodology as for the primary endpoint was used to estimate time-adjusted semi-annual changes during the peri-diagnostic period.

Sensitivity analyses excluding patients with less than 30 days of observation were performed for the primary and secondary outcomes detailed above.

For site measurements, baseline was defined as the measurement collected at enrolment.

To account for a learning curve for using the home devices, the first 7 days corresponding to the learning curve were removed; thus, for home spirometry and home accelerometry measurements, baseline was defined as the average of the daily home measurements collected during the next 7 consecutive days.

The correlation between baseline FVC measured using home and site spirometry was analysed by calculating Pearson’s correlation taking the individual values for each patient into account.

Changes in PROs and the incidences of respiratory-related hospitalisations, all-cause hospitalisations, acute exacerbations, respiratory-related mortality and all-cause mortality, and the percentage of patients diagnosed using an MDT were summarised descriptively.

The percentage per patient of days with at least one home spirometry measurement was calculated as the ratio of the number of days with at least one measurement and the number of days the patient was observed within the study. The percentage of patients with at least one gap in daily home spirometry or accelerometry measurements (a gap in daily home measurements was defined as ≥ 7 consecutive days of missing measurements) and the duration of gaps were summarised descriptively.

Results

Study Participants

Between 25 December 2017 and 4 February 2019, 178 patients were enrolled from 37 centres (24 community and 13 tertiary centres). In total, 68 (38.2%) patients were diagnosed with IPF, 62 (34.8%) patients were diagnosed with non-IPF ILD, 9 (5.1%) patients were diagnosed with a non-ILD condition, and 39 (21.9%) patients did not receive a diagnosis because they had either not received a diagnosis within 12 months of enrolment (11/39 [28.2%]) or because they discontinued the study early before being diagnosed (28/39 [71.8%]; Fig. 1).

Fig. 1
figure 1

Patient disposition. ILD interstitial lung disease, IPF idiopathic pulmonary fibrosis. a One patient failed screening because the investigator did not consider them able to comply with the study protocol, e.g., the ability to use the provided spirometer and tablet, and the ability to fill in the required patient-reported outcomes questionnaires. The other screening failure was due to confirmation of a large B-cell malignancy

Baseline characteristics were generally similar across the diagnostic subgroups; however, a higher percentage of patients with IPF were male and had a previous history of smoking compared with patients with non-IPF ILD (Table 1).

Table 1 Baseline demographics and clinical characteristics

The virtual MDT function of the digital collaboration platform was used for diagnosis in 14.7% and 17.7% of patients diagnosed with IPF and non-IPF ILD, respectively. A local MDT was used in 70.6% of patients with IPF and 59.7% of patients with non-IPF ILD, whereas an MDT was not used in 14.7% and 22.6% of patients with IPF and non-IPF ILD, respectively. See Table S3 and Table S4 in the supplementary material for further details on the non-IPF ILD diagnoses and the assessments used to make a diagnosis of IPF or non-IPF ILD.

The overall median (minimum–maximum) time from enrolment to diagnosis was 10.9 (0.6–53.3) weeks in patients with IPF and 20.7 (0.9–52.9) weeks in patients with non-IPF ILD. For patients enrolled at community sites, the median (minimum–maximum) time from enrolment to diagnosis was 10.7 (0.6–53.3) weeks in patients with IPF and 21.3 (0.9–51.4) weeks in patients with non-IPF ILD. For patients enrolled at tertiary sites, the median (minimum–maximum) time from enrolment to diagnosis was 11.0 (3.1–38.9) weeks in patients with IPF and 19.6 (1.0–52.9) weeks in patients with non-IPF ILD.

In total, 59 (86.8%) patients with IPF and 26 (41.9%) patients with non-IPF ILD were initiated on treatment within 6 months of diagnosis. The overall median (minimum–maximum) time from diagnosis to treatment initiation was 1.4 (0.1–27.3) weeks in patients with IPF and 6.6 (0.1–27.9) weeks in patients with non-IPF ILD. For patients enrolled at community sites, the median (minimum–maximum) time from diagnosis to treatment initiation was 2.0 (0.1–26.4) weeks in patients with IPF and 0.3 (0.1–27.9) weeks in patients with non-IPF ILD. For patients enrolled at tertiary sites, the median (minimum–maximum) time from diagnosis to treatment initiation was 1.4 (0.1–27.3) weeks in patients with IPF and non-calculable (0.1–27.1) weeks in patients with non-IPF ILD. Further details regarding patient disposition and treatment initiation are presented in Table S5 of the supplementary material.

Primary Endpoint

The analysis of the home spirometry data was affected by outliers, primarily due to patients with few data points or short observation periods, resulting in individual slope estimates that were not normally distributed. Therefore, presentation of mean group values, as prespecified, was not appropriate and median was chosen as a more suitable statistic for skewed data. For completeness, the prespecified mean data are provided in Table S6 of the supplementary material. Additionally, the prespecified primary analysis plan included only ‘good blows’; however, as ‘acceptable blows’ were thought to be more representative of clinical practice, these data are discussed in the main text. The data including only ‘good blows’ can be found in Table 2 in the main manuscript and Table S6 of the supplementary material.

Table 2 Summary of semi-annual change in lung function and exercise capacity during the peri-diagnostic period estimated from a linear regression modela in patients with IPF and non-IPF ILD

Time-adjusted median (quartile [Q]1, Q3) semi-annual change in FVC measured using home spirometry during the peri-diagnostic period in patients diagnosed with IPF was – 147.7 (– 723.8, 376.2) ml when including all ‘acceptable blows’ (Table 2; Fig. 2).

Fig. 2
figure 2

Time-adjusted semi-annual FVC change during the peri-diagnostic period in patients with IPF and non-IPF ILD, measured by home spirometry including (a) only ‘good blows’ and (b) all ‘acceptable blows’. FVC forced vital capacity, ILD interstitial lung disease, IPF idiopathic pulmonary fibrosis, Q quartile

Secondary Endpoints

Time-adjusted median (Q1, Q3) semi-annual change in FVC measured using home spirometry during the peri-diagnostic period in patients with non-IPF ILD was 19.1 (– 194.9, 519.0) ml when including all ‘acceptable blows’ (Table 2; Fig. 2).

Time-adjusted median (Q1, Q3) semi-annual change in FVC measured using site spirometry during the peri-diagnostic period was – 149.0 (– 314.6, 163.9) ml in patients with IPF and – 23.4 (– 117.9, 133.5) ml in patients with non-IPF ILD (Table 2).

The correlation between baseline home and site FVC (including only ‘good blows’) was moderate/strong (Pearson’s correlation coefficient = 0.71 and 0.69 for patients with IPF and non-IPF ILD, respectively).

Time-adjusted median (Q1, Q3) semi-annual change in steps per day during the peri-diagnostic period was – 692.9 (– 5010.8, 430.6) steps in patients with IPF and – 403.6 (– 2067.9, 302.4) steps in patients with non-IPF ILD (Table 2).

Time-adjusted median (Q1, Q3) semi-annual change in 6MWD during the peri-diagnostic period was 16.1 (– 31.1, 121.2) m in patients with IPF and – 3.7 (– 47.4, 24.3) m in patients with non-IPF ILD (Table 2).

In sensitivity analyses excluding patients with < 30 days of observation, time-adjusted semi-annual FVC change measured using home spirometry, including all ‘acceptable blows’, during the peri-diagnostic period was – 114.6 (– 526.6, 376.2) ml in patients with IPF and 19.1 (– 180.6, 430.1) ml in patients with non-IPF ILD. Further details regarding sensitivity analyses of lung function and exercise capacity outcomes in patients with IPF and non-IPF ILD excluding patients with < 30 days of observation are presented in Table 2, Fig. 2 and Table S6 of the supplementary material.

During the peri-diagnostic period, mean (standard deviation [SD]) change from baseline in EQ-5D-5L health state VAS score was a deterioration of – 2.0 (15.4) in patients with IPF and an improvement of 3.1 (15.3) in patients with non-IPF ILD (Table 3). Mean (SD) change from baseline in EQ-5D index score was a deterioration of – 0.02 (0.14) for patients with IPF and an improvement of 0.01 (0.12) in patients with non-IPF ILD (Table 3). Mean (SD) change from baseline in K-BILD total score was a deterioration of – 0.45 (9.71) for patients with IPF and an improvement of 3.98 (11.85) in patients with non-IPF ILD (Table 3).

Table 3 Changes from baseline in PROs during the peri-diagnostic period in patients with IPF and non-IPF ILD

Results for K-BILD subscores, FAS, fatigue VAS, cough VAS and urge-to-cough VAS are also presented in Table 3. Results for mMRC Dyspnoea Scale scores are presented in Table S7 of the supplementary material.

The incidence rate of non-elective hospitalisations was low. In total, six (8.8%) and two (2.9%) patients diagnosed with IPF experienced at least one all-cause hospitalisation during the pre-diagnostic (11 events) and post-diagnostic (two events) period, respectively. The corresponding values for respiratory-related hospitalisations were four (5.9%) patients (five events) and 0 patients. In total, three (4.8%) and 0 patients diagnosed with non-IPF ILD experienced at least one all-cause hospitalisation during the pre-diagnostic (six events) and post-diagnostic period, respectively. Respective values for respiratory-related hospitalisations were three (4.8%) patients (five events) and 0 patients. One patient with IPF experienced a single acute exacerbation during the pre-diagnostic period. No deaths were recorded during this study.

Quality of Home Assessments

The total number of blows performed during the study per cohort was 8647 for patients with IPF and 12,167 for patients with non-IPF ILD. Of these, 54.0% and 60.2%, respectively, were categorised as ‘good blows’; 19.6% and 19.5%, respectively, were categorised as ‘acceptable blows except good blows’; and 26.4% and 20.2%, respectively, were categorised as ‘bad blows’ by the spirometer algorithm. The median (Q1, Q3) duration of observation was 15.4 (8.5, 30.6) weeks in patients with IPF and 28.5 (13.3, 46.7) weeks in patients with non-IPF ILD.

Compliance with Home Assessments

Compliance with home spirometry, including all blows, is presented for IPF and non-IPF ILD in Fig. S1 of the supplementary material. During the peri-diagnostic period, the median (Q1, Q3) adherence to home spirometry, defined as the percentage per patient of days with ≥ 1 home spirometry blow, was 84.8% (73.2, 93.2) for IPF and 87.5% (72.6, 95.2) for non-IPF ILD. During the peri-diagnostic period, 50.0% of patients with IPF and 53.2% of patients with non-IPF ILD had at least one gap in daily home spirometry measurements (a gap in daily home measurements was defined as ≥ 7 consecutive days of missing measurements). The median (Q1, Q3) duration of gaps was 20.6 (11.0, 41.0) days in patients with IPF and 21.0 (16.0, 43.0) days in patients with non-IPF ILD. During the peri-diagnostic period, 60.3% of patients with IPF and 75.8% of patients with non-IPF ILD had at least one gap in daily home accelerometry measurements. The median (Q1, Q3) duration of gaps was 27.5 (18.0, 52.0) days in patients with IPF and 26.0 (14.0, 66.0) days in patients with non-IPF ILD. The mean (95% confidence interval [CI]) percentage of days with at least one blow was 77.7% (72.0, 83.3) for patients with IPF and 78.1% (72.1, 84.1) for patients with non-IPF ILD.

Discussion

The STARLINER study aimed to utilise a digital ecosystem to investigate disease behaviour during the peri-diagnostic period in patients with suspected ILD. This study is the first to collect prospective data on ILD disease behaviour during the time period before diagnosis and treatment initiation. Daily home spirometry and accelerometry allowed more frequent data collection and minimised the number of mandatory site visits required to measure lung function and exercise capacity. During the peri-diagnostic period, differences in changes in FVC measured by home and site spirometry were observed between patients with IPF and patients with non-IPF ILD. A greater decline in steps per day was observed for IPF versus non-IPF ILD, whereas an increase in 6MWD was observed for patients with IPF versus a decline in 6MWD for patients with non-IPF ILD. No clear patterns of disease behaviour were observed for IPF versus non-IPF ILD for PROs. Technical and analytical difficulties in the home-based assessments prevented firm conclusions regarding disease behaviour in these diagnostic subgroups and highlight the need for further optimisation of the technology.

Similar to previous studies utilising home spirometry, a number of technical and analytical difficulties were encountered during this study [18]. Connectivity issues between the patients’ devices and the digital platform were encountered, which either prevented patients from performing the home-based measurements or prevented the transfer of data from the devices to the digital platform. There was also no direct helpdesk support available to help patients overcome these issues, and clinicians had to contact the helpdesk on behalf of patients. Additionally, the algorithm within the home spirometers determined blow quality based on technique rather than the obtained value, thus allowing clinically implausible values to be accepted as ‘good blows’. Furthermore, some patients only had a small number of observations collected over a short time period during the study, which led to problems in applying the prespecified linear regression model to analyse the home FVC data, leading to clinically implausible predicted changes in FVC for individual patients, with a subsequent effect on group means. Therefore, whilst home spirometry can be a useful tool, the technical and analytical issues experienced in this study highlight a clear need for further optimisation of the technology. It is important to determine the most accurate type of spirometer, improve the algorithms within spirometers to identify and discard implausible lung-function values and ensure proper calibration of home spirometers to prevent inaccurate measurements [19, 20], and further explore how to analyse home spirometry data in clinical trials [21]. The availability of direct helpdesk support for patients in their local language may help to reduce any technical issues and increase compliance with home measurements in future studies [22]. Furthermore, it is necessary to develop standardised criteria for home spirometry, including frequency of measurements, how many blows may be required per day and suitable acceptability and repeatability criteria for uncontrolled environmental conditions.

Despite the technical issues described above, we demonstrated that in line with previous studies, daily home spirometry was feasible for most patients; however, compliance was variable [22,23,24,25,26]. Home measurements can allow more frequent data collection and minimise the number of mandatory site visits required to measure lung function and exercise capacity. During the COVID-19 pandemic, several components of the diagnosis and management process in patients with ILD have been impacted as a result of minimising face-to-face interactions between healthcare workers and patients with ILD. Moreover, many clinical consultations have taken place via videoconferencing, and pulmonary function tests, which have a high risk of spreading infection, have been restricted [27]. Therefore, optimisation of smartphone technologies, home spirometry technologies and virtual platforms, like the one used in this study, is crucial to maintain or improve the level of care surrounding the diagnosis, monitoring and management of patients with ILD, particularly during the COVID-19 pandemic [21, 27, 28].

A number of insights were gained from this study. First, when measured by home spirometry, patients with IPF experienced a decline in FVC during the peri-diagnostic period, whereas an increase in FVC was observed for patients with non-IPF ILD. In line with the home spirometry data, patients with IPF experienced a decline in FVC during the peri-diagnostic period, when measured by site spirometry. A decline in FVC was also observed for patients with non-IPF ILD when measured by site spirometry; however, patients with IPF experienced a greater decline in FVC versus patients with non-IPF ILD. A greater decline in steps per day was observed for IPF versus non-IPF ILD, whereas an increase in 6MWD was observed for patients with IPF versus a decline in 6MWD for patients with non-IPF ILD. Although the results cannot be directly compared, the observation that the deterioration in steps per day was not mirrored by the 6MWD results may reflect that the 6MWT is a measure of what patients can do in a clinical environment, whereas daily accelerometry is a measure of what patients do under normal circumstances at home. No clear patterns of disease behaviour were observed for IPF versus non-IPF ILD for PROs. In patients with IPF, EQ-5D-5L health state VAS score and K-BILD total score tended to worsen over time, but these changes did not exceed previously reported ranges of the minimal clinically important difference (MCID) [29, 30]. In patients with non-IPF ILD, the improvements from baseline in EQ-5D-5L health state VAS score and K-BILD total score were not greater than the previously reported range of MCIDs [29, 30]. However, in patients with IPF, the deterioration from baseline in EQ-5D index score was within the reported MCID range of 0.005–0.054 [29] and the same applied for the improvement from baseline in EQ-5D index score in patients with non-IPF ILD [29]. Therefore, these data suggest that patient symptoms and well-being, measured by structured PROs, may also provide insights into disease behaviour during the diagnostic trajectory, mimicking clinical practice where a combination of these factors is considered when forming a diagnosis.

Whilst technical and analytical issues prevent firm conclusions about differences in disease behaviour between IPF and non-IPF ILD in this study, further investigations may be able to clarify differences in disease behaviour between these diagnostic subgroups during the peri-diagnostic period and their additive value in classifying ILDs and forming diagnoses in the future [31]. Although disease behaviour alone would not form the basis of a diagnosis, it could, alongside other clinical and radiological information, help clinicians to form working diagnoses and to re-evaluate them when longitudinal disease behaviour is discordant with the working diagnosis [16, 31].

A second key observation was that the median time from enrolment to diagnosis for non-IPF ILD was twice that reported for IPF (20.7 weeks vs. 10.9 weeks, respectively). This finding may reflect that the diagnosis of IPF can be relatively straightforward when patients present with classical symptoms, HRCT images and no relevant exposures. In comparison, the radiological picture can be less clear in non-IPF ILDs, and there may be a need for further diagnostic assessments, e.g., bronchoalveolar lavage or biopsy, or involvement of other specialists, e.g., rheumatologists in patients with suspected CTD. In clinical practice, diagnostic delays and misdiagnosis of ILDs have been well documented [10,11,12] and can have many negative effects on patients, including the need for consultations with multiple clinicians, repetition of diagnostic tests and poorer long-term outcomes [11, 12]. Importantly, delays in diagnosis can prevent the initiation of treatments, deny patients the chance to take part in clinical trials and cause anxiety for patients and their families.

A number of limitations should be considered when interpreting the results of this study. It should be noted that it was not possible to analyse any data for calorie expenditure during this study as technical difficulties with the devices used for functional capacity measurements led to unreliable data being collected. Additionally, some problems were experienced when trying to upload or view HRCT images on the digital collaboration platform. These problems may have prevented full use of the virtual MDT function and potentially impacted the number of patients who were diagnosed. The percentage of patients diagnosed with a non-ILD, or who received no diagnosis, exceeded the 10% assumed in the sample-size calculation. Particularly, the number of patients with no diagnosis (N = 39) was higher than expected, which may have impacted the median time to diagnosis; however, only 11 patients were excluded from the study because they had not received a diagnosis after 12 months in the study—the remaining 28 patients with no diagnosis discontinued the study prematurely and may have received a diagnosis had they remained in the study. The presence of a subgroup of patients without diagnosis after 12 months highlights the unmet need for an earlier and accurate diagnosis in these patients. Furthermore, it was not possible to perform correlation analyses between home and site FVC in patients with IPF and non-IPF ILD over time, as the only mandatory site visits were at baseline, diagnosis and end of study, and these site visits did not have a prespecified schedule. Additionally, there may have been a selection bias for patients who were more interested in, or more familiar with, online technologies, as only 1 of the 180 eligible patients failed screening because of not being able to use the provided equipment and complete the PRO questionnaires. It is, however, unlikely that this influenced the diagnostic trajectory of the patients included in the study. Similarly, only community sites interested in home monitoring in ILD would have participated in this study, which may have led to a selection bias; however, this bias is also present in many other clinical trials, and the inclusion criteria in this study were broad, which may have resulted in a patient population and diagnostic practices that are more representative of the ILD field. Finally, the widespread availability of the technology used in this study may be limited by financial barriers in some healthcare systems; however, the COVID-19 pandemic is forcing policy makers to consider new and sustainable care models.

In conclusion, this is the first study to investigate disease behaviour in patients with suspected ILD during the peri-diagnostic period using real-time, home-based assessments. During this study, differences in FVC change measured by home and site spirometry were observed between patients with IPF and patients with non-IPF ILD. A greater decline in steps per day was observed for IPF versus non-IPF ILD, whereas an increase in 6MWD was observed for patients with IPF versus a decline in 6MWD for patients with non-IPF ILD. It is not clear why the results for steps per day and 6MWD were discordant; however, it may reflect that these measurements capture different aspects of physical activity. No clear patterns of disease behaviour were observed for IPF versus non-IPF ILD for PROs. Technical and analytical difficulties in the home-based assessments prevented firm conclusions regarding disease behaviour in these diagnostic subgroups and highlight the need for further optimisation of the technology. Nevertheless, use of the technology in this study was feasible for most patients and centres. Further optimisation of the home-based assessments and virtual MDT platforms for clinicians and patients, such as the one used in this study, is required before these can be used in clinical practice to facilitate collaboration among patients, community centres and tertiary centres to improve the timely access to diagnosis and treatment for patients.