Introduction

Bronchoscopy with bronchoalveolar lavage (BAL) is a commonly performed clinical procedure that yields fluid containing cells and other substances that populate the lung mucosa and alveolar surface, assisting in diagnosis and treatment of many lung diseases. A flexible video bronchoscope introduced into a participant’s airway, advanced until gently wedged in a target lung lobe or segment, allows warmed sterile saline to be instilled and then aspirated through the working channel of the bronchoscope (Fig. 1). The procedure is not without risk, as often the patient needs intravenous (IV) sedative agents, there may be mechanical trauma from the scope or endobronchial procedures which can cause bleeding or pneumothorax, or residual fluid which cannot be aspirated may, rarely, cause problems such as infection1. These risks can be mitigated and therefore BAL for research alone, without any envisioned clinical benefit to the participant, is becoming a common procedure. The fluid is used for both qualitative and quantitative measurements to study structural and immune cells, soluble mediators, mucus, and features of tissue remodelling, at the site of disease2. Researchers have an ethical duty to ensure that the procedure is as safe as possible for the volunteer, and that the BAL specimen retrieved is of sufficient quantity and quality to be of scientific value. The European Respiratory Society (ERS) in 1999 and the American Thoracic Society (ATS) in 2012 published guidance documents (the latter specifically for patients with suspected interstitial lung disease), in an attempt to standardise BAL procedures, processing and analysis3,4. Research studies may, however, require deviations from these recommendations, and many other factors can influence the outcomes of a BAL procedure beyond these parameters. For example, underlying chronic obstructive pulmonary disease (COPD) is associated with lower BAL volume yield than that from controls, and the yield is worse with declining lung function and disease severity on imaging5,6; tobacco and cannabis smoking both have been shown to increase BAL cell yields7,8; and a manual aspiration technique improves volume and cell yield over wall suction use9. Moreover, there are likely to be many factors which influence outcomes that are yet unknown to us. At the time of the bronchoscopy, the only measures of success or failure the physician (operator) has, is the wellbeing of their patient and the BAL volume yield, or percentage of instilled volume returned. It is only after processing of the specimen that the feasibility of using the BAL for specific research assays can be established, as determined by the cell quantity, quality, viability and differential count.

Figure 1
figure 1

Bronchoscopy and bronchoalveolar lavage. (A) A research participant is about to have a Fujifilm flexible videoscope with 3.2 mm working channel inserted through her nose while under conscious sedation. Clinicians performing the procedure are authors STM and JAS. (B) Sequential photos from within the airways (from top to bottom: vocal cords before intubation, main carina, and right middle lobe medial segment subsegmental bronchi) taken with the bronchoscope. (C) Results of the bronchoalveolar lavage in a non-smoker, aliquots of saline with foamy surfactant on the surface. (D) An example of significant staining of the lavage fluid with black particulate matter from tobacco smoking.

The aim of this study was to optimise bronchoscopy with BAL for immunology research, by identifying factors which affected the main BAL outcomes when procedures are performed according to current recommendations: BAL volume yield, total BAL cell yield and cell concentration, BAL cell viability, pellet colour (to identify blood contamination) and differential cell counts. In addition, procedure-related adverse events during the bronchoscopy were investigated. Special attention was given to investigating the effects (if any) of infections on BAL outcomes, specifically tuberculosis (TB), HIV infection and recent SARS-CoV-2 infection.

Methods

Study design

Participants in this study were recruited from five separate National Institutes of Health (NIH)-funded projects in which BAL was performed between 1 January 2019 and 15 March 2022, for research into host immune responses to Mycobacterium tuberculosis (Mtb) by the Biomedical Research Institute (BMRI) Clinical Team and the Immunology Research Group (IRG) laboratory of Stellenbosch University, Cape Town, South Africa. These studies included participants recruited through contact with BMRI community-based research workers from subdistricts in the Cape Town metropolitan region, as well as from the Drakenstein and Saldanha municipal subdistricts of the Western Cape. Data on clinical parameters were collected by research clinicians and nurses as part of the studies in which the participants were enrolled, while the laboratory processing data were collected by the research assistants on the day of processing. All participants provided written informed consent for the parent study into which they were enrolled. Each parent study was approved by the Stellenbosch University Health Research Ethics Committee (reference numbers: N16/03/033, N18/10/118, N19/07/093, N19/10/150 and N20/07/079). Research was performed in accordance with the Declaration of Helsinki. The participant and authors in Fig. 1 have provided informed consent to publish the image in an online open access publication.

In general, participants were included if they met the following criteria: (1) age 18–65 years; (2) weight 35–120 kg; (3) absence of current source of infection apart from those required for specific groups; (4) consented to bronchoscopy with BAL, Positron Emission Tomography-Computed Tomography (PET-CT) scan where applicable, and nasopharyngeal swab for SARS-CoV-2 reverse transcription quantitative real time-Polymerase Chain Reaction (RT-qPCR); and (5) had a verifiable address or residence location for duration of the study. Exclusion criteria included: (1) pregnancy or breastfeeding; (2) haemoglobin < 9.0 g/dl; (3) any severe systemic condition/co-morbidity that may affect the safety of the participant or the performance of the assays; (4) diabetes (point of care HbA1c ≥ 6.5%, random glucose ≥ 200 mg/dl (or 11.1 mmol/L), fasting plasma glucose ≥ 126 mg/dl (or 7.0 mmol/L), or the concomitant use of any anti-diabetic agent); (5) immunosuppressive medication use, including inhaled corticosteroids, within the past 2 weeks; (6) substance or alcohol abuse that in the opinion of the investigator may interfere with the participant’s adherence to study procedures; or any person that the physician/study nurse feels is not appropriate for the study, or that the study is not in the person’s best interest; (7) known allergy or intolerance to drugs used in procedural sedation or a known history or family history of malignant hyperthermia; or (8) history of taking antibiotics with anti-TB activity in the past 4 weeks (non-TB groups).

Participants were divided into the following groups according to history, clinical assessment, imaging and laboratory investigations:

  1. 1.

    TB groups: categorised according to treatment duration into ‘Pre-Treatment’ (treatment not yet initiated), ‘Early Treatment’ (less than one month) and ‘End of Treatment’ (completed 6 months of standard treatment). Participants with drug resistant TB or extrapulmonary TB (without predominant intrathoracic TB component) were excluded.

  2. 2.

    Household Contacts: close contact with an individual with active untreated TB within three months; active TB disease ruled out on sputum testing, chest radiograph and symptom screen. Contacts were excluded if they had previous TB disease in the last three years, or current TB symptoms.

  3. 3.

    Community Controls: no active TB disease in previous three years, no recent known exposure to TB, active TB ruled out on sputum testing, chest radiograph and symptom screen.

  4. 4.

    People living with HIV (PLHIV): known or newly diagnosed HIV infected individuals, active TB disease ruled out on sputum testing, chest radiograph and symptom screen.

Research procedures

Mtb detection

All participants had sputum testing for Mtb with Auramine smear, liquid culture [Mycobacterial Growth Indicator Tubes (MGIT), BACTEC 960, Becton Dickinson], and GeneXpert MTB/RIF Ultra® assay (Cepheid, Sunnyvale, CA, USA), hereafter ‘Xpert Ultra’. A diagnosis of TB was assigned if a symptomatic individual had a positive MGIT culture or Xpert Ultra (graded as medium and higher), or positive Xpert Ultra graded as lower than medium but with consistent chest radiograph features.

All participants except those with active TB disease were also tested with an interferon gamma release assay (IGRA), in this case QuantiFERON-TB Gold Plus® (QFT) assay (Qiagen, Carnegie, Australia).

SARS-CoV-2 detection

The SARS-CoV-2 positive subgroup was defined as a previous verifiable positive SARS-CoV-2 RT-qPCR in a local reference laboratory, or positive serology on the Abbott SARS-CoV-2 IgG assay (Abbott Laboratories, South Africa, Pty Ltd) or the Abbott SARS-CoV-2 IgG II Quant (Abbott Laboratories, South Africa, Pty Ltd), both done on the Architect i System (Abbott Laboratories, Illinois, USA). All SARS-CoV-2 tests were carried out by the National Health Laboratory Service (NHLS) of South Africa (https://www.nhls.ac.za/). The strategy interpreting the serology results is described in Supplementary Table S1 in accordance with study protocol10. Participants who were previously vaccinated against SARS-CoV-2 who only had a positive anti-S antibody on serologic testing, and no previous positive RT-qPCR (and therefore no evidence of infection-acquired immunity or previous infection), were not included in the SARS-CoV-2 positive subgroup.

Bronchoscopy with bronchoalveolar lavage (BAL)

Procedures were performed in accordance with current recommendations3,4. In brief, all participants were pre-screened for fitness for bronchoscopy according to predefined criteria by a study clinician with knowledge of the procedure. A dedicated sedation practitioner achieved conscious sedation using either a propofol/fentanyl or midazolam/fentanyl combination. The participant received pre-oxygenation and continuous supplemental oxygen throughout the procedure, with continuous cardiac, non-invasive blood pressure and peripheral saturation monitoring until full recovery. Topical 1% lignocaine spray was applied to the throat and an anaesthetic gel in the nose. Topical lignocaine was applied to the vocal cords and large airways through the working channel of the flexible video bronchoscope which was then gently wedged in the target segment. Sequential aliquots of 60 ml were individually instilled then manually aspirated with a 20 ml syringe after ten seconds ‘dwell time’, to a total of 240 ml. The target lobe was chosen by the team prior to the procedure, based on the protocol of the parent study. In general, the right middle lobe (RML) was targeted in Community Controls and Household Contacts with normal chest imaging (with lingula as backup), and the area of most involvement with pulmonary TB or an imaging abnormality in keeping with subclinical TB in a Household Contact was targeted. Two of the studies included bronchial brushings (a single use brush with fine silicone bristles was used to harvest cells and microbiological material by lightly abrading the bronchial mucosa when inserted through the bronchoscope working channel) as part of their procedure, done either before or after the BAL depending on the standard operating procedure at the time. A thoracic ultrasound was performed in the recovery room on participants who underwent brushings, to exclude complications.

Adverse events

Data on adverse events were collected from an additional 20 study participants, resulting in 357 BAL procedures being included in these analyses. Information on adverse events was collected during the procedure and in recovery by clinicians using standardized patient record sheets. Participants were also contacted by study staff at 72 h and 14 days after the procedure to gather information regarding adverse events and persistent symptoms using a standardised list of questions, captured directly onto the electronic database. If needed a follow up review was arranged.

Processing of BAL fluid

The BAL fluid was placed on ice immediately after aspiration. Processing was initiated within two hours of harvest. Steps differed between studies, but in general the processing steps included: pooling of all aliquots, two sterile filtration steps (Falcon Cell Strainers 70 μm and 40 μm, Corning Inc., NY), then centrifugation for ten minutes at 350 × g at + 8 °C to pellet the cells. Two washing steps were performed using supplemented Roswell Park Memorial Institute Medium (RPMI 1640; Biowest, Nuaillé, France), that is 46.5 ml RPMI 1640 with 500 μl of 1% L-glutamine (MilliporeSigma, Massachusetts, USA), 2.5 ml of 5% Human AB serum (MilliporeSigma, Massachusetts, USA), and 500 μl of a 1% Penicillin/Streptomycin/Amphotericin B mixture [100 IU/ml Penicillin, 100 µg/ml Streptomycin and 0.25 µg/ml Amphotericin B in 0.85% saline (MilliporeSigma, Massachusetts, USA)]. If the pellet was judged contaminated with blood by visual inspection, an additional red cell lysis step was performed in 1 ml of Lonza ACK lysis buffer (1x) [Whitehead Scientific (Pty) Ltd, SA]. The total cell count was done with a haemocytometer and viability check by Trypan Blue exclusion method. A fraction (1% of the total cell count) was placed in the cytocentrifuge for the Cytospin (Simport, Saint-Mathieu-de-Beloeil, Canada) and differential count procedure. The Cytospin slide (ELITechGroup, Paris, France) was fixed with a fixative and stained RAPI-DIFF Stain 1 (contains Eosin Y dye in phosphate buffer) and using RAPI-DIFF Stain 2 (Clinical Sciences Diagnostics, Johannesburg, South Africa) containing polychromed methylene blue in phosphate buffer for 30 s each. Cytospin slides were counted manually (to a total of 100 or 200 cells). The BAL was finally fractionated according to the study protocol and cryopreserved in 10% dimethyl sulfoxide (MilliporeSigma, Massachusetts, USA) and 90% fetal bovine serum (Cytiva, Massachusetts, USA) and, by gradual cooling to − 80 °C in a Nalgene Mr Frosty™ container [Sigma Aldrich (Pty) Ltd, Gauteng, South Africa] with isopropanol for 24 h followed by long term preservation in liquid nitrogen.

PET-CT scans

Household Contacts, PLHIV, and a subset of the Community Controls had PET-CT imaging performed in addition to the routine chest radiographs, according to their study protocol. PET-CT imaging was done at the Nuclear Medicine Research Institute Node for Infection Imaging (NII), Stellenbosch University, South Africa with a Vereos PET-CT scanner (Philips, Amsterdam, Netherlands)10. 18F-fluorodeoxyglucose (FDG) radiotracer was administered at a dose of 2.8 mBq per kg, 60 min before imaging. Results were reported by consensus between a nuclear medicine physician and a radiologist.

Analyses

Analyses were conducted in R11. Simple descriptive statistics were used to describe the observations, with percentages and proportions for categorical variables, and means with standard deviations or medians with interquartile ranges for continuous data that had a normal and skewed distribution respectively. Robust versions of statistical algorithms as implemented in the R package WRS212 were used for analyses since the skewed outcome variables would result in non-normality of residuals.

The predetermined outcomes of interest in the analysis were:

  1. 1.

    BAL volume yield: reported in millilitres of fluid or as the percentage of instilled volume returned with aspiration. An optimal yield from BAL is > 30% of the instilled volume (i.e. > 70 ml from a 240 ml BAL)4.

  2. 2.

    Total BAL cells (reported as counts) and BAL cell concentration (total cells per millilitre of BAL fluid), and viability (percentage of live cells in total cell count).

  3. 3.

    Pellet colour: categorised as bloody, clear, or black. Of particular importance is the bloody pellet, which implies contamination with blood and therefore trauma to the mucosal surface (a minor adverse event). A bloody pellet requires a cell lysis step during processing to lyse erythrocytes, as the presence of red blood cells may reduce the viability of the immune cells after thawing, and as the cells originate in the peripheral blood rather than the lung, they potentially undermine the results of the immunological end assays done on the BAL fluid.

  4. 4.

    Differential cell count in BAL fluid: reported as the relative proportions of macrophages (normal > 85%), lymphocytes (normal 10–15%), neutrophils (normal ≤ 3%) and eosinophils (normal ≤ 1%). The pattern of abnormality may provide insight into the underlying immune dysfunction4.

Data were cleaned and the outcome variables BAL yield and BAL concentration were log transformed and winsorized using a robust median-based adjustment implement in RobustHD13 to make linear models tractable. To reduce the number of explanatory variables included in models, the variable selection algorithm Boruta was applied. Variables included in the initial Boruta algorithm were selected based on availability of data, previous evidence of their effect on BAL yields, and the authors’ hypothesis of a possible effect. The following variables were subject to selection with the Boruta algorithm for all outcomes: clinical group, smoking status, pellet colour (not included in the analysis for the outcome ‘pellet colour’), anatomical lung lobe used for the procedure, age category, gender, recreational substance use, imaging category, Karnofsky score, IGRA status, SARS-CoV-2 status, observed airway abnormality during the procedure, comorbidity, and BAL procedure duration. A stringent cut-off for variable selection (P < 0.005), and 3,000 iterations were used to select variables. The final variable list included: anatomical lung lobe, observed airway abnormalities, age, pellet colour, gender, recreational substance use, clinical group and smoking status for BAL volume yield; anatomical lung lobe, pellet colour, age, smoking status, clinical group, imaging category, IGRA status, and gender for BAL total cell yield; smoking status, pellet colour, anatomical lung lobe, clinical group, age, IGRA status, and gender for BAL cell concentration; clinical group, IGRA status, and imaging category for BAL cell viability; and clinical group, Karnofsky score, and SARS-CoV-2 status for pellet colour. These variables were analysed separately using robust ANOVA methods and post-hoc t-tests if the ANOVA was significant. An ANOVA on the medians was also performed for comparison to the robust ANOVA on trimmed means (5% trim). All P values were corrected for multiple testing using the Benjamini and Hochberg14 false discovery procedure.

Cell differential counts were analysed differently to the other outcomes. Since the total cell counts varied widely between individuals, cell type counts were converted to percentages of the total count to standardize within subjects and ensure comparability. Robust ANOVA, followed by post-hoc testing and alpha of 0.05, with correction for multiple testing using the Benjamini and Hochberg procedure. Since cell type numbers were correlated, e.g., where there was a higher proportion of lymphocytes, there was a corresponding lower proportion of one or all of the other cell types, a variable that was found to significantly influence the proportion of one cell type, was interpreted as a variable influencing the pattern or overall distribution of the cell differential count.

Adverse events were counted and classified as those occurring during the procedure or in recovery, and those occurring after the procedure, assessed at 72 h and 14 days after the bronchoscopy. Adverse events in different clinical groups were compared using a two-sided Cochran-Armitage test for trend in one-against-the-rest scenarios.

Results

Participant characteristics

A total of 337 bronchoscopy procedures were performed solely for research purposes, on 336 participants, as part of five NIH-funded studies from 1 January 2019 to 15 March 2022 and were included in the analysis. The pertinent characteristics of the study population are described in Table 1.

Table 1 Characteristics of the study population stratified by clinical group.

The most common comorbidity was hypertension (eight in Community Controls, six in Household Contacts, one in the TB groups and four in PLHIV). Four participants had asthma, four had previous chest trauma, and three had allergic rhinitis. The remaining comorbidities were a wide range of minor ailments. Approximately one fifth of the population used recreational substances, of whom 59 (85.5%) smoked cannabis (only five of these did not also smoke tobacco); four (5.8%) used both cannabis and another substance such as methamphetamine or methaqualone (all current tobacco smokers); and six (8.7%) used methamphetamine or methaqualone alone or in combination (all current tobacco smokers).

Of the 78 participants with active TB (the TB Pre-Treatment and TB Early Treatment groups), the mean body mass index (BMI) was 19.5 (14.8–29.0), with 39 (50%) of participants classifying as ‘underweight’ and only four (5.1%) as ‘overweight’. On study entry, 39 (50.0%) participants were sputum smear positive, with scores as follows: five (6.4%) ‘scanty’ (less than 10 AFB in all fields); 17 (21.8%) ‘1+’ (less than 1 AFB per field); 13 (16.7%) ‘2+’ (1–10 AFB per field); and four (5.1%) ‘3+’ (more than 10 AFB per field). Of the 76 who had sputum MGIT culture, 71 (93.4%) were culture-positive, with a mean time to positivity of 10.6 days. Of the 70 that had sputum Xpert Ultra results, 66 (84.6%) were positive, with the following ranking: 17 (24.3%) ‘high’; nine (12.9%) ‘medium’; 13 (18.6%) ‘low’; six (8.6%) ‘very low’; three (4.3%) ‘trace’; 18 unknown. All participants were symptomatic of TB, but well enough to undergo bronchoscopy. Chest imaging findings are presented in Table 1. Of note is that 11 (19.3%) of the Household Contacts group had chest imaging (which included both PET-CT and chest radiographs) which was suggestive of TB. This may be either related to previous TB episodes, or implying possible subclinical TB, as participants were asymptomatic and tested negative for TB on sputum. There were 28 PLHIV included, one of whom had two procedures. Three (10.7%) of these also had chest imaging results consistent with TB despite their sputum negative and asymptomatic status. All PLHIV were on antiretroviral therapy (ART) at the time of bronchoscopy, for a median of 10 years duration (range 0.5–17 years). Of the 18 participants who had CD4 counts within six months before the bronchoscopy, the median was 602 cells/μL (range 149–842; normal range approximately 500–2000 cells/μL). Of the 16 participants with HIV viral loads within 6 months, the median was < 20 copies per ml (range 0–242 copies per ml).

SARS-CoV-2 subgroup

The distribution of SARS-CoV-2 participants across clinical groups is shown in Table 1. Of the 42 participants (43 procedures), 31 (72.1%) were females, and the mean age was 36 years. There were similar proportions of current (20; 46.5%) and never (18;41.9%) smokers. The majority (37; 86.0%) did not use recreational substances, four (9.3%) used cannabis, and two (4.7%) used methamphetamines. Fourteen (32.6%) had a comorbidity, of which the most common was hypertension. Using a composite of both chest imaging modalities, 15 (34.9%) were normal, six (13.9%) were classified as in keeping with active TB, six (13.9%) had subclinical TB, and 16 (37.2%) were abnormal but not in keeping with active TB. Forty participants (95.2%) tested either anti-N antibody positive, anti-S antibody positive, or both. Four (9.5%) tested RT-qPCR positive in the 3 weeks prior to the bronchoscopy, however none of them had symptoms of coronavirus disease-2019 (Covid19) at the time of testing. Only 16 participants (38.1%) reported symptoms of Covid19 at any time (mean time between symptoms and bronchoscopy was 6.4 months; in 18 symptom status was unknown); two of these (4.8% of the group) required hospital admission; none of them had active TB.

BAL yields

Overall, the median BAL volume collected from the procedures was 130 ml (IQR 49 ml), which represents 54.2% of the instilled volume. The median number of BAL cells collected from the procedures was 30.2 × 106 (IQR 38.7 × 106). The median BAL cell concentration (cells/ml) was 0.24 × 106 (IQR 0.34 × 106). The median cell viability (proportion of live cells in the total cell yield) was high, at 96.8% (IQR 2.8%). The analysis showed no significant variability in BAL cell yield over time overall, or when stratified by parent study. There was no significant difference in BAL cell yields between operators performing the bronchoscopy, and no significant variability in their performance over time (Supplementary Fig. S1). We investigated the effect of the BAL volume on the total BAL cell yield and found that when the whole sample was considered there was no significant relationship between the two outcomes. However, when we performed group-specific testing, we found a log-linear relationship in current smokers from the Household Contacts and Community Controls (P = 0.04) and in the TB treatment groups (P < 0.0001 for TB Early Treatment and P = 0.044 for TB Pre-Treatment. This suggests that the true relationship was confounded in sample population analysis, as shown in Fig. 2. There were 333 data points available for the pellet colour analysis: 215 (64.6%) pellets were black, 54 (16.2%) were clear, and 65 (19.5%) were bloody.

Figure 2
figure 2

Scatter plot with regression trends showing the relationship between BAL total cell count and BAL volume yield. (A) The correlation between the log cell count and BAL volume in the Household Contacts and Community Control groups (the two groups without any current active infection), stratified by smoking status, either ‘current’ or ‘other’ (prior smokers and never smokers). The trend line for current smokers indicates an increase of 0.61 × 106 cells collected per ml increase in volume (SE 0.21 × 106 cells, P = 0.004) (B) Correlation between the log cell count and BAL volume among TB groups. The cell yield increases by 0.32 × 106 cells (SE 0.15 × 106, P = 0.044) per ml BAL volume among participants before TB Pre-Treatment (TB pre-treatment), by 0.54 × 106 cells (SE 0.14 × 106, P < 0.001) per ml among TB Early Treatment participants, and only 0.10 × 106 cells (SE 0.13 × 106, P = 0.451) per ml BAL volume among participants at End of Treatment (TB EOT).

Factors affecting BAL yields

Both the Boruta and Random Forest classification identified anatomical lung lobe as having the highest impact on BAL volume yield. In a simple linear model the RML yielded significantly higher volumes compared to all other lobes except the lingula (with RML as reference, RUL -35.4 ml difference in mean, SE 6.4, P < 0.0001; LUL − 22.5 ml difference in mean, SE 6.8, P = 0.001; LLL -30.8 ml difference in mean, SE 13.4, P = 0.022; RLL − 25.6 ml difference in mean, SE 12.5, P = 0.042; lingula -12.0 ml difference in mean, SE 9.3, P = 0.198). Increasing age was associated with a lower volume yield when considered as a continuous variable in a linear model with BAL volume as the outcome (− 0.7 ml / year, SE 0.2, P < 0.001), as was male gender (− 8.2 ml, SE 4.1, P = 0.0447) though the latter became non-significant after correction for multiple testing. When participant age was divided into three categories, pairwise comparisons (robust median-based ANOVA) showed a relationship between these categories and BAL volume, cell yield and concentration, which remained significant after correction for multiple testing (Fig. 3). The Boruta algorithm identified three variables that were most highly associated with total BAL cell yields: smoking status, gender, and pellet colour. On a simple linear model, current smoking caused a 26 × 106 difference in mean cells from never smokers (SE 7.5 × 106, P < 0.0001); male gender a 16 × 106 cells difference in mean cells from female gender (SE 5.9 × 106, P = 0.007); bloody pellets had a − 19.6 × 106 cells difference in mean cells from black pellets (SE 7.3 × 106, P = 0.0119); and clear pellets had -20.1 × 106 cells difference in mean cells from black pellets (SE 9.0 × 106, P = 0.027). The findings for BAL cell concentration were similar. The effect of gender did not stand up to correction for multiple testing, and further interrogation found that the three variables smoking, gender and pellet colour, were highly correlated, explained by the fact that significantly more males were smokers, and smoking is the commonest cause of a black BAL pellet (Fig. 4). To further investigate the effects of smoking on BAL characteristics, pairwise comparisons (robust median-based ANOVA with robust post-hoc t-tests) were performed, showing clearly that although current smokers give the lowest BAL volume yields, they have the highest overall cell yields (Fig. 5). These effects were seen for the black BAL pellet category as well (Supplementary Fig. S2).

Figure 3
figure 3

Bee swarm plots showing the effect of participant age on BAL cell yield, volume, and cell concentration. Age was binned into three categories (18–30, 30–50 and 50–80 y). (A) BAL volume yield; (B) BAL cell yield; (C) BAL cell concentration. PDF, a distance metric based on the probability density function showing distance from the mean. ANOVA on medians: (A) F = 4.2957, P = 0.0195; (B) F = 8.202, P = 0.001; (C) F = 5.2743, P = 0.004.

Figure 4
figure 4

Bee swarm plots showing the relationship between BAL pellet colour, smoking status and gender.

Figure 5
figure 5

Bee swarm plots showing the effect of smoking on BAL volume, cell yield and cell concentration. (A) BAL volume yield; (B) BAL cell yield; (C) BAL cell concentration. PDF, see Fig. 3. ANOVA on medians: (A) F = 6.6511, P = 0.0075; (B) F = 25.754, P < 0.001; (C) F = 31.0721, P < 0.001.

After feature selection identified clinical group as important in the BAL yields, pairwise comparisons (one-way ANOVA on the medians with robust post-hoc t-test) found that the PLHIV group was very different from the other clinical groups (Supplementary Fig. S3). To further interrogate differences between other clinical groups which might have been obscured by the analysis with the PLHIV group, we performed pairwise comparisons (robust 2-way ANOVA on the medians) on the other groups. This suggested some interesting trends in BAL volume and cell yields, from participants’ TB treatment status (Fig. 6). The TB End of Treatment group yielded lower BAL volumes than the Community Controls group and the Household Contacts group: -60.5 ml difference in modified M-estimator (MOM) (P < 0.001) and − 71.3 ml difference in MOM (P = 0.010) respectively. The TB Pre-Treatment group had higher cell yields than the Community Control group (a difference of 41.9 × 106 in MOM of cell counts, P < 0.001), the Household Contacts group (a difference of 29.8 × 106 in MOM of cell counts, P = 0.038), the TB Early Treatment group (a difference of 30.4 × 106 in MOM of cell counts, P = 0.038) and the TB End of Treatment group (a difference of 45.6 × 106 in MOM of cell counts, P = 0.030). The results for comparisons of concentrations were similar. These findings suggest the TB treatment duration affected the cellular content of the BAL. The trend was confirmed on direct comparison of paired Early Treatment and End of Treatment samples (Supplementary Table S2). We performed a two-way ANOVA on MOM between groups with smoking status taken into account, and found that smoking did not affect the interactions reported for BAL volume yield, BAL cell yield or concentrations (Supplementary Table S3). This additional analysis also suggested that differences in smoking status might have confounded a real interaction in the BAL cell yields between the TB End of Treatment group and the Community Controls and Household Contacts (Supplementary Table S3).

Figure 6
figure 6

The effect of participant TB clinical group on BAL yields with smoking status taken into account. (A) BAL cell yield; (B) BAL cell concentration; (C) BAL volume yield. Red dots are current smokers; blue dots are prior smokers and never smokers. P values < 0.005 for interactions between groups (including both smoking categories) are shown as ****. The PLHIV group is not included in this analysis.

Variables which had no effect on the outcomes in the linear models performed after the feature selection suggested they were important, included SARS-CoV-2 status, imaging findings (using a composite of chest radiograph and PET-CT when performed, categorised as ‘normal’, ‘abnormal, in keeping with active TB’, or ‘abnormal but not active TB’), any airway abnormalities observed during the procedure, IGRA result, recreational substance use, and the presence of other comorbidities. Interestingly, one-way ANOVA on the medians (with post-hoc robust t-test) revealed that participants who had airways which collapsed on minimal suction during the procedure (suggesting underlying airways disease) or abnormal secretions (suggesting chronic lung disease or current infection) had a lower volume yield than those with no observable airways abnormalities (− 10.83 ml, SE 4.14, P = 0.037). Both abnormal imaging categories (‘abnormal but not active TB’, and ‘abnormal, in keeping with active TB’) were associated with a lower BAL volume yield than those with normal imaging results (P = 0.004) before correction for multiple testing, but this became non-significant after correction.

Factors affecting BAL pellet colour

ANOVA testing showed that the highest cell counts are found in black pellets, followed by bloody pellets, then clear pellets (Supplementary Fig. S2). If the bloody pellet’s higher cell count represents contamination with blood immune cells rather than a higher number of lung immune cells, this has potentially negative implications for the post-thaw BAL cell viability and the interpretation of any BAL immunoassays. Logistic regression analysis found that the PLHIV group had a 70% greater chance of producing a bloody pellet compared to the Community Controls (Odds ratio 0.3 for bloody pellet in Community Controls compared to PLHIV, P = 0.013). As the only two parent studies to include PLHIV also had brushings during their procedures, we investigated whether the order in which the procedures were performed had any effect on the pellet colour and found that performing the brushing after the BAL (in the same lung lobe) was associated with a significantly lower rate of bloody BAL pellets compared to performing the brushing before the BAL (26.9% bloody pellets and 69.4% bloody pellets respectively, P = 0.001). As the reduced visibility and enhanced cough stimulus after the BAL may compromise participant safety when performing a brushing, our group opted to try a third technique in sampling the ipsilateral lower lobe before the BAL, but this was not successful in reducing the proportion of bloody pellets (Supplementary Fig. S4).

BAL cell differential count

It has been shown that smaller instilled BAL volumes sample only the distal airways whereas larger volumes sample the alveolar compartment, resulting in differences in measured protein and cellular contents of BAL fluid3. We therefore investigated whether the BAL volume yield affected cell differential counts, but found that there was no relationship between the two. On pairwise ANOVA of the means, several significant differences in the differential cell counts between groups were found: (1) older age categories were associated with higher neutrophil proportions (P = 0.003) though the effect size was small; (2) the TB Early Treatment group had the highest proportions of neutrophils of the clinical groups (P = 0.03); (3) clear BAL pellets had the highest proportions of lymphocytes (P = 0.002), black pellets had the highest proportions of neutrophils (P = 0.01), and bloody pellets did not have any significantly enriched cell populations; (4) current smokers had significantly higher proportions of neutrophils and eosinophils; and (5) SARS-CoV-2 positive participants had higher proportions of BAL lymphocytes than SARS-CoV-2 negative participants (P = 0.023). These results are depicted in Fig. 7, and Supplementary Fig. S5. These interactions became non-significant after correction for multiple testing. The BAL differential cell counts in each clinical group are presented in Table 2.

Figure 7
figure 7

The effect of participant clinical group and smoking status on BAL cell differential counts. (A) The effect of participant smoking status on BAL cell differential counts. Orange is never-smokers; red is prior smokers; blue is current smokers. (B) The effect of participant clinical group on BAL cell differential count. Clinical groups are represented by colours, in order: orange is Community Controls; red is Household Contacts; blue is TB Pre-Treatment; forest green is TB Early treatment; olive green is TB End of Treatment; yellow is PLHIV. Figures show the relative proportions of each cell type within the total number of cells counted (which was either 100 or 200 for each sample).

Table 2 Cell differential counts in clinical groups.

Adverse events

The mean procedure duration (counted from intubation with the bronchoscope to extubation) was 11.5 min (range 4–22), with a mode of 10 min. Procedures in which a brushing was done with the BAL were significantly longer than those without (1.23 min; 99.5% CI 0.40–2.04; p < 0.001); the two studies including brushings had more procedures lasting > 12 min (P < 0.05), > 14 min (P < 0.05) and > 15 min (Supplementary Fig. S6).

There were no severe adverse events in this study. Minor adverse events were recorded for 118 (33.1%) participants. Supplementary Fig. S7 shows the minor adverse events, stratified by clinical group. Minor bronchial mucosal bleeding was significantly more common in the PLHIV group (P < 0.001), all of whom also had brushings in the same procedure. Overall, 69 (23.5%) of 293 participants with available data reported still experiencing symptoms at 72 h after bronchoscopy (Supplementary Fig. S8). ‘Throat discomfort’ and post procedure cough were significantly different between groups, being less frequently reported in the TB Pre-Treatment group (P = 0.0264), TB Early- and End of Treatment groups (P = 0.044) respectively. PLHIV were more likely to experience dizziness than other groups (P = 0.010).

Of those who experienced any symptom, eight (2.7%) sought medical attention for the problem. One (0.3%) study participant was admitted overnight to hospital, three (1.0%) received antibiotics, and six (2.0%) received other medications (including anti-inflammatories, antihistamines, steroids or analgesics). At the day 14 assessment, no participants reported any additional problems. Importantly, at the 72 h assessment, 234 (96.7%) of the 242 participants surveyed stated they would be willing to have a repeat procedure; 236 (97.5%) reported that their overall experience of the bronchoscopy was tolerable; and 239 (98.8%) participants reported that the bronchoscopy procedure was better than they expected. Results were similar at the day 14 assessment.

Discussion

Our findings show that in a sample of over 330 research BAL procedures across diverse clinical groups made up of predominantly current tobacco smokers, the research team may expect a median yield of approximately 130 ml BAL fluid, or 54% of instilled volume, containing 30 million immune cells, with some exceptions. Using our methods, the pre-cryopreservation cell viability was consistently high between procedures, laboratory personnel and bronchoscopy operators. The procedures conducted in a fully equipped, dedicated bronchoscopy suite were safe, with no severe adverse events, with only 2.7% of participants experiencing symptoms which required medical attention after the bronchoscopy. Performing brushings before the BAL, in the same segment or in the ipsilateral lower lobe, is not advised because of the high rate of blood contamination compared to performing it after the BAL, when the risk of serious adverse events is theoretically higher. Our current practice—informed by these results—is to do the brushing after the BAL, with predetermined safety checks both during the procedure and afterwards, including both a thoracic ultrasound and prolonged observation period.

Consistent with the existing guidelines, the RML yielded the highest BAL volume. However, the factors determining how many cells were present in that volume of fluid were more complex. Smoking appeared to be the single most important factor affecting yields from these procedures. Although current smokers gave consistently lower volume yields than never smokers, their immune cell yields were significantly higher—presumably across all cell types except lymphocytes (which were highest in never smokers)—and increased proportionately with volume yield. Participant age appears to be another important factor. Increasing age was associated with not only lower volume yield but also lower cell yield (as well as lower cell concentration, indicating that the drop in cells is independent of the drop in volume), and a higher proportion of neutrophils within their cell population. Our findings confirm those of Koda and colleagues15 who identified an anatomical lung lobe target other than the RML or lingula, having a high pack year smoking history and older age as being negatively correlated with BAL volume yield, which they defined as a less than 30% return of instilled BAL volume in their group of 401 patients with diffuse parenchymal lung disease. Olsen et al.16 found that there was a similar age-dependent decrease in the BAL volume yield, but, in their group of 295 never-smoking healthy volunteers, the cell yield and differential counts were independent of age, gender, season and anatomical lung lobe. Karimi et al.7 compared the cell yields in 132 current smokers, 44 prior-smokers and 295 never smokers and concluded that BAL cell recovery was dependent on cumulative smoking history, which they also quantified in pack years. Whilst the pack-years information was available to us in this analysis, in our experience this metric is highly unreliable as an objective measure of smoking intensity, and we therefore chose to exclude it. Karimi et al.7 also found that BAL volume recovery was negatively correlated with age, and interestingly that smoking men had a lower BAL volume recovery than smoking women7. Men also yielded lower volumes and cells in our results, though neither of these stood up to correction for multiple testing and the high correlation between gender and smoking status suggested this finding was more likely attributable to the difference in smoking status between genders.

The effects of TB on these outcomes were interesting: participants with TB who were untreated or early in treatment yielded the highest cell counts, and there was a steep proportional increase in the number of cells with increasing volume. The practical outcome is that a participant with TB may disappoint the bronchoscopist with a mere 80 ml volume yield, but within that volume are more immune cells than a ‘decent’ 130 ml yield from a healthy Community Control participant, particularly if that Community Control is a non-smoker. This was not the case for participants who had reached the end of their TB treatment. In general, those at the end of treatment had both lower volume and lower cell yields, possibly attributable to chronic TB-related airway inflammation and damage reducing the volume return, but with little residual active inflammation at the time of lavage, and so fewer immune cells. A previous publication from our group which examined the performance and immune characteristics of BAL in 58 participants with active TB, other lung diseases, household contacts and healthy controls, found a similar significantly lower BAL volume and higher BAL cell yield in the TB group compared to the others, with higher proportion of neutrophils17, but the current study is the first to provide information on the effects of treatment on these outcomes. An important caveat is that the observed differences in cell differential counts may not translate into differences in immune cell phenotypes and function, and that smoking is still the factor with the highest impact on the cell yields.

From the data one might infer that HIV infection results in a high volume yield with low immune cell content, which has a high proportion of lymphocytes, but there may be a selection bias to consider, as the group of PLHIV that was included were mainly non-smoking middle-aged women who were well-controlled on ART. The lung is a known reservoir for HIV, resulting in pulmonary immune dysregulation, chronic inflammation, and a higher rate of infectious and non-infectious lung diseases18. Since the dysregulation is only partially restored by ART, our findings would need to be confirmed by a comparison to participants with untreated HIV, smokers living with HIV, and never-smoking community controls. Performing BAL on participants with untreated HIV may, however, result in a higher rate of post-procedural infections, representing a real safety risk for these people, and it should therefore probably be avoided outside of a clinical context19.

Reassuringly for research projects in the post- Covid-19 pandemic era, there were no significant observable effects of recent mild SARS-CoV-2 infection on our BAL yield outcomes, other than a slightly higher proportion of lymphocytes in the BAL fluid. Importantly though, we cannot make the same conclusion across all severities of SARS-CoV-2 infection, and since no functional assays were included in this analysis it remains unclear how the host lung immune profile is affected by recent SARS-CoV-2.

Overall, our study was limited by the need to select generally healthy candidates for bronchoscopy, potentially excluding important groups such as those with advanced HIV infection and the elderly. As demonstrated above, our results are confounded by numerous important and interrelated variables including the participant’s smoking status, recreational drug use, whether a brushing was done in the same procedure, and even age and gender in some groups. Teasing apart the individual effects of these variables on the BAL outcomes is complex, and there may be additional confounders which we have not accounted for. Regarding our assessment of factors affecting the BAL total cell yield and viability, the use of the red cell lysis buffer ACK may adversely affect both these outcomes. As red cell lysis was only included as an optional step in our BAL processing, this may have affected the results in some samples. There was a significant reduction in the cell yield from red pellets compared to black pellets, but not compared to clear pellets. This is reassuring, but there is a possibility that some pellets which were both black and bloody were labelled as bloody only, meaning that the effect size of the red cell lysis step could be minimized by the fact that black pellets start out with more cells. Lastly, our BAL cell viability was only assessed pre-cryopreservation, and post-thaw viability is yet to be established for these samples.

Conclusions

Bronchoscopy with BAL is an invaluable research tool in the study of host immune responses to respiratory infectious disease. Our results show that BAL for research purposes alone, without anticipated clinical benefit, is safe, and even in the presence of current or recent infection with TB, HIV and SARS-CoV-2, the BAL volume yield, cell yield, concentration and viability are consistent and reasonably predictable. Researchers specifically need to consider their target population’s smoking status (and possibly gender), age range, HIV status, and duration on TB treatment when selecting end assays which require a large number of immune cells. Clinicians performing the bronchoscopy should wherever possible aim for the RML (with lingula as an acceptable alternative), anticipate a lower volume yield and consider a higher instilled BAL volume wherever a different lobe is targeted. Additionally, if a brushing is planned during the same procedure, then a strategy to avoid contaminating the BAL with blood whilst optimising participant safety, should be identified beforehand. Whether the clinical variables which this study has identified as important in determining BAL volume yield, BAL cell yield, and pre-cryopreservation viability also impact the immune cell phenotype, function, and post-thaw viability, is yet to be established.