Introduction

Lung cancer is the second most common cancer and is by far the leading cause of cancer death in both men and women. In 2010, 201,144 new cases of lung cancer were diagnosed in the USA, with a death toll of 158,248 persons [1].Footnote 1 Despite progress in lung imaging and advances in radio-chemotherapies, the 5-year survival rate of patients with non-small cell lung cancer (NSCLC) remains low: 16 % in the US and even less in the UK (7.8 %) [2]. Surgical resection is still the best therapeutic option that can be offered in 10–25 % of patients with early stages of NSCLC. In the remaining patients, the disease is considered too advanced or comorbidities are judged too severe, precluding any benefit in terms of postoperative survival and quality of life [3, 4]. The current rate of operative mortality ranges between 1.6 and 7 % with a high incidence of cardiopulmonary complications (20–40 %) resulting in prolonged hospital stay and increased health care costs [5, 6]. Scoring tools predicting operative mortality emphasize the importance of age, performance status, procedure type, lung function and comorbidities (Table 1) [7∙, 8, 9∙].

Table 1 Prediction score of 30-day mortality after lung surgery (Thoracoscore)

Cardiac Assessment

Supra-ventricular arrhythmias are the most frequent postoperative complications (5–25 %), particularly after pneumonectomy and in elderly patients [10]. Other less frequent cardiac complications, such as myocardial infarction, heart failure and ventricular fibrillation (2–3 %), are the second cause of operative death after respiratory failure [11, 12]. As a consequence, cardiac risk stratification should always be performed preoperatively. Based on the guidelines of the American College of Cardiology and the American Heart Association [13], surgery should be deferred (or cancelled) in patients with major risk factors (e.g., recent myocardial infarction, ongoing ischemia, untreated heart failure), whereas those with intermediate risk factors should undergo further medical assessment and therapeutic optimization (Fig. 1). The revised cardiac risk index entails six independent risk predictors of cardiac complications (major surgery, coronary artery disease, heart failure, diabetes mellitus, renal dysfunction and history of stroke). A modified version has been proposed and validated specifically for thoracic surgical patients [14, 15].

Fig. 1
figure 1

Algorithm for preoperative assessment of patient undergoing lung surgery

Patients with coronary stents requiring aggressive antiplatelet therapy represent a major challenge given the risks of perioperative bleeding and intra-stent thrombosis [16]. Patients with pulmonary hypertension (PH) also deserve special consideration given the increased right ventricular load resulting from one-lung ventilation and surgical resection. Although increased mortality has been demonstrated in patients with moderate-to-severe PH undergoing noncardiac surgery [17, 18], favourable outcomes have recently been reported following lobectomy in patients with moderate PH and normal right ventricular function [19].

Pulmonary Assessment

Multivariate analysis in large cohorts of non-cardiac surgical patients revealed that the occurrence of postoperative pulmonary complications was mainly associated with the severity of the lung disorder [e.g., fibrosis, asthma, chronic obstructive lung disease (COPD), recent pneumonia, pulsed oxygen saturation less than 93 %], heart failure, morbid obesity, impaired physiological condition (poor exercise capacity, recent weight loss, anemia, hypoalbuminemia) as well as corticosteroid treatment, smoking habits and alcohol consumption [6, 12] [2022].

Lung Volumes, Gas Flow and Airway Resistance

With modern pneumotachographs, inspiratory and expiratory airflow rates can easily be measured and digitally converted into lung volumes (\( {\text{V }} = \int {{\text{Vd}}t} \)). Flow/volume curves are displayed, and a tabular chart includes the following items: forced or slow vital capacity (FVC, SVC), forced expiratory volume over the first second (FEV1), ratio of FEV1:FVC, peak airflow (PEF) and forced expiratory flow at 50, 75 or 25–75 % of lung vital capacity (FEF50, FEF75, FEF25–75) [23]. These physiological parameters are expressed in raw values and percent predicted as compared with subjects having similar characteristics (body weight, height, age and gender) [24].

Whole-body plethysmography and insoluble gas dilution techniques (using nitrogen or helium) are common methods that are used to assess “non-mobilizable” volumes, [functional residual volume and the residual volume (RV)]. The difference in measured lung volume between the two techniques is useful to estimate the volume of emphysematous bullae.

In restrictive pulmonary disorders (e.g., interstitial fibrosis, morbid obesity, neuromuscular disease), all lung volumes are decreased but the airway conductance is minimally altered. In obstructive pulmonary disorders, limitation in maximal airflow rates is best characterized by reductions in FEV1 and the FEV1:FVC ratio with RV and FRC being increased in emphysema while remaining relatively unchanged in chronic bronchitis [25, 26∙, 27]. Repetition of spirometric tests following inhalation of bronchodilating or bronchoconstrictive agents may detect hyperreactive airways and ascertain the efficacy of treatment with β2-adrenergic agonists, anticholinergic agents or steroids. Analysis of inspiratory/expiratory flow-volume curves may be helpful to document the severity and the site of airway obstruction (small/large airways, intra-/extrathoracic). Diagnosis and assessment of the severity of COPD are largely based on clinical signs (dyspnea, sputum production) and post-bronchodilator FEV1/FVC and FEV1 measurements (Table 2) [26∙, 28]. Monitoring the disease progression and predicting life expectancy can best be achieved using multidimensional instruments such as the BODE index, which takes into account the severity of airway obstruction, dyspnea, age as well as nutritional status and exercise tolerance [29, 30].

Table 2 GOLD classification of chronic obstructive pulmonary disease

Lung Gas Exchange Assessment

The diffusion capacity of carbon oxide (DLCO) reflects the overall ability of the lung to transport gas into and out of the blood. The strong affinity of hemoglobin for carbon monoxide (CO), combined with the enormous capacity of the red cell mass to absorb CO, makes the uptake of CO less dependent on cardiac output. During the test, the patient quickly and deeply inhales an air mixture of 0.3 % CO and 10 % helium, holds the breath for 10 s, and then exhales in less than 4 s [31]. After discarding the first 0.75 l (dead space), alveolar samples of the exhaled gas are continuously analyzed to calculate CO uptake, which depends on the alveolar capillary barrier transfer (DM) and the volume of capillary blood (Vc). Footnote 2The test is repeated after 4 min, and values are corrected for hemoglobin concentration and alveolar ventilation (KCO); these repeated measurements should not differ by more than 2 ml × min−1 mmHg−1 [32].

Typically, DLCO and KCO are reduced in pathological conditions where the endothelial-alveolar barrier is impaired owing to fluid overload, inflammatory infiltrates or collagen deposits (e.g., cardiogenic edema, capillary leak syndrome, fibrosis) [33].

Arterial blood gases are usually analyzed at the time of DLCO measurement or cardio-pulmonary exercise testing. A landmark cohort study in the mid-1970s showed that preoperative hypercapnia was associated with prolonged postoperative ventilation [34]. More recent studies have failed to identify impaired CO2 clearance as a risk factor for unplanned admission to the ICU or need for mechanical ventilator support after nonthoracic surgery [35]. As hypercapnia (PaCO2 > 7.5 kPa) is a key feature of respiratory failure from any cause, these patients are usually not considered for major lung resection and are referred to non-surgical treatments and/or rehabilitation programs.

In contrast with other functional parameters, arterial oxygen pressure (PaO2) is weakly influenced by aging [36]. Hypoxemia is a common marker of respiratory and/or cardiovascular impairments. In the recent Spanish cohort ARISCAT study [20], a low preoperative value of arterial pulsed oxygen saturation (SpO2 < 94 %) was identified as a strong predictor of pulmonary complications after non-cardiac surgery. The associated increase in the alveolar-arterial PO2 gradient (AaPaO2 > 10 mmHg) often reflects regional heterogeneities in ventilation/perfusion that are associated with COPD or restrictive lung diseases [37]. Hypoxemia refractory to O2 therapy is a characteristic feature of a true shunt or “venous admixture” that can be related to congenital vascular abnormalities, paraneoplasic syndrome, pleural effusion, large atelectasis or liver failure [38].

There is a poor correlation between FEV1 and DLCO. For example, low DLCO values (<80 %) are encountered in up to 50 % of patients with normal FEV1 [39]. Hence, guidelines from the British Thoracic Society (BTS) [40], the American Chest Physician (ACCP) and the European Respiratory Society (ERS) [41] all recommend that lung volumes and DLCO should be measured in patients undergoing lung resection.

In the early stages of interstitial infiltrative lung diseases, spirometric results are barely affected, whereas DLCO already exhibits significant impairment [42]. The reverse holds true in COPD where FEV1 and the FEV1:FVC ratio both reflect the severity of the disease, while impairment of the diffusion capacity occurs at the more advanced stages. The DLCO also helps in the differential diagnosis of restrictive lung disease: normal values are reported in patients with extrapulmonary causes (e.g., obesity, neuromuscular disease, pleural effusion), and low DLCO values are found in interstitial pulmonary disorders.

Estimation of Postoperative Respiratory Function

A simple method of estimating the predicted postoperative FEV1 and DLCO (ppoFEV1 and ppoDLCO, respectively) is based on counting the number of functional segments (nine in the left lung and ten in the right lung) or subsegments. Assuming that all lung areas contribute equally to the overall lung function, the preop FEV1 value (or preop DLCO) is multiplied by the number of segments (or subsegments) remaining after surgery [43]. Correction can be made by subtracting the number of obstructed segments as seen at CT scan or bronchoscopy.

Quantitative perfusion scans represent the reference methods for assessing the contribution of each lung (or part of the lung) to overall function [44]. After administration of intravenous macroaggregates labelled with technetium, the perfusion of the radiotracer(s) is measured through the lungs, and specific formulas are used to calculate ppoFEV1 or ppoDLCO.Footnote 3 More recently, quantitative computed tomography, magnetic resonance scanning, single photon emission computed tomography (SPECT) or a combination of them have yielded acceptable results in predicting ppoFEV1 as compared with perfusion scintigraphy [45, 46].

The BTS recommends that imaging-based calculation of residual lung function should only be performed in patients with an FEV1 < 80 % and scheduled for pneumonectomy [40]. In contrast, the more recent ERS and ACCP guidelines recommend that all patients with borderline function (FEV1 or DLCO < 80 % and limited exercise capacity) and scheduled for any major resection (lobectomy or pneumonectomy) should undergo imaging-based calculation of ppoFEV1 and ppoDLCO [41, 47∙].

The maximal decline in FEV1 (approximately −20 to 40 % after lobectomy, −40 to 60 % after pneumonectomy) is reached in the immediate postoperative period when the depressive respiratory effects of anesthesia are added to the effects of lung resection [48]. This functional impairment can be attenuated by minimally invasive surgical approaches, intraoperative ventilatory protective strategies and fast-tracking anesthetic regimens [22, 4952]. Functional recovery continues for approximately 6 months after lobectomy (FEV1 up to 84–91 % of preoperative baseline values, DLCO up to 89–96 %, VO2max up to 89–100 %), while it is usually limited to 3 months after pneumonectomy (FEV1 up to 64–71 %, DLCO up to 77–80 %, VO2max up to 72–80 %) [53]. Importantly, an inverse relationship has been found between baseline severity of airflow obstruction and postoperative reduction in FEV1. Patients with severe heterogeneous emphysema eligible for lung volume reduction surgery represent a unique subset of individuals who will tolerate curative-intent lobar or segmental resection with acceptable perioperative mortality and long-term survival [54].

Assessment of Physical Fitness

Cardiopulmonary Exercise Testing (CPET)

Formal CPET represents the “gold standard” method to assess the patient’s physiological reserve. The scientific rationale is that “unfit” subjects might not be able to sustain the postoperative physiological impairments and the neuroendocrine and inflammatory responses induced by the surgical stress, this subset of patients being therefore at highest risk of complications [55].

This non-invasive test is routinely performed on a bicycle ergometer or on a treadmill. CPET explores the functional interactions among the lungs (gas exchange), cardiovascular system (oxygen transport) and skeletal muscles (O2 diffusion and utilization, aerobic metabolism). After baseline measurements and a 3-min “warm-up” period, the workload is progressively incremented (25-W steps) over 10–15 min until the limits of exertion have been reached or the predicted maximal heart rate achieved. The CPET is interrupted earlier if myocardial ischemia, high blood pressure or cardiac arrhythmias are detected or if the patient complains of dizziness, fainting or angina pectoris [56].

During CEPT, the physiological reserves of the heart, lungs and skeletal muscles are quantified by monitoring ECG changes, heart rate and blood pressure responses, by measuring respiratory volumes, breath-by-breath O2 uptake and CO2 production as well as grading dyspnea and subjective feelings [57]. The maximal oxygen consumption (VO2max) achieved at the highest workload is expressed in relation to the total/ideal body weight (in ml × min−1×kg−1), as a percentage of a predicted normal value (% VO2max pred), and it can be converted in the metabolic equivalent task (MET = VO2max/VO2rest). The predicted normal VO2max takes into account patient’s age, gender and height. In addition to VO2max, the peak heart rate (peakHR) and HR reserveFootnote 4, O2 pulsed saturations (VO2/HR), maximal minute ventilation, breathing equivalents (VE/VCO2 and VE/VO2), anaerobic threshold (AT) and respiratory gas exchange ratio (VCO2:VO2) are calculated to characterize the patient’s aerobic capacity and physical fitness [58].

Poor tolerance to aerobic physical activities is a good indicator of general health status, particularly among COPD patients [59]. Interpretation of abnormal results during CPET is helpful for detecting significant coronary artery disease, ventricular or valvular dysfunction, gas exchange or ventilation abnormalities as well as muscle deconditioning [60, 61]. Physically “inactive” subjects and those with infectious and inflammatory disorders (e.g., cancer, atheromatosis, COPD, congestive heart failure) are prone to develop skeletal muscle dysfunction characterized by reduced muscular mass (sarcopenia), poor muscular strength and early release of lactate during exercise (AT < 70 %) [62]. Both VO2max and AT are enhanced following endurance physical training [63].

Alternative Assessment of Exercise Capability

Complete laboratory CPET is time consuming; it requires technical equipment and specialized resources, which are not readily available in many hospitals. Therefore, low technology exercise tests have been developed to provide surrogates for assessing patient’s physical fitness [64].

Stair climbing is the simplest test where the patient is asked to climb the stairs as quickly as possible—and without pause—until fatigue, maximal dyspnea (Borg scale) or uncomfortable symptoms are achieved. Physical fitness is rated as the number of stairs or preferably the height of ascent (meters). The speed of ascent (>15 m/min) and the ability to climb at least 20–22 m (or 5–6 flights of stairs) correlates with a VO2max > 20 ml × min−1×kg−1 (Table 4) [65, 66]. The HR and pulsed O2 saturation (SpO2) are monitored throughout the test. The inability to climb three flights of stairs or the occurrence of oxygen desaturation (>−4 %) is associated with poor prognosis and warrants further investigation.

The six-minute walk test (6MWT) consists of walking indoors as far as possible in 6 min. In many hospitals and ambulatory settings, the test is performed under the supervision of a physical therapist, along a flat straight corridor with markers to indicate the distance. Although the 6-min walking distance has been shown to provide reliable estimates of VO2max in healthy subjects and transplant candidates, equivocal results have been obtained regarding outcome following thoracic surgery [47∙].

The shuttle walk test (SWT) is an adaptation of the 6MWT where the patient walks back and forth around two cones placed 9 m apart, so that the total course approximates 10 m. Audio signals are given to the patient to stimulate the walking pace, and the test is interrupted when the patient is too breathless or after 12 min. The SWT test is more reproducible and more correlated with VO2max than the 6MWT although it tends to underestimate VO2max in the lower range [67]. Several studies have shown that walking distances of 200 and 400 m relate to VO2max of 10 and 15 ml × min−1×kg−1, respectively [68].

CPET, SWT and symptom-limited stair climbing might not be possible in some patients owing to musculoskeletal or neurological disabilities. Questionnaires related to daily living activities [MET, Karnofsky index, Duke Activity Status Index (DASI)] are valuable surrogates of exercise capacity testing although data involving surgical patients are still lacking. The DASI was found to correlate with VO2max [69], and a physical activity level exceeding eight MET was associated with a lower risk of cardiovascular events in the general population (Table 3) [70].

Table 3 Duke Activity Status Index (DASI)

In elderly and frail patients, simple tests of active mobilization such as the gait speed test (time needed to walk 5 m distance), recording of all movement through the day (podometer) and the mini-mental test have been shown to increment the predictive power of clinical risk algorithms [71]. Recent advances in monitoring technology have also provided non-invasive methods of estimating cardiac output (finger cuff technique), which can be used on patients unable to take part in traditional exercise regimes. These devices may in the future play a role in estimating cardiac reserve during limited ward based exercise.

Predictive Power of Functional Tests for Postoperative Mortality and Morbidity

Using absolute values of FEV1, DLCO or VO2max is inappropriate, particularly in the elderly, who present an age-related decline in the physiological reserve, as well in females and in patients with short stature. These key functional parameters should be indexed to ideal body weight or referred to subjects with similar demographic features (% of predicted).

Although a low FEV1 value (40–80 % of predicted) is a good marker of COPD, contradictory results have been published regarding its predictive power for major postoperative complications [6, 72]. In contrast, a low diffusion capacity (DLCO < 60 or 80 % predicted value) has been consistently associated with increased mortality and morbidity, higher admission rates in intensive care units and a worse long-term quality of life in patients undergoing major resection through thoracotomy but not through thoracoscopy [73, 74].

For pneumologists and thoracic surgeons, ppoFEV1 and ppoDLCO play a pivotal role regarding the decision to proceed to further investigations and type of treatment. Taking into account the lung functional condition and extent of the resection [73], the ppoFEV1 or ppoDLCO should not be lower than 30 % to maintain an acceptable quality of life. In emphysematous patients, a favorable “volume reduction” effect may take place resulting in unchanged or even improved FEV1 and DLCO values [75]. Mortality rates ranging from 5 to 60 % have been reported when ppoFEV1 was lower than 40 %, and this large mortality range underlines the implication of other confounding factors in relation to concomitant organ dysfunctions and perioperative optimizing interventions.

For the anesthesiologists, quantitative or qualitative assessment of aerobic exercise capacity is of paramount importance in risk stratification. Patients with preserved physiological reserves are better able to face stressful perioperative situations such as acute lung injuries, nosocomial infections, acute bleeding, broncho-pleural fistula and myocardial ischemia, which are mainly incriminated in perioperative death. Conversely, “unfit” patients are less capable to provide a satisfactory increase in O2 transport to meet the increased surgical stress-induced metabolic demand. Pre-existing disturbances in cardiac performance, respiratory function and oxygen utilization within the skeletal muscles may even worsen during the early postoperative period as a result of inadequate pain control, residual effects of anesthesia, fluid overload, ventilator-induced lung injury, surgical nerve damage, reflex inhibition of diaphragmatic function and muscular fatigue associated with systemic inflammation and enhanced protein catabolism [7678].

So far, several prospective studies have demonstrated that poor exercise tolerance, low VO2max and reduced AT (<11 ml × kg−1×min−1) are all independent predictive factors of cardiovascular and pulmonary complications as well as early and late mortality following various surgical procedures (e.g., transplantation; bariatric, visceral, urological surgery interventions) [7982]. Moreover, the evaluation of physical fitness (in MET) has become a reliable indicator of health status and a good predictor of cardiovascular events [83].

In patients undergoing lung cancer surgery, determination of VO2max outperforms all other functional tests (e.g., FEV1, DLCO, left ventricular ejection fraction) regarding outcome prediction and plays a key role in identifying patients deemed unsuitable for surgery and those who might benefit from therapeutic interventions [8486]. A VO2max of 15–16 ml × kg−1×min−1 is a reliable threshold to discriminate between patients at low-moderate risk and those at high-very high risk of major postoperative complications. A preoperative VO2max > 20 ml × kg−1×min−1 (or >22-m ascent at stair climbing) is associated with a low risk of serious adverse events whereas patients with VO2max < 10 ml × kg−1×min−1 (or <3–5 m ascent at stair climbing) are at a very high risk for postoperative complications (Table 4) [84, 8689].

Table 4 Aerobic physical capacity and perioperative risk of complications

Conclusions

So far, only a minority of patients with lung cancer undergo curative surgical resection (<25 %) as the diagnosis is established too late and in some patients the risk of perioperative complications outweighs the benefits of resecting a “curable” tumor.

The goal of preoperative functional evaluation is the following:

  1. (1)

    To estimate the loss of pulmonary function and the consequent limitations in quality of life following lung resection.

  2. (2)

    To predict the risk of early mortality and major complications as well as long-term survival.

  3. (3)

    To identify patients who might benefit from risk-reducing interventions.

Postoperative lung function (ppoFEV1 and ppoDLCO) can be evaluated by combining the results of chest-CT scans with measurements of lung volumes (FEV1) and diffusion capacity (DLCO). With the recent advances in minimal-invasive surgery, the early reduction in pulmonary function can be partially attenuated while fast-track anesthesia and rehabilitation programs may further speed up the recovery process.

Assessment of cardiac function and aerobic capacity is of paramount importance for clearing the patient for surgery and also for guiding further optimization therapy. Strong evidence indicates that a VO2max > 20 ml × kg−1×min−1 (or >22 m climbing ascent) qualifies the patient for any major lung resection whereas a VO2max < 10 ml × kg−1×min−1 identifies the “very high-risk” patient. Preliminary reports lend support to the benefits of a preoperative short-term rehabilitation program. Indeed, significant improvements in VO2max and in muscular strength have been demonstrated among patients with NSCLC undergoing a high-intensity training program [90]. Further prospective randomized trials are warranted in this field in order to increase the number of candidates for curative surgical resection.