Introduction

Acromegaly is almost always caused by a growth hormone (GH)-secreting pituitary tumor and is associated with increased morbidity and mortality [1]. Prompt treatment is essential in order to abrogate potentially life-threatening complications. Clinical practice guidelines advocate a multi-component therapeutic approach which includes lowering of both serum GH and insulin-like growth factor 1 (IGF-1) levels, tumor volume reduction, and amelioration of signs, symptoms and co-morbidities [13]. Somatostatin analogues, such as long-acting formulations of octreotide (octreotide LAR) and lanreotide (lanreotide Autogel), are recommended as first-line medical treatment for patients with acromegaly [2]. Octreotide LAR and lanreotide Autogel have been evaluated for the treatment of acromegaly in numerous clinical trials, with heterogeneity in reported biochemical response rates [4]. Studies of octreotide LAR prior to 2006 suggested that 50–70 % of patients with acromegaly respond to octreotide LAR [59]. In line with this, in a meta-analysis by Freda et al. [10] examining trials published before 2004, overall response rates for octreotide LAR were determined to be 57 % in terms of GH control and 67 % in terms of normalization of IGF-1. Additional studies of octreotide LAR [11, 12] and lanreotide Autogel [13, 14] published shortly after reported biochemical response rates as high as 70–80 %. More recently, however, response rates in prospective clinical trials of octreotide LAR, lanreotide Autogel and the multireceptor-targeted somatostatin analogue pasireotide have been substantially lower (17–41 %) [1523].

The purpose of this review is to examine and attempt to account for the differences in response rates reported in the literature for first-generation somatostatin analogues. Where appropriate, examples will be provided to illustrate how various factors may affect reported outcomes. Clinical study results will also be discussed in order to help practitioners treating patients with acromegaly put results from published clinical trials into context.

Selection of studies for review

PubMed was searched for English-language studies evaluating patients with acromegaly treated with octreotide LAR or lanreotide Autogel from 1990 (i.e., prior to the development of either drug formulation) to March 2015. Search terms used were “acromegaly and octreotide” and “acromegaly and lanreotide”; results were filtered for “clinical trial”. Studies that were included had ≥10 patients, a minimum duration of 24 weeks, used octreotide LAR or lanreotide Autogel as either first-line therapy or after previous surgery, radiotherapy or medical therapy, and reported GH and/or IGF-1 data as the primary objective of the study.

Of the 190 “clinical trials” found using “acromegaly and octreotide”, 18 studies fulfilled our criteria for analysis (Table 1) [59, 11, 12, 1419, 2428]. Of the 83 “clinical trials” found using “acromegaly and lanreotide”, 15 studies fulfilled our criteria for analysis (Table 2) [13, 14, 2023, 2936].

Table 1 Summary of octreotide LAR studies in acromegaly: study parameters and biochemical endpoints
Table 2 Summary of lanreotide Autogel studies in acromegaly: study parameters and biochemical endpoints

Factors that may impact on response rates

A recent meta-analysis by Carmichael et al. [37] that looked into how various aspects of acromegaly clinical trial methodology impact on reported response rates to somatostatin analogues concluded that year of publication, study duration and prior somatostatin analogue use significantly affected response rates. This analysis looked at how single factors might affect outcomes, rather than a combination of factors. There are many aspects to clinical studies that can potentially affect reported response rates, including the definition of response, the characteristics of the enrolled patients, whether or not all of the patients recruited into the study were included in the analysis, and whether the study was prospective or retrospective, among others. Most studies will be associated with a number of these factors, all of which can affect reported response rates.

Patient population

Clinical trials of any drug in any disease area select patients for enrollment into the study through a series of inclusion and exclusion criteria. These criteria are necessary to ensure that the correct patient population is treated for the hypothesis being examined, and in acromegaly may include such factors as sex, age, previous treatment history (e.g., transsphenoidal surgery, radiotherapy or medical therapy), and the presence or absence of other medical conditions.

Evaluating a drug effect in different subpopulations of patients with acromegaly may result in differing outcomes. In early studies of octreotide LAR and lanreotide Autogel, most studies included patients who were known responders to the drug (Tables 1, 2). As octreotide LAR and lanreotide Autogel were new formulations of the available drugs, this inclusion criterion was chosen to show that the new formulations were as effective as the older formulations in order to gain market approval. However, if taken out of context, the reported response rates may appear relatively high when compared with more recent results in which patients were not selected for prior responsiveness. For example, early studies of octreotide LAR (Lancranjan et al. [8], [9]) and lanreotide Autogel (Caron et al. [29]), which included patients known to be responsive to subcutaneous (sc) octreotide immediate release and lanreotide slow release (SR), respectively, showed GH response rates of 55–69 % [8, 9, 29] and IGF-1 response rates of 48–65 % [8, 9, 29]. Most of the patients in these studies had also received prior transsphenoidal surgery and/or radiotherapy.

More recently, somatostatin analogues have been evaluated as first-line therapy, in which the enrolled patients had not previously received any treatment for acromegaly. The percentage of patients achieving the composite endpoint of both GH and IGF-1 control after 1 year of treatment in prospective studies of patients with previously untreated acromegaly was 17–27 % with octreotide LAR [1618] and 33–54 % with lanreotide Autogel [22, 33]. Additionally, the recent randomized, double-blind study of pasireotide LAR versus octreotide LAR by Colao et al. [19] in patients who were either de novo or had received transsphenoidal surgery (but no medical therapy and no radiotherapy within 10 years) reported that 19 % of octreotide LAR recipients and 31 % of pasireotide LAR recipients achieved both GH and IGF-1 control after 1 year of treatment. When patients were stratified by prior treatment, the response rates in de novo patients were 17 and 26 % for octreotide LAR and pasireotide LAR, respectively, compared with 22 and 39 % in patients who had received prior surgery. Interestingly, in a 52-week study of lanreotide Autogel, 55 % of patients who had received prior treatment with a somatostatin analogue achieved biochemical control, compared with 20 % of somatostatin-analogue-naïve patients [20].

Although there are other factors that may have contributed to the relatively low response rates in these more recent studies, these results suggest that response rates in studies of patients with de novo acromegaly are generally lower than response rates in prospective studies of patients who have previously received transsphenoidal surgery and/or radiotherapy and/or somatostatin analogues. This is consistent with a number of studies that showed that by reducing tumor mass and, therefore, decreasing basal GH secretion, subsequent control of GH and IGF-1 levels with somatostatin analogues may be improved [3842].

Heterogeneity in the definition of biochemical control

The two parameters of biochemical control, suppression of excess GH secretion to predefined threshold levels and normalization of serum IGF-1 levels, are used as the primary markers of efficacy in most clinical trials of somatostatin analogues in patients with acromegaly. From the studies fulfilling the search criteria (Tables 1, 2), target threshold levels are generally similar across all the studies (the majority use GH ≤ 2.5 µg/L and normal IGF-1 for age and sex). However, studies published before 2006 generally reported response rates for GH and IGF-1 separately [59, 24, 25, 30], rather than the percentage of patients who achieved control of both GH and IGF-1 levels (Fig. 1).

Fig. 1
figure 1

Biochemical response rates to octreotide LAR and lanreotide Autogel reported in the medical literature by publication year and stratified according to whether GH and IGF-1 were reported as separate efficacy endpoints or as a composite efficacy endpoint. M = prior surgery but medically naive; N = treatment naive; T = previously treated with surgery and/or radiotherapy and/or medical therapy

Evaluation of GH or IGF-1 separately, rather than as a composite endpoint, was employed in ‘early’ clinical studies (1996–2006) [59, 24, 25, 30] and resulted in response rates that were relatively high compared with those in studies reported after 2006 (Fig. 1). The literature around this ‘early’ period began to report that patients with GH levels ≤2.5 µg/L did not necessarily achieve normalization of IGF-1 levels [43]. Indeed, approximately 25 % [6, 43] of patients were suggested to have discrepant GH and IGF-1 control (control of GH without IGF-1 control, or vice versa). As a consequence, the measurement of a composite efficacy endpoint of control of both GH and IGF-1 levels was advocated [44] and subsequently recommended by clinical practice guidelines for assessing biochemical response [1]. These findings likely prompted a shift away from only reporting GH and IGF-1 separately towards the more stringent composite measure and, as might be expected, the majority of the more recent studies (2007–2014) [6, 11, 12, 1519] are generally associated with relatively low response rates (Fig. 1). Thus, heterogeneity in the definition of biochemical control may be a contributing factor giving rise to differences in reported response rates.

Study design and nuances of the study protocol

Prospective versus retrospective

Although both prospective and retrospective studies provide valuable data, retrospective studies are more likely to contain elements of bias and confounding. One of the main elements of bias associated with retrospective studies is the inclusion of only those patients who completed a certain duration of treatment and were not lost to follow-up or did not discontinue treatment because of adverse events or other reasons. For example, the retrospective study by Cozzi et al. [6], which reported a GH response rate of 72 % and an IGF-1 response rate of 75 %, excluded an unreported number of patients who received octreotide LAR for less than 18 months. Had all patients who started on octreotide LAR been included in the analysis, the reported response rates may have been lower, as patients not reaching 18 months of therapy may have discontinued treatment because of non-response. Similarly, the retrospective comparison by Tutuncu et al. [14] of octreotide LAR and lanreotide Autogel in patients who had failed surgery reported that 64 % of octreotide LAR patients and 78 % of lanreotide Autogel patients had biochemical control after 18 months of treatment. However, the analysis retrospectively excluded all patients who had required a second operation or who had received additional medical treatment within 18 months of the initial surgery (i.e., probable non-responders to octreotide or lanreotide), thus artificially increasing response rates. As can be seen in Fig. 1, retrospective studies generally result in higher response rates, and the main reason is usually the retrospective exclusion of non-responders from the analysis.

Per protocol or intention to treat

However, many prospective studies also exclude patients who did not complete the study. For example, the prospective, 1-year follow-up study of lanreotide Autogel by Caron et al. [29] enrolled 130 patients previously treated with surgery and/or lanreotide Autogel, yet nine patients were excluded from the final analysis of biochemical control. Although the reasons for exclusion from the analysis may appear valid from a clinical practice perspective, for example, patients dropped out before the first efficacy analysis or withdrew consent to remain on treatment, these patients should be included as non-responders in an intention-to-treat (ITT) analysis. Therefore, although the study reported that 43 % (52/121) of the ITT population achieved biochemical control [29], the actual response rate was 40 % (52/130) (Table 2).

An efficacy analysis conducted on either the per-protocol (PP) or ITT population can produce significantly different outcomes. In an ITT analysis, all patients who were enrolled are considered part of the study, whether they receive treatment, complete the study or not. A PP analysis, however, is based on the population of patients who completed the clinical trial without any major protocol violations. For example, the study by Mercado et al. [18] reported results based on the PP population. This prospective, multicenter study evaluated octreotide LAR as first-line therapy in 98 previously untreated patients for 1 year. At month 12, 30 patients were excluded from the efficacy analysis because of either major protocol violations or early discontinuation; therefore, 68 patients formed the PP population. Seventeen patients achieved biochemical control (GH ≤ 2.5 μg/L and normalization of IGF-1 levels), which corresponds to a response rate of 25 % (17/68) based on the PP analysis, but 17.3 % (17/98) based on analysis of the ITT population (Table 1). Thus, analysis of the PP population usually leads to the reporting of higher response rates than an analysis of all patients who were intended to be treated.

Definition of responders and non-responders

Open-ended studies that report patients’ response at last follow-up generally result in a higher overall response rate than studies with a fixed time point to evaluate response. Although the definition of response may be the same with respect to the biochemical parameters (e.g., GH ≤ 2.5 µg/L and normal IGF-1), the time point at which biochemical control is measured may differ. In the study by Cozzi et al. [12], 56.7 % of previously untreated patients were reported to have achieved biochemical control (GH ≤ 2.5 µg/L and normal IGF-1). Patients in this study were followed up for 6–108 months, and their response at last follow-up was considered. As such, the 56.7 % of patients who were considered responders had achieved a response at some point between months 6 and 108 and so should not be compared with studies of a similar patient population that reported response rates at a specific time point, for example, at month 12 of treatment.

Furthermore, the pre-specified definition of biochemical control should be adhered to during analysis of the results. Like most studies, the randomized, double-blind study of pasireotide LAR versus octreotide LAR by Colao et al. [19] defined biochemical control as GH < 2.5 µg/L and normal IGF-1 at month 12. In this study, a patient with IGF-1 below the lower limit of normal was not considered a responder for the primary analysis because IGF-1 was abnormal. The consequence of this protocol definition is best exemplified by a post hoc analysis which included these patients with IGF-1 below the lower limit of normal as responders: response rates for octreotide LAR and pasireotide LAR increased from 19.2 to 20.9 % and from 31.3 to 35.8 %, respectively [19]. These examples emphasize the importance of appreciating the definitions of responders in study protocols and how they are treated in the primary analysis.

Assay variability

A wide range of immunoassays are used for the assessment of GH and IGF-1 levels. However, considerable heterogeneity in assay characteristics exists, which may lead to variability in results. This assay variability can be largely classified into two categories: cross-sectional variation, where different assays give differing results [45, 46]; and longitudinal variation, in which assays are unstable over time [47]. The lack of standardization between assays is of such concern that a 2011 consensus statement on the standardization and evaluation of GH and IGF-1 assays was developed as part of an international effort to harmonize GH and IGF-1 assays [48]. For the purposes of the current review, an analysis of the assays used in each study has not been undertaken. However, it is likely that some of the variability in GH and IGF-1 response rates seen in studies in this review is due to assay variability.

Response rates in a clinical trial setting: summary

Using representative examples of different clinical trial protocols evaluating octreotide LAR and lanreotide Autogel as a treatment for acromegaly, it is evident that multiple factors are capable of impacting on reported response rates. Prospective studies reporting an ITT analysis that evaluated medically naïve patients, used the composite endpoint of both GH and IGF-1 control, and used a fixed time point to evaluate response were associated with lower response rates. The use of less stringent non-composite biochemical control endpoints, heterogeneous patient populations, study protocols that exclude treatment non-responders from the efficacy analysis, and prior responsiveness to medical therapy are just a few of the factors identified that likely contribute to higher success rates. This review does not attempt to place emphasis on any one particular factor over another, but it serves instead to raise awareness of the dangers of interpreting biochemical control response rates without first carefully considering how the study design, patient population and statistical analysis might impact on the data. Multivariate statistical analyses exploring the differences in response rates between studies would help to quantify the impact these factors may have on reported outcomes, and such an analysis is encouraged. With this in mind, it is perhaps unsurprising that response rates to somatostatin analogues, or to agents of any class of drug, in published clinical trials vary considerably.

Response rates in clinical practice

In clinical practice, biochemical response rates to somatostatin analogues are likely to fall somewhere between the rates observed in the early clinical trials and the rates observed in more recent trials. Many patients seen in the clinic are post-surgical or have had successful prior treatment with a somatostatin analogue, rather than being treatment naïve. Prior to treatment with somatostatin analogues, patients may also be ‘pre-selected’ for potential responsiveness with an acute octreotide suppression test [49]. The clinical setting allows for dose adjustments at the discretion of the physician and long-term treatment, both of which provide opportunities for higher response rates, similar to long-term, open-label studies. The choice of parameters used to gauge clinical response also plays a significant role in perceived treatment success; both GH and IGF-1 levels together as a composite measure should be used to assess biochemical response [2, 50].

Finally, similar to the differences in clinical trials of reporting data from patients who discontinue treatment (e.g., ITT vs. PP analysis), clinicians (including those from specialist pituitary centers) sometimes might remember only those patients who have responded to treatment, rather than those who have transitioned to another treatment or care provider. As a result, clinicians should be cautious of comparing response rates in real-world clinical scenarios with response rates obtained from a controlled trial setting.

As the mechanisms involved in the resistance to somatostatin analogues become better understood, it should be possible to predict which patients will respond to different medical therapies based on biomarkers. Therefore, a more successful outcome may be observed if individualized treatment is based on information such as the somatostatin receptor subtype profile, aryl hydrocarbon receptor-interacting protein (AIP) expression, and T2 intensity on magnetic resonance imaging, among others [51, 52]. This would result in higher response rates, because those patients who would not have responded to treatment with a particular agent would not be unnecessarily treated.

Concluding remarks

As acromegaly is a rare disorder, even experienced endocrinologists may seldom treat a patient with acromegaly and therefore rely on the medical literature to inform treatment decision making. However, the wide range of reported response rates with octreotide LAR and lanreotide Autogel may be confusing and could lead to misinterpretation by both the patient and the physician in certain situations. Understanding the factors that potentially drive the discordance in response rates, as reported in this review, should allow clinicians to better gauge treatment expectations in specific patients.