Background

Attention-deficit/hyperactivity disorder (ADHD) is a disorder characterized by inattention, impulsivity, and hyperactivity that affects 3–7% of school-age children [1]. ADHD is usually associated with significant impairment of cognitive and psychosocial functioning [2, 3] and can have a significant impact on the emotional well-being [46] and the quality of life (QoL) of both patients and their families [712].

Psychostimulants and behavioral therapy are known to be effective in the treatment of ADHD, as reported in the MTA study [13]. Atomoxetine is a non-stimulant treatment option for ADHD [14, 15], for which efficacy and tolerability in children and adolescents have been demonstrated in a number of randomized, placebo-controlled trials [1619], supported by a recent meta-analysis [20]. In most of these studies, questionnaires such as the ADHD-Rating Scale (ADHD-RS) [21, 22] or the Clinical Global Impression (CGI) [23, 24] have been used as outcome measures for the core symptoms of ADHD. Other questionnaires such as the Child Health Questionnaire (CHQ) [25] or the Child Health and Illness Profile, Child Edition (CHIP-CE) [26] assess aspects of ADHD that go beyond the core symptoms of the disorder and reflect various dimensions of health-related quality of life. However, such questionnaires are often rated by the investigator alone, resulting in an assessment from one perspective only. Therefore, several studies have attempted to compare the perspectives of the various individuals involved, such as the patient, the parent, or the physician, as these perspectives have been shown to differ [12, 27]. The newly devised Global Impression of Perceived Difficulties (GIPD) is one such instrument with which the three perspectives can be compared [28, 29]. The Pediatric Adverse Event Rating Scale (PAERS) also allows the comparison between patient, parent, and physician perspectives, although it was designed to capture the tolerability of medication rather than efficacy [30].

This report is based on a secondary analysis of data from two almost identical multi-center, single-arm, open-label studies in two different age groups (children and adolescents). These studies were designed to investigate the quality of life in patients with ADHD treated with atomoxetine as reflected by the degree of difficulties perceived by patients, parents and physicians [28, 29]. The two studies were undertaken to address the need for further research on evidence-based psychopharmacological treatments in children and adolescents [31]. One of the aims of the two studies on which this post-hoc analysis is based [28, 29] was to assess the tolerability of atomoxetine in these patients and compare the tolerability as perceived from the three perspectives (patient, parent, physician) using the Pediatric Adverse Event Rating Scale (PAERS). The PAERS is a 48-item questionnaire designed to assess any type of adverse event occurring in pediatric patients who are treated with psychotropic medication, especially as participant in clinical trials, and was developed as part of the Child and Adolescent Psychiatry Trials Network (CAPTN) [30, 3234]. The response captures the severity of 48 adverse event items on a five-point Likert scale (0–4).

The main assumption of this post-hoc analysis was that 10 of the 48 items of the PAERS are directly related to the patients' emotional state and can therefore be considered to reflect the patient's emotional well-being. Based on this assumption, the hypothesis of this analysis was that the emotional well-being of children and adolescents with ADHD responds well to treatment with atomoxetine as reflected by the 10 items of the PAERS directly related to the patient's emotional state. Differences between the three perspectives (patient, parent, physician) were also explored.

Methods

Study design and procedures

This is a secondary analysis of data from two almost identical multi-center, single-arm, open-label studies in two different age groups (children and adolescents) that were designed to investigate the quality of life in patients with ADHD treated with atomoxetine as reflected by the degree of difficulties perceived by patients, parents and physicians [28, 29]. Patients were recruited from child and adolescent psychiatric and pediatric practices and outpatient clinics throughout Germany. Patients aged 6–17 years with ADHD as defined in the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) [1] were eligible for the studies. The diagnosis was confirmed using the "Diagnose-Checkliste Hyperkinetische Störungen" (Diagnostic Checklist for Hyperkinetic Disorders), a structured instrument which is routinely used for the diagnostic assessment of ADHD in Germany [35]. The items of this instrument correspond to those of the ADHD-RS [21, 22]. Patients had to have an IQ of ≥70 based on the clinical judgment of the investigator. The exclusion criteria included clinically significant abnormal laboratory findings, acute or unstable medical conditions, cardiovascular disorder, history of seizures, pervasive developmental disorder, psychosis, bipolar disorder, suicidal ideation, any medical condition that might increase sympathetic nervous system activity, or the need for psychotropic medication other than study drug. Patients already being treated with atomoxetine were also excluded. Other previous treatments were allowed, provided they were discontinued prior to enrolment in the study. The protocol was approved by an ethics committee, and the study was conducted in accordance with the principles of the Declaration of Helsinki. Following a wash-out period, baseline assessments were carried out with all the instruments used. During the first week of treatment, the patients received atomoxetine at a dose of approximately 0.5 mg/kg body weight (BW) per day. During the following 7 weeks, the recommended target dose was 1.2 mg/kg BW per day, but could be adjusted within a range of 0.5–1.4 mg/kg BW per day, depending on effectiveness and tolerability. Medication was given once a day in the morning. Assessments were carried out weekly during the first two weeks of treatment, and every two weeks thereafter. After the 8 week treatment period, the physicians decided in accordance with the patients and their parents whether the patient was to continue treatment for additional 16 weeks. Those who participated in this extension period continued on the same atomoxetine dose which again could be adjusted within a range of 0.5–1.4 mg/kg BW per day as considered appropriate by the physician. During the extension period, three assessments were carried out, after 12, 16, and 24 weeks after baseline. The following instruments were used to assess efficacy: Global Impression of Perceived Difficulties (GIPD), Attention-Deficit/Hyperactivity Disorder Rating Scale (ADHD-RS), Clinical Global Impression-Severity (CGI-S), and the Weekly Rating of Evening and Morning Behavior – Revised (WREMB-R). The results from these scales have been published elsewhere [28, 29].

In order to assess the tolerability of atomoxetine in more detail than is possible using spontaneous adverse event reports, the Pediatric Adverse Event Rating Scale (PAERS) [30] was used in the two studies reported here. The data from both studies were combined and analyzed together. Tolerability assessments included monitoring vital signs at every visit and recording all spontaneously reported adverse events, followed by a systematic elicitation of any further adverse events using the PAERS. First, the physician asked an open question as to any adverse events. Then, the patient and the parent (or other primary caregiver) filled out the PAERS independently and without any interference by the physician. Both the patient and the parent had to rate each adverse event in terms of how bothersome or how much of a problem it was during the past week on a scale from 0 (= not present) to 4 (= a lot). If the patient was unable to fill out the scale all by him or her self, an independent person (e. g. a study nurse, but not the parent or physician) was allowed to provide assistance. As the spontaneous adverse event reports captured by the physician preceded the elicitation of any further adverse events using the PAERS, the number of adverse events captured by these two methods potentially differed.

Although the PAERS was designed to measure adverse events, some items are reflecting ADHD symptoms and difficulties associated with ADHD rather than adverse events. These items can be expected to improve but not worsen during ADHD treatment. Of these, all ten items of the PAERS that were thought to reflect emotional well-being by face validity were selected for this post-hoc analysis to measure changes over time.

Noncompliance was defined as missing intake of study drug on more than five consecutive days, failure to take at least 70% of study medication for at least two weeks, or repeated intentional intake of more than the prescribed dose.

Sample size and statistical analysis

Details on the sample size calculation for the two studies have been published elsewhere [28, 29]. The data of all patients were evaluated (Full Analysis Set, FAS) using SAS version 8. The dataset for all analyses of changes from baseline to endpoint consisted of all patients with a baseline measurement and at least one post-baseline measurement during the 8-week treatment phase.

Evaluation was largely descriptive. All tests of statistical significance were carried out at a nominal level of 5% using two-tailed test procedures. Two-sided confidence intervals (CIs) were computed using a 95% confidence level. All inferences regarding statistical significance were based on comparisons of the 95% confidence intervals (CI). This is equivalent to significance tests with p-values and a two-sided α-level of 5%. To avoid correlations of imputed values, only observed cases (OC) analysis were performed. No imputation of missing values like last observation carried forward (LOCF) was applied as the intention was to describe the patterns for patients still on medication.

Spearman's correlation coefficients were computed between all items within each perspective in order to identify patterns of interdependency among items. This analysis was based on all visits. A sensitivity analysis was done using the baseline visit only. 95% confidence intervals for the correlation coefficients were computed based on Fisher's z-transformation.

Results

Patient population and disposition

Of the 425 patients screened, 421 patients (100%) were enrolled in the two studies and treated with atomoxetine [28, 29]. All patients were diagnosed with ADHD according to DSM-IV criteria. The mean age of the patients was 11.1 years, 338 (80.3%) were boys, 83 (19.7%) were girls. The 8-week treatment period was completed by 355 (84.3%) patients. 27 (6.4%) of these did not continue into the extension period because of physician decision. 68 (16.1%) patients discontinued the study between week 8 and week 24. The extension period was completed at week 24 by 260 (61.8%) patients. The reasons for discontinuation at any time during the 24-month observation period were lack of efficacy (12.4%), parent decision (6.9%), adverse event (4.8%), protocol violation 3.6%, patient decision (2.4%), entry criteria exclusion (0.7%), physician decision (0.7%), and patient lost to follow-up (0.5%). The patient disposition is shown in Figure 1.

Figure 1
figure 1

Patient disposition.

Table 1 shows the patient characteristics. Boys and patients with the combined subtype ADHD according to DSM-IV [1] tended to be younger and tended to be diagnosed earlier than girls or patients with predominantly inattentive subtype. 239 (70.7%) of the boys and 39 (47.0%) of the girls were diagnosed with the combined subtype. The predominantly inattentive subtype was diagnosed in 86 (25.4%) of the boys and 38 (45.8%) of the girls. The subgroups "predominantly hyperactive-impulsive subtype" and "ADHD, not otherwise specified" were small (6 and 13 individuals, respectively). The mean ADHD-RS total score at baseline was 32.6 [CI 31.5 to 33.6] points. This score decreased to 16.3 [CI 15.1 to 17.5] points at week 8, and was 14.5 [CI 13.1 to 15.8] points at the end of week 24.

Table 1 Patient characteristics

Pre-existing comorbid conditions were reported for 310 (73.6%) patients, the most frequent being psychiatric comorbidities, specifically conduct disorder (19.7%), oppositional defiant disorder (17.6%), enuresis (4.3%), tic disorder (2.4%), emotional disorder of childhood (2.6%), and depression (1.4%). Physical comorbidities that were reported at a rate of >2% were headache (5.7%), seasonal allergy (4.3%), asthma (3.3%), neurodermatitis (2.6%), acne (2.4%), upper respiratory tract infection (2.1%), and rhinitis (2.1%).

349 (82.9%) of the 421 patients had previously been treated for ADHD. The percentage was similar for the predominantly inattentive subtype (N = 101, 81.5%) and the combined subtype (N = 231, 83.1%). Medications most frequently used before study entry were short-acting methylphenidate (N = 290, 68.9%), long-acting methylphenidate (N = 196, 46.6%), amphetamines (N = 56, 13.3%), antipsychotic drugs (N = 12, 2.9%) and herbal/complementary therapies (N = 10, 2.4%). Commonly reported non-drug therapies prior to study were: occupational therapy (N = 48, 11.4%), "other" psychotherapy (N = 31, 7.4%), structured psychotherapy (N = 42, 10.0%), and remedial education (N = 10, 2.4%). The most frequent reason for discontinuation of previous therapy in patients with pre-treatment was inadequate response (N = 216, 61.9%). N = 68 patients (16.2%) discontinued previous therapy because of adverse events.

The mean atomoxetine dose given during the first week of treatment was 0.50 mg/kg per BW day (SD 0.07, range 0.40 – 0.80 mg/kg BW per day). Thereafter, the mean dose for the respective visit intervals ranged between 1.17 and 1.18 mg/kg per day (min. 0.40, max. 1.50 mg/kg per day). Compliance as defined above was present in 91.2% of all patients over the course of the entire study.

Concomitant medication was taken by 272 (64.6%) of the patients. Cough and cold remedies, analgesics, antibiotics and herbal/complementary medicines were given most frequently. Whilst continuous medication with any psychotropic compound other than the study medication led to discontinuation of the patient in the study, 3.8% (N = 16) of patients did receive a psychotropic medication at least once over the entire course of the 24-week study. The medication included compounds such as St. John's Wort, imipramine or a benzodiazepine. Concomitant behavioral therapy was given to 27 (6.4%) patients, and 20 (4.8%) patients received additional occupational therapy.

Results from ten items of the PAERS

The following ten items of the Pediatric Adverse Event Rating Scale (PAERS) were selected to investigate emotional well-being: "feeling withdrawn or numb" (item 8), "nervous, tense, or uptight" (item 16), "trying to hurt him or her self" (item 20), "feeling restless or keyed up" (item 26), "sad or low mood/unhappy" (item 32), "drowsy or 'out of it"' (item 37), "unusually good mood/super happy" (item 38), "not interested/no enthusiasm" (item 39), "angry or irritable/in a bad mood" (item 42), and "thinking about or wanting to hurt self" (item 43). Each of these items was rated from three perspectives (patient, parent, and physician) like all PAERS items. If present, the severity of the respective behavior or emotional state was rated on a 5 point Likert scale (0 = not present, 1 = mild, 2 = moderate, 3 = severe, and 4 = extreme).

The following five groups of items were identified, whose correlations were larger than 0.4 between items: (a) items relating to depressed mood (items 8, 32, and 39), (b) items relating to self-harm (items 20 and 43), (c) items relating to irritability/agitation (items 16, 26, and 42), (d) one item relating to drowsiness (item 37), and (e) one item relating to euphoria (item 38) (Table 2). The correlation of the various items in the five groups is shown in Table 3.

Table 2 Groups of items of the Pediatric Adverse Event Rating Scale (PAERS) used to assess emotional well-being
Table 3 The five groups of items from the Pediatric Adverse Event Rating Scale (PAERS) whose Spearman's correlation coefficients for the physician rating were larger than 0.4 between items (all visits pooled; 3611 observations).

In general, the correlations were moderate to high only for parent and physician ratings, but not for patient ratings. The pattern of moderate to high correlations was similar between parent and physician ratings. In the sensitivity analyses using baseline ratings only (rather than all ratings), the correlations were generally lower, but confirmed the overall pattern of correlations based on the other points in time. A total score of all the items was not calculated as correlations were low between items that belonged to different groups (Table 2). These correlations are not reported here.

Items relating to depressed mood

Based on the confidence intervals at baseline, the parent ratings of the items "feeling withdrawn or numb" (item 8), "sad or low mood/unhappy" (item 32), and "not interested/no enthusiasm" (item 39), were significantly higher compared to both the patients and the physician ratings, which were similar. However, mean scores for all items were below 0.81 (Table 4). The scores for the items relating to depressed mood decreased over time. The mean change from baseline was statistically significant for all three items and all three perspectives both at week 8 and at week 24 (Table 4). Generally, there was a tendency for mean scores to decrease further the longer patients stayed on medication. Moreover, the decrease in scores was generally most pronounced in parent ratings, followed by patient and physician ratings.

Table 4 Baseline ratings and change from baseline at weeks 8 and 24

Items relating to self-harm

The scores for the items relating to self-harm were extremely low at baseline and stayed low over time (Table 4). The scores were comparable in terms of the three perspectives. No significant changes were observed compared to baseline.

Items relating to irritability/agitation

At baseline, a similar pattern was observed for items "nervous, tense, or uptight" (item 16), "feeling restless or keyed up" (item 26), and "angry or irritable/in a bad mood" (item 42). Based on the confidence intervals, the parent rating was significantly higher than the physician rating, which was again significantly higher than the patient rating for all three items. The scores for the items relating to irritability/agitation decreased over time (Table 4). The decreases from baseline were largest for parents, followed by physicians and patients: the mean changes from baseline were statistically significant for all perspectives, all three items, and both at week 8 and week 24, as shown by non-overlapping confidence intervals.

Item relating to drowsiness

Based on the confidence intervals at baseline, the item "drowsy or 'out of it"' (item 37) was scored similarly by parents and patients, and significantly higher than by physicians. The scores for the item relating to drowsiness decreased over time (Table 4). Also the changes from baseline were more pronounced in parent and patient ratings than in physician ratings. The mean changes from baseline were statistically significant for all perspectives and both at week 8 and week 24, as shown by non-overlapping confidence intervals.

Item relating to euphoria

Based on the confidence intervals at baseline, the item "unusually good mood/super happy" (item 38) was scored significantly higher by patients than by parents, and significantly higher than by physicians. The scores for the item relating to euphoria decreased over time (Table 4). The changes from baseline were scored similarly by patients and physicians, but the decreases were smaller in the physician rating. The mean changes from baseline were statistically significant in terms of all three perspectives and were significant both at week 8 and week 24, as shown by non-overlapping confidence intervals.

Discussion

The aim of this post-hoc analysis was to evaluate the scores of the ten items of the Pediatric Adverse Event Rating Scale (PAERS) that were considered to be related to emotional well-being by face validity. Each of these ten items was rated from three perspectives: the patient, the parent, and the physician perspective. These perspectives were subsequently compared in terms of the height of scores and changes in the scores over time.

Emotional well-being as reflected by the scores on the respective ten items of the PAERS showed both similarities as well as differences both regarding the course of the scores over time and comparisons between the three perspectives (patient, parent, physician). Generally, scores for all items rated from all three perspectives decreased over the 24-week duration of the two studies. Thus, emotional well-being as reflected by the scores on the ten items of the PAERS was seen to improve during treatment with atomoxetine.

Correlations between parent and physician scores were generally higher than correlations between parent and patient scores as well as correlations between physician and patient scores. There were, however, distinct differences between the patterns observed for the five groups of items relating to depressed mood, self-harm, irritability/agitation, drowsiness, and euphoria.

For the items relating to depressed mood, parent ratings resulted in higher scores than either patient or physician ratings at baseline. This may be due to parents being particularly concerned about the emotional well-being of their children. Over time, however, there is a reduction in the scores for these items from all three rater perspectives. Obviously, all individuals concerned recognize an improvement in the emotional well-being of the patients over time. Surprisingly, both patient and physician ratings on the PAERS were similar, although children and adolescents seemed to dissimulate their difficulties or failed to perceive their difficulties correctly according to the Global Impression of Perceived Difficulties (GIPD), whilst physicians perceived the child's difficulties as being considerably greater [28, 29]. Mean changes for most items and ratings were approximately one third of a standard deviation (Table 4). This can be considered a moderate change, given the low scores at baseline.

For the items relating to self-harm, scores from all three perspectives were very low at baseline and did not change significantly whilst the child or adolescent is being treated with atomoxetine. Mean changes from baseline for these items and ratings were negligible. This finding is encouraging, because it suggests that attempts to self-harm or thoughts of self-harm are not aggravated by treatment with atomoxetine.

For the items relating to irritability/agitation, scores differed significantly depending on the rater. Whilst baseline scores rated by parents were the highest, baseline scores rated by patients were lowest, and baseline scores rated by physicians were in between. These findings may be due to the high impact that a child's irritability/agitation may have on the parents. The physician may be less likely to observe irritability/agitation than a parent might, and children or adolescents seemed to dissimulate their difficulties in these two studies [28, 29]. Mean changes for most items in this dimension and for physician and parent ratings were greater than one half of the standard deviation (Table 4). This is a considerable change. In contrast, the patient ratings for these items changed to a smaller degree. This finding may be a result of the lower patient-rated scores at baseline.

The scores for the item relating to drowsiness as rated by patients and parents were similar whilst the scores rated by the physicians differed from the scores rated either by the patients or the parents at baseline. Physician-rated baseline scores were lower, which may be due to the physicians not having as much opportunity to witness any drowsiness as parents may do. Patients can be expected to experience this well-known adverse event related to atomoxetine [36]. Mean changes for this item were just below one third of the standard deviation. This can be considered a moderate change, given the low scores at baseline.

The scores for the item relating to euphoria differed between all three perspectives (patient, parent, physician). Whilst patient ratings resulted in the highest scores for euphoria, the scores from physician ratings were the lowest and scores from parent ratings were in between. The greater euphoria experienced by the patients compared to the euphoria seen by the parents or physicians seems to correspond to the lower degree of ADHD-related difficulties perceived by patients compared to the parent or physician perspectives as measured by the GIPD in these two studies [28, 29]. The rating of euphoria by the parents may have resulted in scores that more objectively reflect the actual situation, whilst the physicians may not have had adequate opportunity to witness the euphoria before carrying out their rating. Mean changes for this item were approximately one third of the standard deviation. This can be considered a moderate change, given the low to moderate scores at baseline.

This study has several limitations. Most importantly, the study did not include a placebo control, so that the degree to which the results reflect drug-specific effects cannot be determined definitely. More specifically, placebo-controlled studies would be needed to distinguish direct medication effects on emotional well-being from indirect effects caused by improvement of core symptoms. Furthermore, the age-distribution of the sample does not reflect the age-distribution of individuals with ADHD in the general pediatric population. This is due to the fact that this analysis is based on two studies, one in children and one in adolescents. Whilst the age-distribution is normal within each of the two otherwise identical studies, the age-distribution of the combined samples is not quite normal, as it shows two peaks. Due to the open-label design, unspecific factors such as rater bias, expectation effects, and time effects cannot be ruled out. However, this does not automatically compromise the validity of the results [37]. Furthermore, although both mean symptom reduction and improvement in emotional well-being were considerable, the results do not allow direct comparison against changes of these parameters upon treatment with other ADHD medications. Treatment emergent adverse events occurring in the two studies on which this analysis is based have been reported and discussed elsewhere in more detail [28, 29]. For evaluating the adverse event profile, it needs to be taken into account that only those patients for whom the physician decided to continue atomoxetine treatment at week 8 were followed for additional 16 weeks until week 24.

Taken together, these findings could be expected, as cognition and the regulation of emotion are known to influence one another [38]. Furthermore, cognitive control of emotion involves frontal structures of the brain [39], areas of the brain that play an important role in the pathophysiology of ADHD [3]. Thus, any pharmacological treatment that is effective on the core symptoms of ADHD and executive functioning can also be expected to improve the emotional regulation and thus the emotional well-being of patients with ADHD. This hypothesis is supported by the findings from this secondary analysis. These findings appear particularly relevant in face of the important role that emotional regulation plays in children, adolescents, and adults with ADHD [4, 5, 3943].