Background

The term psychosis refers to a serious psychological disorder characterized by obvious and long-lasting abnormalities in understanding, emotion, cognition, behavior, and other psychological activities. Psychoses such as schizophrenia often involve cognitive impairment and comorbid anxiety and depression [1]. Pharmacological therapy often has a limited effect or produces side effects [2, 3]. Non-pharmacological therapies (such as psychotherapy, occupational therapy, and art therapy) for patients with psychiatric disorders are useful, adaptable, and potentially cost-effective approaches to improve outcomes and quality of life [4].

Chinese calligraphy therapy (CCT) is a branch of art therapy that involves visual–spatial patterning of characters. This type of calligraphy is more than an art therapy; in essence, it involves culture, health, behaviour treatment and rehabilitation. The art nature is only one of its varied roles and functions. It necessitates exercising motor control of the brush to follow specific character configurations based on a projection of the cognitive images of the characters [5]. CCT requires the use of a soft-tipped brush to reproduce Chinese glyphs. It combines physical, mental, and personal processes and integrates visual performance, spatial abilities, and cognitive planning [6].

Recent empirical studies have shown that the practice of calligraphy may improve behavioral and psychosomatic disorders and may have a therapeutic effect on attention and emotional stability [7]. CCT has been scientifically investigated within the contexts of psychology, cognitive science, and cognitive neuroscience, and the findings suggest that it can reduce neuropsychiatric symptoms [8].

There have been many systematic reviews of other art therapies, such as painting therapy, music therapy, and gardening therapy. However, there are no systematic reviews or meta-analyses of CCT’s effect in reducing neuropsychiatric symptoms. Thus, the aim of this systematic review and meta-analysis was to explore the efficacy of CCT for people with neuropsychiatric symptoms.

Methods

Search strategy

A systematic search for articles was made in December 2016 using the databases MEDLINE, EMBASE, PsycINFO, Cochrane Central Register of Controlled Trials, and Wanfang Data. We searched for keywords and/or controlled vocabulary, such as medical subject headings and Emtree terms. Keywords were “handwriting,” “Chinese calligraphy,” “Chinese calligraphy therapy,” “Calligraphy exercise,” and “Calligraphy training.”

Study selection

We included in the analysis studies that met the following criteria: (1) randomized controlled trial (RCT), cohort study, or case-control study, (2) published in Chinese or English, (3) subjects were either healthy or psychiatric patients, (4) experimental group intervention measures for Chinese calligraphy; the control group intervention could be general care, health education, or no care, and (5) measurement of physiological or mental indicators. The exclusion criteria were as follows: (1) not relevant to Chinese calligraphy/calligraphy therapy, (2) commentary, case report, or a review article, (3) experimental interventions other than calligraphy treatment, (4) and lack of a control group and/or overlapping populations.

Quality assessment

We used the guideline suggested by Queen’s Joanna Briggs Collaboration critical appraisal checklist, Version 4.0 [9]. This scale contains 10 appraisal criteria that assess whether the assignment to treatment groups was truly random, participants were blinded to treatment allocation, allocation to treatment groups was concealed from the allocator, the outcomes of people who withdrew were described and included in the analysis, those assessing the outcomes were blind to the treatment allocation, the control and treatment groups were comparable at entry, the groups were treated identically other than for the named interventions, the outcomes were measured in the same way for all groups, the outcomes were measured in a reliable way, and the statistical analysis was appropriate. Two researchers extracted information and screened the quality of the articles independently. A third researcher was used to determine the quality of the studies in cases where it was difficult to reach a consensus.

Data analysis

Meta-analysis was performed using RevMan 5.3 software (Review Manager (RevMan) [Computer program]. Version 5.3. Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014.). The mean and standard deviation of each group was calculated to determine the overall effect of the intervention. Chi-square was used to determine heterogeneity. In addition, I2 values were calculated to test the heterogeneity among the studies. When I2 was < 50%, a fixed-effects model was used to determine the homogeneity among studies [10]. However, if there were differences between the studies (such as study location, population, and intervention program), a random-effects model analysis was used to avoid underestimation of treatment variability. Odds ratios and mean differences were used to compare different measurement indexes, to obtain standardized mean differences (SMD), and to estimate the combined effect level. Based on the different psychiatric symptoms and diseases included in the studies, we divided the meta-analysis into six groups: group 1: psychosis, group 2: anxiety, group 3: depression, group 4: cognition, group 5: neurofeedback, and group 6: schizophrenia.

Results

Characteristics of the studies

Figure 1 shows the process of study selection. Our initial search strategy yielded 299 citations, 233 of which were ineligible based on our screening of titles and abstracts. Thus, we retrieved the full text of 66 studies. Of these, 24 were excluded because they were commentaries; 4 were case reports; 8 lacked a control group; 6 had study populations that overlapped with other included studies; and 3 did not meet the eligibility criteria because they were not intervention studies. Consequently, 21 eligible studies were analyzed.

Fig. 1
figure 1

Flow diagram of study selection

Table 1 shows study characteristics and patient demographic data from each of the 21 studies included in the review. These studies were published between 2000 and 2016, and had sample sizes ranging from 16 to 224. Ten studies measured psychosis or general psychosis (the meta-analysis results and forest map are shown in Fig. 2). Nine studies on anxiety were analyzed; one study was removed [11] because the sensitivity analysis indicated large heterogeneity (Fig. 3). Seven studies on depression were analyzed (Fig. 4). Two studies on cognitive impairment were analyzed (Fig. 5) and three studies on neurofeedback were analyzed (Fig. 6).

Table 1 Characteristics of studies that fulfilled inclusion criteria
Fig. 2
figure 2

Forest plot of comparison: Experimental (Chinese calligraphy therapy, etc.) versus Control. Outcome: index of psychosis

Fig. 3
figure 3

Forest plot of comparison: Experimental (Chinese calligraphy therapy, etc.) versus Control. Outcome: index of anxiety

Fig. 4
figure 4

Forest plot of comparison: Experimental (Chinese calligraphy therapy, etc.) versus Control. Outcome: index of depression

Fig. 5
figure 5

Forest plot of comparison: Experimental (Chinese calligraphy therapy, etc.) versus Control. Outcome: index of cognitive function

Fig. 6
figure 6

Forest plot of comparison: Experimental (Chinese calligraphy therapy, etc.) versus Control. Outcome: index of neurofeedback

Eight studies evaluated patients with schizophrenia, of which four measured positive psychopathology of schizophrenia and four measured negative syndromes of schizophrenia (see Figs. 7 and 8 ).

Fig. 7
figure 7

Forest plot of comparison: Experimental (Chinese calligraphy therapy, etc.) versus Control. Outcome: index of schizophrenia-psychopathy

Fig. 8
figure 8

Forest plot of comparison: Experimental (Chinese calligraphy therapy, etc.) versus Control. Outcome: index of schizophrenia-negative syndrome

Table 2 shows the methodological quality of the 21 included studies. Two studies clearly documented the use of random allocation [12, 13]. The remaining quality criteria were scored based on the narrative of the studies. The scores for risk of bias ranged from 6 to 9 points. Agreement between the two reviewers was assessed using Cohen’s kappa coefficient and was 0.891 (p < 0.001), indicating a high degree of consistency.

Table 2 Methodological quality assessment of the included studies (JBI)

Figure 2 shows the meta-analysis results for group 1. Using a fixed-effects model, the heterogeneity test (I2) result for the seven studies was 0%, indicating homogeneity among the studies. The SMD was used to estimate the combined effect of measurements using different scales. The results showed that CCT can significantly reduce psychosis (10 studies, 965 subjects, SMD = − 0.17, 95% CI [− 0.30, − 0.04], Z = 2.60, p = 0.009).

Figure 3 shows the meta-analysis results for group 2. Using a fixed-effects model, the heterogeneity test (I2) was 0%, indicating that the nine studies were homogeneous. The SMD indicated that calligraphy treatment significantly reduced anxiety symptoms (9 studies, 579 subjects, SMD = − 0.78, 95% CI [− 0.95, − 0.61], Z = 8.98, p < 0.001).

Figure 4 shows the meta-analysis results for group 3. Using a fixed-effects model, the heterogeneity test (I2) was 0%, indicating that the seven studies were homogeneous. The SMD indicated that calligraphy treatment significantly reduced depressive symptoms (7 studies, 456 subjects, SMD = − 0.69, 95% CI [− 0.88, − 0.50], Z = 7.11, p < 0.001). Figure 5 shows the meta-analysis results for group 4. Using a fixed-effects model, the heterogeneity test (I2) was 0%, indicating that the two studies were homogeneous. The estimated combined effect showed that CCT significantly improved cognitive function (2 studies, 55 subjects, MD = 2.17, 95% CI [− 0.03, 4.38], Z = 1.93, p = 0.05). Figure 6 shows the meta-analysis results for group 5. Using a fixed-effects model, the heterogeneity test (I2) was 0%, indicating that the three studies were homogeneous. The estimated combined effect showed that CCT significantly improved neurofeedback (3 studies, 148 subjects, SMD = − 1.09, 95% CI [− 1.44, − 0.73], Z = 6.01, p < 0.001). Figure 7 shows the meta-analysis results for group 6–1 (schizophrenia-psychopathy). Using a fixed-effects model, the heterogeneity test (I2) was 0%, indicating that the four studies were homogeneous. The estimated combined effect showed that CCT significantly reduced positive psychopathological expression of schizophrenia symptoms (4 studies, 287 subjects, SMD = − 0.35, 95% CI [− 0.59, − 0.12], Z = 2.96, p = 0.003). Figure 8 shows the meta-analysis results for group 6–2 (schizophrenia-negative syndrome). Using a fixed-effects model, the heterogeneity test (I2) was 15%, indicating that the four studies were homogeneous. The estimated combined effect showed that CCT significantly reduced negative symptoms of schizophrenia (4 studies, 276 subjects, SMD = − 1.39, 95% CI [− 1.65, − 1.12], Z = 10.23, p < 0.001).

Discussion

The purpose of this study was to explore the effectiveness of calligraphy therapy in improving symptoms of psychiatric disorders by reviewing and analyzing relevant literature. Twenty-one studies met the inclusion criteria and were reviewed. Most of the evidence suggests that CCT can change targeted behaviors in individuals with neuropsychiatric symptoms and that CCT is associated with improvements in objective measurements of psychiatric performance.

Evidence from these kinds of studies is needed before CCT can be considered effective for neuropsychiatric symptoms. The study findings reviewed here suggest that, used as psychiatric therapy, CCT can significantly improve selected neuropsychiatric symptoms. The use of CCT in compensation-focused interventions and selected psychotherapeutic interventions may lead to neuropsychiatric changes and thus improve daily life.

RCTs provide the best evidence of the efficacy of CCT. In RCTs with large samples, there is more balance between the characteristics of participants in the treatment and control groups. In small-sample RCTs, some characteristics may not be balanced between groups. Table 1 shows that most of the sample sizes in each group were between 15 and 30. Only about half of the included studies had quality scores greater than 8 (Table 2). Therefore, more high-quality RCTs are needed to strengthen the evidence for CCT’s effect in reducing neuropsychiatric symptoms. If more systematic reviews are produced to establish clinical guidelines, this could increase the clinical use of CCT.

A funnel plot of the CCT literature was symmetrical (i.e., showed no positive or negative relations between effect size and standard error). This indicates that there was no publication error. The best way to avoid publication bias is to begin with a rigorous examination of the literature.

As this is the first review of the efficacy of CCT for neuropsychiatric symptoms, we are unable to compare the findings with other similar studies. Compared with other non-pharmacological therapies like music therapy, painting therapy, or gardening therapy, CCT is perhaps a more culture-specific therapy, as it requires participants to learn Chinese writing and use a special soft calligraphy brush.

Our findings show that calligraphy therapy can significantly enhance cognitive function and relieve neuropsychiatric and depressive symptoms. However, there were some study limitations. First, studies varied markedly in their intervention approaches and selected outcome measures, and were frequently hampered by design limitations. Second, the pattern of effects on specific neuropsychiatric domains was inconsistent across studies. Additionally, some important outcomes, such as daily functioning, quality of life, and neuropsychiatric symptom severity, were assessed infrequently in the reviewed studies. Moreover, handwritten communication has largely been replaced by typed communications (and, in more recent years, by mobile phone calls and texts). This may have affected the results.

Overall, the results from trials are promising but inconclusive. Additional well-designed and adequately powered trials are warranted. However, this evidence must be treated with caution because of methodological limitations. To better assess the value of non-pharmacological interventions for this population, we recommend the following: (1) RCTs should have a large size of over 30 subjects; (2) standards for cognitive/neuropsychiatric rehabilitation must be established for treatment of diseases, such as stroke, Alzheimer’s disease and MCI; (3) general character templates for adaptation and modification in diverse fields of clinical trials must be standardized; (4) character form design must be set; (5) authority for standards of template designs, protocols for CCT and a large database for cases and trial follow ups must be established.

Conclusion

This study shows that CCT exerts a curative effect on neuropsychiatric symptoms, but the evidence remains insufficient. A large number of RCTs are needed to facilitate additional systematic reviews of evidence for CCT. In general, we hope that this paper offers a method for a systematic CCT review and meta-analysis, which may provide a basis for establishing standards for CCT in clinical trials and applications in the future.