Background

In recent years, increased violent events were reported in the general hospitals of China [1, 2], which has threatened medical workers’ safety and even their lives, and has also jeopardized the relationship between medical worker and patient. A meta-analysis by Lu et al. 2018 covering 81,771 health-care professionals reported that the prevalence of workplace violence against health-care professionals was 62.4% [3]. Previous studies showed that non-psychiatric patients with mental health disorders were more likely to have a tendency to exhibit violence [4, 5]. For example, individuals with paranoid personality disorder (PPD), emotionally unstable personality disorder (EUPD) or histrionic personality disorder (HPD) had an increased tendency to suffer from comorbid psychological and physical problems and atypically exhibited a higher use of medical resources [6,7,8]. At the same time, there was a lack of flexibility in adapting to social, environmental and interpersonal communications [9]. The core phenomenological feature of the above personality disorders is the difficulty in emotional regulation [10]. Many individuals with such disorders have a biological susceptibility to experience emotional disturbances and are more likely to have a strong reaction to the experience. Their impulsive and angry behavior toward others, is regarded as a bad strategy they have adopted to alleviate or avoid intense negative emotional experiences [11]. PPD, EUPD and HPD are strongly associated with anxiety and depression, and the individuals with these disorders are more prone to suicide or interpersonal violence [12,13,14]. In this study, we hypothesized that individuals with demonstrated in the above-mentioned mental health disorders are more likely to present with a tendency to be violent.

However, the recognition rate of mental health disorders by non-psychiatric medical workers was very low in China [15]. The currently commonly used psychological-assessment tools, such as Kessler Psychological Distress Scale (K10) [16], Hospital Anxiety and Depression Scale (HADS) [17], were related to emotional disorders (e.g., anxiety and/or depression). These scales are rarely used in non-psychiatric routine clinical practice, and personality disorders are not assessed in non-psychiatric clinical setting, which may weaken the effectiveness of the mental health disorder screening. Patients with mental disorders are prone to suicidal ideation or suicidal behavior, while non-psychiatric medical workers pay little attention to this situation [18]. The “Huaxi Emotional-distress Index” (HEI) [19], a Chinese questionnaire, including 9 items for screening of depression and anxiety, was developed for use in non-psychiatric routine clinical practice of general hospitals. However, it failed to integrate personality disorders as screening items and thus had not reached high efficiency for patient selection of mental health disorders (e.g, anxiety and depression).

There were two main factors making it difficult to employ mental health disorders screening: first, the lack of necessary knowledge or skills in non-psychiatric departments for mental-health and related personality disorder assessment [20]; second, the complexity of existing assessment tools. To begin with, most of existing psychological-assessment tools were designed for the use by psychiatric-departments in hospitals. Non-psychiatric doctors and nurses usually do not gain enough professional-training in psychiatric assessment and care, and therefore it is difficult for them to use psychological-assessment tools. Furthermore, the existing assessment tools (e.g., HADS, Self-rating Anxiety Scale, Self-rating Depression Scale) usually contain too many items and patients may lose their attention during the assessment and give inaccurate answers [21]. Last but not least, the items and standard scores of the tools were specially designed for used in Westerner society. However, the translated version of the tools is rarely used in China hospitals due to the cultural and language barriers [22].

To address these issues, in this study we made efforts to learn from the design of existing assessment tools, and also integrate the clinical experience used in China, and develop the two questionnaires according that showed effective and easy-to-use in Chinese patients. IPEQ-1 is for the screening of anxiety, depression and suicidality; and IPEQ-2 is for the screening of PPD, EUPD and HPD. This article reported on the development of the initial questionnaire items, selection of the final questionnaire items, and evaluation of two new IPEQs. The entire study flow diagram was shown in Fig. 1.

Fig. 1
figure 1

Study flow diagram

Materials and methods

Participants

To develop and evaluate the proposed two IPEQs, we recruited 1210 participants, who were inpatients from different department of First Hospital of Shanxi Medical University (3A-grade general hospital, Taiyuan, Shanxi, China). All inpatients met the following inclusion criteria: (1) age ≥ 18 years, (2) being ability to complete the questionnaires independently, and (3) willing to participate in this study. Inpatients were excluded if they were unable to complete the questionnaire. The 1210 participants were separated into three groups for item selection and evaluation, with the sample size of 160, 450 and 600, respectively. The sample size of first-stage was calculated according to the ratio of subjects to items (the maximum number of items in the scale), which was suggested as 5:1 in the pre-investigation stage [23]; and the sample size to item ratio was set as 10:1 for the second and third stage [24]. The recruitment of these samples was as follows.

The first group was used for item pre-selection, which contains 160 inpatients recruited from July 2018 to August 2018. Specifically, 40 were from the psychiatric department, and 120 from non-psychiatric departments, including respiratory, neurology, traditional Chinese medicine, oncology and general surgery. The questionnaires with incomplete or biased answers were recognized as invalid and excluded. Finally, there were 149 inpatients completed IPEQ-1 and were included in the statistical analysis. For IPEQ-2, the number was 130.

The second group was used for the item-revaluation based on the second-stage item selection, which contains 450 inpatients, recruited from non-psychiatric departments in August and September, 2018, from several departments respiratory, neurology, traditional Chinese medicine, oncology, general surgery, nephrology, hematology, geriatrics, cardiology, gastroenterology, endocrinology, rheumatology, plastic surgery, urology, orthopedics, otolaryngology, neurosurgery and gynecology. For IPEQ-1, there were 423 valid questionnaires collected, and the number for IPEQ-2 was 419.

The third sample group was used for the evaluation of the proposed IPEQs. This group was composed of 600 non-psychiatric inpatients recruited from September 2018 to October 2018. There were 547 completed questionnaires collected for IPEQ-1and 523 for IPEQ-2.

We present participants demographics of three sample groups in Table 1.

Table 1 Participants demographics of three sample groups

Item generation of questionnaires

From a literature review, we designed two questionnaires for the screening of violence-related six mental health disorders: IPEQ-1: screening anxiety, depression and suicidality; IPEQ-2: screening PPD, EUPD and HPD. (Fig. 2).

Fig. 2
figure 2

Study design and the network between violence and six mental health disorders. IPEQ-1: Questionnaire for screening mental health disorders related to anxiety, depression and suicidality; IPEQ-2: Questionnaire for screening mental health disorders related to paranoid personality disorder, emotionally unstable personality disorder and histrionic personality disorder

The item pool of IPEQs were generated according to International Statistical Classification of Diseases and Related Health Problems 10th Revision (ICD-10) [25], which is used to provide official criteria in clinical diagnosis. ICD-10 codes: anxiety: F41.1, depression: F32, suicidality: F32 (e); paranoid personality disorder: F60.0, emotionally unstable personality disorder: F60.3, histrionic personality disorder: F60.4. And we drafted the items by reference of existing self-completed screening scales. The main reference in this study included the Symptom Checklist-90 (SCL-90) [26], HADS [17], Self-rating Anxiety Scale (SAS) [27], Beck Anxiety Inventory (BAI) [28], Generalized Anxiety Disorder-7 (GAD-7) [29], Self-rating Depression Scale (SDS) [30], Beck Depression Inventory (BDI) [31], Patient Health Questionnaire-9 items (PHQ-9) [32], HEI [19], Beck Scale for Suicide Ideation-Chinese Version (BSI-CV) [33], and Personality Diagnostic Questionnaire-4+ (PDQ-4+) [34].

The candidate item of IPEQs were reviewed by a panel of 16 senior experts that consisted of professors, clinicians and researchers in the field of psychology and psychiatry from 5 cities in China (Shanxi, Beijing, Hebei, Zhejiang and Jiangsu). These experts had more than 10-year experiences in the field. They rated each item in the pool as “strongly approved,” “revise slightly,” “revise moderately,” “revise entirely or delete,” or “delete”; and put forward specific suggestions for revision.

Initially, 37 and 30 items were selected for IPEQ-1 and IPEQ-2, respectively (Table 2 and Table 3), which were evaluated following an approach described in the following sections.

Table 2 The results of PCC, EFA and IRT obtained from the 37 (initial version) and 18 (second version) items of IPEQ-1
Table 3 The results of PCC, EFA and IRT obtained from the 30 (initial version) and 21 (second version) items of IPEQ-2

Item selection of questionnaires

A two-stage item selection has been conducted with the first two aforementioned groups. The survey data were analyzed by employing both classical test theory (CTT) (including Pearson’s correlation coefficient (PCC) and exploratory factor analysis (EFA)) and item response theory (IRT).

In the first stage of item selection, the participants in the first sample group (n = 160) were asked to answer all 67 questions of the two initial questionnaires (37 questions in IPEQ-1, and 30 in IPEQ-2; see Table 2 and Table 3, respectively). We used PCC to test the strength of relation between the disorder types and their related question-items within each questionnaire. A question was recognized as valid if it demonstrated strong correlation with its corresponding disorder (PCC > 0.40 and P < 0.05), otherwise, it was excluded [35]. Then EFA was used to evaluate the overall significance of a question to its corresponding disorder and to the questionnaire as a whole. A principal-component method with varimax rotation was performed. The number of factors were decided according to the study design, the eigenvalue and the contribution ratio of cumulative variance. Items with low factor loading for its corresponding disorder (factor loading < 0.40) or cross-loading on two or more disorders were excluded [36].

In addition, IRT was used to test the relationship between subjects’ disorders and their responses to questions by the discrimination (a) of each item [37, 38]. For IPEQ-1, Samejima’s hierarchical response model was employed as the questionnaire was in the form of five-point Likert; for IPEQ-2, a two-parameter logistic model was employed as the questionnaire accepted binary answers. An item was excluded if it did not represent its corresponding disorder (a < 0.40) [39]. Based on the analysis of the results, 18 and 21 items were selected for IPEQ-1 and IPEQ-2, respectively. (Table 2 and Table 3).

For the second stage of item selection, the second inpatient group (n = 450) were required to answer the 18-item version of IPEQ-1 and 21-item version of IPEQ-2 from the first-stage item selection. The survey data were analyzed following the same statistical workflow (CTT and IRT) as used in the first stage item selection. IPEQs aimed to include the items that were not only efficient but also brief. The selection of an item was based on its consistency with primary symptoms of its corresponding disorder in ICD-10 and its statistical significance to the questionnaire. After the two-stage item selection, 12 questions were selected for both IPEQ-1 and IPEQ-2, which were used as the final version IPEQs (Supplementary Table 1 and Supplementary Table 2).

The total score of final version IPEQ-1 ranged among 0 ~ 48, with higher scores indicate greater severity of emotional distress and violence-tendency. The following cutoff scores have been suggested: ≤11 = normal, 12 ~ 16 = mild, 17 ~ 21 = moderate, ≥22 = severe. The total score of the final version IPEQ-2 range was 0 ~ 12, with higher scores indicate an increased likelihood of having a personality disorder and violence-tendency. The score range of each personality disorder was 0 ~ 4, the cutoff score of 3 or greater was adopted for screening of each personality disorder.

Evaluation of final questionnaires

Reliability

The third group (n = 600) were used to evaluate the reliability and validity of both questionnaires. The internal disorder-specificity were measured by using Cronbach’s alpha coefficient (threshold ≥0.50) [40]. The stability of the questionnaires was tested by a test-retest process: four days after the initial test, 62 patients out of the 600 inpatients were randomly selected to answer two IPEQs one more time and the replication of the answers were calculated and used to evaluate the stability of the questionnaires. The acceptable reliability was set as correlation coefficient ≥ 0.70 and P < 0.05 [41].

Criterion-related validity

The efficacy of the propose questionnaires was evaluated by measuring the consistency between the results of using the proposed IPEQs and the three commonly used existing psychometric assessment-tools: 238 out of 600 were randomly picked to completed self-evaluation using GAD-7 [29], PHQ-9 [32], and PDQ-4+ [34]. Survey data were collected and compared with that from the two proposed IPEQs [42].

The GAD-7 was a 7-item anxiety scale, that score ranged from 0 to 21. Cronbach’s alpha was 0.92 for the GAD-7. The PHQ-9 was a 9-item depression questionnaire, which the ninth item pertains to self-harm. The PHQ-9 score ranged from 0 to 27 and Cronbach’s alpha was reported by developers to be 0.89 and 0.86. The PDQ-4+ was a 107-item questionnaire, designed to assess the 12 personality disorders (e.g., paranoid personality disorder, borderline personality disorder of emotionally instable personality disorder, histrionic personality disorder, etc.). The cutoff score of 4 or 5 was for screening of personality disorders. Cronbach’s alpha was 0.51 ~ 0.74 for the PDQ-4 + .

Construct validity

The constructional validity was measured by evaluating the relation between each item and it corresponding disorder using confirmatory factor analysis (CFA), with criteria set as: 1), factor loading > 0.50 and P < 0.05; 2), root mean square error of approximation (RMSEA) < 0.08; 3), goodness-of-fit index (GFI) > 0.90; 4), adjusted goodness of fit index (AGFI) > 0.90; 5), normed fit index (NFI) > 0.90; 6), non-normed fit index (NNFI) > 0.90; and 7), comparative fit index (CFI) > 0.90 [43].

Feasibility analysis

The feasibility of the IPEQs was evaluated by examining completion rate and completion time. When the questionnaire can be understood and completed by participants easily, it is considered acceptable feasibility.

The statistical analysis was conducted by using SPSS software version 20.0, MULTILOG software version 7.03 and LISREL software version 8.70.

Results

Item selection results

The results of the statistical analysis for the two-stage item selection were provided in Table 2 and Table 3, for IPEQ-1 and IPEQ-2, respectively. According to the selection criteria, 19 items from IPEQ-1 (37-item) and 9 items from IPEQ-2 (30-item) were excluded in the first stage of item selection, then items 6 and 9 were excluded for IPEQ-1 and IPEQ-2, respectively (Supplementary Fig. 1 and Supplementary Fig. 2).

Some items were to be excluded based on statistical results. For example, item 29 in IPEQ-1 was excluded, because the correlation coefficient was < 0.40. Item 12 in IPEQ-1 was excluded, because it was showed weak correlation with its corresponding disorder in PCC. The correlation coefficient between item 12 and suicidality was 0.71, which was higher than correlation with anxiety (coefficient = 0.65). Item 18 in IPEQ-2 was excluded, because it was also demonstrated weak correlation with its corresponding disorder in PCC. The correlation coefficient between item 18 and HPD was 0.78, which was higher than correlation with EUPD (coefficient = 0.76). Item 23, item 30 and item 32 in IPEQ-1 and item 1, item 2, item 13 and item 16 in IPEQ-2 were excluded, because there was no correlation between these items and their corresponding disorder in EFA.

Some items were to be excluded based on its consistency with primary symptoms of its corresponding disorder in ICD-10 in order to achieve two brief questionnaires. For example, item 5 was excluded because it was about physical symptoms. Item 15 was excluded because it had a lower factor loading compared to other items in depression in EFA, and its language expression was close to the item of suicidality.

Eventually, for each of two IPEQs, 12 items passed the selection scrutiny and were used as final versions of the questionnaires. As seen in Supplementary Table 1 and Supplementary Table 2, within IPEQ-1, items 1 to 4 were related to anxiety, 5 to 8 to depression and 9 to12 to suicidality; within IPEQ-2, items 1 to 4 were related to PPD, 5 to 8 to EUPD and 9 to 12 to HPD.

Evaluation results of final IPEQs

Reliability

Cronbach’s alpha coefficient analysis indicated that items within both questionnaires demonstrated sufficient internal disorder-specificity (> 0.50). Test-retest reliability analysis indicated that they showed strong stability (correlation coefficient > 0.70 and P < 0.01). (Table 4).

Table 4 Cronbach’s alpha coefficient and correlation coefficient of test-retest reliability obtained from two IPEQs

Criterion-related validity

The correlation coefficients between IPEQs and GAD-7, PHQ-9, PDQ-4+ were significant (P < 0.01), indicated acceptable criterion-related validity. (Table 5).

Table 5 Correlation coefficients of criterion-related validity obtained between IPEQs and GAD-7, PHQ-9, PDQ-4+

Construct validity

As seen in Table 6, the CFA analysis indicated good construct validity in IPEQ-1. Factor loadings for each of the 12 items were above 0.50 (P < 0.05). The values of GFI, NFI, NNFI and CFI were met the acceptance criteria.

Table 6 CFA obtained from the 12 items of IPEQ-1

As seen in Table 7, the CFA analysis indicated good construct validity in IPEQ-2. Factor loadings for each of the 12 items were above 0.50 (P < 0.05), except for items 1, 9 and 12. However, these three items were recommended for retention by the results of CTT and IRT analysis. The values of RMSEA, GFI, AGFI, NFI, NNFI and CFI were met the acceptance criteria.

Table 7 CFA obtained from the 12 items of IPEQ-2

Feasibility analysis

The completion rate for IPEQ-1 and IPEQ-2 were 91.17% (547/600) and 87.17% (523/600), respectively. The average completion time were 3.19 ± 3.09 min and 3.71 ± 3.39 min, respectively.

Discussion

China legislated against violence to medical workers [44]. Unfortunately, increased violence events are still occurring in the general hospital in China, and doctors and nurses are usually caught un-prepared [45]. 2015–2016 Chinese physician-patient relationship blue book showed that in analyzing the characteristic of perpetrators showing in violence to medical workers, impulsiveness took 65.05%, especially in injury event and killing event, the proportions reached 78.60 and 84.21% respectively [46]. Some patients with certain mental health disorders are more likely to occur impulsiveness and violence [4, 5]. Therefore, by using clinical symptoms or personality traits associated with violence-related mental health disorders, patients with violent tendencies would be screened. However, the recognition rate of mental health disorders was very low. Existing psychiatric assessment tools are too complicated for non-psychiatric department to use. Therefore, it is of urgent need to develop an easy-to-use and yet effective screening tool for certain mental health disorders screening. In this study, we developed and evaluated two IPEQs, which were designed for the screening of six mental health disorders. We hypothesize that a patient (not necessarily with mental health disorders) with symptoms of these mental health disorders would present a higher risk of becoming violent.

Compared with three commonly used tools for psychological-assessment tools, which are usually used by professionals in psychiatric departments, our IPEQs demonstrated strong consistence(P < 0.01). These three tools are rarely directly used by non-psychiatric departments for multiple reasons. First, these tools were specifically designed for the use by psychiatric department and non-psychiatric doctors and nurses usually lack professional training for the effective use of those tools. Second, these tools usually contain too many question items and inpatients usually are short of patience to answer [47]. For example, both items in GAD-7 and PHQ-9 were combined to reach a total of 16 questions to cover only anxiety and depression. In PDQ-4+, 27 items were employed to cover three personality disorders. Thus, an inpatient has to answer 43 questions in total to test 6 mental health disorders. In comparison, each of our IPEQs contains only 12 items and showed similar efficacy in the screening of 6 mental health disorders. Doctors and nurses in the non-psychiatric department could easy to use our proposed IPEQs.

These two IPEQs were developed based on the ICD-10, experts’ opinions and relevant scale reviews. Three statistical methods (PCC, EFA and IRT) were considered in the two-stage item selection. Moreover, our statistical analysis showed that the items within each IPEQ showed high internal consistency (The Cronbach’ s alpha coefficients were 0.91 and 0.78 for IPEQ-1 and IPEQ-2, respectively) and strong stability (Test-retest replication ratios were 0.95 and 0.87 for IPEQ-1 and IPEQ-2, respectively). The final IPEQs also well-matched with the original IPEQs in terms of structure (Factor loadings were > 0.50 and P < 0.05). These results showed, as simple as our IPEQs (12 questions only), it is effective in screening the proposed six mental health disorders, which were implicated to be associated with violence tendency [40, 41, 43].

In our study, normal subjects were suggested P25 (cut-off score of 12) for IPEQ-1 cut-off by the percentile of the questionnaire total score, according to literature reports [48, 49]. ICD-10 showed that for diagnosing most of the subtypes’ personality disorders, clear evidence is usually required of the presence of at least three of the traits or behaviors given in the clinical description. So, IPEQ-2 cut-off score was set as 3 in our study. Our investigation research showed that the incidence of emotional distress among non-psychiatric inpatients was 23.81%. The incidence of paranoid personality disorder was 10.89%, emotionally unstable personality disorder was 11.87% and histrionic personality disorder was 9.05%.

Medical workers have traditionally assessed violence risk on an individual basis when it has happened. It is important to screen for potentially violent inpatients according to the underlying psychiatric motive, such as impulsiveness, through relatively accurate predicable tool. Previous studies showed that there were positive association between six mental disorders (anxiety, depression, suicidality, PPD, EUPD and HPD) and violence [12,13,14]. Many inpatients with those mental disorders can exhibit impulsiveness. The characteristics of this impulsive violence involves a reactive or emotional response with a loss of behavioral control, coupled with a failure to consider consequences [50]. This supported the rationality of this study. Therefore, by using our proposed IPEQs, medical workers in the non-psychiatric department could collect the necessary information to assess six mental health disorders of an inpatient, and thereby better prepare themselves to provide individual medical care accordingly. The communication barriers between patients and medical workers will be reduced, and a friendly patient-doctor relationship will be rebuilt up.

There are several limitations that we will addressed in future studies. First, all participants were from only one general hospital in Taiyuan, Shanxi Province. The proposed IPEQs should be validated in non-psychiatric departments and data should be collected for further evaluation of the tool. Second, we selected six mental disorders that were implicated to have strong association with violence. Adding other violence-related mental health disorders (e.g. dissocial personality disorder) for testing may improve the accuracy of the IPEQs. Third, we did not use golden standard in the cut-off study. A main future research would be to recruit participants with psychiatrists’ diagnoses, which will be suggested as golden standard to calculate sensitivity and specificity. In that way, we can improve the cut-off setting for IPEQs. Forth, our study only used three statistical methods to select items. Other statistical methods (e.g. Velicer’s minimum average partial (MAP) Test combined with the parallel analysis in EFA [51]) may help in the item evaluation and selection.

Conclusion

We developed and evaluated two IPEQs that demonstrated satisfactory reliability and validity. By using clinical symptoms or personality traits associated with violence-related mental health disorders, patients with violent tendencies would be screened. However, further studies are needed in different participants to revise and improve IPEQs. This initial study guaranteed further validation of the two proposed IPEQs in non-psychiatric departments of the general hospitals in China.