Background

There is general agreement that health literacy goes beyond the ability to read, write and understand the meanings of words and numbers in health care settings [1]. Health literacy comprises various competencies and depends on individual and community factors. These factors include different issues ranging from cultural issues to health care, public health and other relevant systems and settings in which people obtain and use health information [2].

Health literacy is a complicated concept [3] that is a global issue [4] and its existence is a way to achieve good health. One common definition of health literacy is offered by the World Health Organization (WHO) and reads as follows: ‘the cognitive and social skills which determine the motivation and ability of individuals to gain access to, understand and use information in ways which promote and maintain good health’ [5]. It is believed that people with inadequate health literacy might suffer from poor health, have little information about disease prevention, participate less in clinical preventive services against chronic illnesses, and have trouble comprehending health instructions or interpreting them correctly [6, 7]. Furthermore, limited health literacy is associated with poor adherence to medical treatment and inappropriate communication with health professionals, more hospitalization, increased medical and health expenditure, higher mortality and morbidity and poorer self-care [8]. Therefore, the measurement of health literacy is an essential component of any effort to prevent consequences of limited health literacy and health care discrepancies [9].

The number of instruments developed to address and evaluate health literacy is growing rapidly. The most widely and frequently used instruments are the Rapid Estimate of Adult Literacy in Medicine (REALM) [10], the Test of Functional Health Literacy in Adults (TOFHLA) [11], and the Newest Vital Sign (NVS) [12]. These instruments have been criticized for several reasons, including for assessing only a few domains of health literacy, not being suitable for use in intervention studies or not having been developed with a health promotion perspective [13]. Furthermore, most of these scales were developed and used in clinical settings [14]. Haun et al. in a comprehensive review of the literature from 1999 through 2013 identified 51 instruments and reported that most instruments represent a narrow set of conceptual dimensions with limited modes of administration and missing information on key psychometric properties. They recommended that as researchers develop new measures, a full range of conceptual dimensions of health literacy and better validation studies should be included to establish sound evidence for measuring health literacy [15]. As such, instruments that were developed recently are improved greatly [16, 17]. More recently The WHO Regional Office for Europe provided a comprehensive review of older and the recent instruments (31 instruments) and concluded that at the policy level, frameworks and indicators that cover various domains are needed to enable consistent and comparable population monitoring and evaluations [18].

However, it should be noted that there are three types of health literacy instruments: general health literacy instruments, condition (disease or content) specific measures, and instruments that are developed for specific populations [15]. The WHO Regional Office for Europe also followed a very similar categorization for health literacy instruments [18]. Of these, most investigators usually use the general measures that are applicable to different conditions and populations. Thus, the focus of this study is on developing a general instrument for measuring health literacy. In doing so we provided a short list of existing measures that includes the advantages and disadvantages of existing well-known general health literacy instruments (Table 1; for a more comprehensive list see [15, 18]).

Table 1 A short list of some selected instruments for measuring general health literacy

The aim of this study was to develop a rigorous and valid instrument for measuring health literacy for adults, yet easy to use, and multidimensional. Although the current study is not unique, perhaps could contribute to the existing knowledge on the topic as the instrument reported here was developed in a non-western country.

Methods

The conceptual framework

The core conceptual model presented by Ratzan and Parker inspired the current study’s conceptual framework of health literacy. The concept comprised the ability to obtain health information (access); understand health information (understanding); ability to assess and evaluate the health information (appraisal); and use the information to make a decision (apply or use health-related information) [27].

Item generation

We used a similar methodology that previously was described in details elsewhere [28]. In brief, first a review of existing health literacy measures was conducted. The review was performed in 2012 and has since been updated. Consequently a panel of specialists in public health, health education/health promotion, health care management, mental health, oral health, maternal and child health, and community medicine was held. The selection of experts was based on their research interest, previous works on health literacy and managerial responsibility in the health care system. Panelists’ characteristics are presented in Table 2. Following ten sessions, of at least 3 h, an item pool of 400 statements was generated. Items were generated using brainstorming in each session and a member of the panel was responsible for listing the items. The panelists were limited to suggesting items related to the potential subscales (reading, access, understanding, appraisal, and behavioral intention). Then items were checked for duplicates and excluded when they were inconsistent with the intended subscales. Accordingly, the panel reduced the number of items to 66. Upon initial agreement on selected items a 5-point Likert scale (never =1, rarely =2, sometimes = 3, usually = 4, always = 5) was used to indicate the lowest to highest level of the respondents’ abilities. In the next step, content validity and face validity of the questionnaire was evaluated.

Table 2 The characteristics of panelists

Content validity

A group of 15 experts in public health was invited to assess the questionnaire. First, we asked the experts to check the items for relevance, clarity and simplicity on a 4-point Likert scale that comprised the options very relevant, relevant, relatively relevant, and not relevant. The experts were then asked to indicate necessity of each item by rating a 3-point Likert scale of essential, useful but not essential, and not essential. Next, experts were asked to comment on wording, and grammar. As a result, 19 items were removed and a provisional version of the questionnaire with 47 items was provided.

Face validity

To determine the face validity of the questionnaire, 10 individuals aged 18–65 years were selected using a purposive sampling to verify the clarity, relevance and difficulty of each item. None of items were removed or changed at this stage and the Health Literacy Instrument for Adults (HELIA) was prepared for psychometric analysis.

Main study

  1. i.

    Design and data collection: Psychometric properties of the HELIA were examined by conducting a cross sectional study. In this regard, a random sample of adults aged 18 to 65 and living in Tehran was recruited. The sample size was calculated as the number of items in the questionnaire multiplied by 5 [29]. The participants completed the HELIA at their homes under trained interviewers’ supervision. Demographic data including the participants’ age, gender, education, job and source of health information, were also recorded.

  2. ii.

    Statistical analysis: The structural validity and reliability of the HELIA were examined using Exploratory Factor Analysis (EFA) with varimax rotation and internal consistency respectively. The Kaiser-Meyer-Olkin (KMO) and Bartlett’s Test of Sphericity were used to determine the appropriateness of the sample for factor analysis [30, 31]. Eigenvalues above 1 and factor loadings greater than or equal to 0.50 were considered appropriate to verify the number of possible underlying factors. Furthermore, confirmatory factor analysis was performed while a five-factor model (access, reading, understanding, appraisal and behavioral intention) was specified. Several goodness-of-fit indicators including: chi-square ratio (χ2/df), goodness of fit index (GFI), the root mean square error of approximation (RMSEA), standardized root mean square residual (SRMR), normed fit index (NFI) and comparative fit index (CFI) were selected for reporting the analysis outcomes. The following thresholds were considered to verify the model’s goodness of fit: χ2/df < 2.0, CFI, NFI, NNFI, and GFI ≥ 0.90–0.95, SRMR ≤0.05–0.08, and RMSEA ≤0.05–0.06 [32,33,34,35]. Finally, the Cronbach’s alpha coefficient (acceptable level of 0.7) for each dimension and the whole scale was calculated to examine internal consistency [36]. Additionally the Intraclass Correlation Coefficient (ICC) was calculated by performing test-retest analysis to establish instrument stability. For this purpose, a convenience sample of 30 individuals aged 18 to 65 (mean age 34.9 ± 10.1, 18 female and 12 male) attending to a health center in Tehran was recruited. They have completed the questionnaire twice with a 1-week interval. The correlations of 0.75 or higher were considered satisfactory [37]. All statistical analyses were performed using SPSS version 17.0. The confirmatory factor analysis was performed using LISREL 8.8 software.

Results

Demographic characteristics

In all 336 adults aged 18 to 65 took part in the study and completed the questionnaire. Of these, 13 individuals were excluded due to incomplete response to the questionnaire. Thus, the data obtained from 323 participants were analyzed. The mean age of participants was 37.89 ± 13.31 years, 54% were females, and 39% were housewives. The characteristics of participants are shown in Table 3.

Table 3 Demographic characteristics of the study participants (n = 323)

Factor structure

1. Exploratory factor analysis: The adequacy of sample size was confirmed by KMO and Bartlett’s Test of Sphericity (KMO = 0.919 and χ2 = 4101.78, p <  0.0001). The initial analysis indicated a 9-factor solution with eigenvalues greater than 1 that jointly accounted for 58.9% of the variance observed. After careful assessment, four factors were excluded for the following reasons:

a. There was a factor with two items more relevant to behavioral intension and thus the factor was excluded and the items conjugated to factor 1 (behavioral intension subscale).

b. Item loading on three factors did not satisfy the expected threshold. Examples of some low loading items read as follows: I can find health information about physical activity such as walking and exercise; I can understand health information on diet and obesity; I can fill-in medical forms when needed.

Thus after deletion of the low loading items (with one exception), eventually 33 items were loaded on 5 factors: access to information (4 items), reading (6 items), understanding (7 items), appraisal (4 items) and behavioral intention (12 items), that jointly accounted for 52.9% of the variance observed. The detailed results are shown in Table 4 [Additional file 1].

Table 4 The results obtained from exploratory factor analysis for the HELIA

2. Confirmatory factor analysis: The result obtained from the confirmatory factor analysis is depicted in Fig. 1. The results provided a good fit to the data. The fit indexes were as follows: χ2 = 778.33; χ2/df = 1.60; SRMR = 0.049; RMSEA (90% CI) = 0.043 (0.038–0.049); CFI = 0.98; NFI = 0.95; and NNFI = 0.98; GFI = 0.87 (Table 5). The correlations between latent factors are also presented in Table 6.

Fig. 1
figure 1

The result obtained from confirmatory factor analysis for the HLIA

Table 5 Fit indices and their acceptable thresholds in confirmatory factor analysis
Table 6 Correlations between latent factors obtained form confirmatory factor analysis

Reliability

Reliability was assessed by estimating the Cronbach’s alpha coefficient. The results showed that all factors had acceptable internal consistency. The Cronbach’s alpha coefficient for each subscale and the questionnaire as a whole ranged from 0.72 to 0.89 (Table 7). Further analysis indicated that the alpha coefficient could not be improved if any further items deleted. The stability of the HELIA and its sub-scales as measured by the Intraclass Correlation Coefficient (ICC) was also found to be satisfactory. All ICCs were above acceptable threshold (Table 7).

Table 7 Cronbach’s α coefficient and ICC for the HELIA and its subscales

Health literacy

The mean health literacy score for the study sample was 76.3 (SD = 15.1). Overall 78.6% of the respondents showed adequate health literacy while the remaing 21.4% had limited health literacy. There was no significant difference in health literacy among male and female respondents (P = 0.33), although women scored higher compared to male respondents (77.5 vs. 74.9 respectively). However, health literacy score was sginficatly different among people who differed in age, education, and employment status as expected. These are presented in Table 8. The scoring manual for the HELIA is supplemented (Additional file 2).

Table 8 Health literacy based on demographic variables

Discussion

The findings showed that the HELIA is a valid instrument for measuring health literacy among adult populations and could be considered as a useful measure along with other recently developed instruments [16, 17, 38, 39]. However, it is important to note that although the methods we used were scientific, they were traditional and not as strong as the methods of two recently developed measures [16, 17]. For instance, for development of the Health Literacy Questionnaire (HLQ) Osborne et al. followed a validity-driven method that involved systematic grounded approaches in which existing theories were not considered until later in the development process of the questionnaire [16]. In fact, they first focused on individuals’ and professionals’ lived experiences and then used the definition of health literacy proposed by the World Health Organization.

The HELIA has a multidimensional structure that can be easily used for public health purposes. Although not identical to the HELIA, the European Health Literacy Survey is also a relatively comprehensive instrument. It has two sections, a core health literacy section and a section on determinants and outcomes associated with health literacy. Indeed in the European Health Literacy Survey model ‘health literacy refers to an evolving set of competencies that do not remain static over time and can be regarded as a means to an end rather than a fixed state, to which a person should aspire’ [17]. However, the very well known and popular instruments cover only a few dimensions of health literacy and none assess the broad range of abilities such as access to information, reading, understanding, appraisal, and decision making (behavioral intention). For example, the Test of Functional Health Literacy in Adults (TOFHLA) assesses only reading, comprehension and numeracy skills and it seems that completion of this test would be difficult for those who are not well educated to complete. The Rapid Estimate of Adult Literacy in Medicine (REALM), another well-known scale for measuring health literacy, is also examine only reading and recognition of medical words. Although it is a brief and easy to use instrument, it comprises just one dimension.

The HELIA contains five subscales (dimensions), which we believe is an important feature of this instrument covering the basic concept and constructs that build the meaning of health literacy. Additionally the items that included in the HELIA are relevant to public health in general and to healthy life styles in particular. In fact, underlying concepts included in the instrument cover the three most important global public health topics, which are issues related to cardiovascular diseases (nutrition items), cancers and accidents. These topics were arranged in a way that people with both limited literacy and a high level of education could easily respond to items. We did not want to test people’s knowledge but rather were interested in examining skills relevant to health literacy. Indeed, we believe a range of people with education ranging from primary to higher could relate the items to themselves and provide honest responses to the questionnaire. However, one should notice that the current version of the HELIA has some limitations. For instance, because numeracy skill is an important issue in a health care context, it is necessary to add a few more items to the questionnaire.

Nevertheless all these efforts should be greeted because relying on one measure might not properly help policy and practice. It is argued that since social, environmental and cultural factors influence health literacy in different populations, the need for integration of definitions and models of health literacy are essential [40]. In this respect, it seems that items addressing numerical literacy and media literacy might also be necessary in new versions of existing health literacy instruments including the HELIA. Furthermore, to measure how valid the invented instrument is, it is necessary to compare the results of measurements with other recognized instruments and to show that at least some scales show comparable results (concurrent or criterion validity). The current study did not include such analysis and future studies should therefore employ a previously validated instrument and report on concurrent or criterion validity. One more limitation is the fact that the instrument was tested in one location using a cross-sectional approach and stability (test-retest analysis) was examined in a separate sample. Finally, it is important to remember that no external assessment was applied for the HELIA to objectively assess different skills. For instance when the respondent says that she or he is always understands the content, all the answers the respondents give are self-estimated skills and not objective ones. Thus, some measures should be integrated into the questionnaire to assess actual skills.

Conclusion

The results showed that the Health Literacy Instrument for Adults (HELIA) is a valid and reliable measure for assessing health literacy among adults.