An investigation of the construct validity of the ICECAP-A capability measure

Purpose To investigate the construct validity of the ICECAP-A capability wellbeing measure. Methods A face-to-face interview-administered survey was conducted with 418 members of the UK general population, randomly sampled from the Postcode Address File. Pre-specified hypotheses were developed about the expected associations between individuals’ ICECAP-A responses and their socio-economic circumstances, health and freedom. The hypotheses were investigated using statistical tests of association. Results The ICECAP-A responses and scores reflected differences across different health and socioeconomic groups as anticipated, but did not distinguish individuals by the level of local deprivation. Mean ICECAP-A scores reflected individuals’ perceived freedom slightly more closely than did measures of health and happiness. Conclusion This study suggests that the ICECAP-A measure can identify expected differences in capability wellbeing in a general population sample. Further work could establish whether self-reported capabilities exhibit desirable validity and acceptability in sub-groups of the population such as patients, social care recipients and informal carers.


Introduction
The capability approach advocates assessing wellbeing in terms of individuals' 'functionings' and 'capabilities'. Functionings are the things that an individual 'is' or does' and can be broadly defined, potentially ranging from elementary aspects of their life such as 'being adequately nourished' and 'having good health' to more complex aspects such as 'achieving self-respect' or 'being socially integrated' [1]. Capabilities represent an individual's freedom to carry out these functionings, whether or not the individual chooses to do so. Interest in using the capability approach in the health field has grown in recent years, with authors proposing it as a framework for conceptualising health [2,3] and disability [4,5], measuring intervention outcomes [6,7] and assisting in decisions about healthcare resource allocation [8,9].
Despite much interest in the capability approach, few practical measures of capability have been developed. Indeed, some authors question the degree to which capability measurement is possible with such a rich array of potential functionings and disagreement on the functionings that constitute a 'good life' [10]. Nevertheless, recent work has been conducted to develop measures of capability as a way of operationalising the capability approach. One stream of work has sought to develop capability 'indicators' using existing survey questions [11], relating to Martha Nussbaum's list of central human capabilities [12]. Another approach to operationalising the capability approach is in using interviews with the public to generate a set of core capabilities, which can then be assessed using short, self-completion questionnaires, such as the ICECAP measures [13,14].
An important challenge in the development of all measures is the assessment of validity. If it can be demonstrated that measures reflect what they purport to, then greater confidence can be placed in results generated. Capability is a particularly challenging trait for which to develop valid measures. First, the scope of capability measures is potentially quite broad. In principle, one could demand that any capability measure needs to demonstrate responsiveness to a huge array of factors before it can be considered valid. Second, capability measurement implies the quantification of something that is unobservable [15]: the freedom or opportunities available to an individual. It requires an ex ante assessment, focusing on what an individual has the freedom/potential to do, rather than an ex post assessment of what they do in fact do.
Some validation work has been conducted with the ICECAP-O capability measure, developed for older populations [16,17], and this has focused on examining factors anticipated to be associated with functioning per se rather than freedom to function. The aim in the study described here was to investigate the construct validity of a new instrument, recently developed to measure capability wellbeing for the general adult population: the ICECAP-A (ICEpop CAPability measure for Adults) [13]. The ICE-CAP-A (reproduced in 'Appendix') has been designed to capture capability to function across five attributes of life: 'stability', 'attachment', 'autonomy', 'achievement' and 'enjoyment'. Individuals are asked to select the level of capability (from four options) that corresponds with their situation across each of the five attributes. This paper reports a series of investigations of the construct validity of the ICECAP-A descriptive system and index scores, focussing on associations between reported capability and individuals' socio-economic circumstances and health status. The paper also reports an investigation of the, more challenging, issue of whether the ICECAP-A measure appears to be reflecting individual perceptions of their freedom in life.

Methods
The data for this study come from a face-to-face interviewadministered survey, conducted by the National Centre for Social Research (NatCen) in the UK. In this survey, the ICECAP-A measure and a range of contextual questions were asked. The survey questions covered: (1) sociodemographics, (2) measures of material wellbeing (income, home ownership), (3) major life events (bereavement, relationship break-up, etc.) in the last 6 months, (4) happiness and religiosity, (5) health, (6) use of healthcare, (7) perceptions of freedom.
Respondents were randomly selected for the survey from the Postcode Address File (PAF) in Great Britain using a two-stage stratified random sample design (the PAF was stratified on the basis of geographic area and socioeconomic deprivation). The sample of 802 addresses was selected with the aim of obtaining at least 400 responses for the valuation survey, based on prior experiences of response rates from NatCen surveys. At each selected address, one adult was randomly selected to take part in the survey. Each address was sent a postal invitation to participate, which the designated interviewer followed up in person. Up to nine attempts were made to make contact and confirm whether the selected individual wished to participate. The survey was administered by NatCen interviewers using computer-assisted personal interviewing (CAPI) software. Interviewers received specific training on the content and purpose of the survey and procedures to use when making contact, gaining consent and interviewing participants. The study protocol was approved by the University of Birmingham's Life and Health Sciences Ethical Review Committee (ERN_08-93).
Good practice in validating measurement tools demands that hypotheses are developed in advance regarding the (expected) relationship between the trait (capability) and relevant contextual factors [18]. In this study, a network of constructs identifying factors likely to be associated with each of the five capabilities was developed. These hypotheses drew partly on the qualitative research to develop the ICECAP-A measure [13]. This qualitative work set out to go beyond identifying influences on wellbeing, such as work or income, to examine why these factors were important in individuals' lives. As a result, the qualitative data provide a rich source of information to identify hypotheses about the anticipated relationship between the influences on wellbeing that individuals tended to discuss in interview (for instance, work) and the ultimate capability that this helped to facilitate (such as achievement or stability). Hypotheses also drew on research relating to the validity of related quality of life measures [16,17,19,20] and on the general wellbeing literature [21].
Associations between the five capability responses and the background variables were investigated using chisquared tests for categorical variables and one-way analysis of variance for continuous variables. For categorical variables, where a number of cell counts were \ 5, exact tests were used when computationally feasible; where they were not possible, variables were re-coded to increase cell counts. Alongside the direction of the relationship, the statistical strength of the evidence for each relationship was noted using significance levels of 5 and 1 %. All analyses were undertaken using Stata version 10.
Investigation 1: Do measured capabilities reflect socio-economic circumstances?
Drawing on Sen's conceptual framework for the creation of capabilities, it is clear that capability can be limited by poor socio-economic circumstances and enhanced by good circumstances [22]. For investigation 1, a table was drawn up showing the expected association between various indicators of socio-economic status and response to each of the ICECAP-A capabilities. All members of the research team contributed to this table. The section below details the conceptual capability in bold, the lay terms in the measurement instrument (also reproduced in 'Appendix') in italics and the anticipated associations with this capability.
• A capability for stability (able to feel settled and secure) relates to the absence of stress and dramatic changes in life and an ability to assign meaning to life. It was therefore anticipated that recent major negative life events (e.g. bereavement, relationship break-up, financial problems and serious ill health) were likely to be associated with reduced capability in this area. It was also predicted that being employed, being in a permanent relationship, having a good income and living in a low crime area were all likely to be associated with higher capability in this domain. • A capability for attachment (able to have love, friendship and support) relates to the ability to interact with others and have high quality relationships. It was therefore anticipated that capability would be lower on this domain for individuals who reported recent relationship problems, separation or bereavement and higher for individuals who had a partner. • A capability for autonomy (able to be independent) relates to being able to look after oneself, make one's own decisions, and secure privacy and identity. It was anticipated that autonomy would be lower for those who were in relationships but higher for those with more education, those who were employed, those with higher income and home owners. • A capability for achievement (able to achieve and progress) reflects individuals' abilities to move forward in their life and attain their goals. This attribute also reflects perceptions of satisfaction and recognition. It was therefore anticipated that capability for achievement would be higher for individuals in employment, with more education, with higher incomes (and no recent financial worries) and those who had no recent break-up with a partner. • A capability for enjoyment (able to have enjoyment and pleasure) reflects opportunities for the 'quiet pleasures' in life, such as enjoying nature, as well as things that are perceived to be 'fun' or 'exciting'. As such, it was anticipated that the capability for enjoyment would be lower for individuals who reported negative recent life events, suffered unemployment or lived in an area with high crime rates, and a capability for enjoyment would be higher amongst individuals in relationships, those with higher incomes and those who reported high happiness levels.
Investigation 2: Do measured capabilities reflect variations in health status?
The ICECAP capability measures were developed with an initial aim of measuring the effectiveness of health and social care interventions. The degree to which variations in health and health care usage are reflected in individuals' capabilities is therefore of crucial interest and importance. The capability literature is fairly clear that poor health and disability plays an important role in limiting human capability [23]. Based on the qualitative work to develop the measurement tool and evidence from the ICECAP-O instrument [14,17,24,25], it was anticipated that impairments to physical health would reduce capability for stability, autonomy, achievement and enjoyment, while impairments in mental health would additionally limit all five capabilities (including attachment). It was also anticipated that proxy measures of poor physical health-such as presence of a long-standing illness, receipt of hospital care and unpaid (informal) care-would be associated with impairments across stability, autonomy, achievement and enjoyment. Analysis to investigate these hypotheses proceeded in the same way as in 'Investigation 1'.

Investigation 3: Do measured capabilities reflect individual perceptions of freedom?
The investigations outlined in 'Investigation 1' and 'Investigation 2' would be relevant whether a measurement tool focuses on functioning or capability. Since this study focuses on the measurement of capability, we investigated the degree to which responses to the measure reflected an individual's perceptions of their freedom in life. This was achieved by examining the association between responses to the capability measure and the three statements below about individuals' perceived freedom.
• Life is full of opportunities (often/sometimes/not often/ never) • What happens to me is out of my control (often/ sometimes/not often/never) • I can do the things in life I want to do (often/sometimes/ not often/never) These questions were not intended to represent a 'gold standard' assessment of freedom. However, at the very minimum, it was anticipated that individuals who indicated higher levels of freedom would indicate higher levels of capability. This was investigated by calculating the magnitude and strength of evidence of the associations between responses to the freedom questions and an individuals' ICECAP-A index score [26]. Since one may expect individuals reporting greater freedom and control to report higher levels of wellbeing (however measured), a more exacting test of a capability measure is whether it correlates more closely with measures of freedom than alternative outcome measures that focus on functioning. To examine this issue, the pair-wise (Pearson) correlation coefficients between the freedom questions and the ICECAP-A measure were compared to those between the freedom questions and two prominent techniques for measuring outcomes in health economics: the EQ-5D health measure [27] and a global subjective 'happiness' question. The EQ-5D is a self-report measure of generic health status, focussing on five attributes of health: mobility, self-care, usual activities, pain/discomfort and anxiety/depression. The happiness question required respondents to select one of three statements that referred to their situation ('Taking all things together, how would you say you are these days-would you say you're very happy, fairly happy, or not too happy these days?'). It was hypothesised that the correlations between the freedom variables and ICECAP-A (capability) measure would be stronger than those between the freedom variables and the EQ-5D and happiness measures (which could both be conceived more as measures of functioning [22]).
Do different capabilities measure different things?
Since the capability approach generally (and the ICECAP-A instrument specifically) is multidimensional, we investigated whether different capabilities were tapping into different constructs. We hypothesised that certain capabilities should be more highly associated with specific characteristics than other capabilities. Therefore, while both autonomy and achievement might correlate with both self-care problems (on the EQ-5D) and educational level, we hypothesised that the stronger relationships would be between self-care and autonomy and between education and achievement. Drawing on the key influences on each attribute noted in the qualitative research (and based on variables that were available), one background variable was selected for each of the five ICECAP-A capabilities and hypothesised to correlate more highly with a selected capability than the other four. These hypotheses were examined through calculating correlation coefficients between the background variables and the capabilities.

Results
Survey interviews were completed between March and June 2010. From the 802 addresses selected, 422 (52 %) individuals responded, and of these 418 (99 %) produced complete interviews. Descriptive statistics for the sample are provided in Table 1. All 418 individuals who reached the end of the survey fully completed the ICECAP-A capability measure ( Table 2). The modal response of the ICECAP-A was the top or second level of capability across each of the five attributes. Nevertheless, many individuals indicated that their capability was highly limited (little capability or no capability) on each of the five attributes. This ranged from 37 individuals (8 %) on attachment to 120 individuals (28 %) on achievement.
Investigation 1: Do measured capabilities reflect socio-economic circumstances? Table 3 shows the associations between contextual characteristics of individuals' lives and their capabilities.
Associations that were hypothesised a priori are highlighted in italics. The remaining associations are listed for completeness. Of the 55 hypothesised associations: twentynine (53 %) were in the expected direction and had p \ 0.01; five (9 %) were in the expected direction and had p \ 0.05 (but [ 0.01); 21(38 %) had p C 0.05; none were in an unexpected direction and had p \ 0.05. Broadly speaking, there were statistically significant associations, where anticipated, between measured capability and employment, education, relationship status, home ownership, income, major life events (with some exceptions) and happiness.
However, associations were not found, although they were anticipated, between capability and indicators of local deprivation, religiosity or having a recent bereavement, household job loss or accident. No relationship was hypothesised between sex (gender) and capability, and none was found. Investigation 2: Do measured capabilities reflect variations in health status? Table 4 shows the expected association between various indicators of health (and health care use) and each of the ICECAP-A capabilities. As for 'Do measured capabilities reflect socio-economic circumstances?', associations that were hypothesised are highlighted in italics, with remaining associations listed for completeness. To summarise, for the 42 hypothesised associations: thirty-two (76 %) were in the expected direction and had p \ 0.01; one (2 %) was in the expected direction and had p \ 0.05 (but [0.01); eight (19 %) had p C 0.05; one (2 %) was in an unexpected direction and had p \ 0.01.
The results indicate strong evidence for all but one of the hypothesised associations between the five capabilities and the EQ-5D attributes (and index score), in the anticipated direction. As hypothesised, stability, autonomy, achievement and enjoyment were associated with the four physical health attributes of the EQ-5D. Attachment (along with the other four capabilities) was associated with the answers to the mental health question about anxiety and depression. There was also evidence for hypothesised associations between the capabilities and the presence of a long-standing illness and receipt of care. Although associations between capabilities and inpatient/outpatient appointments and the provision of informal care were hypothesised, there was no evidence for them in this data set.
Investigation 3: Do measured capabilities reflect individual perceptions of freedom? Table 5 shows the mean ICECAP-A index score (with 0 indicating no capability on any attributes and 1 indicating full capability on all attributes) for individuals responding to the three questions about their freedom. Across each freedom question, higher levels of reported freedom are associated with higher capability scores. The effect is more pronounced when individuals are differentiated by their perceptions of their opportunities and ability to do what they want to do life. Table 6 confirms the strong statistical evidence of an association between individuals' capability in general, and their perceptions of freedom. The table also indicates that health (as measured by the EQ-5D) and happiness are associated with perceptions of freedom. As hypothesised, freedom is slightly more closely correlated with the capability measure than the functioning measures. Table 7 shows the effect sizes for these differences in correlation. The only comparison demonstrating evidence of a difference is that between the capability and 'doing things that I want' correlation and the happiness and 'doing things that I want' correlation. Nevertheless, for four of the other five pairwise comparisons of correlation coefficients, there is a 'small' [28] difference (effect size of approximately 0.1) in favour of the capability measure in the correlation coefficients. Table 8 reports the correlation coefficients between the responses to the freedom questions and the five capability questions individually (all have p \ 0.05 at least). From the table, it appears that perceptions of freedom are most strongly associated with capabilities for achievement and enjoyment and least strongly associated with the capability for attachment.
Do different capabilities measure different things? Table 9 shows the pairwise correlations between the five selected background variables and the five ICECAP-A capabilities. The hypothesised strongest correlations (by row) are in the cells on the diagonal from top left to bottom right. In all cases, these correlations are in the expected direction and have p \ 0.01. For three of the five capabilities (stability, autonomy and achievement), the selected background variable correlates more closely with that capability than the other four. In two cases (attachment and enjoyment), the correlation with the identified background variable is the second strongest correlation in the row (in both cases after the correlation of the background variable and stability). Table 10 reports the differences between the hypothesised strongest correlation and the other correlation coefficients in the row. It can be seen that financial worries correlated more strongly with stability than any of the other capabilities. Similarly, self-care is more strongly correlated with autonomy than any of the other four capabilities. Education is most strongly correlated with achievement as expected, but the differences in correlation are small. In the case of marital status and happiness, stability, rather than the hypothesised capability (attachment and enjoyment, respectively), is marginally more strongly related to the background variable, but there is no evidence that this difference is greater than expected by chance.

Discussion
This study represents a first investigation of whether capability wellbeing can be captured in a valid manner through a simple generic measure for the (UK) adult population. Although measuring 'capability' is challenging [10,14], the findings indicate that capabilities, self-reported through the ICECAP-A measure, are associated with other indicators of freedom and, in general, socio-economic and health characteristics that were anticipated to be associated with an individual's capability. The findings therefore provide encouraging evidence of the construct validity of the ICECAP-A measure in this setting.
The correlations between the freedom questions and selfreported capabilities suggest that capability questions appear to 'capture' freedom, to a greater degree, than measures of happiness and health do. This may be important when selecting outcome measures in contexts where expanding individuals' freedoms is a key policy goal. For example, current health policy in England seeks to expand patient choice through the use of personal budgets and involving patients in decisions about the location of their care [29]. It must be noted that the 'gain' offered by the capability measure, relative to measures of health and happiness, is, in general, small and (from the p values) based on weak statistical levels of evidence. Further investigation is recommended in this area, in particular to employ larger sample sizes and to examine the effect of phrasing questions in terms of capability as opposed to functioning.
A large number of hypotheses were tested, relating to the relationships between individuals' socio-economic and health characteristics and their responses to capability questions. Although a minority of associations would be expected by chance, 69 % of stated hypotheses (67/97) were found to have p values less than 5 %. This provides evidence that the ICECAP-A measure reflects expected differences between individuals in the general population defined by their health and education, extending the  findings of previous studies of capabilities for older people [16,17]. Furthermore, this study additionally demonstrated the sensitivity of self-reported capability responses to characteristics such as employment, income, relationships and a range of major life events which provides encouraging evidence for the use of the ICECAP-A as a measure of wellbeing outside (as well as inside) the health setting. Despite supportive evidence for the majority of hypotheses, a number of anticipated associations were not detected in this sample. Two such areas were between local deprivation [as measured by the domain indices of the IMD (Index of Multiple Deprivation)] and capability, and health care access and capability. One explanation is that the IMD domain indices may be relatively poor proxies for the underlying construct of interest (individual deprivation). On healthcare access, only two of eight hypothesised associations between a recent inpatient/outpatient appointment and capability were detected. One explanation may be that the relationship between health care appointments and wellbeing is complex: although those in 'need' of health care may report lower wellbeing than those who do not, those who receive health care are likely to report higher wellbeing than those who do not but have similar 'need'. Examining the relationship between health care access and wellbeing is confounded by these two relationships.
As this study is the first study to report on the validity of the ICECAP-A measure, there are a number of caveats and research opportunities that are worth noting. First, the method of sampling ensured that individuals had an equal probability of being approached for the survey. However, due to higher response rates in certain groups, older people      recipients and informal carers. Finally it should be noted that this study was conducted through face-to-face interviews. Although the intention is that the ICECAP-A measure would be used in paper-based and internet formats too, further work to assess the validity of the measure in these settings would be useful. This study does not provide a definitive judgement that capability measurement is valid, since there is substantial debate surrounding the interpretation of capabilities. It does, however, offer a body of evidence suggesting that policy-relevant differences in wellbeing can be identified by the ICECAP-A measure and thus that it offers promise as a tool for capturing outcomes for economic evaluations.
Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Appendix 1: ICECAP-A capability wellbeing measure
About your overall quality of life Please indicate which statements best describe your overall quality of life at the moment by placing a tick (4) in ONE box for each of the five groups below