Interpreting mutual adjustment for multiple indicators of socioeconomic position without committing mutual adjustment fallacies
Research into the effects of Socioeconomic Position (SEP) on health will sometimes compare effects from multiple, different measures of SEP in “mutually adjusted” regression models. Interpreting each effect estimate from such models equivalently as the “independent” effect of each measure may be misleading, a mutual adjustment (or Table 2) fallacy. We use directed acyclic graphs (DAGs) to explain how interpretation of such models rests on assumptions about the causal relationships between those various SEP measures. We use an example DAG whereby education leads to occupation and both determine income, and explain implications for the interpretation of mutually adjusted coefficients for these three SEP indicators. Under this DAG, the mutually adjusted coefficient for education will represent the direct effect of education, not mediated via occupation or income. The coefficient for occupation represents the direct effect of occupation, not mediated via income, or confounded by education. The coefficient for income represents the effect of income, after adjusting for confounding by education and occupation. Direct comparisons of mutually adjusted coefficients are not comparing like with like. A theoretical understanding of how SEP measures relate to each other can influence conclusions as to which measures of SEP are most important. Additionally, in some situations adjustment for confounding from more distal SEP measures (like education and occupation) may be sufficient to block unmeasured socioeconomic confounding, allowing for greater causal confidence in adjusted effect estimates for more proximal measures of SEP (like income).
KeywordsSocioeconomic position Education Occupation Income Regression DAGs Causal inference
Directed Acyclic Graph
Socioeconomic position (SEP) can be defined as the “the social and economic factors that influence what position individuals or groups hold within the structure of a society”  and is widely recognised as associated with health [2, 3]. Understanding causal effects of SEP on health can be problematic as SEP is a broad and heterogeneous concept, difficult to operationalise [1, 4], and it may act via a multitude of mechanisms or pathways . Various SEP measures are often used interchangeably despite differences in their theoretical grounding and interpretation, and without regard to the level (e.g. individual, household, area) or life-stage at which they are measured [1, 4, 5, 6, 7, 8, 9]. Nevertheless, researchers sometimes try to differentiate the importance of different SEP indicators. While a range of more advanced methods have developed in recent years that could be employed in this context, we have noticed that many researchers still default to mutually-adjusted regression models of health, which require careful interpretation. Regression models may be seen as ‘good enough’, while more advanced methods are seen as offering diminishing returns to effort. With this in mind, the aim of this article is to highlight potential interpretive pitfalls when mutually adjusting for multiple measures of SEP, explaining best practice and the limitations and assumptions that are implicit with such models. Using directed acyclic graphs (DAGs), we emphasise that causal interpretations of such analyses are based on assumptions about causal relationships between SEP measures.
Multi-dimensionality of SEP
SEP measures are often correlated but could rarely be treated as proxies for each other . Where SEP is viewed as a confounder for effects of other exposures, this multi-dimensionality may be a minor nuisance: adjustment for a single measure of SEP may leave residual confounding, and best practice would include adjustment for multiple (or all available) measures of SEP [7, 16]. At least this is true as long as SEP is really a confounder and not a mediator of the exposure, if any of the SEP measures could be considered mediators of the exposure effects then adjusting for these could induce collider bias via unobserved SEP [17, 18]. However, when SEP is the causal exposure of interest this multi-dimensionality is more critical, and may even be informative.
Consistent associations with health across a range of SEP measures suggest a relationship between the overall ‘core’ construct of SEP and health. Some even investigate this ‘core’ effect of SEP by aggregating information from multiple SEP measures, e.g. as a latent variable , or index counting disadvantages experienced [20, 21].
However, investigation of heterogeneity across multiple measures may help identify which characteristics or resources associated with SEP are most important for health, and which factors could best be intervened on in order to alleviate health inequalities [9, 16, 22, 23]. For example, one study found inequalities in a range of health outcomes were stronger for occupational class when stratifying by education than vice versa, concluding adult occupational class was “a better discriminator of socioeconomic differentials” . Authors of a US study showing clearer associations with mortality for income than for education and occupation argued for the systematic collection of administrative income data for the monitoring and analysis of health inequalities . A Finnish study finding independent associations with smoking for a range of socioeconomic variables concluded that alleviation of inequalities in smoking requires “efforts directed at various dimensions of socioeconomic position” . Numerous studies have compared SEP measures across the lifecourse to infer regarding the lifecourse mechanisms leading to health inequalities [16, 22, 24, 25].
Certainly, more advanced methods are becoming more popular for addressing such issues, with examples including: applications of machine-learning methods to identify the most important predictors of health outcomes , path analyses or structural equation models to test hypotheses regarding lifecourse pathways [19, 27], and Bayesian approaches where observed data are used to indicate the most likely causal structure linking a set of variables [28, 29]. Nevertheless, many studies have used regression-based models where SEP measures are mutually adjusted [13, 15, 16, 22, 24, 25, 30, 31], and it is not hard to find even very recent examples e.g. see [32, 33, 34, 35]. We focus therefore on explaining best practice for such models, highlighting potential interpretative pitfalls and their implicit assumptions and limitations.
The “mutual adjustment” fallacy
Interpretation of mutually adjusted regression models requires care and conceptual clarity (as exemplified in many of the citations given above). It is tempting to interpret the mutually adjusted regression coefficients for each SEP measure as the independent effects of the unique aspects of SEP represented by that measure (i.e. the effects of the areas labelled ‘a’ in Fig. 1). However, if this interpretation is carried across all the included measures of SEP, then aspects of SEP that are common across measures (i.e. the areas labelled ‘b’ in Fig. 1) have been assumed to have no effect. With only three coefficients and each interpreted as the effect of its respective ‘a’ area, the overlapping information in ‘b’ is not represented by a coefficient. Worryingly, if this area of conceptual overlap represents the ‘core’ of the SEP construct, then this assumes ‘core SEP’ has no effect and that all effects are due to the unique characteristics of the different SEP measures employed.
This can be considered a case of the “Table 2” fallacy, where mutually adjusted coefficients are treated as if they all have an equivalent interpretation . The idea was so named because mutually adjusted coefficients have often been presented in the second results table of a paper. We prefer the term “mutual adjustment” fallacy as this is a little more explicit about the error being committed, which could occur in the first, third, fourth or any other results table.
Causal interpretation of a mutually-adjusted model
Illustrative example of mutual adjustment
Unadjusted ORs for poor health (95% CI)
Interpretation under Fig. 2a
Partially Adjusted ORs for poor healtha (95% CI)
Interpretation under Fig. 2a
Mutually Adjusted ORs for poor healthb (95% CI)
Interpretation under Fig. 2a
Total effect of education
Direct effect of education, not mediated via occupation
Direct effect of education, not mediated via occupation and income.
Total effect of occupation, confounded by education
Total effect of occupation, not confounded by education
Direct effect of occupation, not mediated via income, or confounded by education.
Total effect of income, confounded by education and occupation
Total effect of income, not confounded by occupation and education.
Under Fig. 2a, the mutually adjusted OR for income represents the total effect of income on health, after adjusting for confounding by occupation and education . However, the mutually adjusted OR for occupation does not have the same interpretation; it is not a total effect of occupation, but rather a direct effect of occupation, i.e. the portion of its total effect on health which is not mediated via income, after adjusting for confounding by education . Similarly, the mutually adjusted OR for education represents a direct effect, not mediated via occupation or income . Thus, a comparison of the mutually adjusted ORs for education and income is not a like-for-like comparison, but a comparison of a total effect (for income) with a direct effect (for education). If the intent was to compare the total effects of education and income then this would be an incorrect comparison: a mutual adjustment fallacy.
Given Fig. 2a, a question about the relative magnitude of the total effects of these SEP measures could be answered by comparing the mutually adjusted OR for income (1.68), with the partially adjusted OR for occupation (2.33), and the unadjusted OR for education (2.43), i.e. adjusting for confounders but not mediators. This highlights quite a different pattern in the magnitude of effects as would be obtained by straight comparison of the mutually adjusted (or the unadjusted) ORs.
Figure 2a is of course not the only set of assumptions that could be made about the relationships between these variables, and analyses may be sensitive to different assumptions . Figure 2b shows an alternative set of (less plausible) causal assumptions to those shown in Fig. 2a. In this case, education is now considered the more proximate and income the more distal cause. Under Fig. 2b, a question about the relative magnitude of the total effects of education and income would require comparison of the mutually adjusted OR for education with the unadjusted OR for income (i.e. 1.44 and 2.48 respectively in Table 1). Thus, different assumptions about the causal relationships between socioeconomic variables can lead to different conclusions about which are more important.
If Fig. 2a and b were both equally plausible, one might want to compare total effect sizes under competing causal assumptions: e.g. the total effect of education with income and occupation viewed as confounders, vs. the total effect of income with education and occupation viewed as confounders. In this case, a comparison of the mutually adjusted ORs for education and income would be appropriate. However, you would only want to do this in situations with multiple plausible competing causal assumptions. In such cases, interpretation could be facilitated by clear description of the competing assumptions, explicitly linking the reported results to those assumptions. We consider ‘mutual adjustment’ to be poor short-hand for such a nuanced interpretation.
Further, we would suggest the causal ordering in Fig. 2a rather than Fig. 2b is more plausible and more consistent with theory regarding the development of SEP over the lifecourse, with education usually preceding and determining occupational level, which in turn generates income. [10, 41] However, while we would favour interpretation under Fig. 2a over interpretation under Fig. 2b, it would be poor practice to assume all other researchers share our assumptions. Reporting the findings in Table 1 as mutually adjusted without explicitly linking them to the assumptions in Fig. 2a could lead to misinterpretation. A naïve reader would have no guidance as to which effects should be interpreted as total and which as direct effects, or a reader who strongly subscribed to Fig. 2b might interpret the results according to those assumptions. Phrases such as “the independent effect of occupation” do not specify whether the “independence” is from confounders or mediators and are thus less informative than they could be with further explication.
However, under Fig. 3b the estimate of the total effect of income from the mutually adjusted model in Table 1 remains unbiased. The confounding influence of unmeasured parental SEP on the effect of income is sufficiently blocked by adjustment for own education and occupation, though the direct effects of occupation and education are still subject to bias from the confounding influence of unmeasured parental SEP and from conditioning on a collider [17, 18]. Thus, if Fig. 3b were at all plausible relative to Fig. 3a we might have more confidence in the total effect estimate for income from Table 1 than in the direct effect estimates for occupation and education. This total effect estimate requires less stringent assumptions to be valid (i.e. Figure 3b) than those for the direct effect estimates (i.e. Figure 2 with no unmeasured confounding).
Logistic regression predicting low income from parental occupation, with and without adjustment for own education and occupation
Unadjusted OR for Low Income (95% CI)
Adjusted OR for Low Income (95% CI)
Manual Parental Occupation
We focused here on the relatively simple method of mutually adjusted regression, without tackling issues such as interactions between SEP measures, effect heterogeneity, measurement error or statistical mis-specification . As mutually adjusted regression is still commonly used in epidemiology, we hope this discussion will aid best practice, highlight implicit assumptions and limitations, and provide an introductory step towards the ever-developing literature on more advanced causal inference methods, which might include Bayesian analyses [28, 29] or more precisely decomposed definitions of direct and indirect effects that can deal appropriately with interactions between SEP measures [42, 43, 44, 45, 46].
We examined four possible causal diagrams linking three SEP measures, but many more are possible , and interpretations of analyses should be altered under differing causal assumptions, as illustrated here. Indeed, the causal structure linking SEP indicators to each other and to health may vary between different social contexts, welfare regimes etc. , as may the degree of heterogeneity both within and between socioeconomic categories , so researchers may be unsure of what causal model to interpret their data under. Causal direction may also be ambiguous where there are feedback loops (e.g. where a better job leads to a higher income, which then leads to an even better job and so forth). In some cases, such longitudinal ambiguity may be resolved by including repeated measures from different life-stages and interpreting analyses in light of the causal links between measures over time , but researchers will not always have the luxury of such detailed lifecourse data. Where there is ambiguity over the causal structure linking socioeconomic variables, Bayesian methods offer a promising avenue for empirically determining the most likely structure given the data [28, 29]. Indeed, such exploratory research is important for developing a good understanding of the causal relationship between socioeconomic variables. Nevertheless, data-driven approaches such as these may be sensitive to idiosyncrasies of the data under study, and/or there may be multiple causal diagrams that would fit the data more or less equally well. Thus, it may still be worthwhile to assess how sensitive analytical conclusions are to alternative theoretically informed causal diagrams.
We highlight the importance of a full causal diagram that includes all relevant common causes of included variables, which might include distal and unmeasured aspects of SEP . Given difficulties associated with gathering inter-generational socioeconomic data, our illustration of residual confounding from parental SEP is a pertinent one. It demonstrates how, in some instances, adjustment for some socioeconomic confounders may be sufficient to block confounding from other more distal and unmeasured aspects of socioeconomic background, and give greater confidence that effect estimates from a proximal measure of SEP are causal. On the other hand, we have shown how estimates of direct effects for more distal measures of SEP can be biased by residual unmeasured confounding , and/or collider bias . Where there is the likelihood of residual unmeasured confounding (i.e. almost always) it would seem advisable to test sensitivity of conclusions to different strengths of unmeasured confounding .
Estimating “effects” of SEP measures presupposes intention to intervene on and manipulate SEP [23, 46]. Assuming Fig. 2a to be the more plausible set of assumptions regarding the causal relationships between education, occupation and income (which may not hold in all contexts ), it follows that we could have greater causal confidence in published estimates of the effects of income that are adjusted for education and occupation (or other more distal socioeconomic variables), than we might have in published estimates of the effects of occupation or education (whether or not they are adjusted for other socioeconomic measures). This is fortuitous, as of the three, it could be argued that income would also be the simplest to intervene on , e.g. through welfare or taxation policies. Estimates of effects for education or occupation that are adjusted for income are not without their merit however, and, assuming no unmeasured confounding, could be interpreted for example as the remaining effect of occupation, if we were to somehow intervene and equalise income across occupational strata .
The principles set forth here are not limited to applications involving education, occupation and income but will be applicable whenever researchers are using mutual regression adjustment to compare the effects of different SEP measures, including measures taken at different levels or life-stages. We have explained how effects for multiple socioeconomic measures should not be interpreted equivalently as the “independent” effect of that measure, ignoring the effects of the common information they provide. Causal effects of more distal SEP measures, will be biased with adjustment for proximate SEP measures. A good causal understanding of relationships between socioeconomic variables can improve interpretation and lead to different conclusions about which measure is most important for health than naïve comparisons of “independent” effects. Such understanding is therefore foundational to understanding the causal processes by which SEP influences health over the lifecourse, and to identifying effective points of intervention .
MG and FP are funded by the Medical Research Council (MC_UU_12017/13) and the Scottish Government Chief Scientist Office (SPHSU13). Funders had no active role in the design of the study, collection, analysis or interpretation of data, or in writing the manuscript.
Availability of data and materials
The datasets generated and analysed during the current study are available for use by application. Details of how to apply can be found on the study website: http://2007study.sphsu.mrc.ac.uk/Information-on-data-sharing.html
MG had the initial idea, performed all analyses and drafted the manuscript. FP was a major contributor in writing the manuscript. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Ethical approval was gained for each wave of the Twenty-07 Study from the National Health Service and/or Glasgow University Ethics Committees. Participants gave informed consent. We had access to the data as researchers at the MRC/CSO Social and Public Health Sciences Unit, where the study is based.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 3.Department of Health and Social Security. equalities in Health: Report of a Working Group Chaired by Sir Douglas Black. London; 1980.Google Scholar
- 9.McCartney G, Bartley M, Dundas R, Katikireddi SV, Mitchell R, Popham F, Walsh D, Wami W. Theorising social class and its application to the study of health inequalities. SSM - Population Health. 2018. https://doi.org/10.1016/j.ssmph.2018.10.015.
- 10.Lynch J, Kaplan G. Socioeconomic Position. In: Berkman LF, Kawachi I, editors. Social Epidemiology. Oxford: Oxford University Press; 2000.Google Scholar
- 11.Bartley M. Health inequality: an introduction to theories, concepts and methods. Cambridge: Polity Press; 2004.Google Scholar
- 20.Degerud E, Ariansen I, Ystrom E, Graff-Iversen S, Høiseth G, Mørland J, Davey Smith G, Næss Ø. Life course socioeconomic position, alcohol drinking patterns in midlife, and cardiovascular mortality: analysis of Norwegian population-based health surveys. PLoS Med. 2018;15(1):e1002476.CrossRefGoogle Scholar
- 39.Office of Population Censuses and Surveys. Classification of occupations. London: HMSO; 1980.Google Scholar
- 41.Graham H. Unequal lives: health and socioeconomic inequalities. Maidenhead: Open University Press; 2007.Google Scholar
- 47.Green MJ, Popham F. Life course models: improving interpretation by consideration of total effects. Int J Epidemiol. 2017:46(3):1057-1062.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.