Communicating causality

Swanson, Sonja A.

doi:10.1007/s10654-015-0086-6

Communicating causality

COMMENTARY
Open access
Published: 07 October 2015

Volume 30, pages 1073–1075, (2015)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Epidemiology Aims and scope Submit manuscript

Communicating causality

Download PDF

Sonja A. Swanson^1,2

2000 Accesses
6 Citations
2 Altmetric
Explore all metrics

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

We epidemiologists have long recognized the importance of using rigorous causal inference approaches to design and analyze our studies. Causal diagrams comprise one such tool for formalizing assumed data-generating processes. And indeed, the ubiquity and importance of causal diagrams within epidemiology is evidenced by four articles presented in this issue of the European Journal of Epidemiology [1–4]. As epidemiologic studies are often used to inform clinical and policy decision-making, we have also understood the need to unambiguously communicate our studies’ findings amongst ourselves and across the disciplines with whom we collaborate. While others have taught and espoused how causal diagrams can guide and improve our study designs and analyses [5–7], perhaps one of the most transformative aspects of the current “revolution”—in the words of Porta et al. [4]—is that we have adopted tools that enhance the clarity of our study conclusions and the premises on which they rest.

Causal diagrams as (formal) story-telling

When and why are causal diagrams useful? One of the most evident successes of causal diagrams is in supplementing story-telling. With a few arrows and letters, an investigator can tell a story of a data-generating process. For a reader fluent in causal diagrams, even a dauntingly complex story can now be quickly and fully digested. In this way, we have seen a series of “paradoxes” demystified, including proposed explanations for the so-called Berkson’s [8], birth-weight [9], obesity [10], and Simpson’s [11] paradoxes. Similarly, causal diagrams focused our attention on the structures of oft overlooked potential biases, such as biases due to time-dependent confounding in stratification-based analyses [12], mediator-outcome confounding in mediation analyses [13], selecting on treatment in instrumental variable analyses [14], and naïve per-protocol restrictions in randomized trial analyses [15]. Readers familiar with causal diagrams will recognize that many of these examples can be described as collider-stratification biases, and that, while some encompass previously recognized threats to validity, these potential biases were infrequently mentioned until their associated causal diagrams were drawn.

Beyond demystifying perplexing patterns or illuminating subtle problems that exist across many studies, causal diagrams can also facilitate debates regarding a specific study’s conclusions. Consider two investigators who are in disagreement over whether a specific study’s analysis and conclusions were appropriate. If these two investigators “speak DAG” (directed acyclic graph) then they may seamlessly convey their assumptions and ideas to one another with little fear of miscommunication. Perhaps the two investigators will realize they had different causal diagrams in mind, and that favoring one analytic approach over another depends on which causal diagram is drawn—and thus on particular assumptions that, undrawn, might have suggested favoring a different analysis. Perhaps they will even be able to collect further data to help settle on which causal diagram—which set of assumptions—is more reasonable. Such discussions, which can be cumbersome and confusing without a formal language, can take place quickly and explicitly when supplemented with causal diagrams.

In these ways, a causal diagram, like a picture, is worth one thousand words. Unlike artwork, however, where the “thousand words” convey a subjective perspective, a causal diagram should convey exactly the thousand words its creator and all other fluent readers would attribute to it. Causal diagrams are useful because they facilitate precise communication, but ignoring the formal rules that govern them can lead to miscommunication. For some examples of this, we can turn to an article in this issue of the European Journal of Epidemiology in which Greenland and Mansournia [3] caution how failing to read a causal DAG as encoding only structural (not random) confounding or failing to be explicit about faithfulness when presumed can lead readers of a causal diagram to perceive a different “thousand words” than intended.

As with any tool that can streamline communication, there is also a danger of causal diagrams providing a false sense of security when they are constructed without investigators applying deep thought and subject matter knowledge. To see this, consider the use of causal diagrams in the context of instrumental variable analyses. Many epidemiology studies with instrumental variable analyses redraw the same textbook instrumental variable causal diagram to justify their analysis, yet the story is rarely as straightforward as the one depicted in that causal diagram. Hernán and Robins [16], Swanson et al. [14] and VanderWeele et al. [17] have presented expanded versions of this standard graph that illustrate relatively subtle yet potentially common ways in which bias could arise. Thus, redrawing the textbook version of a causal diagram may oversimplify the likely data-generating process and even offer false comfort when applied to a specific study. Of note, some have argued that causal diagrams are not useful in the context of instrumental variable analyses because “the” DAG seems so simple that drawing it does not add to our understanding of the process [18]. While causal diagrams (arguably) add less to our understanding of what is a true instrument, we have seen many examples of causal diagrams adding substantially to our understanding of what is not an instrument.

If two epidemiologists “speak DAG” fluently and think deeply while constructing causal diagrams, they can cleanly convey their premises and ideas to one another with little fear of miscommunication. However, many of us are not fluent in causal diagrams. Moreover, fluency or even familiarity with causal diagrams is currently rare among the broad range of medical researchers, clinicians, and policy-makers with whom we work. While our field would doubtlessly benefit from having more fluent speakers, we as a field ought to ask ourselves: should fluency in causal diagrams be a requisite in our training and communication standards?

The case for causal inference “multilingualism”

Causal diagrams are attractive because they facilitate clear communication. Of course, the same argument can be made for other formal representations, including the counterfactual outcome framework that DAGs are linked with in this issue [2]. Should epidemiologists favor one framework over another? Ultimately, translations between these representations are achievable, as evident by the mathematical equivalencies between the DAG-based do-calculus and the counterfactual-based g-formula [2, 19–21]. Nonetheless, in our day-to-day work as epidemiologists, an argument could be made that learning to both “speak DAG” and “speak counterfactuals” will deepen our own comprehension of our subject matter.

Being well-versed in multiple formal representations of causality can lead to not just clearer but also more efficient communication. For example, some assumptions (e.g., directionality or monotonicity of treatment effects) are readily stated via counterfactuals but require augmentations to our causal diagrams. Indeed, defining causality without mention of counterfactual outcomes—as counterfactuals are not immediately apparent in DAGs, although they do take center-stage in single-world intervention graphs [22]—may seem at times like learning a language with one less tense. On the other hand, particularly in high-dimensional data, translating a data-generating process from a causal diagram to a list of independencies expressed with counterfactuals can be onerous—why do we need to use so many phrases to express something that would otherwise be succinctly (and appropriately) stated in a diagram? Each representation has advantages, and being facile with multiple formal representations allows us to capitalize on the benefits of all.

Considering the benefits of causal inference “multilingualism” lends itself to another question we should ponder: should every epidemiologist learn every language? As a corollary question, what would be the benefits of a causal inference Esperanto that explicitly combines the best of graphical and counterfactual language? Perhaps the future of succinct and clear communication in epidemiology lies in single-world intervention graphs [22].

Conclusion

Regardless of the framework in which it is couched, inferring causality comes down to combining data and assumptions. As epidemiologists, we make causal inferences all the time. Consequently, it is our responsibility to communicate effectively the assumptions we are making and the way in which we combine assumptions with data. Science benefits when communication is flawless—i.e., when our premises are precisely and transparently stated, and our results are accurately interpreted. In embracing causal diagrams, we are indicating our commitment to unambiguous communication.

References

Banack HR, Kaufman JS. From bad to worse: collider stratification amplifies confounding bias in the “obesity paradox”. Eur J Epidemiol. 2015. doi:10.1007/s10654-015-0069-7.
PubMed Google Scholar
Flanders WD, Eldridge RC. Summary of relationships between exchangeability, biasing paths and bias. Eur J Epidemiol. 2014;1–11. doi:10.1007/s10654-014-9915-2.
Greenland S, Mansournia MA. Limitations of individual causal models, causal graphs, and ignorability assumptions, as illustrated by random confounding and design unfaithfulness. Eur J Epidemiol. 2015;1–10. doi:10.1007/s10654-015-9995-7.
Porta M, Vineis P, Bolúmar F. The current deconstruction of paradoxes: one sign of the ongoing methodological “revolution”. Eur J Epidemiol. 2015;1–9. doi:10.1007/s10654-015-0068-8.
Hernán MA, Robins JM. Causal inference. Boca Raton: Chapman & Hall/CRC; 2016. http://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/.
Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology. 1999;1(10):37–48.
Article Google Scholar
Rothman KJ, Greenland S, Lash TL. Modern epidemiology. Philadelphia: Lippincott Williams & Wilkins; 2008.
Google Scholar
Snoep JD, Morabia A, Hernández-Díaz S, Hernán MA, Vandenbroucke JP. Commentary: a structural approach to Berkson’s fallacy and a guide to a history of opinions about it. Int J Epidemiol. 2014;43(2):dyu026.
Hernández-Diaz S, Schisterman EF, Hernán MA. The birth weight “paradox” uncovered? Am J Epidemiol. 2006;164(11):1115–20. doi:10.1093/aje/kwj275.
Article PubMed Google Scholar
Banack HR, Kaufman JS. The obesity paradox: understanding the effect of obesity on mortality among individuals with cardiovascular disease. Prev Med. 2014;62:96–102. doi:10.1016/j.ypmed.2014.02.003.
Article PubMed Google Scholar
Hernán MA, Clayton D, Keiding N. The Simpson’s paradox unraveled. Int J Epidemiol. 2011;40(3):780–5. doi:10.1093/ije/dyr041.
Article PubMed Central PubMed Google Scholar
Robins JM, Hernán MA. Estimation of the causal effects of time-varying exposures. In: Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G, editors. Longitudinal data analysis. Boca Raton: CRC Press; 2009.
Google Scholar
Pearl J, editor. Direct and indirect effects. In: Proceedings of the seventeenth conference on uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc.; 2001.
Swanson SA, Robins JM, Miller M, Hernán MA. Selecting on treatment: a pervasive form of bias in instrumental variable analyses. Am J Epidemiol. 2015;181(3):191–7.
Article PubMed Google Scholar
Hernán MA, Hernández-Diaz S. Beyond the intention-to-treat in comparative effectiveness research. Clin Trials. 2012;9(1):48–55. doi:10.1177/1740774511420743.
Article PubMed Central PubMed Google Scholar
Hernán MA, Robins JM. Instruments for causal inference: an epidemiologist’s dream? Epidemiology. 2006;17(4):360–72. doi:10.1097/01.ede.0000222409.00878.37.
Article PubMed Google Scholar
VanderWeele TJ, Tchetgen Tchetgen EJ, Cornelis M, Kraft P. Methodological challenges in mendelian randomization. Epidemiology. 2014;25(3):427–35. doi:10.1097/EDE.0000000000000081.
Article PubMed Central PubMed Google Scholar
Imbens G. Rejoinder. Stat Sci. 2014;29(3):375–9. doi:10.1214/14-STS496.
Article Google Scholar
Robins J. A graphical approach to the identification and estimation of causal parameters in mortality studies with sustained exposure periods. J Chronic Dis. 1987;40:139S–61S.
Article PubMed Google Scholar
Robins JM. Causal inference from complex longitudinal data. Latent variable modeling and applications to causality. Springer: New York; 1997. p. 69–117.
Book Google Scholar
Robins JM. Comment on Judea Pearl’s paper, “Causal diagrams for empirical research”. Biometrika. 1995;82:695–8.
Google Scholar
Richardson TS, Robins JM, editors. Single world intervention graphs: a primer. In: Second UAI workshop on causal structure learning, Bellevue, Washington; 2013. http://www.statslab.cam.ac.uk/~rje42/uai13/papers.htm.

Download references

Acknowledgments

Thank you to Miguel Hernán, Matthew Miller, and Ryan Seals for their helpful comments on early versions of this manuscript.

Author information

Authors and Affiliations

Department of Epidemiology, Erasmus Medical Center, P.O. Box 2040, CA 3000, Rotterdam, The Netherlands
Sonja A. Swanson
Department of Epidemiology, Harvard T. H. Chan School of Public Health, 677 Huntington Avenue, Boston, MA, 02115, USA
Sonja A. Swanson

Authors

Sonja A. Swanson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sonja A. Swanson.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Swanson, S.A. Communicating causality. Eur J Epidemiol 30, 1073–1075 (2015). https://doi.org/10.1007/s10654-015-0086-6

Download citation

Published: 07 October 2015
Issue Date: October 2015
DOI: https://doi.org/10.1007/s10654-015-0086-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Communicating causality

Causal diagrams as (formal) story-telling

The case for causal inference “multilingualism”

Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation