Skip to main content
Log in

A Review of the Techniques Used to Control Confounding Bias and How Spatiotemporal Variation Can Be Controlled in Environmental Impact Studies

  • Published:
Water, Air, & Soil Pollution Aims and scope Submit manuscript

A Correction to this article was published on 30 January 2020

This article has been updated

Abstract

Inferring causality has long been a challenging task in environmental impact studies and monitoring programs, mostly because of the problem of confounding bias, i.e. the difficulty of separating impact from natural variation. Traditional approaches for dealing with confounding, despite improvements in study design and statistical analysis, are inadequate. Using aquatic biota as a case study, this review explains the limitations of traditional methods used to separate the impact of human-made pollution from natural variation in the environment. Advantages and disadvantages of the traditional and novel techniques are enumerated. Bayesian networks (BNs) and structural equation modelling (SEM) as causal modelling techniques are introduced as approaches to improve environmental impact monitoring.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Change history

  • 30 January 2020

    Unfortunately, the Figure 1b was incorrectly captured in the published online paper.

  • 30 January 2020

    Unfortunately, the Figure 1b was incorrectly captured in the published online paper.

Notes

  1. Lowercase symbols (e.g. xj, paj) indicate the particular observed value for the corresponding variables (e.g. Xj, PAj).

References

  • Aguilera, P. A., Fernández, A., Fernández, R., Rumí, R., & Salmerón, A. (2011). Bayesian networks in environmental modelling. Environmental Modelling & Software, 26, 1376–1388.

    Google Scholar 

  • ARMCANZ/ANZECC. (2000). Australian guidelines for water quality monitoring and reporting. Canberra: Australian and New Zealand environment and conservation council/ agriculture and resource management council of Australia and New Zealand.

    Google Scholar 

  • Barton, D. N., Kuikka, S., Varis, O., Uusitalo, L., Henriksen, H. J., Borsuk, M., Hera de la, A., Farmani, R., Johnson, S., & Linnell, J. D. (2012). Bayesian networks in environmental and resource management. Integrated Environmental Assessment and Management, 8, 418–429.

    Google Scholar 

  • Bizzi, S., Surridge, B., & Lerner, D. N. (2009). Analysing biological, chemical and geomorphological interactions in rivers using structural equation modelling (pp. 1837–1843). Cairns: International Congress on Modelling and Simulation.

    Google Scholar 

  • Bizzi, S., Surridge, B. W., & Lerner, D. N. (2013). Structural equation modelling: a novel statistical framework for exploring the spatial distribution of benthic macroinvertebrates in riverin ecosystems. River Research and Applications, 29, 743–759.

    Google Scholar 

  • Byrne, B. M. (2010). Structural equation modeling with AMOS: Basic concepts, applications, and programming (2rd ed.). New York: Routledge.

  • Campbell, D. T., & Stanley, J. C. (1963). Experimental and quasi-experimental designs for generalized causal inference. Houghton Mifflin.

  • Carey, R. O., & Migliaccio, K. W. (2009). Contribution of wastewater treatment plant effluents to nutrient dynamics in aquatic systems: a review. Environmental Management, 44, 205–217.

    Google Scholar 

  • Chainho, P., Costa, J. L., Chaves, M. L., Lane, M. F., Dauer, D. M., & Costa, M. J. (2006). Seasonal and spatial patterns of distribution of subtidal benthic invertebrate communities in the Mondego River, Portugal, a poikilohaline estuary. Hydrobiologia, 555, 59–74.

    Google Scholar 

  • Christenfeld, N., Sloan, R., Carroll, D., & Greenland, S. (2004). Risk factors, confounding, and the illusion of statistical control. Psychosomatic Medicine, 66, 868–875.

    Google Scholar 

  • Clements, W. H., Farris, J. L., Cherry, D. S., & Cairns, J. (1989). The influence of water quality on macroinvertebrate community responses to copper in outdoor experimental streams. Aquatic Toxicology, 14, 249–262.

    CAS  Google Scholar 

  • Clements, W. H., Carlisle, D. M., Lazorchak, J. M., & Johnson, P. C. (2000). Heavy metals structure benthic communities in Colorado mountain streams. Ecological Applications, 10, 626–638.

    Google Scholar 

  • Cook, T. D., Shadish, W. R., & Wong, V. C. (2008). Three conditions under which experiments and observational studies produce comparable causal estimates: new findings from within-study comparisons. Journal of Policy Analysis and Management, 27, 724–750.

    Google Scholar 

  • DeAngelis, D. L., & Waterhouse, J. C. (1987). Equilibrium and nonequilibrium concepts in ecological models. Ecological Monographs, 57, 1–21.

    Google Scholar 

  • Downes, B. J., Barmuta, L. A., Fairweather, P. G., Faith, D. P., Keough, M. J., Lake, P. S., Mapstone, B. D., & Quinn, G. P. (2002). Monitoring ecological impacts: concepts and practice in flowing waters. Cambridge University Press.

  • Druzdzel, M. J., & Simon, H. A. (1993). Causality in Bayesian belief networks. Proceedings of the Ninth International Conference on Uncertainty in Artificial Intelligence (pp. 3–11). Morgan Kaufmann Publishers Inc.

  • Freedman, D. (1999). From association to causation: some remarks on the history of statistics. Journal de la Société Française de Statistique, 140, 5–32.

    Google Scholar 

  • Freedman, D. A. (2005). On specifying graphical models for causation, and the identification problem. Identification and Inference for Econometric Models, 56–79. New York: Cambridge University Press.

  • Freedman, D. A. (2009). Statistical models: theory and practice. New York: University Press.

  • Frenette Dussault, C., Shipley, B., & Hingrat, Y. (2013). Linking plant and insect traits to understand multitrophic community structure in arid steppes. Functional Ecology, 27, 786–792.

    Google Scholar 

  • Friedman, J., Hastie, T., & Tibshirani, R. (2001). The elements of statistical learning. Berlin: Springer.

    Google Scholar 

  • Goldthorpe, J. H. (2001). Causation, statistics, and sociology. European Sociological Review, 17, 1–20.

    Google Scholar 

  • Grace, J. B. (2006). Structural equation modeling and natural systems. Cambridge University Press.

  • Grace, J. B., & Bollen, K. A. (2005). Interpreting the results from multiple regression and structural equation models. Bulletin of the Ecological Society of America, 86, 283–295.

    Google Scholar 

  • Grace, J. B., Schoolmaster, D. R., Jr., Guntenspergen, G. R., Little, A. M., Mitchell, B. R., Miller, K. M., & Schweiger, E. W. (2012). Guidelines for a graph-theoretic implementation of structural equation modeling. Ecosphere, 3, 1–44.

    Google Scholar 

  • Green, R. H. (1979). Sampling design and statistical methods for environmental biologists. Ontario: John Wiley & Sons.

    Google Scholar 

  • Greenland, S., & Brumback, B. (2002). An overview of relations among causal modelling methods. International Journal of Epidemiology, 31, 1030–1037.

    Google Scholar 

  • Greenland, S., Robins, J. M., & Pearl, J. (1999). Confounding and collapsibility in causal inference. Statistical Science, 14, 29–46.

    Google Scholar 

  • Grimes, D. A., & Schulz, K. F. (2002). Bias and causal associations in observational research. The Lancet, 359, 248–252.

    Google Scholar 

  • Grimshaw, J., Campbell, M., Eccles, M., & Steen, N. (2000). Experimental and quasi-experimental designs for evaluating guideline implementation strategies. Family Practice, 17, 11–16.

    Google Scholar 

  • Gupta, S., & Kim, H. W. (2008). Linking structural equation modeling to Bayesian networks: decision support for customer retention in virtual communities. European Journal of Operational Research, 190, 818–833.

    Google Scholar 

  • Hair, J. F., Black, W. C., Babin, B. J., Anderson, R. E., & Tatham, R. L. (2006). Multivariate data analysis. Upper Saddle River: Pearson Prentice Hall.

    Google Scholar 

  • Hatami, R. (2017). Development of protocols for environmental impact studies using causal modelling, with special reference to wastewater discharges to freshwater systems. La Trobe University, Albury-Wodonga

  • Hatami, R. (2018a). Development of protocols for environmental impact studies using causal modelling. Water Research, 138, 206–223.

    CAS  Google Scholar 

  • Hatami, R. (2018b). A practical method developed to control spatiotemporal confounding in environmental impact studies. MethodsX, 5,710-716.

    Google Scholar 

  • Hennekens, C. H., Buring, J. E., & Mayrent, S. L. (1987). Epidemiology in medicine. Boston: Little Brown and Company.

    Google Scholar 

  • Hewitt, J. E., Thrush, S. F., Dayton, P. K., & Bonsdorff, E. (2007). The effect of spatial and temporal heterogeneity on the design and analysis of empirical studies of scale-dependent systems. The American Naturalist, 169, 398–408.

    Google Scholar 

  • Hickey, C. W., & Golding, L. A. (2002). Response of macroinvertebrates to copper and zinc in a stream mesocosm. Environmental Toxicology and Chemistry, 21, 1854–1863.

    CAS  Google Scholar 

  • Holland, P. W. (1986). Statistics and causal inference. Journal of the American Statistical Association, 81, 945–960.

    Google Scholar 

  • Horrigan, N., Choy, S., Marshall, J., & Recknagel, F. (2005). Response of stream macroinvertebrates to changes in salinity and the development of a salinity index. Marine and Freshwater Research, 56, 825–833.

    CAS  Google Scholar 

  • Hoyle, R. H. (2011). Structural equation modeling for social and personality psychology. Sage.

  • Hurlbert, S. H. (1984). Pseudoreplication and the design of ecological field experiments. Ecological Monographs, 54, 187–211.

    Google Scholar 

  • Irvine, K. M., Miller, S. W., Al-Chokhachy, R. K., Archer, E. K., Roper, B. B., & Kershner, J. L. (2015). Empirical evaluation of the conceptual model underpinning a regional aquatic long-term monitoring program using causal modelling. Ecological Indicators, 50, 8–23.

    Google Scholar 

  • Isham, V. (1981). An introduction to spatial point processes and Markov random fields. International Statistical Review, 49, 21–43.

    Google Scholar 

  • João, E. (2002). How scale affects environmental impact assessment. Environmental Impact Assessment Review, 22, 289–310.

    Google Scholar 

  • Karaouzas, I., Skoulikidis, N. T., Giannakou, U., & Albanis, T. A. (2011). Spatial and temporal effects of olive mill wastewaters to stream macroinvertebrates and aquatic ecosystems status. Water Research, 45, 6334–6346.

    CAS  Google Scholar 

  • Korb, K. B., & Nicholson, A. E. (2010). Bayesian artificial intelligence. New York: CRC Press.

    Google Scholar 

  • Krogh, M., Dorani, F., Foulsham, E., McSorley, A., & Hoey, D. (2013). Hunter catchment salinity assessment. Sydney: NSW Environment Protection Authority.

    Google Scholar 

  • Leduc, A., Drapeau, P., Bergeron, Y., & Legendre, P. (1992). Study of spatial components of forest cover using partial Mantel tests and path analysis. Journal of Vegetation Science, 3, 69–78.

    Google Scholar 

  • Legendre, P., & Troussellier, M. (1988). Aquatic heterotrophic bacteria: modeling in the presence of spatial autocorrelation. Limnology and Oceanography, 33, 1055–1067.

    Google Scholar 

  • Leunda, P. M., Oscoz, J., Miranda, R., & Ariño, A. H. (2009). Longitudinal and seasonal variation of the benthic macroinvertebrate community and biotic indices in an undisturbed Pyrenean river. Ecological Indicators, 9, 52–63.

    Google Scholar 

  • Likens, G. E. (1989). Long-term studies in ecology. Springer.

  • Linke, S., Bailey, R. C., & Schwindt, J. (1999). Temporal variability of stream bioassessments using benthic macroinvertebrates. Freshwater Biology, 42, 575–584.

    Google Scholar 

  • Malchow, H., Petrovskii, S. V., & Venturino, E. (2008). Spatiotemporal patterns in ecology and epidemiology: theory, models, and simulation. London: Chapman & Hall/CRC Press.

    Google Scholar 

  • Miltner, R. J. (1998). Primary nutrients and the biotic integrity of rivers and streams. Freshwater Biology, 40, 145–158.

    CAS  Google Scholar 

  • Paul, W. L. (2011). A causal modelling approach to spatial and temporal confounding in environmental impact studies. Environmetrics, 22, 626–638.

    Google Scholar 

  • Paul, W. L., & Anderson, M. J. (2013). Causal modeling with multivariate species data. Journal of Experimental Marine Biology and Ecology, 448, 72–84.

    Google Scholar 

  • Paul, W., Cook, R., Shackleton, M., Suter, P., & Hawking, J. (2013). Investigating the distribution and tolerances of macroinvertebrate taxa over 30 years in the River Murray. Final report prepared for the Murray-Darling Basin authority by the Murray-Darling freshwater research centre. Victoria: Wodonga.

    Google Scholar 

  • Paul, W., Rokahr, P., Webb, J., Rees, G., & Clune, T. (2016). Causal modelling applied to the risk assessment of a wastewater discharge. Environmental Monitoring and Assessment, 188, 1–20.

    Google Scholar 

  • Pearl, J. (1988). Probabilistic reasoning in intelligent systems: networks of plausible inference. San Francisco: Morgan Kaufmann Publishers, INC.

    Google Scholar 

  • Pearl, J. (1995). Causal diagrams for empirical research. Biometrika, 82, 669–688.

    Google Scholar 

  • Pearl, J. (1998). Graphs, causality, and structural equation models. Sociological Methods & Research, 27, 226–284.

    Google Scholar 

  • Pearl, J. (2000). Causality. USA: Cambridge University Press.

    Google Scholar 

  • Pearl, J. (2009). Causal inference in statistics: an overview. Statistics Surveys, 3, 96–146.

    Google Scholar 

  • Pearl, J. (2012). The causal foundations of structural equation modeling. Handbook of structural equation modeling. New York: Guilford Press.

  • Pearl, J., & Paz, A. (2010). Confounding equivalence in observational studies.(or, when are two measurements equally valuable for effect estimation?). Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (pp. 433–441). CiteseeX.

  • Pearl, J., & Verma, T. (1991). A theory of inferred causation. San Mateo: Morgan Kaufmann.

    Google Scholar 

  • Peeters, E. T., Gylstra, R., & Vos, J. H. (2004). Benthic macroinvertebrate community structure in relation to food and environmental variables. Hydrobiologia, 519, 103–115.

    Google Scholar 

  • Pitt, M. A., & Myung, I. J. (2002). When a good fit can be bad. Trends in Cognitive Sciences, 6, 421–425.

    Google Scholar 

  • Pugesek, B. H., Tomer, A., & Von Eye, A. (2003). Structural equation modeling: applications in ecological and evolutionary biology. Cambridge University Press.

  • Reichenbach, H. (1956). The direction of time. Berkeley: University of California Press.

    Google Scholar 

  • Sargeant, B. L., Gaiser, E. E., & Trexler, J. C. (2011). Indirect and direct controls of macroinvertebrates and small fish by abiotic factors and trophic interactions in the Florida Everglades. Freshwater Biology, 56, 2334–2346.

    Google Scholar 

  • Shipley, B. (2000). Cause and correlation in biology: a user's guide to path analysis, structural equations and causal inference. Cambridge University Press.

  • Shipley, B. (2016). Cause and correlation in biology: a user's guide to path analysis, structural equations and causal inference with R. Cambridge University Press.

  • Simon, H. A. (1977). Causal ordering and identifiability. Models of discovery. Springer.

  • Spirtes, P., Glymour, C. N., & Scheines, R. (1993). Causation, prediction, and search. New York: Springer-Verlag.

    Google Scholar 

  • Stewart-Oaten, A., & Bence, J. R. (2001). Temporal and spatial variation in environmental impact assessment. Ecological Monographs, 71, 305–339.

    Google Scholar 

  • Stewart-Oaten, A., Murdoch, W. W., & Parker, K. R. (1986). Environmental impact assessment: "pseudoreplication" in time? Ecology, 67, 929–940.

    Google Scholar 

  • Steyvers, M., Tenenbaum, J. B., Wagenmakers, E. J., & Blum, B. (2003). Inferring causal networks from observations and interventions. Cognitive Science, 27, 453–489.

    Google Scholar 

  • Underwood, A. (1991). Beyond BACI: experimental designs for detecting human environmental impacts on temporal variations in natural populations. Marine and Freshwater Research, 42, 569–587.

    Google Scholar 

  • Vandekerckhove, J., Matzke, D., & Wagenmakers, E. J. (2015). Model comparison and the principle of parsimony. Oxford handbook of computational and mathematical psychology. New York: Oxford university press.

    Google Scholar 

  • Wiens, J. A. (2013). Oil in the environment: legacies and lessons of the Exxon Valdez oil spill. Cambridge University Press.

Download references

Acknowledgments

Warren Paul provided guidance, and Michael Shackleton and John Morgan helped improve early versions of the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rezvan Hatami.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

1.1 Bayesian Network and a Theory of Inferred Causation

Bayesian networks (BNs) are graphical models for reasoning under uncertainty that describe the relationships among variables, using probabilistic expressions (Barton et al. 2012). A directed acyclic graph (DAG) is a causal structure of a set of variables, in which the nodes represent variables and arcs represent direct connections between the variables. These direct connections are often causal connections (Pearl 2000; Korb and Nicholson 2010).

A simple yet typical BN is illustrated in Fig. 4. It shows relationships among season (X1), catching a cold (X2) or suffering from hay fever (X3), sneezing (X4) and reaching for a tissue (X5). The lack of a direct link between X1 and X5 shows that the impact of seasonal changes on the need for tissues is mediated by other variables. To describe the relationship among variables in Fig. 4, once we know the season, it is revealed that the sneeze is because of a cold or hay fever (assuming that a cold is more common in winter and hay fever happens more in spring). In modelling language, by knowing the value of X1 (season) and conditioning on it, the middle variables, X2 (cold) and X3 (hay fever), are independent. On the other hand, the case where two causes have a common effect acts the opposite way, e.g. X2 (cold) and X3 (hay fever), will be dependent if the person is sneezing and reaches for a tissue in that by rejecting one of these explanations, the probability of the other increases (Pearl 2000).

In nature, there are arbitrary functional relationships between each effect and its cause(s) imposed by stable causal mechanisms. If these relationships are perturbed by arbitrary disturbances introduced by nature, then it results in hidden or unmeasurable conditions that can be summarised in some probability function. Scientists attempt to identify these mechanisms, organised in the form of acyclic directed graphs (Pearl and Verma 1991). Directed graphs are an intuitive way of expressing causal knowledge (Steyvers et al. 2003), in which each variable is modelled as a function of its direct causes (parents). By definition, a directed acyclic graph can form the causal structure of a set of variables, with each node representing a variable of V, and each link relating to a direct functional relationship among the corresponding variables. The causal structure serves as a blueprint for a causal model which, in turn, specifies how each variable is influenced by its immediate causes (known as Markovian parents in a DAG) (Pearl 2000).

It is important to know how strongly the variables are related, and this can be obtained by utilising the quantitative component of the BN (Aguilera et al. 2011). Taking into account a BN as a carrier of conditional independence relationships, it holds that joint probability distribution over all the variables is equal to the product of the conditional distributions attached to each node, as follows:

$$ p\left({x}_1,\dots .,{x}_n\right)={\prod}_{i=1}^np\left({x}_i| pa\left({x}_i\right)\right) $$
(11)

where pa(xi) is a set of predecessors of xi.

Suppose we have a set of variables as in V = {X1,  … , Xn}, and p(v) is the joint probability distribution on these variables. Then, a set of variables PAj is said to be Markovian parents of Xj if PAj is a minimal set of predecessors of Xj that renders Xj independent of all its other predecessorsFootnote 1 (Pearl 2000). In simple terms, the Markov condition is the state of any variable being probabilistically independent of its non-descendants given the state of its parents (Steyvers et al. 2003). The Markov condition guides us to decide when a set of parents is complete in terms of including all the relevant immediate causes of a variable. Once a causal model is formed, it defines a joint probability distribution of the variables in the system in which each variable must be independent of its ancestors given its parents (Pearl and Verma 1991). At this stage, the scientist has a chance to inspect a subset of the observed variables and the probability distribution over those observed variables, but the underlying causal model and causal structure are still not revealed (Pearl 2000).

As the causal structure of a set of variables is unknown, there is no unique model that would fit a given distribution of those variables. There are numerous models that each might fit data with different sets of hidden variables, and each might connect the observed variables through different causal relationships. The solution to the problem of having no unique model is to follow standard norms of scientific induction and find a simpler, less-elaborate model that is equally consistent with the data. The model which is selected using this process is referred to as ‘minimal’ (Pearl and Verma 1991). From this, inferred causation can be defined as follows: a variable X is said to have a causal influence on a variable Y if a directed path from X to Y exists in every minimal latent structure consistent with the data (Pearl 2000).

In situations where all of the variables in a causal model are observable, conditional independencies are sufficient to infer causal relationships. However, this relationship cannot be attributed to the unobservable variables known as latent or hidden variables (Pearl and Verma 1991). The Markov condition permits us to exclude some of the causes out of the set of parents (to be summarised by probabilities), but not if they are also the cause of other variables in the model (Pearl 2000). For example, disturbances, dictated by nature, that perturb the causal relationships and influence several families of parent–child in the model, will be latent variables (Pearl and Verma 1991). Under the assumption of model-minimality and stable distribution, causal relationships can be uncovered. This method does not claim to identify stable physical mechanisms in nature, but it can identify the mechanisms that may be used for causal inference, plausibly from non-experimental data (Pearl 2000).

1.2 The Relationship Between BNs and SEM and a General Framework for Causal Modelling

Similar to the BN approach, SEM is a causal modelling approach for inferring causal relationships from observational data (Gupta and Kim 2008). SEM is a family of causal modelling (Pearl 2000) and is defined as the process of developing and evaluating structural equation models (Grace et al. 2012). This method provides a framework for learning about causal processes by combining the cause–effect information and statistical data to determine the quantitative relationship among studied variables (Gupta and Kim 2008). Structural equation models can be shown with related mathematical equations, e.g. regression equations in that regression models illustrate the influence of variables on another variable. This influence can be shown using an arrow pointing to the variable of interest from the variable that is influencing it (Byrne 2010). Generally, a structural causal model consists of a set of equations, and can be written as:

$$ {x}_i={f}_i\left(p{a}_i,{u}_i\right)\kern0.5em i=1,\dots, n, $$
(12)

where pai is the set of variables that directly determine the value of Xi, and Ui is representative of errors. Structural equation models are schematically depicted in causal diagrams. Given a causal model in Eq. 12, if we draw an arrow to each variable from its immediate causes, the resulting graph will be a causal diagram. If the causal diagram is acyclic and the errors are jointly independent, the model is a Markovian model. Markovian models connect causation and probabilities via Markov condition (Spirtes et al. 1993; Pearl 2000). Two assumptions need to be met for justification of causal Markov condition. First, every variable that is a cause of two or more variables should be included in the model. Second, ‘common cause principle’ should be met which states that if two variables are dependent, either one is the cause of the other or there is a third variable that they share as a common cause (Reichenbach 1956). Based on the causal Markov condition, a child–parent relationship can be specified as a deterministic function instead of the usual conditional probability used in BNs. This leads to the same distribution that characterises BNs (Pearl 2000). In other words, for every BN with a distribution P (as in Eq. 11), there is a structural equation model (as in Eq. 12) that produces a distribution identical to P (Druzdzel and Simon 1993).

SEM models not only have the graphical feature and the logic of causality that accompanies BNs, but also they have a number of advantages over BNs (Gupta and Kim 2008). Pearl (2000) states some reasons for preferring functional models. First, functional models are more general than stochastic models. Using functional models, one can go beyond predicting the effect of interventions to analysing counterfactuals. For example, functional models can answer questions such as ‘what would be the concentration of algae if we could stop discharging effluent to the streams’, whereas SEM can answer questions like ‘what would have been the algal concentration if discharging had not been occurring’ (Paul et al. 2016). Another advantage of functional representation is that standard statistical methods and software can now be used for the building and testing of SEM models (Paul and Anderson 2013).

Graph theoretical SEM is an approach for translating a causal diagram into a structural causal model and is the most common method of testing causal models. This process is shown in Fig. 4 (Paul et al. 2016). The process of structural causal modelling starts with drawing a causal diagram. The next part of the process involves translating the causal diagram into a set of structural equation and d-separation statements. d-Separation is used as a translation tool for translating the language of causality to the language of probability distribution (Shipley 2000). The statistical models and independence relationships underpinning SEM can be fitted and tested using the structural equation and d-separation statements (Pearl 1998). The third step of the process is to fit and check the statistical models for each of the structural equations, and to test all the independencies entailed in the causal structure (Pearl 2000; Shipley 2000). In the process of building a causal diagram, based on the results from model testing, the causal diagram might need to be adjusted in an iterative process. As SEM is strictly confirmatory, the hypothesised model might be accepted or rejected. Based on the results of the hypothesised model and how perfect it fits the data, the model might be modified and re-estimated (Byrne 2010). Finally, the model is used to predict effects of interventions and to inform management and policy (Pearl 2000).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hatami, R. A Review of the Techniques Used to Control Confounding Bias and How Spatiotemporal Variation Can Be Controlled in Environmental Impact Studies. Water Air Soil Pollut 230, 132 (2019). https://doi.org/10.1007/s11270-019-4150-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11270-019-4150-9

Keywords

Navigation