In this section we present our findings regarding the source reliability models shown in Figs. 4 and 5. We find that belief polarisation is not possible on the hypothesis in such models given only one piece of evidence. However, given multiple pieces of evidence, belief polarisation can arise due to differential weighting of the evidence produced by differences in the priors on reliability of the sources of evidence.
Simple model with one piece of evidence
Updating probability for hypothesis
First, let us consider a simple model where there is just one piece of evidence E (see Fig. 4). In this case, there is no polarisation on \(\mathrm {H}\). This can be seen by considering the likelihood ratio for \(\mathrm {H}\), given a positive report \(\mathrm {E}\), which can be computed as:
$$\begin{aligned} l^{+}&=\frac{p(\mathrm {E}|\lnot \mathrm {H})}{p(\mathrm {E}|\mathrm {H})}\nonumber \\&=\frac{\sum _{R}p(\mathrm {E}|\lnot \mathrm {H},R)\,p(R)}{\sum _{R}p(\mathrm {E}|\mathrm {H},R)\,p(R)}\nonumber \\&=\frac{a\,{\overline{\rho }}}{a\,{\overline{\rho }}+\rho }\nonumber \\&=\frac{1}{1+\frac{\rho }{a\,{\overline{\rho }}}} \end{aligned}$$
(1)
It is clear that this expression is always less than one, since \(\frac{\rho }{a\,{\overline{\rho }}}\) is positive. This means that the posterior probability for \(\mathrm {H}\) always increases, given a piece of positive evidence \(\mathrm {E}\). Furthermore, we can see that the size of the update is governed by \(\frac{\rho }{a\,{\overline{\rho }}}\) for a fixed h. \(\Delta ^{\mathrm {H}}\) is greater when \(\rho \) is larger and/or when a is smaller. When \(\rho \) is larger, the agent has initially more trust in the reliability of the source, and thus is more responsive to what it says. The update is also larger when a is small, because this means that the chance that the positive report arises because the source is actually unreliable but erroneously gives a positive report is small.
Similar calculations show that when the evidence is negative \(\lnot \mathrm {E}\), the likelihood ratio for \(\mathrm {H}\) is:
$$\begin{aligned} l^{-}&=\frac{{\overline{a}}\,{\overline{\rho }}+\rho }{{\overline{a}}\,{\overline{\rho }}}~~~~~~~~~\nonumber \\&=1+\frac{\rho }{{\overline{a}}\,{\overline{\rho }}} \end{aligned}$$
(2)
Since \(\frac{\rho }{{\overline{a}}\,{\overline{\rho }}}\) is positive, this is always greater than one. This means that the posterior probability for \(\mathrm {H}\) always decreases, given a piece of negative evidence \(\lnot \mathrm {E}\). Furthermore, the size of the update \(\Delta ^{\mathrm {H}}\) for a given h is larger when the source is initially more trusted (high \(\rho \)). The update is also larger when a is large, because this means that the chance that the negative report arises because the source is actually unreliable but erroneously gives a negative report is small.
No matter what priors Alice and Bob start with then, either they both update to a higher posterior probability (\(\Delta _{A}^{\mathrm {H}}>0\) and \(\Delta _{B}^{\mathrm {H}}>0\)), when the piece of evidence is positive, or they both update to a lower posterior (\(\Delta _{A}^{\mathrm {H}}<0\) and \(\Delta _{B}^{\mathrm {H}}<0\)) when the piece of evidence is negative. Thus, there can be no belief polarisation with respect to the hypothesis given a single piece of evidence.
Since the likelihood ratio for \(\mathrm {H}\) does not depend at all on the prior h for the hypothesis, the prior h has no effect on the direction of the update. If Alice and Bob have different priors for \(\mathrm {H}\), they will still update in exactly the same direction regardless of whether they have different priors for the reliability of the source, though the size of their update \(\Delta ^{\mathrm {H}}\) depends on their individual priors \(h_{A}\) and \(h_{B}\) as well as on how reliable they consider the source to be.
Updating probability for reliability
It is also of interest to consider how the agents update their probabilities for the reliability of the source. Again, the update is determined by the likelihood ratio. For a positive piece of evidence \(\mathrm {E}\) this is given by:
$$\begin{aligned} r^{+}&=\frac{p(\mathrm { \mathrm {E} }|\lnot \mathrm {R})}{p(\mathrm {E}|\mathrm {R})}\\&=\frac{\sum _{H}p(\mathrm {E}|\lnot \mathrm {R},H)\,p(H)}{\sum _{H}p(\mathrm {E}|\mathrm {R},H)\,p(H)}\\&=\frac{a\,h+a\,{\overline{h}}}{h+0\,{\overline{h}}}\\&=\frac{a}{h} \end{aligned}$$
Thus, the likelihood ratio for the reliability does depend on the prior h, but it does not depend on the prior \(\rho \). If \(h<a\), the likelihood ratio for a positive piece of evidence will be greater than one, and hence the posterior for reliability is lower than the prior, \(\Delta ^{\mathrm {R}}<0\). If \(h>a\), on the other hand, \(\Delta ^{\mathrm {R}}>0\). This is illustrated in Fig. 6. This makes sense if we think that someone who initially sees the probability of \(\mathrm {H}\) as low will take a positive report \(\mathrm {E}\) as an indication that the source is unreliable. Whereas, if the probability of \(\mathrm {H}\) is initially high, a positive report will reinforce the view that the source is reliable.
For a negative piece of evidence \(\lnot \mathrm {E}\), the likelihood ratio is:
$$\begin{aligned} r^{-}&=\frac{p(\mathrm { \mathrm {\lnot E} }|\lnot \mathrm {R})}{p(\mathrm {\lnot E}|\mathrm {R})}\\&=\frac{\sum _{H}p(\mathrm {\lnot E}|\lnot \mathrm {R},H)\,p(H)}{\sum _{H}p(\mathrm {\lnot E}|\mathrm {R},H)\,p(H)}\\&=\frac{{\overline{a}}\,h+{\overline{a}}\,{\overline{h}}}{0\,h+1\,{\overline{h}}}\\&=\frac{{\overline{a}}}{{\overline{h}}} \end{aligned}$$
In this case, if \(h<a\), the posterior probability for the reliability increases, and if \(h>a\), the posterior probability for the reliability decreases. If someone thinks that the probability of \(\mathrm {H}\) is very low, then a piece of negative evidence is taken as an indication that the source is reliable. Whereas, if someone thinks that the probability of \(\mathrm {H}\) is high, then a piece of negative evidence is taken to indicate that the source is not so reliable.
Thus, in this model, even though there cannot be polarisation regarding the hypothesis itself, polarisation regarding the reliability of the source is possible. This could, for example, happen in the legal case if Alice has a lower prior for the reliability of a source of evidence than Bob, \(\rho _{A}<\rho _{B}\), and she also thinks that the prior probability that the accused is guilty is low (\(h_{A}<a\)), while Bob considers it to be high (\(h_{B}>a)\). In that case, a positive piece of evidence will make Alice think the source is even less reliable \((\Delta _{A}^{\mathrm {R}}<0)\), whereas Bob will think it is more reliable \((\Delta _{B}^{\mathrm {R}}>0)\).
Multiple pieces of evidence from independent sources
In some of the canonical experiments on belief polarisation, the subjects were presented with mixed evidence—that is, multiple pieces of evidence where some of the evidence appears to be in favour of the hypothesis, whereas some goes against it. As we saw in Sect. 3, we can represent this as a situation where the agents are presented with n pieces of evidence \(E_{1},...,E_{n}\), all coming from different and independent sources.Footnote 4 For example, in a legal case, the jury might receive a police report, a report from forensic investigation, and eyewitness reports. In principle, each of these reports should be independent of the others. The graphical structure of a source reliability model representing these cases is depicted in Fig. 5. This structure together with the Markov condition guarantees the independence of the variables \(R_{1},...,R_{n}\) and, thus, the independence of the n sources. Again, we assume the probabilistic constraints in the table in Fig. 5 in order to keep things simple and to guarantee that the variables \(R_{1},...,R_{n}\) represent the reliability of the different sources assigned by the agents.
Updating probability for hypothesis
The direction of updating on the probability for the hypothesis \(\mathrm {H}\) is determined, as we saw in Sect. 2, by the likelihood ratio:
$$\begin{aligned} l=\frac{p(E_{1},E_{2},...,E_{n}|\lnot \mathrm {H})}{p(E_{1},E_{2},...,E_{n}|\mathrm {H})} \end{aligned}$$
On the basis of the DAG in Fig. 5, the denominator \(p(E_{1},E_{2},...,E_{n}|\mathrm {H})\) is
$$\begin{aligned} p(E_{1},E_{2},...,E_{n}|\mathrm {H})=\prod _{i}\sum _{R_{j}}p(E_{i}|R_{j},\mathrm {H})\,p(R_{j}) \end{aligned}$$
and likewise for the numerator \(p(E_{1},E_{2},...,E_{n}|\lnot \mathrm {H})\). Thus, the likelihood ratio for \(\mathrm {H}\) factorises as
$$\begin{aligned} l&=\prod _{i}l_{i}\nonumber \\&=\prod _{+}l_{i}^{+}\prod _{-}l_{i}^{-} \end{aligned}$$
(3)
where \(l_{i}\) is the likelihood ratio for observation \(E_{i}\). This is a product of the likelihood ratios for the positive pieces of evidence (for which \(l_{i}=l_{i}^{+}=\frac{p(\mathrm {E_{i}|\lnot \mathrm {H})}}{p(\mathrm {E_{i}|H)}}=\frac{1}{1+\frac{\rho _{i}}{a\,{\overline{\rho }}_{i}}}\)) and the likelihood ratios for the negative pieces of evidence (for which \(l_{i}=l_{i}^{-}=\frac{p(\mathrm {\lnot E_{i}|\lnot \mathrm {H})}}{p(\mathrm {\lnot E_{i}|H)}}=1+\frac{\rho _{i}}{{\overline{a}}\,{\overline{\rho }}_{i}}\)).
As in the model in Subsect. 4.1, the likelihood ratio does not depend at all on the prior h for the hypothesis \(\mathrm {H}\). It does, however, depend on the priors for the reliabilities of the sources, namely \(\rho _{i}\). If Alice and Bob assign different priors for the reliability of sources, they may update to different extents on each piece of evidence. In some situations this may give rise to belief polarisation. Suppose, for example, that Alice and Bob receive two pieces of evidence. Alice starts with a higher prior for \(\mathrm {H}\) than Bob. She also assigns a higher prior to the reliability of the source of the first piece of evidence than to the source of the second piece of evidence. Bob, on the other hand, assigns a higher prior to the reliability of the source of the second piece of evidence than the first. Suppose Alice and Bob now receive a positive piece of evidence from the first source and a negative piece of evidence from the second. Then because Alice initially trusts the first source more than the second, she is more responsive to the positive evidence than the negative. The overall effect of updating on the mixed evidence is that Alice’s probability increases, \(\Delta _{A}^{\mathrm {H}}=p_{A}(\mathrm {H}|\mathrm {E_{1},\mathrm {\lnot E_{2})}- p_{A}\mathrm {(H)} }>0\). Bob, on the other hand, is more responsive to the negative evidence than to the positive evidence, and so his probability decreases, \(\Delta _{B}^{\mathrm {H}}=p_{B}(\mathrm {H}|\mathrm {E_{1},\mathrm {\lnot E_{2})}- p_{B}\mathrm {(H)} }<0\). Thus there is belief polarisation. Notice that given the same reliability priors, if Alice had started with a lower prior for \(\mathrm {H}\) than Bob, the same updating can result in convergence of their posterior probabilities. Both cases are illustrated in Fig. 7.
In general, the probability update \(\Delta ^{\mathrm {H}}\) depends on the reliability priors for the different sources, the value of the parameter a, and the relative number of pieces of positive and negative evidence. The probability update \(\Delta ^{\mathrm {H}}\) is negative when the overall contribution of the negative evidence outweighs the overall contribution of the positive evidence in making the overall likelihood ratio greater than one. The condition for this is:
$$\begin{aligned} \prod _{i}^{-}\left( 1+\frac{\rho _{i}}{{\overline{a}}\,{\overline{\rho }}_{i}}\right) >\prod _{i}^{+}\left( 1+\frac{\rho _{i}}{a{\overline{\rho }}_{i}}\right) \end{aligned}$$
Here, the product on the left hand side is over all negative pieces of evidence and the product on the right is over all positive pieces of evidence. For the special case where we have just two pieces of evidence, one positive and one negative, the likelihood ratio is
$$\begin{aligned} l^{+-}=\left( \frac{1}{1+\frac{\rho _{1}}{a\,{\overline{\rho }}_{1}}}\right) \left( 1+\frac{\rho _{2}}{{\overline{a}}\,{\overline{\rho }}_{2}}\right) \end{aligned}$$
which is greater than one when:
$$\begin{aligned} \left( 1+\frac{\rho _{2}}{{\overline{a}}\,{\overline{\rho }}_{2}}\right) >\left( 1+\frac{\rho _{1}}{a{\overline{\rho }}_{1}}\right) \end{aligned}$$
This occurs when:
$$\begin{aligned} \frac{\rho _{2}}{{\overline{\rho }}_{2}}>\frac{{\overline{a}}\,\rho _{1}}{a\,{\overline{\rho }}_{1}} \end{aligned}$$
For any given a, there is thus a region on the \(\rho _{1}-\rho _{2}\) plane where it is possible that the size of the update on the negative evidence is greater than the size of the update on the positive evidence. In these cases, the probability for H decreases given both pieces of evidence (\(\Delta ^{\mathrm {H}}<0\)).
Suppose now that \(h_{A}<h_{B}\). Polarisation will occur exactly when Alice’s priors for reliability fall in the region where \(\Delta ^{\mathrm {H}}\) is negative, and when Bob’s priors for reliability fall in the region where \(\Delta ^{\mathrm {H}}\) is positive. Such a case is illustrated in Fig. 8.
The effect of varying a on the regions is shown in Fig. 9. We see that the region where \(\Delta ^{\mathrm {H}}<0\) grows for higher a. This is because, as we have seen, for higher a, a positive piece of evidence has less effect on the probability update and a negative piece of evidence has more. An interesting case is where \(a=0.5.\) In this case, the probability of getting positive evidence when the hypothesis is false is equal to the probability of getting negative evidence when the hypothesis is true. In this balanced situation, the probability update \(\Delta ^{\mathrm {H}}\) is positive on exactly half of the \(\rho _{1}-\rho _{2}\) plane. Since \(\Delta _{A}^{\mathrm {H}}\) is negative in half of Alice’s parameter space of reliability priors, and \(\Delta _{B}^{\mathrm {H}}\) is positive in half of Bob’s parameter space, belief polarisation will occur in one quarter of the total parameter space of Alice and Bob’s reliability priors. This is the maximum proportion of the parameter space on which belief polarisation can occur. Moving a away from 0.5 reduces the size of the region where belief polarisation occurs, since it produces an imbalance between the size of the \(\Delta ^{\mathrm {H}}<0\) and \(\Delta ^{\mathrm {H}}>0\) regions. Similarly, as we increase the amount of evidence beyond two pieces of evidence, an imbalance between the number of pieces of positive and negative evidence will also produce such an imbalance between the size of the \(\Delta ^{\mathrm {H}}<0\) and \(\Delta ^{\mathrm {H}}>0\) regions, and hence reduce the region in which belief polarisation occurs. In the case where the amount of evidence is increased, but the evidence remains balanced (i.e., equal numbers of pieces of positive and negative evidence), the proportion of the parameter space where belief polarisation occurs still cannot be increased beyond the maximum of one quarter.Footnote 5
Updating probability for reliability
We will now look at the direction of update of the posterior probabilities for the reliabilities when there are multiple pieces of evidence. We compute the likelihood ratio for one of the reliabilities, say \(\mathrm {R}_{1}\):
$$\begin{aligned} r=\frac{p(\mathrm { E }_{1},E_{2},...,E_{n}|\lnot \mathrm {R}_{1})}{p(E_{1},E_{2},...,E_{n}|\mathrm {R}_{1})} \end{aligned}$$
(4)
On the basis of the DAG in Fig. 5, the denominator \(p(E_{1},E_{2},...,E_{n}|\mathrm {R}_{1})\) can be computed as:
$$\begin{aligned} p(E_{1},E_{2},...,E_{n}|\mathrm {R}_{1})&=\sum _{H,R_{2},R_{3},...,R_{n}}\,p(H)\,p(E_{1}|H,\mathrm {R}_{1}) p(E_{2}|H,R_{2})\,p(R_{2})\,...\,p(E_{n}|H,R_{n})\,p(R_{n})\nonumber \\&=\sum _{H}p(H)\,p(E_{1}|H,\mathrm {R}_{1})\prod _{j=2}^{n}p(E_{j}|H)\nonumber \\&=h\,p(E_{1}|\mathrm {H},\mathrm {R}_{1})\prod _{j=2}^{n}p(E_{j}|\mathrm {H})+{\overline{h}}\,p(E_{1}|\mathrm {\lnot H},\mathrm {R}_{1})\prod _{j=2}^{n}p(E_{j}|\mathrm {\lnot H}) \end{aligned}$$
(5)
A similar calculation gives the numerator:
$$\begin{aligned} p(E_{1},E_{2},...,E_{n}|\mathrm {\lnot R}_{1}) =h\,p(E_{1}|\mathrm {H},\mathrm {\lnot R}_{1})\prod _{j=2}^{n}p(E_{j}|\mathrm {H})+{\overline{h}} p(E_{1}|\mathrm {\lnot H},\mathrm {\lnot R}_{1})\prod _{j=2}^{n}p(E_{j}|\mathrm {\lnot H}) \end{aligned}$$
(6)
Substituting into Eq. 4 the expressions given by Eqs. 5 and 6 thus gives the likelihood ratio:
$$\begin{aligned} r&=\frac{h\,p(\mathrm { E }_{1}|\mathrm {H},\mathrm {\lnot R}_{1})\prod _{j=2}^{n}p(E_{j}|\mathrm {H})+{\overline{h}}\,p(\mathrm { E }_{1}|\mathrm {\lnot H},\mathrm {\lnot R}_{1})\prod _{j=2}^{n}p(E_{j}|\mathrm {\lnot H})}{h\,p(E_{1}|\mathrm {H},\mathrm {R}_{1})\prod _{j=2}^{n}p(E_{j}|\mathrm {H})+{\overline{h}}\,p(\mathrm { E }_{1}|\mathrm {\lnot H},\mathrm {R}_{1})\prod _{j=2}^{n}p(E_{j}|\mathrm {\lnot H})}\nonumber \\&=\frac{h\,p(\mathrm { E }_{1}|\mathrm {H},\mathrm {\lnot R}_{1})+{\overline{h}}\,p(\mathrm { E }_{1}|\mathrm {\lnot H},\mathrm {\lnot R}_{1})\prod _{j=2}^{n}l_{j}}{h\,p(\mathrm { E }_{1}|\mathrm {H},\mathrm {R}_{1})+{\overline{h}}\,p(\mathrm { E }_{1}|\mathrm {\lnot H},\mathrm {R}_{1})\prod _{j=2}^{n}l_{j}} \end{aligned}$$
(7)
Consider the special case where \(n=2\) and suppose the first piece of evidence is positive, \(\mathrm {E_{1}},\) and the second piece of evidence is negative, \(\lnot \mathrm {E_{2}}.\) Then the likelihood ratio
$$\begin{aligned} r^{+-}=\frac{p(\mathrm {\mathrm {E}}_{1},\mathrm {\lnot E}_{2}|\lnot \mathrm {R}_{1})}{p(\mathrm {E}_{1},\mathrm {\lnot E}_{2}|\mathrm {R}_{1})} \end{aligned}$$
is given by substituting \(l_{2}=l^{-}\), given by the expression in Eq. 2, into Eq. 7:
$$\begin{aligned} r^{+-}=\frac{a}{h}\left( h+{\overline{h}}\,(1+\frac{\rho _{2}}{{\overline{a}}\,{\overline{\rho }}_{2}})\right) \end{aligned}$$
Thus the updating of the probability for \(\mathrm {R_{1}}\) depends on the prior h and the prior \(\rho _{2}\), as well as a. Again, there is a region of parameter space where \(\Delta ^{\mathrm {R_{1}}}>0\) and a region where \(\Delta ^{\mathrm {R_{1}}}<0\). These regions are shown in Fig. 10a for the case where \(\rho _{1}=0.1\) and \(a=0.1\). There can be cases of polarisation on \(R_{1}\) where Alice has priors in the \(\Delta ^{\mathrm {R_{1}}}<0\) region and Bob has priors in the \(\Delta ^{\mathrm {R_{1}}}>0\) region. In Fig. 10b we show how these regions intersect with the regions where \(\Delta ^{\mathrm {H}}>0\) and \(\Delta ^{\mathrm {H}}<0\). The parameter space is then divided into four regions. It is then possible that Alice and Bob’s probabilities for both the hypothesis and the reliability update in the same direction (when they both have priors in the same region). But it is also possible for certain choices of priors that Alice and Bob polarise on the hypothesis, or on the reliability, or both.