1 Introduction

This article presents a formalisation of the Popov v. Hayashi case in an instance of Prakken’s (2010) framework for argument-based inference with structured arguments. This framework further develops Amgoud et al. (2006)’s attempt to integrate work on rule-based argumentation of Pollock (1994), Vreeswijk (1997), Prakken and Sartor (1997) and others. It defines arguments as inference trees formed by applying two kinds of inference rules, strict and defeasible rules. This naturally leads to three ways of attacking an argument: attacking a premise, attacking a conclusion and attacking an inference. To resolve such conflicts, preferences may be used, which leads to three corresponding kinds of defeat: undermining, rebutting and undercutting defeat. The framework is abstract in that it applies to any set of inference rules, as long as it is divided into strict and defeasible ones, and to any logical language with a contrary relation defined over it. Moreover, since it associates each knowledge base with a set of arguments ordered by a binary relation of defeat, the acceptability status of arguments can be defined in terms of Dung’s (1995) well-understood abstract approach to argument acceptability.

The choice for tree-structured arguments based on two types of inference rules arguably is very natural, not only in light of logic and argumentation theory but also when looking at argumentation as it occurs in human thinking and dialogue. The notion of arguments as trees of inferences is very common in standard logic and in argumentation theory, and it is the basis of many software tools for argument visualisation. Moreover, in actual argumentation humans often express their arguments as claims supported with one or more premises, which can in turn be supported with further premises, and so on, which results in tree structures. The tree form is therefore very suitable for capturing the dependency relations between the various elements of the court’s decision. Finally, the framework is very suited for modelling reasoning with argumentation schemes, by regarding such schemes as defeasible inference rules.

The instance of the framework assumed in this paper will draw on my previous work (partly with others) on rule-based argumentation (Prakken and Sartor 1997), reasoning with argument schemes (Bex et al. 2003; Prakken 2005a), accrual of arguments (Prakken 2005b) and reasoning under burden of proof (Prakken and Sartor 2009).

The main purpose of the present reconstruction is theoretical, namely, to test the adequacy of the framework of Prakken (2010) for giving a formal reconstruction of the Popov v. Hayashi case. More precisely, the object of the formalisation is the relevant facts and rules stated by judge McCarthy and the arguments constructed by him on the basis of these facts and rules. The reconstruction will not be descriptive but rational: the aim is to reconstruct the judge’s reasoning as being, as far as possible, rationally sound. Assuming that the judge’s reasoning in the case is by and large sound, the extent to which such a reconstruction is possible indicates the suitability of the formal framework.

A secondary aim of the case study is more practical, namely, to give a reconstruction as could be made by lawyers or law students with so-called sensemaking software. With such software, e.g. Reed and Rowe’s (2004) Araucaria or Van den Braak’s (2010) Avers, users can draft their own arguments or analyse given arguments, in order to improve the quality of their arguments or to better grasp the issues in a case. The present case study is also meant to test the suitability of the framework of Prakken (2010) as a theoretical basis for the design of sensemaking software for legal applications. Since users of such software will not be experts in knowledge representation, the main focus of this case study is on the general structure of arguments and their relations instead of on detailed knowledge representation issues, such as time, action, causation and intention. The only representation issues that will be discussed in detail are the representation of the validity of rules and of rule-exception structures, since these issues turn out to be crucial for capturing judge McCarthy’s reasoning. The focus on sensemaking applications also means that the reconstruction of the case is ideally as close to the judge’s actual formulation as possible, within the bounds defined by the primary aim to give a rational instead of purely descriptive reconstruction of the case.

This article is organised as follows. First in Sect. 2 the instance of Prakken’s (2010) framework used in this article will be semi-formally presented. Then in Sect. 3 the formalisation of the case will be described in two parts: the facts (Sect. 3.1) and the legal analysis (Sect. 3.2). This section only presents the facts and rules discussed by judge McCarthy; the arguments that he constructs from them are presented in Sect. 4. The formalisation will be evaluated in Sect. 5.

2 The formalism

In this section the instance of Prakken’s (2010) framework used in this article will be semi-formally presented. For the full details the reader is referred to the technical report.

2.1 The basic definitions

The basic notion of the present framework is that of an argumentation system, which extends the familiar notion of a proof system with a distinction between strict and defeasible inference rules and a preference ordering on the defeasible inference rules. More precisely:

An argumentation system is a tuple \(AS = (\mathcal{L},{}^{-}, \mathcal{R},\leq)\) where

  • \(\mathcal{L}\) is a logical language,

  • is a contrariness function from \(\mathcal{L}\) to \({2^{\mathcal{L}}, }\)

  • \(\mathcal{R} = \mathcal{R}_s \cup \mathcal{R}_d\) is a set of strict \((\mathcal{R}_s) \)and defeasible \((\mathcal{R}_d)\) inference rules such that \(\mathcal{R}_s \cap \mathcal{R}_d = \emptyset, \)

  • ≤ is a partial preorder on \(\mathcal{R}_d\).

In this article the logical language \(\mathcal{L}\) will be that of first-order logic augmented with a defeasible conditional \(\leadsto \). To avoid excessive formalism, the facts and rules will be expressed in a semi-formal way. The contrariness function specifies which formulas cannot be accepted together. In the present paper it will be used to capture classical negation (formulas \(\varphi\) and \(\neg \varphi\) are each other’s contraries) and to express some domain-specific conflict relations (e.g. ‘x must return the baseball to y’ and ‘x and y must equally share the value of the baseball’ will be each other’s contraries).

Strict rules are of the form \(\varphi_1,\,\ldots,\,\varphi_n\,\rightarrow\,\varphi,\) interpreted as ‘if the antecedents \(\varphi_1, \ldots,\varphi_n\) are acceptable, then the consequent \(\varphi\) must be accepted, no matter what’. Defeasible rules are written as \(\varphi_1,\,\ldots,\,\varphi_n\,\Rightarrow\,\varphi,\) meaning ‘if the antecedents are acceptable, then the consequent must be accepted if there is no good reason not to accept it’. Although the general framework allows for domain-specific inference rules, as used by e.g. Prakken and Sartor (1997), the present instance will only use inference rules that express general inference patterns; domain-specific rules will instead be represented in \(\mathcal{L}\) with the material implication \(\supset\) (if exceptionless) or with the connective \(\leadsto\) (if subject to exceptions). The set \(\mathcal{R}_s\) of strict inference rules will be assumed to contain all deductively valid propositional and first-order inferences, while the set \(\mathcal{R}_d\) of defeasible inference rules contains a defeasible modus ponens rule for \(\leadsto\) and some argument schemes specified below in Sect. 2.2. The preference ordering ≤ will not be needed and is therefore not specified.

Arguments are constructed from a knowledge base \(\mathcal{K}\) which, inspired by Gordon et al. (2007), is assumed to contain four kinds of formulas:

  • \(\mathcal{K}_n\) is a set of (necessary) axioms. Intuitively, arguments cannot be attacked on their axiom premises.

  • \(\mathcal{K}_p\) is a set of ordinary premises. Intuitively, arguments can be attacked on their ordinary premises, and whether this results in defeat must be determined by comparing the attacker and the attacked premise.

  • \(\mathcal{K}_a\) is a set of assumptions. Intuitively, arguments can be attacked on their assumptions, where these attacks always succeed.

  • \(\mathcal{K}_i\) is a set of issues. Intuitively, arguments of which the premises include an issue are never acceptable: an issue must always be backed with a further argument.

In the present paper only ordinary premises and assumptions will be needed. In the general framework a partial priority ordering is assumed on \(\mathcal{K} \setminus K_n\). Since it will not be needed for the present formalisation, it is not specified here.

Arguments can be built by chaining inference rules into trees, starting with elements from the knowledge base. The following definition is adapted from Vreeswijk (1997). In this definition, for any argument A, the function \(\tt{Prem}\) returns all the formulas of \(\mathcal{K}\) (called premises) used to build \(A, \,\tt{Conc}\) returns A’s conclusion and \(\tt{Sub}\) returns all of A’s sub-arguments.

An argument A on the basis of a knowledge base \((\mathcal{K},\leq')\) in an argumentation system \((\mathcal{L},{}^{-},\mathcal{R},\leq)\) is:

  1. 1.

    \(\varphi\) if \(\varphi\,\in\,\mathcal{K}\) with: \(\tt{Prem}(A) = \{\varphi\};\,\tt{Conc}(A)\,= \varphi;\,\tt{Sub}(A)= \{\varphi\}\).

  2. 2.

    \(A_1, \ldots\,\, A_n \rightarrow/\Rightarrow \psi\) if \(A_1, \ldots, \, A_n\) are arguments such that there exists a strict/defeasible rule \(\tt{Conc}(A_1), \ldots,\tt{Conc}(A_n) \rightarrow/\Rightarrow \psi\) in \(\mathcal{R}_s/\mathcal{R}_d\).

    $$ \begin{aligned} \tt{Prem}(A&= \tt{Prem}(A_1) \cup \ldots \cup \tt{Prem}(A_n),\\ \tt{Conc}(A)&= \psi,\\ \tt{Sub}(A)&= \tt{Sub}(A_1) \cup \ldots \cup \tt{Sub}(A_n) \cup \{A\}. \end{aligned} $$

Consider by way of example a knowledge base in an argumentation system with

$$ \begin{aligned} {\mathcal{R}}_s &=\{p,q \rightarrow s; u,v \rightarrow w \}; {\mathcal{R}}_d =\{p \Rightarrow t; s,r,t \Rightarrow v \}\\ {\mathcal{K}}_n &= \{q\}; {\mathcal{K}}_p = \{p,u\}; {\mathcal{K}}_a = \{r\} \end{aligned} $$

An argument for w is displayed in Fig. 1. The type of a premise is indicated with a superscript.

Fig. 1
figure 1

An argument

Formally the argument and its subarguments are written as follows:

$$ \begin{array}{ll} A_1{:}\, p & A_5{:}\, A_1 \Rightarrow t\\ A_2{:}\, q & A_6{:}\, A_1,A_2 \rightarrow s\\ A_3{:}\, r & A_7{:}\, A_5,A_3,A_6 \Rightarrow v\\ A_4{:}\, u & A_8{:}\, A_7,A_4 \rightarrow w\\ \end{array} $$

We have that

$$ \begin{aligned} \tt{Prem}(A_8) &= \{p,q,r,u\}\\ \tt{Conc}(A_8) &= w\\ \tt{Sub}(A_8) &= \{A_1, A_2, A_3,A_4,A_5,A_6,A_7,A_8\}\\ \end{aligned} $$

Arguments can be attacked on their premises (except on axioms) and on their applications of defeasible inference rules. The latter can be done in two ways: by rebutting the argument, i.e, by providing an argument for a contrary conclusion, or by undercutting the argument, i.e, by concluding that the attacked inference rule does not apply to the case at hand. Rebutting and undercutting attacks can be directed towards an argument’s final conclusion or towards one of its intermediate conclusions, that is, to a conclusion of a subargument. In our example, argument A 8 can be attacked on its premises p and r but not on q. It can be rebutted on s and w and undercut on the inferences of t and v. In general, premise attacks and rebuttals must be resolved with the priority orderings on \(\mathcal{R}_d\) and \(\mathcal{K}\) to see whether the attack results in defeat. However, since in the present reconstruction these orderings are not needed, all three forms of attack will result in defeat.

Having thus defined a set of arguments ordered by a binary defeat relation, any of Dung’s (1995) semantics can be used to define the acceptability status of arguments. It will turn out that for present purposes their differences do not matter and their application is straightforward. For this reason these semantics will not be discussed here.

2.2 The argument schemes

Because of its distinction between strict and defeasible inference rules, the framework is suitable for modelling reasoning with argument schemes, which currently is an important topic in the computational study of (legal) argument (Walton et al. 2008; Gordon et al. 2007; Atkinson and Bench-Capon 2007). Argument schemes are stereotypical non-deductive patterns of reasoning, consisting of a set of premises and a conclusion that is presumed to follow from them. Uses of argument schemes are evaluated in terms of critical questions specific to the scheme. A natural way to formalise reasoning with argument schemes is to regard them as defeasible inference rules and to regard critical questions as pointers to counterarguments; cf. (Verheij 2003; Bex et al. 2003). Since attacks on premises and conclusions are captured by the general definitions of rebutting and premise attack, below only undercutters of argument schemes will be listed.

First two schemes needed for reasoning with and about rules will be defined. As said above, domain-specific defeasible rules will be represented in the object language \(\mathcal{L}\) with a connective \(\leadsto\). (Such domain-specific defeasible rules must not be confused with the defeasible inference rules from R d , which are specified in the metalanguage of \(\mathcal{L}\). Below the context will disambiguate the use of the term ‘rule’). Since in the reconstruction of the case the validity of (domain-specific) rules will turn out to be crucial, Hage’s (1996) conceptual distinction (though not his formalisation) between the general validity of rules and their applicability to a case at hand will be adopted. To capture this distinction, an argument scheme will be used that is inspired by recent work of Sartor (2008) and Bex (2009). (The following schemes assume that the rule in \(\mathcal{L}\) of the form \(\varphi \, \leadsto \, \psi\) is named by term \(\mathcal{L}\)-term r.)

Rule validity scheme:

AS rv ::

rule r is valid \(\rightarrow\,\varphi \, \leadsto \, \psi\)

(Since this is a strict rule, it has no undercutters.) The Popov v. Hayashi case contains several explicit arguments about rule validity but most rules are simply stated; in those cases their validity will below be expressed as an assumption-type premise. It will turn out that some rules of the case are legal while others are empirical commonsense rules. Both types of rules will be represented with \(\leadsto;\) their different nature can be captured with different validity conditions.

Once a rule has been derived with the rule validity scheme, it can be applied with the following defeasible modus ponens argument scheme. This scheme has one undercutter (U), namely that the rule is not applicable, which can be used to express exceptions to the rule.

Defeasible modus ponens:

AS dmp ::

\(\varphi \, \leadsto \, \psi, \,\varphi\,\Rightarrow\,\psi\)

U::

rule r is not applicable

Note that while thus all rules are applicable by default (since inapplicability means that there is an exception to the rule), their validity can only be assumed when warranted by a particular assumption-type premise for a particular rule. It seems too strong in general to assume that rules are valid by default.

Later in Sect. 3.2 the need for an analogical rule application scheme will become apparent.

Next some argument schemes are needed to reconstruct the arguments about the facts, namely, the witness testimony scheme familiar from the literature on argument schemes, plus a scheme for arguments from videotape taken from Wyner et al. (2007). The latter scheme will be used to model judge McCarthy’s factual inferences from the video recording that was made of the incident. The witness testimony scheme has a general undercutter in case the witness is not credible. Further rules can describe cases in which witnesses are not credible; these further rules then correspond to the familiar lists of critical questions for this scheme (both schemes below assume a naming convention for elements of \(\mathcal{L}\)).

Witness testimony scheme:

AS wt ::

Witness w says that \(\varphi\,\Rightarrow\,\varphi\)

U::

Witness w is not credible

The argument scheme from video tape has a similar form.

Video tape scheme:

AS vt ::

Video recording v shows that \(\varphi\,\Rightarrow\,\varphi\)

U::

Video recording v is not reliable

Finally, Prakken’s (2005b) scheme for accrual of arguments (acc) will be used. If there are multiple defeasible arguments for the same conclusion, this scheme combines them in a new accrual argument, for which a new strength can be determined. Since the use of this scheme is not crucial for the case, the reader is referred to Prakken (2005b) for the formal details.

3 The formalisation

This section presents the reconstruction of the case. The reconstruction will not be purely descriptive but to some extent ‘rational’ in that incomplete lines of reasoning will be completed in a way that makes them fit the theoretical model. Since this theoretical model is intended to formulate principles for rational argumentation, these completions will therefore make the incomplete arguments ‘valid’ in the sense defined by the theoretical model. However, completions must still be close to the actual text of the decision; speculations as to what the judge could have had in mind will be avoided as much as possible.

3.1 The facts

In the formula names below e is used for evidence, f for material facts and l for legal (or ultimate) facts. Furthermore, rules are named with r or, if proposed by legal experts or the judge, with their initials.

Judge McCarthy observes that the following facts are all proven by the video tape and supporting witness statements. McCarthy explains in some detail that the video tape is reliable evidence since it was was shot from a position with clear view on the events. In two cases he makes statements on the reliability of witnesses, namely, if he wants to reject their testimony. One of these cases is formalised below in Fig. 3. Since apart from these (still minor) issues the proof of the facts was not an issue in the case, their derivation from the evidence with the witness and video tape schemes will be left implicit. The main facts concerning what happened during the baseball match are then as follows.

f 1::

Popov stopped the forward motion of the baseball

f 2::

the baseball was in the glove of Popov

f 3::

Popov was in motion

f 4::

an out of control mob of unknown persons illegally attacked Popov

f 5::

Hayashi did not attack Popov

f 6::

Popov lost contact with the baseball during the violent attack

f 7::

Hayashi retrieved the baseball

f 8::

Popov intended to control the baseball

f 9::

the baseball was hit into the stands

As a matter of fact, McCarthy briefly discusses the proof of f 4. After observing that several witnesses contradicted this fact, he says that the videotape clearly establishes its truth and (footnote 4) that these witnesses made no true statements “in other particulars”, because of which their entire testimony must be rejected. His arguments can be easily reconstructed with the schemes from witness testimony and video tape but since the truth of f 4 was just a minor issue in the case, they will for reasons of space not be formalised.

Later it will turn out that further material facts are needed; they will not be listed in the text but indicated in the argument graphs of Sect. 4. The same holds for procedural and legal facts unless listed below.

3.2 Legal analysis

A central issue in the case is whether certain rules should be adopted. This will be modelled in terms of rule validity. In the case three sources of rule validity are relevant.

v 1::

rule r was used in precedent case \(c\,\leadsto\) rule r is valid

v 2::

rule r is proposed by legal expert \(e\,\leadsto\) rule r is valid

v 3::

rule r is adopted by relevant practice \(p\,\leadsto\) rule r is valid

(The reason why these rules are not modelled as argument schemes is that they are not general patterns of reasoning but specific rules about the validity of legal rules and as such themselves debatable.) This list is not meant to be exhaustive in general but (with a further rule to be specified below) suffices for the present case. An interesting philosophical question is how the validity of these rules can be supported and whether this leads to an infinite regress. However, for present purposes their validity can simply be assumed as as assumption-type premises.

The main legal issue is whether Popov achieved possession or the right to possession as he attempted to catch and hold on to the ball (p. 5). If Popov (who has the burden of persuasion), succeeds in proving this, then further rules justify liability of Hayashi if he does not return the baseball on demand of Popov. Leaving the precise nature of these further rules implicit, this can be formalised as follows:

r 1::

(x has possession of y ∨ x has a right to possession of y) \(\land\,z\) holds \(y\,\land\,x\) demands z to return \(y\,\leadsto\,z\) must return y to x

r 1::

x does not have possession of \(y\,\land\,x\) has no right to possession of \(y\,\land\,z\) holds \(y\,\land\,x\) demands z to return \(y\,\leadsto\,z\) does not have to return y to x

(Since both formulas are based on the same rule, they are given the same name.) According to footnote 14, this rule is apparently based on a precedent:

l 1::

r 1 was used in Metropolitan Life Insurance Company v. San Francisco Bank

With this fact, the validity of r 1 can be derived with validity rule v 1.

3.2.1 Possession

McCarthy first discusses whether Popov has achieved possession of the ball (the first disjunct of the first condition of r 1). The following thee rules state that spectators of baseball games who catch or retrieve a baseball gain possession of the ball.

r 2::

a baseball is hit into the stands \(\leadsto\) the baseball becomes abandoned property

r 3::

a baseball is abandoned property \(\land\,x\) catches the baseball \(\leadsto\,x\) has possession of the baseball

r 4::

a baseball is abandoned property \(\land\,x\) retrieves the baseball \(\leadsto\,x\) has possession of the baseball

Apparently, these rules are generally accepted by baseball fans. Then combined with an obvious implicit commonsense rule, their validity can be derived from rule v 3.

l 2::

rule r 2 confirms to the long-standing expectations of baseball fans

l 3::

rule r 3 confirms to the long-standing expectations of baseball fans

l 4::

rule r 4 confirms to the long-standing expectations of baseball fans

\(\mathit{cs}_{1}{:}\,\) :

rule x confirms to the long-standing expectations of baseball fans \(\leadsto\) rule r is adopted by the relevant practice of baseball

Rule r 4 has an exception in case the ball is not loose when retrieved, for instance, when someone else achieved a right to the ball but involuntarily lost control. The effect of this exception is shown in Fig. 2, which displays why Hayashi cannot win by rules r 4 and r 1. (Alternatively, such cases could be modelled as giving rise to an argument that the ball was not abandoned property any more when Hayashi picked it up. In this interpretation of the concept ‘abandoned property’ the counterargument in Fig. 2 must be targeted at the premise ‘The baseball was abandoned property’.)

r 5::

the ball is not loose when x retrieves it \(\leadsto\) rule r 4 is not applicable

r 6::

(x′ has possession of the baseball ∨ x′ has a right to possession of the baseball ∨ x′ has a qualified right to possession of the baseball) \(\land\,x'\) involuntarily lost control of the baseball \(\land\,x' \not= x\,\leadsto\) the ball is not loose when x retrieves it

(The point of the second and third disjunct of the first condition will become apparent in Sect. 3.2.2.) The validity of r 5 and r 6 was not discussed and will therefore be stated as assumptions. Now while r 2r 3 and r 4 were uncontroversial in the case, a closer analysis of the decision shows that r 3 in fact compresses a more detailed line of reasoning, as will be explained below after rules G 2a and G 2b .

Fig. 2
figure 2

An overruled argument why Hayashi does not have to return the baseball

Since it is not in dispute that Hayashi retrieved the ball, Popov must provide an argument that the ball was not loose when Hayashi retrieved it, for instance, that he had gained possession but then lost control because of the illegal attack by the mob. Since it is not in dispute that he was illegally attacked by the mob and that he at least attempted to catch the ball in order to take possession, the crucial issue is the definition of possession.

At this point McCarthy takes his departure in Professors Bernhardt and Brown’s general definition of possession (not restricted to baseball games):

BB 1a ::

x has physical control over item \(y \,\land x\) intends to control \(y \,\leadsto\)

 :

x has possession of y

BB 1b ::

x has no physical control over item y ∨ x does not intend to control \(y\,\leadsto\,x\)

 :

x does not have possession of y

Assuming that professors Bernhardt and Brown are legal experts, these rules are valid by validity rule v 2 (likewise below for the further rules proposed by professors of law). Popov’s intention to physically control the ball (f 8) was not in dispute. The factual issue was whether he had obtained physical control by his ‘snowcone’ catch. In addition, there is the legal issue whether physical control is indeed a necessary condition for possession.

McCarthy next discusses two alternative ways to use Bernhardt and Brown’s rule to “craft” a definition of possession that is specific to the context of baseball games. He first presents Professor Gray’s opinion, who first adopts r 3 above and then states:

G 1a ::

x has complete control of the baseball when momentum of x and baseball cease \(\leadsto\,x\) catches the baseball

G 1b ::

x does not have complete control of the baseball when momentum of x and baseball cease \(\leadsto\,x\) does not catch the baseball

How can we see these rules as a specialisation of Bernhardt and Brown’s rule to a baseball context? An obvious way is to add the rules

G 2a ::

x catches the baseball \(\leadsto\,x\) has physical control over the baseball

G 2b ::

x does not catch the baseball \(\land\,x\) does nothing else to gain physical control \(\leadsto\,x\) does not have physical control over the baseball

The specialisation thus takes the form of specifying a way to derive the condition of BB 1a or BB 1b in the context of a baseball game. Note that on this account r 3 turns out to be a compression of a line of reasoning, namely, by hypothetically assuming its conditions, then applying G 2a to them and then applying BB 1a to the thus derived consequent of G 2a . Such compressions were analysed by Loui and Norman (1995) as a kind of case rationale and recently by Bench-Capon and Prakken (2010) as a way to generate tests in US Supreme Court oral arguments. Alternatively, r 3 can be retained in its original form: the result is then the same but the logical connection with BB 1a and BB 1b is lost.

Professor Gray then says that a baseball which is dislodged by incidental contact with an inanimate object or another person is not possessed. This can be regarded as specifying a case in which the first condition of G 1a is not satisfied. On this account, Professor Gray again compresses a hypothetical line of reasoning, this time using the following rule:

G 3::

x lost the baseball before momentum of x and baseball ceased \(\land\,x\) lost the baseball by incidental contact \(\leadsto\,x\) does not have complete control of the baseball when momentum of x and the baseball cease

The hypothetical line of reasoning starts by assuming the conditions of G 3 and then applies G 3 to them to derive that x does not have complete control of the baseball when momentum of x and the baseball cease. This conclusion then satisfies the condition of G 1b , which leads to the conclusion that x does not catch the baseball. Hence G 2a cannot be applied so there is no way to apply BB 1a to conclude that Popov gained possession of the ball. It is this hypothetical line of reasoning that is arguably compressed in professor Gray’s statement that a baseball which is dislodged by incidental contact with an inanimate object or another person is not possessed.

McCarthy next discusses professors Bernhardt and Finkelman’s alternative definition of possession, according to which possession can also result if no complete physical control is achieved:

BF 1::

x is actively and ably engaged in efforts to establish complete control of item \(y\,\leadsto\,x\) possesses item y

BF 2::

x intends to take control of a baseball \(\land\,x\) manifests this intention by stopping the forward motion of the ball \(\leadsto\,x\) is actively and ably engaged in efforts to establish complete control of the baseball

These rules are valid since they are the rationale of a set of precedents concerning the fishing and hunting of wild animals and the salvage of sunken vessels (such as the famous triple Pierson v. Post, Keeble v. Hickeringill and Young v. Hitchens, widely discussed in AI and Law). In addition, McCarthy gives a further reason why the exception is valid, namely that this rule is influenced by the custom and practice in each of these industries.

Since McCarthy explicitly discusses which rule he should adopt in the present case, an analysis in terms of applicability of valid rules seems more suitable than an analysis in terms of priority between conflicting valid and applicable rules. More precisely, McCarthy says that BF 1 applies in contexts where it is impossible to achieve complete physical control of an item, as in the fishing and hunting of wild animals and the salvage of sunken vessels, while Gray’s analysis applies in contexts where such control is possible. Since the undercutter for defeasible modus ponens make rules applicable by default, the following inapplicability rules suffice to capture this analysis:

a 1::

full physical control is impossible \(\leadsto\) applying BB 1b is unworkable and unreasonable

a 2a ::

applying rule r is unworkable and unreasonable \(\leadsto\) rule r is not applicable

a 2b ::

full physical control is possible \(\leadsto\) rule BF 1 is not applicable

McCarthy leaves the validity of these rules implicit.

After thus having adopted professor Gray’s analysis, McCarthy observes that Popov did not lose control of the ball by incidental contact but by an intended attack by a violent mob, so that Gray’s rule G 3 cannot be used to argue that Popov did not achieve possession of the baseball: one condition of this rule is not fulfilled; not even an argument can be built for it. He then discusses whether Popov would have gained control over the ball had he not been attacked. Here he seems to read a counterfactual element into Gray’s rules; this reading can be captured by modifying G 1a and G 1b as follows:

G 1a'::

x has complete control of the baseball when momentum of x and baseball cease ∨ (x lost control of the baseball during a violent attack \(\land\,x\) would have had complete control of the baseball when momentum of x and baseball ceased if x had not been attacked) \(\leadsto\)

 :

x catches the baseball

G 1b'::

x does not have complete control of the baseball when momentum of x and baseball cease ∨

 :

(x lost control of the baseball during a violent attack \(\land\)

 :

x would not have had complete control of the baseball when momentum of x and baseball ceased if x had not been attacked) \(\leadsto\)

 :

x does not catch the baseball

McCarthy next applies the counterfactual element of this rule to the facts, observing that

Mr. Popov has not established by a preponderance of evidence that he would have retained control of the ball after all momentum ceased and after any incidental contact with people or objects. Consequently, he did not achieve full possession. (p. 8)

McCarthy here observes in passing that Popov has the burden of persuasion for his claim that he would have retained control. It therefore seems justified to interpret his remark that Popov did not achieve full possession as saying that Popov has not been able to prove full possession. This is a metalevel observation about what can be concluded from the available arguments. Now there was some evidence for Popov’s claim, namely the facts f 1 (Popov stopped the forward motion of the baseball) and f 2 (The baseball was in the glove of Popov) and Popov’s own testimony:

e 1::

Witness Popov says he had full control of the baseball when he was attacked

From Popov’s testimony and the following obvious commonsense rule

\(\mathit{cs}_2{:}\, \) :

x had full control of the baseball when he was attacked \(\leadsto\)

 :

x would have had complete control of the baseball when momentum of x and baseball ceased if x had not been attacked

an argument from witness testimony can be constructed, which is undercut by McCarthy with the following words and implicit commonsense rule:

e 2::

Mr. Popov’s testimony on this question is inconsistent on several points, ambiguous on others and, on the whole, unconvincing

\(\mathit{cs}_3{:}\, \) :

\(e_2\,\leadsto\) Popov is not credible

Next, from f 1 and f 2 an argument can be constructed if the following commonsense generalisation is assumed to be valid:

\(\mathit{cs}_4{:}\, \) :

\(f_1 \land f_2 \, \leadsto\) Popov would have had complete control of the baseball when momentum of Popov and baseball ceased if Popov had not been attacked

Then McCarthy’s statement that Popov’s claim cannot be proven implies an argument against the validity of this rule, which can arguably be based on facts f 3 (Popov was in motion when he was attacked) and f 6 (Popov lost contact with the baseball during the violent attack).

\(\mathit{cs}_5{:}\, \) :

\(f_3\,\land\,f_6\,\leadsto\,{cs}_4\) is not valid

Now that both of Popov’s arguments that he would have retained full control of the baseball if he had not been attacked are defeated, their accrual is also defeated (see Fig. 3).

Fig. 3
figure 3

An overruled argument why Popov has possession of the baseball

3.2.2 (Qualified) right to possession

After having decided that Popov never achieved full possession of the ball (Fig. 3), McCarty turns to the second disjunct of the first condition of r 1, namely, whether Popov at least achieved a right to possession. He observes that in cases like this one, where the court “sits in equity”, it has the authority to state its own rules “to achieve fundamental fairness”:

v 4::

rule r concerns legal action \(x\,\land\,x\) is an action in equity \(\land\) rule r promotes fundamental fairness as regards \(x\,\land\) court c adopts rule r to decide \(x\,\leadsto\) rule r is valid

v 4::

rule r concerns legal action \(x\,\land\,x\) is an action in equity \(\land\) rule r does not promote fundamental fairness as regards \(x\,\leadsto\) rule r is not valid

This validity rule, of which McCarthy leaves the validity implicit, seems a key element in the decision, since without it McCarthy may not have had any legal ground for the decision he considers fair.

McCarthy then proposes the following rule:

MC 1::

x undertakes significant but incomplete steps to achieve possession of \(y\,\land\)

 :

y is abandoned property \(\land\)

 :

the effort of x is interrupted by unlawful acts of others \(\leadsto\)

 :

x has a legally cognizable pre-possessory interest in y

and states that it is “consistent” with the principle formalised above as v 4. Thus he in fact states that all conditions of v 4 are satisfied for this rule. From McCarthy’s statements it seems obvious that he also assumes the following rules:

MC 2::

x has a legally cognizable pre-possessory interest in \(y\,\leadsto\)

 :

x has a qualified right to possession of y

MC 3::

x has a qualified right to possession of \(y\,\leadsto\,x\) has no right to possession of y

\(\mathit{cs}_6: \) :

x has a qualified right to possession of \(y\,\leadsto\,z\) has a legitimate claim to y

Figure 4 displays the argument why Popov has a legitimate claim to the baseball. Note the word “qualified” in the consequent of MC 2, which makes it insufficient to fulfill the second disjunct of the first condition of r 1, so that further rules are needed if Popov’s interest is to be satisfied.

Fig. 4
figure 4

Justified arguments that both Popov and Hayashi have a legitimate claim

McCarthy now first says that “The court is required to balance the interests of all parties”. This can be read as saying that a rule that does not balance the interests of all parties is unfair:

MC 4::

rule r does not balance the interests of all parties \(\leadsto\)

 :

rule r does not promote fundamental fairness as regards x

McCarthy leaves the validity of this rule implicit. He then observes that Hayashi also has legitimate interests in the baseball (see also the second argument in Fig. 4).

MC 5a ::

z retrieved \(y\,\land\,z\) did nothing wrong when achieving control over \(y \, \leadsto \, z\) has a legitimate claim to y

MC 5b ::

x has possession of y ∨ x has the right to possession of \(y \, \leadsto \)

 :

MC 5a is not applicable

McCarthy then first rejects the following two candidate decision rules as not taking the interests of both parties into account (see the arguments in Fig. 5):

r p ::

x has a qualified right to possession of \(y\,\land\,z\) holds \(y\,\land\)

 :

x demands z to return \(y\,\leadsto\,z\) must return y to x

r h ::

z has a legitimate claim to \(y\,\leadsto\,z\) does not have to return y to x

Then McCarthy arguably states the following two rules (again leaving their validity implicit), which together and combined with MC 4 say that if both parties have a legitimate claim to a property, deciding for one of them is unfair:

MC 6::

x has a legitimate claim to \(y\,\land\,x\) is defendant in a conversion case \(\land\)

 :

rule r finds for plaintiff \(\leadsto\)

 :

rule r does not balance the interest of all parties

MC 7::

x has a legitimate claim to \(y\,\land\,x\) is plaintiff in a conversion case \(\land\)

 :

rule r finds for defendant \(\leadsto\)

 :

rule r does not balance the interest of all parties

It is interesting to see what happens if, in McCarthy’s words, a psychic can tell whether Popov would have achieved full control had he not been attacked by the mob. If Popov would have had complete control of the baseball without the violent attack, then the conditions of MC 5b can be derived via G 1a' so MC 5a cannot be used to derive that Hayashi has a legitimate interest in the ball. So in that case Popov wins by r 3. If, on the other hand, Popov would not have had complete control of the baseball without the violent attack, then arguably a commonsense rule applies that his steps to achieve possession of the baseball were not significant so that the first condition of MC 1 is not satisfied and it cannot be derived that Popov has a legitimate interest in the baseball. Moreover, in that case G 1b' makes that Popov did not achieve possession, so in that case Hayashi wins by r 4. Thus the present reconstruction captures that McCarthy’s decision crucially depends on the fact that it cannot be proven what would have happened had Popov not been attacked.

Fig. 5
figure 5

Overruled arguments that Popov or Hayashi should win

Finally, after rejecting the rules that find for either Popov or Hayashi, McCarthy justifies his decision to equally divide the profit over both (his deciding argument is visualised in Fig. 6). Here his arguments must be interpreted with care. On the one hand, McCarthy has just observed that he has the authority to formulate rules that promotes fairness. On the other hand, he now justifies his choice for the deciding rule by referring to one legal scholar who proposed it and two precedents in which it was used. My interpretation is that McCarthy only uses v 4 to fill the gap between the facts and the rules that he can ‘find’ in the law. On this account, the rule found by him is the one suggested by legal expert Helmholz and the two precedents, while the other rules (MC 1MC 7) are ‘fashioned’ by McCarthy and then combined with Helmholz’ rule to establish that Popov and Hayashi have equal rights to the baseball (although MC 4 instead seems a legal commonsense rule on what it means to promote fundamental fairness).

Fig. 6
figure 6

The deciding argument

This interpretation can be formalised as follows. To start with, it should be noted that the “fashioning” of rules to achieve fundamental fairness in the case is not a problem of finding the valid and applicable legal rule but a problem of choice, that is, a problem of practical reasoning: faced with the goal to adopt a rule that promotes fundamental fairness in the case, McCarthy must ideally consider all available ways to achieve that goal, that is, all rules that achieve fairness in this case, and then make a rational choice given his values and preferences. Ideally, this reasoning process would be captured by applications of practical reasoning argument schemes like those proposed by Atkinson and Bench-Capon (2007) and Bench-Capon and Prakken (2006b). However, since McCarthy does not consider any alternative rules that promote fundamental fairness, his decision will simply be stated as a fact. It may be that McCarthy’s internal deliberations were much more involved and contained rejections of alternative ways to promote fundamental fairness in the case, but we cannot know this from the text.

The rule that in the present interpretation is ‘found’ by McCarthy is

MC 8::

x has a legitimate claim to property \(y\,\land\,x\) is plaintiff in a conversion case \(\land\)

 :

z has a legitimate claim to property \(y\,\land\,z\) is defendant in a conversion case \(\land\) the claims of x and z are equally legitimate \(\land\)

 :

nobody else has a legitimate claim to \(y\,\leadsto\)

 :

x and z must equally share the value of property y

It seems that McCarthy derives the validity of this rule by three accruing validity arguments, grounding the rule, respectively, in legal expert Helmholz’s writings and in two precedents. To formalise this, the following three relevant ‘legal’ facts must be stated and combined with, respectively, validity rules v 2 and v 1.

l 5::

rule H was proposed by legal expert Helmholz

l 6::

rule H was was used in case Arnold v. Producers Fuit Company

l 7::

rule H was was used in case Keron v. Cashman

(Here it is for simplicity ignored that Arnold v. Producers Fuit Company in fact used a slightly more general rule, referring to the proportionality of the competing legal claims.) The reason that these legal facts use the rule name H instead of MC 8 is that there is a complication here: expert and precedents proposed and used the rule for a different type of case than conversion, namely, to decide competing claims between finders of lost or mislaid property and the owners of the land on which the property was found. McCarthy thus arguably uses the following argument scheme for analogical reasoning with legal rules:

Analogical rule validity scheme:

  • r is valid,

  • r is of the form

    r::

    x has a legitimate claim to property \(y\,\land\)

     :

    x is plaintiff in a case of type \(c\,\land\)

     :

    z has a legitimate claim to property \(y\,\land\)

     :

    z is defendant in a case of type \(c\,\land\)

     :

    the claims of x and z are equally legitimate \(\land\)

     :

    nobody else has a legitimate claim to \(y\,\leadsto\)

     :

    x and z must equally share the value of property y

  • r′ is of the same form but with c replaced with c′, 

  • c and c′ are relevantly similar

  • r′ is valid

More general versions of this scheme might be used in which the other conditions and consequents of the rules are similar instead of equal, but this is not supported by McCarthy’s statements. An obvious question is why the cases of conversion and finding of property are similar. If McCarthy had made his reasoning at this point explicit, then sophisticated AI (& Law) models of analogical reasoning might have been needed, but this is beyond the text of his decision and therefore beyond the scope of this paper. The main point of the present reconstruction is to show that at least some forms of legal analogical reasoning can be formalised in the framework of Prakken (2010) as defeasible inference rules.

4 The arguments and their evaluation

4.1 The arguments

The most important arguments resulting from the above analysis are displayed in Figs. 2, 3, 4, 5, 6. As for notation, the type of premise is indicated with (p) for ordinary premises and (a) for assumptions. Some but not all of these facts were listed above in Sect. 3.1. Of those that were not listed above the instantiation of the variables with constants should be obvious. The inference links are labelled with the name of the argument scheme that is applied. In the figures that contain counterarguments, the defeated parts of arguments, that is, the defeated subarguments, are shown in grey colour.

Some specific comments:

  • In Fig. 2 the statement that Popov and Hayashi cannot both have possession of the baseball is formalised as an assumption premise since, although it is presumably based on general property law, it cannot be traced back to explicit statements in the decision and instead seems to be implicitly assumed. Furthermore, the commonsense rule \(\mathit{cs}_{7}\) has the following form:

    \(\mathit{cs}_{7}{:}\, \) :

    x loses control of item y during a violent attack \(\leadsto\)

     :

    x involuntarily loses control of y

  • In Fig. 3 the application of the accrual argument scheme is for reasons of space displayed in a compressed way; Fig. 6 shows its application in another argument in more detail.

  • In Figs. 4 in the premises of ‘MC 2’ is valid and in Fig. 5 in the premises of “r p is not valid’, some conditions of rule v 4 are summarised into one node; in the latter figure the application of v 4 is shown in full detail in the premises of ‘MC 6 is valid’.

In Fig. 4 the commonsense rules \(\mathit{cs}_{8},\,\mathit{cs}_{9}\) and \(\mathit{cs}_{10}\) have the following form:

\(\mathit{cs}_{8}{:}\, \) :

\(f_1\,\land\,f_2\,\land\,f_8\,\leadsto\) Popov undertook significant but incomplete steps to achieve possession of the baseball

\(\mathit{cs}_{9}{:}\, \) :

\(f_4\,\land\,f_6\,\leadsto\) Popov’s efforts were interrupted by unlawful acts of others

\(\mathit{cs}_{10}{:}\, \) :

\(f_5\,\land\,f_6\,\leadsto\) Hayashi did nothing wrong when achieving control of the baseball

  • In Fig. 5 the proposition ‘r p is valid’ is formalised as an assumption since it was presumably suggested by Popov. Judge McCarthy’s rejection of this rule thus becomes a defeater by way of premise attack. Alternatively, McCarthy’s arguments could be regarded as a simple explanation that one premise to build the argument for ‘Hayashi must return the baseball to Popov’ is as a matter of fact false so that this argument cannot be constructed.

  • In Fig. 5 the argument that Hayashi does not have to return the baseball and its refutation are similar to the argument for ‘Hayashi must return the baseball to Popov’ and are therefore not visualised.

  • In Figs. 5 and 6 the rebuttal relations are based on the choice to make the three top-level propositions contraries of each other in \(\mathcal{L}\).

  • To keep Figs. 5 and 6 readable, they do not display that the argument for ‘Hayashi does not have to return the baseball’ from Fig. 2 also rebuts and is rebutted by the arguments for the conclusions ‘Hayashi must return the baseball’ and ‘Popov and Hayashi must equally share the value of the baseball’. For the same reason the figures do not visualise the accrual of the arguments for the conclusion ‘Hayashi does not have to return the baseball’ from Figs. 2 and 5. Note also that similar remarks apply to the extension of the argument in Fig. 3 to an argument for ‘Hayashi must return the baseball’.

4.2 Evaluating the arguments

As remarked above, the framework of Prakken (2010) defines a set of arguments with a binary defeat relation (called an argumentation theory), so that the acceptability of arguments can be defined in terms of Dung’s (1995) abstract semantics for argument acceptability (Dung called an argumentation theory an “argumentation framework”). Without going into details, all of Dung’s semantics can be regarded as classifying arguments into three acceptability classes: the justified arguments (those that directly or indirectly survive all attacks), the overruled arguments (those that are defeated by a justified argument) and the defensible arguments (involved in conflicts that cannot be resolved). The main feature of Dung’s semantics that is relevant for the present analysis is that they all capture the phenomenon of reinstatement: an argument with defeaters is still justified if all its defeaters are in turn defeated by a justified argument.

At first sight, it might be thought that the argumentation theory consists of the eleven arguments and their defeat relations shown in Figs. 2, 3, 4, 5, 6, that is, the elevent trees shown in these figures. However, this is not the case, since of each argument also all its subarguments are part of the theory. In graphical terms, all subtrees of the trees shown in the figures are also in the argumentation theory. Moreover, the theory also contains more defeat relations than displayed, since when an argument defeats another argument, it also defeats all its continuations with further inferences. So, for instance, in Fig. 2 the argument with conclusion ‘Popov is incredible’ not only defeats the argument with conclusion ‘Popov had full control of the baseball when he was attacked’ but also, for instance, the arguments with conclusion ‘Popov would have had complete control … if he had not been attacked’ and ‘Popov had possession of the baseball’.

Having said this, determining the acceptability status of McCarthy’s deciding argument is still straightforward, whichever of Dung’s (1995) semantics is used. As shown in Fig. 6, this argument is rebutted on its final conclusion by the two arguments that Popov, respectively, Hayashi should win. Judge McCarthy did not resolve these conflicts with preferences, so that both rebuttals result in symmetric defeat relations. Instead, McCarthy defeated the two rebuttals by premise-attacking them on their assumptions that rule r p , respectively, rule r h , are valid (the argument for ‘r p is not valid’ shown in Fig. 5 and the similar argument for ‘r h is not valid’ that is left implicit in this figure). Since an attack on an assumption always results in asymmetric premise-defeat and since the two defeaters themselves have no defeaters, they are both justified.

However, this is not enough to reinstate McCarthy’s deciding argument, since it has two further defeaters. Firstly, as remarked above, the argument in Fig. 3 can be extended to another argument for the conclusion that Hayashi must return the baseball, based on the subargument for the conclusion that Popov has possession of the baseball. This argument also rebuts (and is rebutted by) the arguments for the conclusions that Hayashi does not have to return the baseball and that Popov and Hayashi must equally share its value. This rebuttal is overruled in a similar way as just explained, by an argument that strictly defeats one of its subarguments and thereby reinstates McCarthy’s deciding argument. More specifically, this holds for the subargument for ‘Popov had full control of the baseball when he was attacked’. In addition, the argument in Fig. 3 is also defeated by an attack on its assumption premise that cs 4 is valid. A similar analysis applies to the argument in Fig. 2 for the conclusion that Hayashi does not have to return the baseball to Popov. This argument rebuts (and is rebutted by) the arguments for the conclusions that Hayashi must return the baseball and that Popov and Hayashi must equally share its value. However, it is still overruled since its subargument for the conclusion that Hayashi has possession of the baseball (in turn based on the fact that he retrieved it) is undercut since the fact that Popov has a qualified right to possession of the baseball is an exception to rule r 4.

A final complication is that, as also remarked above, the two arguments for the conclusion that Hayashi must return the baseball accrue since they are both defeasible, and that the same holds for the two arguments that Hayashi does not have to return the baseball. However, this does not change the above outcome, since for both accrual arguments their accruing subarguments are defeated by justified arguments, so that the same holds for the accruals.

Finally, how does this analysis account for the burden of proof? In the present case it was uncontroversial that Popov had the burden of persuasion for his main claim, with as proof standard ‘preponderance of evidence’ and that there were no shifts in the burden of persuasion for subissues. As suggested by Prakken and Sartor (2009) this can be modelled in the present framework by requiring that Popov wins if and only if a justified argument for his main claim can be constructed. Proof standards are in Prakken and Sartor’s (2009) approach applied in two ways. The first is when deciding whether one of two rebutting arguments strictly defeats the other, that is, in defining the preference relations with which the conflict can be resolved: the lower the proof standard, the easier the preference will be defined such that one argument strictly defeats the other. In the present case this way of applying proof standards did not occur. The second way to apply a proof standard is in deciding whether an undercutter or invalidity argument must be moved against an evidential argument: the lower the proof standard, the easier such a counterargument will be moved. In the present reconstruction this way of applying proof standards occurred in deciding whether to formulate the rules \(\mathit{cs}_3\) and \(\mathit{cs}_5\).

5 Discussion

The aim of this paper was to test the suitability of the abstract framework of Prakken (2010) for modelling the argumentation structure of judge McCarthey’s decision. Although a definite conclusion can only be drawn after comparing the present reconstruction with the other ones submitted to this issue, it can be tentatively concluded that the formalism is suitable. The abstract rule-based nature of the formalism turned out to be suitable for modelling the argument schemes used in the case, while the tree-structure of the arguments captured the dependencies between the elements in the judge’s decision. Also, the three classes of conflict between arguments were sufficient to capture all conflicts in the case and Dung’s (1995) acceptability semantics sufficed for capturing the outcome of the case, including a proper handling of the burden of proof. The two main technical devices that were needed were rule-exception structures and arguments about the validity of rules. The latter proved sufficient to capture value-based reasoning, since the only value that was at stake in this case, fairness, was reconstructed as a condition of a validity rule. For this reason more sophisticated models of value-based reasoning, such as those of Prakken (2002) or Bench-Capon (2003), in which values are used to resolve conflicts between arguments, were not needed. Arguments about validity also proved useful in modelling the use of cases, since cases were by McCarthy mostly used as sources of rules (see also Verheij 2008). There was no need to use more sophisticated models of legal case-based reasoning such as those discussed by Ashley (2006) but there was a need for an argument scheme for analogical rule application. Finally, the present reconstruction contains some examples of argument accruals but they were not relevant for the outcome of the case.

It is interesting to compare the present reconstruction with the only reconstruction available in the literature, the one of Wyner et al. (2007). Being a conference paper, their reconstruction is less detailed than the present one. Wyner et al. also make use of domain-specific defeasible rules and argument schemes; like the present paper they use defeasible modus ponens and schemes for witness testimony and video tape, while in addition they use a ‘scheme from purpose’ to model the teleological aspects of the case. The legal analysis is mostly reconstructed with rule-exception structures; no rule validity rules are used. One main difference with the present reconstruction is in the treatment of socio-legal values. While in the present reconstruction the inclusion of the fairness condition in the validity rule v 4 made McCarthy’s considerations on fairness a matter of rule validity, in Wyner et al.’s reconstruction values are used to resolve conflicts between arguments (based on Bench-Capon 2003). In my opinion, the present account is closer to the actual wording of McCarthy’s decision. In fact, besides fairness Wyner et al. include a second value of public order in their analysis. In the present reconstruction this value was not included since it cannot easily be based on statements of McCarthy.

A second main difference is that Wyner et al. use a different conception of arguments, in which each applicable rule gives rise to a separate argument in an abstract argumentation framework in the sense of Dung (1995) and in which there are no subargument relations between arguments. Instead, an argument applying a rule ‘if q then p’ can be attacked by an argument ‘not p?’, which can then be counterattacked with an argument applying a rule ‘if r then q’. Applying any of Dung’s (1995) then yields that the argument for p is reinstated by the argument for q, so that the arguments for p and q are justified and the argument ‘not q?’ is overruled. In the present approach instead a two-steps argument is created with premise r, intermediate conclusion q and final conclusion p, and no counterargument is needed. In my opinion, this better captures the dependency relations between the elements of arguments, since complex arguments are usually not expressed with the attack-counterattack relations of 2007’s method. It seems better to regard ‘not q’? not as stating a counterargument but as a separate kind of speech act, namely a challenge ‘why q?’ of the premises of an argument, which can be replied to with a further argument for that premise. Such speech acts can be modelled by embedding an argumentation logic in a dialogue system for argumentation, as in e.g. Gordon (1994); Prakken (2005c). For a case study that applies this idea see Prakken (2008).

An important question is to what extent the results of the present case study can be generalised. Strictly speaking this requires an experiment with a substantial number of cases but this goes beyond the scope of the present issue. A comparison can be made, however, with Prakken (2008), in which a similar formalism was applied to a reconstruction of a Dutch civil ownerships dispute. In that case study not just the final decision was reconstructed but also the preceding stages of the proceeding, for which reason in Prakken (2008) I had to embed a model of argument-based inference in a dialogue model of legal argument. As for the inferential part, the earlier case study also used rule-exception structures and the argument scheme from witness testimony, while in addition it used priority arguments; on the other hand, no validity arguments or arguments about values were used. The case contained an explicit shift in the burden of persuasion, which required the use of Prakken’s (2001) adapted definition of justification of arguments.

As I observed earlier in Prakken (2008), case files often leave a considerable part of the relevant law implicit, since the law is assumed known to all parties. In Prakken (2008) this was not a serious problem, since the case was from a jurisdiction with which I am familiar. However, in the present case this is different. In experiments like the present one, where researchers are invited to perform case studies in jurisdictions with which they may not be familiar, this is an inevitable methodological problem. Nevertheless, when interpreted with care, case studies are a necessary complement to designs of formalisms or architectures, to reduce the risk that these designs are based on too simple accounts of legal reasoning.