soid: A Tool for Legal Accountability for Automated Decision Making

Judson, Samuel; Elacqua, Matthew; Cano, Filip; Antonopoulos, Timos; Könighofer, Bettina; Shapiro, Scott J.; Piskac, Ruzica

doi:10.1007/978-3-031-65630-9_12

Samuel Judson⁹,
Matthew Elacqua⁹,
Filip Cano¹⁰,
Timos Antonopoulos⁹,
Bettina Könighofer¹⁰,
Scott J. Shapiro¹¹ &
…
Ruzica Piskac⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14682))

Included in the following conference series:

International Conference on Computer Aided Verification

Abstract

We present \(\textsf{soid}\), a tool for interrogating the decision making of autonomous agents using SMT-based automated reasoning. Relying on the Z3 SMT solver and KLEE symbolic execution engine, \(\textsf{soid}\) allows investigators to receive rigorously proven answers to factual and counterfactual queries about agent behavior, enabling effective legal and engineering accountability for harmful or otherwise incorrect decisions. We evaluate \(\textsf{soid}\) qualitatively and quantitatively on a pair of examples, i) a buggy implementation of a classic decision tree inference benchmark from the explainable AI (XAI) literature; and ii) a car crash in a simulated physics environment. For the latter, we also contribute the \(\textsf{soid}\hbox {-}\!\textsf{gui}\), a domain-specific, web-based example interface for legal and other practitioners to specify factual and counterfactual queries without requiring sophisticated programming or formal methods expertise.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Recent advances in (often ML-based) artificial intelligence have led to a proliferation of algorithmic decision making (ADM) agents. The risk that these agents may cause harm – and the many demonstrated examples of them already doing so, ranging across numerous domains [3, 8, 19, 30] – has led to a significant demand for technologies to enable their responsible use. In this work, we present \(\textsf{soid}\), a tool based on Judson et al.’s method [16] to account for software systems using computational tools from the fields of formal methods and automated reasoning. The \(\textsf{soid}\) tool is primarily oriented towards supporting legal reasoning and analysis, in order to better understand the ultimate purpose of an agent’s decision making – as is often relied upon by various bodies of law.

In particular, rather than traditional verification methods which aim towards proving a specific program property, \(\textsf{soid}\) instead aims to ‘put the agent on the stand’. The design of \(\textsf{soid}\) enables factual and counterfactual querying – underlying a finding of fact – in support of human-centered assessment of the ‘why’ of the agent’s decision making. Such an assessment can then in turn justify holding responsible an answerable owner or operator, like a person or company. We describe the functioning of the \(\textsf{soid}\) tool itself as well as a pair of examples of its use on simulated harms. We also describe the \(\textsf{soid}\hbox {-}\!\textsf{gui}\), a domain-specific interface for \(\textsf{soid}\) applied to autonomous vehicles, allowing for adaptive and interpretable analysis of driving decisions without requiring extensive programming skills or familiarity with formal logical reasoning.

The basic flow of \(\textsf{soid}\), depicted in Fig. 1, is adaptive and requires a human in the loop. The human investigator – likely a practitioner such as a lawyer or regulator supported as necessary by engineers – uses \(\textsf{soid}\) to better understand the decision making of an agent program A. They do so by finding critical decision moments in the logs of A that transpired in the lead up to a harm, and then relaxing or perturbing the program inputs to specify a (family of) counterfactual scenario(s). The investigator then formulates a query asking what the behavior of A ‘might’ or ‘would’ have been [20] under that (family of) counterfactual(s). As we show in the design of our \(\textsf{soid}\hbox {-}\!\textsf{gui}\), such questions can even be formulated in user-friendly interfaces that abstract away all of the formal logic and reasoning of \(\textsf{soid}\) for non-technical practitioners. Once a query is posed, a verification oracle using SMT-based automated reasoning – including constrained symbolic execution – gets the investigator a prompt answer. They can then continue to ‘interrogate the witness’ until they are satisfied they have a sufficient understanding of the purpose of A’s decisions, and terminate the loop.

Contribution. In summary, we developed a command line tool and Python library \(\textsf{soid}\), which uses symbolic execution (through Z3) and SMT solving (through KLEE) to enable rigorous interpretation of the decision-making logic of an autonomous agent. We demonstrate \(\textsf{soid}\) on a pair on instructive involving machine-learned agents. In both cases, we find \(\textsf{soid}\) able to resolve counterfactual queries with reasonable efficiency, even when adaptively posed through the interpretable \(\textsf{soid}\hbox {-}\!\textsf{gui}\) aimed at non-technical practitioners.

A Motivating Example. Consider a program A which computes a decision tree in order to classify the diabetes health risk status of an individual, a classic example in automated counterfactuals with legal implications due to [31]. The decision tree and code of A are shown in Fig. 2. However, the software system surrounding A creates an implicit unit conversion bug: A computes the body-mass-index (BMI) input to the decision tree, using height and weight parameters from its input. But, A expects metric inputs in kg and m and so computes the BMI without a necessary unit conversion, while the program inputs are instead provided in the imperial in and lb. Notably, A is ‘correct’ with respect to natural specifications – as is the decision tree in isolation. The flaw occurs due to a mistake in the composition of the software system as a whole. Nonetheless, the system misclassifies many inputs, as \((kg/m^2) \gg (lb/in^2)\) for the same quantities.

The goal of \(\textsf{soid}\) is to enable a legal practitioner to understand the presence of and conditions underlying a potential misclassification. Unlike statistical methods for counterfactual analysis which only analyze the (correct) decision model [31], the minimal assumptions underlying \(\textsf{soid}\) – namely, the lack of an assumption that the broader software system correctly uses the decision model – make it a more capable framework for analyzing this type of ‘implicit conversion’ failure. In §2.1 we run a small empirical analysis on A, showing how \(\textsf{soid}\) enables a user to specify concrete factual and counterfactual queries to understand the conditions under which the failure can occur and their implications.

1.1 Related Work

The explainable AI (XAI) and fairness, accountability, and transparency (FAccT) communities have developed numerous methods and tools for enabling accountability of ADMs, machine-learned or otherwise, for which [1, 10, 13] are recent surveys. The closest tool to \(\textsf{soid}\) of which we are aware is the VerifAI project [9, 11]. Many of these tools and techniques focus on counterfactual reasoning in particular [7, 14, 15, 24, 31]. In comparison to the prevailing lines of this research, \(\textsf{soid}\) emphasizes i) after-the-fact (or ex post) analysis for algorithmic accountability in the style of with legal reasoning; ii) the use of SMT-based verification technologies capable of resolving counterfactual questions about whole families of scenarios; and iii) emphasis on the ‘code as run’, rather than evaluating a specific component like a particular decision model, or requiring an abstracted program representation or a formal model of the (often complex social and/or physical) environment the agent operates within.

2 \(\textsf{soid}\) Tool Architecture and Usage

Figure 1 illustrates the architecture of \(\textsf{soid}\). The tool is implemented in Python, and invokes the Z3 SMT solver [26] for resolving queries.

Before working with \(\textsf{soid}\), the investigator must use their domain expertise to find and extract the critical moment they care about from the factual trace within the logging infrastructure of A. We assume some mechanism guarantees the authenticity of the trace, such as an accountable logging protocol, as has been previously proposed for cyberphysical systems [33]. After extracting the trace the investigator must specify the i) (counter)factual query defining the factual, counterfactual, or family of counterfactual scenarios the query concerns; as well as ii) some possible agent behavior. In the remainder of this section, we explain how the user does so using \(\textsf{soid}\) and a Python library interface it exposes called \(\textsf{soidlib}\). Constraints are specified through an API similar to Z3Py, see Fig. 3, while queries can be written as independent Python scripts or generated dynamically within a Python codebase.

Upon invocation, \(\textsf{soid}\) symbolically executes A to generate a set of feasible program paths as constrained by the (counter)factual query. The constraints in that query must be provided directly to the symbolic execution engine – an integration API exposes the query to the symbolic execution in order to enable this communication, or the user can do so directly outside \(\textsf{soid}\) itself. After the symbolic execution completes, \(\textsf{soid}\) formulates the query formula and invokes Z3 to resolve it. It then outputs to the user the finding, as well as any model – which exists in the event of a failed ‘would’ or successful ‘might’ query.

Query API. The query API of \(\textsf{soid}\) is exposed as a Python library called \(\textsf{soidlib}\). A query specified using \(\textsf{soidlib}\) is composed of a name and query type, as well as a set of functions. These functions return either \(\textsf{soidlib}\) variable declarations or constraints, which are in either case automatically encoded into a set of corresponding Z3Py constraints for use during SMT solving to establish the satisfiability or validity of the query. An example query is shown in Fig. 3. The main API function interfaces the user must define in order to encode their query are:

: A function that must return three dictionaries of \(\textsf{soidlib}\) variable declarations, enumerating the set of environmental inputs ( ) and internal state inputs ( ) over which the factual or (family of) counterfactual scenario(s) are defined, as well as the set of decision ( ) variables over which the behavior is defined. In order to do this \(\textsf{soidlib}\) exposes a variety of variable types, which it then converts into Z3 statements with the appropriate logical sorts as required by the underlying SMT logic (e.g., encoding an object of integer type as an object of the 32-bit bitvector sort).
: A function that must return a \(\textsf{soidlib}\) constraint over E describing the environmental program inputs.
: A function that must return a \(\textsf{soidlib}\) constraint over S describing the internal state program inputs.
: An optional function, returns a \(\textsf{soidlib}\) constraint encoding a concrete factual to be negated from the query formula, and therefore excluded from the set of possible output models.
: A function that must return a \(\textsf{soidlib}\) constraint over D describing the behavior being queried.

Language Support. Through a modular API \(\textsf{soid}\) extensively supports any symbolic execution engine that produces output in the SMT-LIB format [4]. An integrator needs only to write a Python class implementing an interface between \(\textsf{soid}\) and the engine. As such, \(\textsf{soid}\) supports agents written in any programming language for which a suitable symbolic execution engine is available. We use the KLEE family of symbolic execution engines throughout our benchmarks. At present, support is integrated into \(\textsf{soid}\) for C language programs with floating-point instructions using KLEE-Float [21], working over the SMT logic of \(\texttt {QF\_FPBV}\), the quantifier-free theory of floating-point and bitvectors. Support is also integrated for C and C++ language programs without floating-point using mainline KLEE [5], producing representations in \(\texttt {QF\_ABV}\), the quantifier-free theory of arrays and bitvectors.^{Footnote 1} KLEE can be further extended to analyze other LLVM-compilable languages such as Rust [22], while other engines exist for compiled binaries [29] and many other languages including Java [2] and Javascript [23].

Symbolic Execution API. Adding support for a new symbolic execution engine to \(\textsf{soid}\) requires specifying between two and five functions: , , , , and , which are all hooked into the main \(\textsf{soid}\) execution path. Only and are necessary – they must respectively invoke the symbolic execution and then process the output into a list of Z3Py statements capturing the possible path conditions. Optionally, provides a hook for cleaning up temporary or output files generated by the symbolic execution engine, while and are designed to automate additional steps that may be desirable for the symbolic execution – the former is given access to the query, the latter additionally to the set of variables declared along the path conditions. For example, KLEE-Float automatically converts arrays into bitvectors using a technique called Ackermannization [25], and renames any such variables in the process. The KLEE-Float function packaged with \(\textsf{soid}\) i) casts objects as necessary; and ii) constrains them to equal the corresponding input declarations in the function so that they alias those inputs, e.g., adding the constraint where is KLEE-Float’s synthesized, Ackermannized representation of .

Query to Symbolic Execution. One of the major benefits of the ex post method of \(\textsf{soid}\) is that the (counter)factual query specified by the user can be used to constrain what parts of the program A are relevant to the scenarios in question and therefore must be included in the formula being checked. However, in order to do so the query must also be exposed to the symbolic execution engine in order to limit the symbolic execution to just the (ideally small) set of program paths feasible under the (counter)factual scenario conditions. This can either be done independent of \(\textsf{soid}\), e.g. by the code invoking \(\textsf{soid}\) when it is used as a library, or by using the hook in the symbolic execution framework. At present, our codebase exclusively uses the external method.

Invocation. There are two ways to use \(\textsf{soid}\): through a command line script (the soidcli) or directly as a Python library. If the latter, the user calling the code must declare a object and configure it with i) a ; ii) the path to the A; and iii) the identity of the symbolic execution engine. If using the soidcli, the CLI script declares the oracle object for the user, who must specify the path to where (a collection of) objects can be found declared in independent Python scripts (as well as the same path to A and symbolic execution engine identity). In case multiple variants of A are required in order to specify different symbolic execution preconditions for different counterfactual families, \(\textsf{soid}\) passes an identifier corresponding to a index that the user can specify through the CLI interface. In the examples present in the \(\textsf{soid}\) codebase this identifier is passed to a Makefile, which is then used to invoke KLEE(-Float) on the correct variant.

2.1 Example #1: Decision Tree Inference

Using \(\textsf{soid}\), we analyzed our decision tree misclassification motivating example. The results are summarized in Table 1, and were gathered on an Intel Xeon CPU E5-2650 v3 @ 2.30GHz workstation with 64 GB of RAM. We used scikit-learn [27] to train a decision tree over the Pima Indians dataset as used in [31]. We then implemented A as a C program that preprocesses the data – triggering the software system bug, as it does so without the necessary unit conversion – and then infers a binary classification using the decision tree. In order to create the factual basis for an investigation, we then invoked A on an example input where the unit conversion bug leads to the misclassification of the input as low risk instead of high risk.

Table 1. Benchmark results for our incorrect statistical inference example.

Full size table

We posed two queries:

1.
Did the classification happen as described?
2.
Does there exist a weight input parameter for which the instance is instead classified as high risk instead?

The former query provides a baseline for how much the counterfactual possibility of the latter query increases the cost of solving. It also fulfills the natural goal of many accountability processes to formally confirm apparent events and create a confirmed, end-to-end chain of analysis so that there is the highest possible societal confidence in any policy changes or punishments derived it. Both of these queries were resolved by \(\textsf{soid}\) in the positive, requiring at most a few seconds, even over a program structure in A that includes recursive invocations of floating-point comparison operations. Together, they demonstrate the weight input to A was causal for the classification, and establish its lack of unit conversion as contributory to the (harmful) misclassification decision.

Working with A, \(\textsf{soid}\) provides an adaptive oracle allowing the investigator to query its behavior and receive prompt and useful answers. The output of the program is also simple and interpretable. Without an intermediating GUI or developer tools, \(\textsf{soid}\) does require comfort with its API and the logical framework of expressing (counter)factuals and program outputs, but we do not expect a usable interface would be meaningfully difficult to integrate for this example.

3 \(\textsf{soid}\hbox {-}\!\textsf{gui}\) Architecture and Usage

The \(\textsf{soid}\hbox {-}\!\textsf{gui}\) is a web-based interactive interface for \(\textsf{soid}\) applied to the domain of autonomous vehicle accountability. It demonstrates that the use of \(\textsf{soid}\) can be managed by a high-level abstraction that exposes to non-technical practitioners the expressiveness and capacity of the tool, but none of its logical or technical complexity. We demonstrate the design and use of the \(\textsf{soid}\hbox {-}\!\textsf{gui}\) in Fig. 4.^{Footnote 2}

Architecturally, the \(\textsf{soid}\hbox {-}\!\textsf{gui}\) is composed of three main components: i) a frontend written in React; ii) a backend server written in Python that operates a vehicle simulation using the Duckietown simulator for the OpenAI Gym (henceforce Gym-Duckietown [6]) and also interfaces with \(\textsf{soid}\); and iii) a proxy server that manages communication between the browser frontend and the server backend. The Duckietown simulation is used as a stand-in for the real vehicle logs and instrumentation on which \(\textsf{soid}\) would be deployed in practice. We designed the crossroads intersection simulation interface to mimic the real-time driving context interface generated by contemporary autonomous vehicles, like those produced by Tesla. We stress that Gym-Duckietown is not exposed to \(\textsf{soid}\), which operates exclusively over the program (and decision model) A. Gym-Duckietown is used only to simulate crashes and generate logfiles as the basis for \(\textsf{soid}\) queries.

Outside of the \(\textsf{soid}\) investigatory loop, the user can first use the \(\textsf{soid}\hbox {-}\!\textsf{gui}\) to design a car crash scenario by manipulating the location, destination, and other properties of the simulated car through menus and a drag and drop interface (see Fig. 4). The \(\textsf{soid}\hbox {-}\!\textsf{gui}\) also allows the user to select from among five different decision logics for the ego car: a directly programmed ‘ideal’ car, and four reinforcement-learned (specifically, Q-learned [32]) agents, colloquially the ‘defensive’, ‘standard’, ‘reckless’ and ‘pathological’ decision models. They are so named on the basis of the reward profiles used to train them.

After an iteration of the simulation (usually, after a crash occurs), the \(\textsf{soid}\hbox {-}\!\textsf{gui}\) allows the user to operate the \(\textsf{soid}\) investigatory loop. Using a slider the user can pick out a moment from the logs of the agent, and supported by detailed logging information about the inputs to A at each timestep can select the critical moment (see Step 1 in Fig. 4). They can then use car-specific dropdown menus to specify counterfactuals about any of the agents in the system in a user-friendly manner, which fully abstracts away the underlying logical formalism (Step 2 in Fig. 4). Finally, they can invoke \(\textsf{soid}\) on the query they have specified by asking whether the ego car ‘might’ or ‘would’ move or stop under the (family) of counterfactual scenario(s) they have defined (Step 3 in Fig. 4). After solving the \(\textsf{soid}\hbox {-}\!\textsf{gui}\) then presents an interpretable answer, including a valuation for any variables the counterfactual was stated over when one is available (Step 4 in Fig. 4). The user can then clear or adjust their counterfactual statement and ask further queries, until satisfied they have reached an understanding of the car’s decision making under the selected decision model.

To use \(\textsf{soid}\), the \(\textsf{soid}\hbox {-}\!\textsf{gui}\) first writes out a C language file with the necessary constraints for the KLEE-Float symbolic execution. It then creates the and objects, allowing it to invoke \(\textsf{soid}\) through the Python library interface. Once \(\textsf{soid}\) has invoked KLEE-Float and Z3 to determine the answer to the query the output is then processed. When applicable, this includes model parsing. The result is then passed back to the browser frontend to be shown to the user.

3.1 Example #2: Three Cars on the Stand

We use the \(\textsf{soid}\hbox {-}\!\textsf{gui}\) to investigate a crash in Fig. 4. It is a simple intersection scenario, where the blue ‘ego’ car under investigation strikes the broadside of the red ‘other’ car which has indicated a right turn but proceeded straight nonetheless. As the red car possesses the right of way the fault lies with the blue car. We investigate ‘to what purpose’ the blue car entered in the intersection, in order to grade the severity of its misconduct in conjunction with legal norms that frequently apply the greatest possible penalties to purposeful action [16]. Notably, this crash occurs for all three of the ‘standard’, ‘reckless’, and ‘pathological’ decision models (but not the ‘defensive’ model).

Table 2. Benchmark results for our car crash example. For the final query, we phrased it as both a ‘would’ and a ‘might’ counterfactual for comparison.

Full size table

We pose three queries about the blue car’s decision making at the moment when it releases the brakes and enters the intersection (Step 1 in Fig. 4):

1.
Did the blue car actually decide to move, as it appeared to?
2.
Could a different turn signal have led the blue car to remain stationary?
3.
If the blue car had arrived before the red car and the red car was not signaling a turn, might the blue car have waited to ‘bait’ the red car into entering the intersection and creating the opportunity for a crash?

Intuitively, the second question should distinguish the ‘standard’ car from the ‘reckless’ and ‘pathological’, which should continue to move into the intersection no matter what. The third question should then distinguish between the ‘reckless’ and ‘pathological’ cars, with the former taking the opportunity for a clean path through the intersection, while the latter lies in wait.

There are natural explanations for the behavior of the other decision models: the ‘standard’ car is undertaking common human driving behavior given the perception of an unobstructed path through the intersection, the ‘reckless’ car demonstrates a prioritization of individual speed over collective safe driving, while the ‘pathological’ car might be attempting to trigger a crash for insurance fraud. Notably, in the case of the ‘reckless’ car, we do not want to inherently describe that behavior as incorrect as verification methods might, such as any implementing [28]. It could be that exigent circumstances necessitate reckless behavior, and that the blue car not entering the intersection as fast as possible would trigger a greater harm than a minor crash.

The results of our benchmarks are summarized in Table 2. As before, all of the statistics were gathered on an Intel Xeon CPU E5-2650 v3 @ 2.30GHz workstation with 64 GB of RAM. Each heading in Table 2 describes a family of (counter)factual scenarios and behavior, as well as whether the query is a verification (‘would...?’) or counterfactual generation (‘might...?’) one. The rows list the decision model invoked within A, the answer as determined by the verification oracle, timings, and the total number of feasible paths.

We find that \(\textsf{soid}\) provides an interpretable and adaptive oracle allowing the investigator to query a sequence of counterfactuals without directly interacting with A or the machine learned-model underlying it. Most of our queries resolved within \(< 20s\), providing effective usability. The results of the queries demonstrate the distinctive behaviors expected of the three conflicting purposes, allowing a capable investigator to distinguish them as desired.

4 Conclusion

We briefly conclude by considering some future directions for extensions to \(\textsf{soid}\).

Supporting DNNs. Many modern machine-learned agents rely on models built out of deep neural network (DNN) architectures. Extending \(\textsf{soid}\) to support such agents – most likely by relying on recent innovations in symbolic execution for neural networks [12] and SMT-based neural network verifiers [17, 18] – is a possible direction for increasing the utility of \(\textsf{soid}\).

Programming Counterfactuals. Although \(\textsf{soid}\) is adaptive, that does not necessarily mean it needs to be interactive. A further possible direction would be to design a counterfactual calculus as the basis for a programming language that would invoke \(\textsf{soid}\) as part of its semantics. Such a language could potentially be the basis for formalizing legal regimes for which counterfactual analysis forms a critical component. A related direction would be to integrate with a scenario specification language like SCENIC from the VerifAI project [9, 11] to add another layer of capability onto the specification of families of counterfactuals.

Notes

1.
Adding support for floating-point instructions into mainline KLEE remains at present an open enhancement for the project, see: https://klee.github.io/projects/.
2.
The repositories for \(\textsf{soid}\) and \(\textsf{soid}\hbox {-}\!\textsf{gui}\) are available at https://github.com/sjudson/soid and https://github.com/mattelacqua/duckietown-soid, respectively.

References

Adadi, A., Berrada, M.: Peeking inside the black-box: a survey on Explainable Artificial Intelligence (XAI). IEEE Access 6, 52138–52160 (2018)
Article Google Scholar
Anand, S., Păsăreanu, C.S., Visser, W.: JPF–SE: a symbolic execution extension to Java PathFinder. In: Grumberg, O., Huth, M. (eds.) Tools and Algorithms for the Construction and Analysis of Systems, pp. 134–138. Springer Berlin Heidelberg, Berlin, Heidelberg (2007). https://doi.org/10.1007/978-3-540-71209-1_12
Chapter Google Scholar
Angwin, J., Larson, J., Mattu, S., Kirchner, L.: Machine Bias. ProPublica (May 23rd, 2016). https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing
Barrett, C., Fontaine, P., Tinelli, C.: The SMT-LIB Standard: Version 2, 6 (2021)
Google Scholar
Cadar, C., Dunbar, D., Engler, D.R.: KLEE: Unassisted and automatic generation of high-coverage tests for complex systems programs. In: USENIX Symposium on Operating Systems Design and Implementation (OSDI ’08), pp. 209–224 (2008)
Google Scholar
Chevalier-Boisvert, M., Golemo, F., Cao, Y., Mehta, B., Paull, L.: Duckietown Environments for OpenAI Gym. https://github.com/duckietown/gym-duckietown (2018)
Chockler, H., Halpern, J.Y.: Responsibility and blame: a structural-model approach. J. Artif. Intell. Res. 22, 93–115 (2004)
Google Scholar
Dastin, J.: Amazon scraps secret AI recruiting tool that showed bias against women. Reuters (2018). https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
Dreossi, T., et al.: VerifAI: a toolkit for the formal design and analysis of artificial intelligence-based systems. In: Intentional Conference on Computer Aided Verification (CAV ’19), pp. 432–442. Springer (2019)
Google Scholar
Feigenbaum, J., Jaggard, A.D., Wright, R.N.: Accountability in computing: Concepts and Mechanisms. Found. Trends® Privacy Security 2(4), 247–399 (2020)
Google Scholar
Fremont, D.J., Dreossi, T., Ghosh, S., Yue, X., Sangiovanni-Vincentelli, A.L., Seshia, S.A.: Scenic: a language for scenario specification and scene generation. In: ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI ’19), pp. 63–78 (2019)
Google Scholar
Gopinath, D., Wang, K., Zhang, M., Pasareanu, C.S., Khurshid, S.: Symbolic Execution for Deep Neural Networks. arXiv preprint arXiv:1807.10439 (2018)
Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., Pedreschi, D.: A survey of methods for explaining black box models. ACM Comput. Surv. (CSUR) 51(5), 1–42 (2018)
Article Google Scholar
Halpern, J.Y., Pearl, J.: Causes and explanations: a structural-model approach. part i: causes. British J. Philos. Sci. 56(4), 843–887 (2005)
Google Scholar
Halpern, J.Y., Pearl, J.: Causes and explanations: a structural-model approach. part II: explanations. British J. Philos. Sci. 56(4), 889–911 (2005)
Google Scholar
Judson, S., Elacqua, M., Córdoba, F.C., Antonopoulos, T., Könighofer, B., Shapiro, S.J., Piskac, R.: ‘Put the Car on the Stand’: SMT-based Oracles for Investigating Decisions. In: ACM Symposium on Computer Science and Law (CSLAW ’24) (2024). https://arxiv.org/abs/2305.05731 for an extended technical report
Katz, G., Barrett, C., Dill, D.L., Julian, K., Kochenderfer, M.J.: Reluplex: an efficient SMT solver for verifying deep neural networks. In: International Conference on Computer Aided Verification (CAV ’17), pp. 97–117 (2017)
Google Scholar
Katz, G.: The marabou framework for verification and analysis of deep neural networks. In: International Conference on Computer Aided Verification (CAV ’19), pp. 443–452 (2019)
Google Scholar
Kroll, J.A., et al.: Accountable algorithms. Univ. Pa. Law Rev. 165(3), 633–705 (2017)
Google Scholar
Lewis, D.: Counterfactuals. John Wiley & Sons (2013). originally published in 1973
Google Scholar
Liew, D., Schemmel, D., Cadar, C., Donaldson, A.F., Zahl, R., Wehrle, K.: Floating-point symbolic execution: a case study in N-version programming. In: IEEE/ACM International Conference on Automated Software Engineering (ASE ’17), pp. 601–612 (2017)
Google Scholar
Lindner, M., Aparicius, J., Lindgren, P.: No panic! verification of rust programs by symbolic execution. In: 2018 IEEE 16th International Conference on Industrial Informatics (INDIN), pp. 108–114. IEEE (2018)
Google Scholar
Loring, B., Mitchell, D., Kinder, J.: ExpoSE: practical symbolic execution of standalone JavaScript. In: International SPIN Symposium on Model Checking of Software (SPIN ’17), pp. 196–199 (2017)
Google Scholar
Mothilal, R.K., Sharma, A., Tan, C.: Explaining machine learning classifiers through diverse counterfactual explanations. In: ACM Conference on Fairness, Accountability, and Transparency (\(\text{FAT}^*\) ’20), pp. 607–617 (2020)
Google Scholar
de Moura, L., Bjørner, N.: Model-based theory combination. Electron. Notes Theor. Comput. Sci. 198(2), 37–49 (2008)
Article MathSciNet Google Scholar
Moura, L.d., Bjørner, N.: Z3: An efficient SMT Solver. In: International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS ’08), pp. 337–340 (2008)
Google Scholar
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet Google Scholar
Shalev-Shwartz, S., Shammah, S., Shashua, A.: On a Formal Model of Safe and Scalable Self-Driving Cars. arXiv preprint arXiv:1708.06374 (2017)
Shoshitaishvili, Y., et al.: SoK: (State of) The art of war: offensive techniques in binary analysis. In: IEEE Symposium on Security and Privacy (S&P ’16) (2016)
Google Scholar
Smiley, L.: ‘I’m the Operator’: The Aftermath of a Self-Driving Tragedy. Wired Magazine (2022). https://www.wired.com/story/uber-self-driving-car-fatal-crash/
Wachter, S., Mittelstadt, B., Russell, C.: Counterfactual explanations without opening the black box: automated decisions and the GDPR. Harvard J. Law Technolo. 31, 841 (2017)
Google Scholar
Watkins, C.J.C.H., Dayan, P.: Q-learning. Mach. Learn. 8, 279–292 (1992)
Article Google Scholar
Yoon, M.K., Shao, Z.: ADLP: accountable data logging protocol for publish-subscribe communication systems. In: International Conference on Distributed Computing Systems (ICDCS ’19), pp. 1149–1160. IEEE (2019)
Google Scholar

Download references

Acknowledgements

The authors thank Gideon Yaffe, Man-Ki Yoon, Cristian Cadar, and Daniel Liew. This work was supported by the Office of Naval Research (ONR) of the United States Department of Defense through a National Defense Science and Engineering Graduate (NDSEG) Fellowship, by the State Government of Styria, Austria - Department Zukunftsfonds Steiermark, by EPSRC grant no EP/R014604/1, and by NSF awards CCF-2131476, CCF-2106845, CCF-2219995, CCF-2318974, and CNS-2245344. The authors would also like to thank the Isaac Newton Institute for Mathematical Sciences, Cambridge, for support and hospitality during the programme Verified Software where work on this paper was undertaken.

Author information

Authors and Affiliations

Yale University, New Haven, USA
Samuel Judson, Matthew Elacqua, Timos Antonopoulos & Ruzica Piskac
Graz University of Technology, Graz, Austria
Filip Cano & Bettina Könighofer
Yale Law School and Yale University, New Haven, USA
Scott J. Shapiro

Authors

Samuel Judson
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Elacqua
View author publications
You can also search for this author in PubMed Google Scholar
Filip Cano
View author publications
You can also search for this author in PubMed Google Scholar
Timos Antonopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Bettina Könighofer
View author publications
You can also search for this author in PubMed Google Scholar
Scott J. Shapiro
View author publications
You can also search for this author in PubMed Google Scholar
Ruzica Piskac
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Samuel Judson .

Editor information

Editors and Affiliations

University of Waterloo, Waterloo, ON, Canada
Arie Gurfinkel
Georgia Institute of Technology, Atlanta, GA, USA
Vijay Ganesh

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Judson, S. et al. (2024). soid: A Tool for Legal Accountability for Automated Decision Making. In: Gurfinkel, A., Ganesh, V. (eds) Computer Aided Verification. CAV 2024. Lecture Notes in Computer Science, vol 14682. Springer, Cham. https://doi.org/10.1007/978-3-031-65630-9_12

Download citation

DOI: https://doi.org/10.1007/978-3-031-65630-9_12
Published: 25 July 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-65629-3
Online ISBN: 978-3-031-65630-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

soid: A Tool for Legal Accountability for Automated Decision Making