On the Claim that a Table-Lookup Program Could Pass the Turing Test

McDermott, Drew

doi:10.1007/s11023-013-9333-3

On the Claim that a Table-Lookup Program Could Pass the Turing Test

Published: 05 March 2014

Volume 24, pages 143–188, (2014)
Cite this article

Minds and Machines Aims and scope Submit manuscript

Drew McDermott¹

660 Accesses
10 Citations
Explore all metrics

Abstract

The claim has often been made that passing the Turing Test would not be sufficient to prove that a computer program was intelligent because a trivial program could do it, namely, the “Humongous-Table (HT) Program”, which simply looks up in a table what to say next. This claim is examined in detail. Three ground rules are argued for: (1) That the HT program must be exhaustive, and not be based on some vaguely imagined set of tricks. (2) That the HT program must not be created by some set of sentient beings enacting responses to all possible inputs. (3) That in the current state of cognitive science it must be an open possibility that a computational model of the human mind will be developed that accounts for at least its nonphenomenological properties. Given ground rule 3, the HT program could simply be an “optimized” version of some computational model of a mind, created via the automatic application of program-transformation rules [thus satisfying ground rule 2]. Therefore, whatever mental states one would be willing to impute to an ordinary computational model of the human psyche one should be willing to grant to the optimized version as well. Hence no one could dismiss out of hand the possibility that the HT program was intelligent. This conclusion is important because the Humongous-Table Program Argument is the only argument ever marshalled against the sufficiency of the Turing Test, if we exclude arguments that cognitive science is simply not possible.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Framing the predictive mind: why we should think again about Dreyfus

Article Open access 06 May 2024

What an Algorithm Is

Article 11 January 2015

From ethical AI frameworks to tools: a review of approaches

Article Open access 09 February 2023

Notes

I will capitalize the word “test” when referring to the Turing Test as a concept, and use lower case when referring to particular test occurrences.
I realize that the use of this slang term makes the paper sound a bit frivolous. I take this risk because the size of the required table will easily be seen to be beyond comprehension, and it’s important to keep this in mind. I don’t think words like “vast”, “stupendous”, “gigantic” really do the job. In (Dennett 1995, Ch. 1) the word “Vast” with a capital “v” is used for numbers in the range I discuss in this paper, numbers of magnitude 10¹⁰⁰ and up.
Or some other arbitrary time limit fixed in advance; but I’ll use an hour as the limit throughout this paper. The importance of this rule will be seen below.
Further clerical details: Turns end when the person enters two newlines in a row, or exceeds time or character limits (including further constraints imposed later). As explained in section “The Argument and Its Role”, the judge gets a chance to edit their entries before any part of them is sent to the interlocutor. (I will use third-person plural pronouns to refer to a singular person of unimportant, unknown, or generic gender, to avoid having to say “him or her” repeatedly). Judge inputs that violate constraints such as character limits must be edited until the constraints are satisfied. The two newlines between turns don’t count as part of the utterance on either side. We’ll always let the judge go first, but they can type the empty string to force the interlocutor to be the first to “speak”. The interview ends after an hour or if the judge and interlocutor successively type the empty string (in either order). Note that I’ll use sometimes words like “speak” or “say” when I mean “type”, only because the latter sounds awkward in some contexts.
Block’s term for what I am calling the “judge”.
It’s obviously necessary to insert something like the ## marks because otherwise there would be many possible interchanges that could begin ABC. It’s not clear whose turn it is to speak after a conversation beginning “Veni … Vidi … Vici. Ave Caesar! A fellow Latin scholar. Great!” Block probably just assumed some such marker would end A, B, and C. I’m making it explicit.
Actually, just to get the chronology right, it’s important to note that Block described a slightly different version of the program in Block (1978, p. 281) in order to make a somewhat different point. Very confusingly, an anthology published 2 years later included a slightly condensed version of the paper under the same title (Block 1980), a version that lacks any mention of the Humongous-Table Program.
Shannon and McCarthy (1956) require that a definition of “thinking”, in the case of an allegedly intelligent machine, “must involve something relating to the manner in which the machine arrives at its responses”.
Block talks as though the “programmers” might emulate his Aunt Bertha. Actually, they can be somewhat more creative if they want to. On different branches of the tree, different “personalities” might emerge. But it will be much simpler, and sacrifice no generality, to speak as though each tree emulated one personality, and we’ll go along with calling her “Aunt Bertha” or “AB”. I have my doubts that we will ever be able to simulate a particular person in enough detail to fool their close friends. But that’s not necessary. If someone creates a program to compete in a Turing test and bases it on their aunt, it doesn’t have to mimic her that closely. If it sounds to the judges like it might be someone’s aunt, that’s good enough.
Equivalently, odd-length lists of strings.
No time can be greater than the number of milliseconds in an hour, but at “run time” the actual time left determines whether the interview comes to an end before the judge and examinee give the signal.
If we want to allow interlocutors to edit lines before they are seen by the judge, then times should be associated with completed lines, not individual characters. If we really want to avoid reaction times completely, then we can introduce random delays (as we do for the judge; see below) or we could have two sets of judges, one to conduct the interviews and another to review the transcripts and decide who’s human. But that’s a rather drastic change to the rules.
One more restriction: timed strings can’t have times so short that the typing speed exceeds the rate at which a plausible human can type. Of course, if the examinee types at blinding speed it will be easy for the judge to identify, but if we’re considering the set of all possible examinees, as we will in section “Argument Two: Why the Possibility of HTPLs Proves Nothing”, it’s necessary to set bounds on their abilities to keep the set finite.
We could do the same with the interlocutor’s output, but it’s traditional to put the burden of replicating human timing and error patterns on the examinees.
For now, I will be casual about the distinction between a strategy tree—a mathematical object—and the incarnation of a strategy tree in a physical medium. How the latter might work is discussed in section “Argument One: Why the Possibility of HTPSs Proves Nothing”
Braddon-Mitchell and Jackson seem oddly oblivious to the fact that real people grow and then wither over their lifespans. Perhaps “behavior” for them includes changes in body shape. For our purposes the robot’s lifespan need merely be an hour.
This test is a blend of what call Harnad calls T3 and T4 in (Harnad 2000), depending on whether the automaton has to be able to do things like blush or not.
If we opt instead for all mathematically possible input sequences, then for all but a vanishingly small fraction scientific induction does not work; the universe is mostly white noise. In the ones where scientific induction does work, all but a vanishingly small fraction have different laws of nature from those in the real world. At this point I no longer believe that the game tree has been specified precisely enough for me to conceive of it.
Of course, a truly intelligent examinee would have to have delusional beliefs about its physical appearance, so as to be able to answer questions such as “How tall are you, Bertha?”, and “Are you left- or right-handed?” (And about its surroundings; see “If We Neglect Phenomenology, Computational Models of People are Possible”.) It will also have to have delusional memories of, say, having eaten strawberries and cream, or having ridden a snowmobile, or having done some real-world thing, or the judges will get suspicious. Whether we can attribute even delusional beliefs to the HT program is an issue we take up in section “If We Neglect Phenomenology, Computational Models of People are Possible”.
Strategically optimal or not, is it ethical to create a model of a person, run it for an hour so it can take a test, reset it to its original state, run it again a few times, then junk it?
Jorge Luis Borges’s vision (Borges 2000) of a library of all possible books of a certain size conveys the idea.
For the exact rules, see Appendix A in Supplementary Material.
See Appendix A in Supplementary Material
And perhaps a cognitive psychologist.
I allude once again to “The Library of Babel”.
Cf. (Culbertson 1956), although Culbertson was talking about a somewhat different set of robot-control mechanisms. He pointed out that they were “uneconomical”, which must be the greatest understatement of all time.
In (Block 1978), Block points out that “… If it [the strategy tree] is to ‘keep up’ with current events, the job [of rebuilding it] would have to be done often” (p. 295). How such a huge thing is to be rebuilt “often” is not clear.
There might be issues of wide vs. narrow content here (Botterill and Carruthers 1999), but they probably take a back seat to problems raised by the fact that x and her world are fictional.
It’s odd that no one has, as far as I know, raised this issue before. If the surroundings of the participants are not made uniform the judge might be able to figure out who’s who by asking the participants to describe the location where they’re sitting.
When a leaf state is reached, the FSM halts.
This is related to the function TS described in section “If We Neglect Phenomenology, Computational Models Of People Are Possible”, but that one ignored O, and took a series of inputs as argument.
It is, of course, just a coincidence that Turing’s name is on both the Turing Test and the Turing machine; he never linked the two, if you don’t count vague allusions.
Using multiple tapes is a convenient device that doesn’t change the computational power of Turing machines (Homer and Selman 2011, Ch. 2).
Another example is Searle’s (1980) “Chinese Room” argument. One reason it is so easy to fall into this trap is that the inventors of the first computers resorted so often to words such as “memory” to describe pieces of these new things, and we’ve been stuck with them ever since. But I confess that in teaching intro programming I get students into the right mindset by pretending the computer is a “literal-minded assistant” or some such thing, that variables are “boxes” this assistant “puts numbers into”, and so on.
This may or may not be the “real” machine, depending on whether machine language is executed by a microcode interpreter. And if the computer has several “cores”, should we think of it as a committee?
Recall that in section “The Argument and Its Role” we “optimized” keys by removing the examinee’s contributions to the dialogue.
Of course, some people contend that it is absurd to deny a creature phenomenal consciousness if it doesn’t seem to believe it lacks anything (Dennett 1978; McDermott 2001).
For the syntax of the programming language used in what follows, see Appendix 1.
A set of deterministic processors acting asynchronously in parallel would be nondeterministic, and this nondeterminism would be eliminated when we switch to a single processor. But I argued above (section “If We Neglect Phenomenology, Computational Models of People are Possible”) that a judge would be unable to tell the difference between a deterministic and nondeterministic program.
It may seem unusual to compute a new knowledge base rather than make changes to the old one, but it’s a standard move made for technical reasons; the compiler is supposed to eliminate any inefficiencies that result from this device. I will take this opportunity to insert the standard disclaimer about the term “knowledge base”: It should really be called the “belief base”, but for some reason that term hasn’t caught on.
One might object that a person sentenced to capital punishment could always get a last-minute reprieve from the governor; their hopes and dreams are never necessarily futile. So imagine someone poisoned by an irreversible infusion of nanobots that snip out pieces of brain one by one until after an hour the victim is dead.
Of course, she can discuss them, and probably will if the judge brings them up.
So the state of remembering the name of the judge is mediated by the disjunctive state consisting of all string sequences in which the judge tells AB their name and AB is able to recite it correctly later.
If we supply a special input channel from which random numbers are read, analogous to a tape containing random bits for a Turing machine (section The Sensible-String Table Must Not Have Been Built by Enacting All Possible Conversations), then we can treat randomness elimination as a special case of input anticipation.
In this appendix I use the word “branch” to mean something different from the meaning explained in section “The Argument and Its Role” Here it means a decision point in a program, an “ if ” statement, conditional jump, or the like.
Although it’s hard to be completely sure of what happens in 10⁴⁴⁵ branches.
How come I haven’t had to treat ${\tt KB}_{{\tt new}}$ and ${\tt T}_C$ the same way I handled R ? I could have, but it’s not necessary, because the name reuse doesn’t actually cause any confusion.
If you really, really want the program to be isomorphic to the HTPL, you could transform it once again by converting it to a loop with an iteration-counting variable, adding a test for the appropriate value of this variable to every test of the if and replacing the semicolons with else s. A transformation to accomplish this (“loop imposition”?) is left as an exercise for the reader.
See “The Argument and Its Role” for why the length of a key string = the number of judge inputs so far.

References

Allen, R., & Kennedy, K. (2001). Optimizing compilers for modern architectures: A dependence-based approach. San Francisco: Morgan Kaufmann.
Google Scholar
Bertsekas, D. P. (1987). Dynamic programming, deterministic and stochastic models. Englewood Cliffs, NJ: Prentice-Hall.
MATH Google Scholar
Binmore, K. (2007). Playing for real: A text on game theory. Oxford: Oxford University Press.
Book Google Scholar
Block, N. (1978). Troubles with functionalism. In C. W. Savage (Ed.), Perception and cognition: Issues in the foundation of psychology, Minnesota studies in the philosophy of science (pp. 261–325). USA: University of Minnesota Press.
Google Scholar
Block, N. (ed.) (1980). Readings in the philosophy of psychology (Vol. 2). Cambridge, MA: Harvard University Press.
Google Scholar
Block, N. (1981). Psychologism and behaviorism. The Philosophical Review, 90(1), 5–43.
Article Google Scholar
Borges, J. L. (2000). The library of Babel. In The total library: Non-fiction, 1922–1986 (pp. 214–216) (trans: Weinberger, E.).
Botterill, G., & Carruthers, P. (1999). The philosophy of psychology. Cambridge: Cambridge University Press.
Book Google Scholar
Braddon-Mitchell, D. (2009). Behavourism. In J. Symons & P. Calvo (Eds.), The routledge companion to philosophy of psychology (pp. 90–98). London: Routledge.
Google Scholar
Braddon-Mitchell, D., & Jackson, F. (2007). Philosophy of mind and cognition (2nd ed.). Oxford: Blackwell Publishing.
Google Scholar
Braithwaite, R., Jefferson, G., Newman, M.,& Turing, A. (1952). Can automatic machines be said to think? (BBC Radio broadcast). Also in (Copeland 2004)
Chisholm, R. (1957). Perceiving. Ithaca: Cornell University Press.
Google Scholar
Christian, B. (2011). The most human human: What talking with computers teaches us about what it means to be alive. New York: Doubleday.
Google Scholar
Copeland, B. J., & Proudfoot, D. (2009). Turing’s test: A philosophical and historical guide. In Epstein et al. 2008 (pp. 119–138).
Culbertson, J. T. (1956). Some uneconomical robots. In Shannon and McCarthy 1956 (pp. 99–116).
Davidson, D. (1987). Knowing one’s own mind. In Proceedings and addresses of the American philosophical association (Vol. 60, pp. 441–58). (Also in Donald Davidson 2001 Subjective, Intersubjective, Objective. New York and Clarendon: Oxford University Press, pp. 15–38).
Dennett, D. C. (1978). Toward a cognitive theory of consciousness. In D. C. Dennett (Ed.), Brainstorms (pp. 149–173). Cambridge, MA: Bradford Books/MIT Press, (originally in Savage 1978).
Dennett, D. C. (1985). Can machines think?. In M. Shafto (Ed.), How we know (pp. 121–145). San Francisco: Harper and Row.
Google Scholar
Dennett, D. C. (1995). Darwin’s dangerous idea: Evolution and the meanings of life. New York: Simon and Schuster.
Google Scholar
Dowe, D. L., & Hájek, A. R. (1997). A computational extension to the Turing test. Technical Report 97/322, nil, Department of Computer Science, Monash University
Dowe, D. L., & Hájek, A. R. (1998). A non-behavioural, computational extension to the Turing Test. In Proceedings of international conference on computational intelligence and multimedia applications (pp. 101–106). Gippsland, Australia
Epstein, R., Roberts, G., & Beber, G. (2008). Parsing the Turing test: Philosophical and methodological issues in the quest for the thinking computer. New York: Springer
Google Scholar
Fodor, J. (1975). The language of thought. New York: Thomas Y. Crowell.
Google Scholar
French, R. M. (1990). Subcognition and the limits of the Turing Test. Mind, 99(393):53–65. [Reprinted in (Shieber 2004), pp. 183–197].
Google Scholar
Furht, B., & Escalante, A. (eds) (2010). Handbook of cloud computing. New York: Springer.
MATH Google Scholar
Geach, P. (1957). Mental acts. London: Routledge and Kegan Paul.
Google Scholar
Harnad, S. (1990). The symbol grounding problem. Physica D: Nonlinear Phenomena, 42, 335–346.
Article Google Scholar
Harnad, S. (1991). Other bodies, other minds: A machine incarnation of an old philosophical problem. Minds and Machines, 1(1), 43–54.
Google Scholar
Harnad, S. (2000). Minds, machines, and Turing. Journal of Logic, Language and Information, 9(4), 425–45.
Article MATH MathSciNet Google Scholar
Hayes, P., & Ford, K. (1995). Turing Test considered harmful. In Proceedings of Ijcai (Vol. 14, pp. 972–977).
Hodges, A. (1983). Alan Turing: The enigma. New York: Simon and Schuster.
MATH Google Scholar
Homer, S., & Selman, A. L. (2011). Computability and complexity theory. New York: Springer.
Book MATH Google Scholar
Humphrys, M. (2008). How my program passing the Turing Test. In Epstein et al. 2008 (pp. 237–260).
Jones, N., Gomard, C., & Sestoft, P. (1993). Partial evaluation and automatic program generation. In L. O. Andersen, T. Mogensen (Eds.). Prentice: Prentice Hall International.
Kam, T. (1997). Synthesis of finite state machines: Functional optimization. Boston: Kluwer Academic.
Book MATH Google Scholar
Kirk, R. (1995). How is consciousness possible?. In T. Metzinger (Ed.), Conscious experience (pp. 391–408). Paderborn: Ferdinand Schoningh. (English edition published by Imprint Academic).
Google Scholar
Knuth, D. E. (1998). The art of computer programming: seminumerical algorithms (3rd ed.). Reading, MA: Addison-Wesley.
Google Scholar
Leber, J. (2013). The immortal life of the Enron e-mails. Technology Review, 116(5), 15–16. http://www.technologyreview.com/news/515801/the-immortal-life-of-the-enron-e-mails/
Leigh, J. (2006). Applied digital control: Theory, design and implementation (2nd ed.). New York: Dover.
Google Scholar
Lenat, D. B. (2009). Building a machine smart enough to pass the Turing Test: Could we, should we, will we? In Epstein et al. 2008 (pp. 261–282).
Lindholm, T., Yellin, F., Bracha, G., & Buckley, A. (2012). The java virtual machine specification: Java se 7 edition. http://docs.oracle.com/javase/specs/jvms/se7/html/index.html. Accessed 01 July 2012.
McDermott, D. (2001). Mind and mechanism. Cambridge, MA: MIT Press.
MATH Google Scholar
Millican, P., & Clark, A. (1996). The legacy of Alan Turing. Oxford: Clarendon Press.
Google Scholar
Perlis, D. (2005). Hawkins on intelligence: Fascination and frustration. Artificial Intelligence, 169, 184–191.
Article MathSciNet Google Scholar
Purtill, R. (1971). Beating the imitation game. Mind, 80(318), 290–94. [Reprinted in (Shieber 2004), pp. 165–71].
Google Scholar
Rothschild, L. (1986). The distribution of english dictionary word lengths. Journal of Statistical Planning and Inference, 14(2), 311–322.
Article MathSciNet Google Scholar
Russell, S., & Norvig, P. (2010). Artificial intelligence: A modern approach (3rd ed.). Englewood Cliffs, NJ: Prentice Hall.
Google Scholar
Searle, J. R. (1980). Minds, brains, and program. The Behavioral and Brain Sciences, 3, 417–424.
Article Google Scholar
Shannon, C. (1950a). A chess-playing machine. Scientific American, 182(2), 48–51. (Reprinted in Newman, J. R. (1956). The world of mathematics (Vol. 4, pp. 2124–2133). New York: Simon and Schuster).
Shannon, C. (1950b). Programming a computer for playing chess. Philosophical Magazine, 7–41(314), 256–275. (Reprinted in Levy, D. N. L. (ed.) (1988). Computer chess compendium. New York, NY: Springer).
Shannon, C. E.,& McCarthy, J. (eds) (1956). Automata studies. [Note: Annals of Mathematics Studies (Vol. 34)]. Princeton: Princeton University Press.
Sloman, A.,& Chrisley, R. (2003). Virtual machines and consciousness. Journal of Consciousness Studies, 10(4–5), 6–45. [Reprinted in (Holland 2003), pp. 133–172].
Google Scholar
Smith, S.,& Di, J. (2009). Designing asynchronous circuits using NULL conventional logic (ncl). San Rafael: Morgan and Claypool Publishers.
Styler, W. (2011). The EnronSent corpus. Technical Report 01-2011, nil. http://verbs.colorado.edu/enronsent/, University of Colorado at Boulder Institute of Cognitive Science.
Turing, A. (1950). Computing machinery and intelligence. Mind, 49, 433–460.
Article MathSciNet Google Scholar
Turing, A.,& Brooker, R. (1952). Programmers’ handbook (2nd ed.). for the Manchester Electronic Computer Mark II. http://www.computer50.org/kgill/mark1/progman.html
Wegener, I. (1991). The complexity of boolean functions. London: Wiley.
Google Scholar
Weizenbaum, J. (1976). Computer power and human reason: From judgment to calculation. San Francisco: W. H. Freeman.
Google Scholar

Download references

Acknowledgments

Thanks to Dana Angluin, Ned Block, Matt Ginsberg and especially the anonymous referees for useful suggestions. I take responsibility for the flaws that remain.

Author information

Authors and Affiliations

Yale University, New Haven, CT, USA
Drew McDermott

Authors

Drew McDermott
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Drew McDermott.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Caption of the data object (PDF 107 kb)

Appendices

Appendix 1: A Simple Programming Language

The algorithms used in Appendix A of supplementary material and Appendix 2 are expressed in a simple programming language. There is no distinction between commands and expressions; commands are just expressions with side effects. Assignments are written $var \leftarrow val$. Sequences of statements are written as

$$ \begin{aligned} & \left\{ \right. \\ & \quad s_1; s_2; \ldots; s_1 \\ & \left. \right\} \end{aligned} $$

Each s _i is an expression. Braces are used to indicate eliminate ambiguity in grouping.

Conditionals are of the form if e ₁ then e ₂ [else e ₃ ] . (The else is optional.)

Functions are defined thus:

$$ \begin{aligned} & {\tt define}\, name (\text{-\!-\!-}parameters\text{-\!-\!-}) \\ &\quad\quad e \\ \end{aligned} $$

where e is an expression, often a sequence.

Function parameters become locally bound variables when the function is applied to arguments. The other way to bind variables is

$$ \begin{aligned} {\tt let} \quad & v_1 = e_1 \\ & v_2 = e_2 \\ & \ldots \\ & v_k = e_k \\ & {\tt in} \\ &\quad e_0 \end{aligned} $$

which evaluates $e_1,\ldots,e_k$, then binds $v_1,\ldots,v_k$ to the resulting values during the evaluation of e ₀.

See “Argument Two: Why the Possibility of HTPLs Proves Nothing” for discussion of the constructs defined using in-parallel-with and exit .

Pseudo-code is written with italics.

Appendix 2: Proof of Theorem

I will use the term partial evaluation for the behavior-preserving transformations used to prove Theorem 1. The term is used to refer to performing some of a procedure’s operations in advance when some of its arguments are known (Jones et al 1993).

One of the transformations covered by the term “partial evaluation” is constant folding (Allen and Kennedy 2001), the process of substituting a known (usually numerical) value of a variable for all its occurrences, and then simplifying. Another is function unfolding, in which the definition of a function is substituted for one of its occurrences (and, again, propagating the resulting simplifications).

The simplifications to be propagated are straightforward, except for random-number generation. At every point in the program where a random number is needed, we perform randomness elimination. This is one of two transformations, depending on how the computer is designed:

1.
If the program actually relies on pseudo-random numbers (Knuth 1998, Ch. 3), the random-number generator is run at compile time (which changes the stored “seed” needed to produce the next pseudo-random number).
2.
If the computer has an actual “generate random number” instruction, then we generate one. By definition, the number depends on nothing in the program, so running it in advance is equivalent to running it in real time. (I doubt there are any computers in existence today that actually have such an instruction, but the Manchester Alpha machine, for which Turing co-authored a manual (Turing and Brooker 1952), had an instruction of this kind.)

Partial evaluation will also include three new transformations. The first is called input anticipation. It consists of transforming any piece of code with the form

in a situation where the possible values read are a finite set $\{v_1, \ldots, v_n\}$ into

where A[v] is A with v substituted for all free occurrences of r. ^{Footnote 43}

The second new transformation is let -elimination. Any expression of the form

may be transformed into

where each u _i is a variable that occurs nowhere else in the program and $e[u_1,\ldots,u_k]$ is e with every free occurrence of v _i replaced by u _i. (The new variables have global scope.)

The third new transformation is if -raising. Code of the form

may be transformed into

The idea is that if every clause of an if-then-else statement ends in an if-then-else , then those terminal if-then-else s can be raised to the level of the original if-then-else , provided we add an extra condition to every if -test. For example, the last line mimics the last else-if clause of the original schema by adding the gating condition P _k that used to be there implicitly because of the nested control.

Be sure to note the (easy-to-miss) semicolon between the first k if clauses and the rest of the program. It means that after those first tests are run, control passes to the remaining ones without returning to the first group.

Proof of theorem 1

The call to aunt_bertha_loop(KB ₀ ) in context T may be transformed via partial evaluation into a list of if statements that is isomorphic to the HTPL. □

Proof

The first step in transforming the program is to add an argument that records an upper bound on the amount of time the program has left.

We call this version aunt_bertha_loop_t . The “react” pseudocode has been altered to return the total time T _C it took to type the output, a timed string; and ${\tt T}_{MJ}{\tt (R)}$ is the minimum time it would take the judge to type the string R . Adding the if statement doesn’t affect correctness, because, assuming the initial value of max_time_left is ≥ the actual amount of time remaining on the clock, then it obviously remains an upper bound in the recursive call to aunt_bertha_loop .

We ensure that this is indeed the case by replacing the original call to aunt_bertha_loop with

which is equivalent to

(We do one function unfold, then one constant fold; because ${\tt 1}\,\hbox{h} {\tt > 0}$ evaluates to true , we can replace the if statement with its then clause.)

The statement binding R is the bit that reads what the judge types. There are 10⁴⁴⁵ possible values for R . Because we are back in Unbollywood, where space and time cost essentially nothing, we can use input anticipation to introduce a 10⁴⁴⁵-way branch ^{Footnote 44} after the input statement. The program has become the version shown in Fig. 4.

Within each branch in Fig. 4, the values of both R and ${\tt KB}_0$ are known. Several consequences follow from this fact. The intricate structure of code I’ve summarized as “react to …” can be simplified. Everywhere a test is performed that depends on the value of R , we can eliminate the dependency, discarding all paths through the code that are incompatible with what we know the value of R to be. When a random number is needed, we apply randomness elimination, changing the call to the random-number generator into a constant. (The output from a random-number generator is a number chosen from a uniform distribution. Often the outputs are squeezed into a different distribution using parameters available at run time; the values of these parameters are available during partial evaluation as well.)

There are only three results of this process we care about:

1.
The variable ${\tt T}_{MJ}{\tt (R)}$ in each branch of the if-then-else has become constant, so we can compute immediately the minimum time it would have taken to read R .
2.
The characters that are typed by the “react to” code, and their times, can be collected as a (timed) string S. The net behavior can be written as print S.
3.
The variables KB_new and ${\tt T}_C$ can be computed.

Hence, in each branch, we can replace the code written “react to …” with print S, and the call to aunt_bertha_loop_t with the definition of aunt_bertha_loop_t , with constants substituted for its arguments. This expanded definition begins

$$ {\tt if}\,T > 0 \, {\tt then }\, C \ldots $$

where T is a constant, the value of max_time_left in the call being expanded. In this first round of expansions, T is likely to be greater than 0 in virtually every call to aunt_bertha_loop_t , because a single exchange between the judge and the simulation is unlikely to take more than an hour. ^{Footnote 45} So we can replace the if with C, which looks like

T and KB are constants, different in each branch. The result looks like Fig. 5, where these constants have been given subscripts.

When we’re done with all that, we start our series of transformations all over again on each new instantiation of aunt_bertha_loop_t , unfolding it, adding an if statement to branch on each possible value of the input, and partially evaluating each branch.

What we would like to do is apply if -raising repeatedly. But the let s are in our way. This is where let -elimination comes in (not too surprising). In each branch we create a new variable, R _i for the i’th branch; so that branch 10⁴⁴⁵ will have the variable $R_{10^{445}}$. The result is as shown in Fig. 6. ^{Footnote 46}

Each read can be subjected to input anticipation, and further expansion ensues. After the next round each clause of the outer if will be of this form:

That means we can use if -raising, transforming the program into the form schematized in Fig. 7. In this figure, the “first if ” has 10⁴⁴⁵ branches; the second, (10⁴⁴⁵)².

The program will gradually evolve into a gigantic list of if statements, which occasionally emits some characters to be sent to the judge, and along the way builds and preserves data structures (the KB s) for future use. Although rather bulky, the list is finite, because, even though aunt_bertha_loop_t is written as a potentially infinite recursion (which will be terminated by the alarm clock if necessary), the argument max_time_left is smaller for each recursive call. In every branch it eventually becomes the case that the if statement if max_time_left > 0 then … else exit expands into exit .

Now, this list of if s is isomorphic to the HTPL. Each test is of the form

$$ {\tt if\,R} = \ldots {\tt and}\,{\tt R}_{j_{2}}\, {\tt = }\ldots {\tt and} \,\ldots{\tt R}_{j_{k}}\, {\tt =}\, \ldots\, {\tt then} $$

(Treat R as if it were ${\tt R}_0$ and let j ₀ = 0). In this list of tests, k starts at 1 and there are 10⁴⁴⁵ branches of that length; then it becomes 2 and there are (10⁴⁴⁵)² branches of length 2; and so forth up to k = the maximal number of exchanges between judge and examinee that can occur in one hour.

This might as well be a (rather laborious) table lookup for the string corresponding to the conversation so far. At first we check for strings of length 1, then strings of length 2, and so forth. ^{Footnote 47} These strings correspond exactly to the key strings used in HTPL. ^{Footnote 48} QED

Please note the fate of the knowledge base as these transformations are made. Each version of KB_new reflects the acquisition of a few new beliefs as a result of one interchange with the judge, and of course the retention of old ones. Initially the facts married(me, husb) and name(husb, “Herb”) might be stored in the cloud, so that if anyone asks for AB’s husband’s name, AB can respond “Herb” . A new fact like name(judge, “Pradeep”) might be added later. At some point the response “Herb” to the query “What’s your husband’s name?” or variants thereof occurs in a huge number of branches of the tree, and similarly for the query “What’s my name, do you remember?” . But as branches are closed off because they exhaust all the available time, these versions of the KB are discarded. If the transformation process runs to completion, eventually every possible way that any piece of information recorded in the KB might be reflected in AB’s behavior is so reflected, and there is no longer any need for the knowledge base. We are in behaviorist heaven, where it really is the case that any fact about what the program believes can be expressed as an (incredibly large but finite) disjunction of dispositions to behave in certain ways.

Rights and permissions

Reprints and permissions

About this article

Cite this article

McDermott, D. On the Claim that a Table-Lookup Program Could Pass the Turing Test. Minds & Machines 24, 143–188 (2014). https://doi.org/10.1007/s11023-013-9333-3

Download citation

Received: 12 July 2013
Accepted: 31 October 2013
Published: 05 March 2014
Issue Date: May 2014
DOI: https://doi.org/10.1007/s11023-013-9333-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the Claim that a Table-Lookup Program Could Pass the Turing Test

Abstract

Access this article

Similar content being viewed by others