Evaluating the Effectiveness of Personal Cognitive Augmentation: Utterance/Intent Relationships, Brittleness and Personal Cognitive Agents

Walters, Grover

doi:10.1007/978-3-319-92046-7_46

Grover Walters¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10905))

Included in the following conference series:

International Conference on Human Interface and the Management of Information

1587 Accesses

Abstract

The popularity of applying intelligent agents is moving business into a new age of actionable information production. Managing the introduction and operation of such entities in an enterprise is a critical factor board rooms will face as the trend continues. Thus, it is important now to develop tools that measure their effectiveness. This paper seeks to understand the efficacy of these agents I call Personal Cognitive Agents (PCA). At their infancy, PCAs are subject to a disconnect between what the human operator intends and what the PCA understands as operator intent. A relationship exists between what is uttered by the operator and the operator’s intent. I establish a metric called utterance intent relationship (UIR) for this purpose and seek to determine UIR’s viability as a universal tool to measure an agent’s effectiveness in human/agent symbiosis.

You have full access to this open access chapter, Download conference paper PDF

How perceptions of intelligence and anthropomorphism affect adoption of personal intelligent agents

Article 27 March 2020

Perceptions of Agent Loyalty with Ancillary Users

Article 22 February 2021

AI increases unethical consumer behavior due to reduced anticipatory guilt

Article 02 March 2022

Keywords

1 Introduction

1.1 Inevitability of Cognitive Augmentation

There exists an “ecosystem” that will serve as a significant catalyst of change in the human-computer experience [12]. The impending change may be comparable to the impact of the World Wide Web on the tech boom of the 1990s. Consumers are adopting these systems now and companies will follow-suit soon.

11 Million Alexa devices have been sold as of Jan 2017 with already,” [1].
1.5 Billion smartphones have been sold with cognitive augmentation apps (Siri, Google, Cortana) [2].
Investment in AI technology was ~600MM in 2016. Expected to be 37.8 Billion by 2025 [4].
SAP Ariba to use Watson AI with procurement data to produce “predictive insights” for supply chains [3].

The influx of AI will have organizational behavioral implications with regards to cognitive systems in the form of cognitive augmentation in human operators. Such organizational behavioral implications can be measured with metrics that have yet to be established. These metrics should evaluate behavioral characteristics with human-cog and cog-cog interactions. Consequently, there exists potential application of those metrics to situations where the effectiveness of personal cognitive augmentation is required.

1.2 Related Problems

“Brittleness” - When vocally interacting with a personal cognitive agent:

The device does not understand your phrasing.
The device misunderstands your intent.
The device cannot find an answer, when you know one exists.
The device offers an answer that is technically correct, but not enough detail is offered.

“Directive Contention” - When more than one device (same/different platform(s)) is in use:

Devices may or may not answer the same way or start researching at different times.
Human operator cannot delegate question priority between the devices.
Device cannot extract directives from multi-part questions that are aligned with its responsibility domain.

Developers of personal cognitive augmentation agents (PCA) and platforms such as IBM Watson internally measure quality of interaction and information transmitted from human operator to cog. One way is to measure brittleness, an anomalous result due to comprehension gaps between what the operator speaks and what the cog understands. Cog platform application programming interfaces (API) provide supervised machine learning mechanisms that establish continuity between what is spoken (utterance) and what is intended (intent). That mechanism produces an utterance/intent relationship. However, API evaluation methodologies applied are proprietary and will differ between platforms. As such, there should exist publicly available standardized evaluation practices that assess cognitive augmentation interactivity. This paper will explore tools that will provide a foundation for such standards.

1.3 Practical Contribution

With the emergence of big data analytics, it will be necessary to discriminate numerous and varying potential answers to business questions. Cognitive augmentation would be a mechanism used to process such volumes of results offered by big data and similar platforms. Moreover, a business entity will need the ability to evaluate communication between its stakeholders—specifically when some stakeholders will exist in the form of a cognitive system or agent. With a standardized set of metrics, managers may be able to evaluate communication between stakeholders in the enterprise as well as evaluate efficacy of human/cog augmentation. Measuring utterance/intent relationships is a step towards realizing communication assessment in this domain.

2 Literature

This study assesses interrelationships between information theory, information science, representational information theory and human-robot interaction. Efforts are already under way in the field of human-robot interaction (HRI) [23]. Researchers continue exploration of a practical symbiotic relationship between humans and computer needs.

2.1 Humans and Computers

The idea of artificial intelligence and human task support has been explored for decades. Newell, Engelbart and Licklider in their early works in the 1960s reveal a desire for human-computer symbiosis [20] and frameworks [10] that improve the efficiency of tasks performed by humans. Almost 60 years later, advancements in technology have strengthened the relationship between humans and computers, specifically shifting mental processing capacity and physical tasks to machines. Weizenbaum’s ELIZA was able to converse in English on any topic [32]. Ted Shortliffe developed an expert system to approach medical diagnoses [27]. Hans Moravec developed an autonomous vehicle with collision avoidance in 1979 [22]. In 1979 BKG, a backgammon program, defeats the world champion [6]. Chinook program beat Checkers world champion Tinsley in 1994 [5]. Google introduces a self-driving car in 2009 [29]. IBM’s Watson AI agent defeats Ken Jennings as game show Jeopardy champion [16].

2.2 A Cognitive Era Emerges

The work continues as a confluence of technologies enable the cog ecosystem. Dr. Ron Fulbright attributes six classes of technology working together providing a backbone for large-scale interconnected cognitive entities [12].

Deep Learning: Multi-layered supervised machine learning algorithms utilizing convolutional neural networks [17]. Cognitive systems tap into deep learning algorithms to develop systems with human expert like performance [12].
Big Data: Almost limitless datasets of granular data derived from multiple sources [14].
Internet of Things: A global network of machines [18] producing ambient data without human input evaluated by deep learning algorithms [12].
Open Source AI: ROS: an open-source Robot Operating System [21] is one of many open source projects that allow many developers work on the same project from the comfort of their garages, basements, attics and pajamas.
NLI: Natural language interfaces are application libraries used to facilitate person-machine communication [15].
Connected Age: The adoption of smartphones, tablets, wearable technologies, Internet, Cloud services by millions of users globally provide a market for cog-enabled applications like Siri, Alexa and Cortana [12].

Tapping into this ecosystem are companies like IBM, Amazon, Google, Apple and Facebook. They are investing billions of dollars into artificial intelligence architectures [22].

2.3 Brittleness

Brittleness is an unstable systems behavior brought on by data validation failures or degradation in some other foundational process [7]. The term was used to describe software subject to disruption as a result of transitioning to the year 2000 during the Y2K crisis in the late 1990s. Brittle system behavior has also been a term applied to Expert Systems architecture [19]. To avoid brittleness in cognitive systems, I look to understand and measure utterance/intent relationships as a root cause for this phenomenon.

2.4 Utterance Intent Relationships

There a relationship between an utterance and an intent. In the literature, phrasing is covered under a metric called situation-specific vocal register [9]—more explicitly defined as an utterance (U_h) or articulated utterance originating from a human operator (h). After accepting an utterance U_h, PCA evaluates phraseology with one or more cognitive system platform APIs (Cog_x) for utterance/intent relationship (UIR_PCA) quality. UIR_PCA quality is defined by the degree of U_h match with predefined Intents (I_PCA). UIR_PCA quality is scored differently in each platform. Cog_x engines typically use natural language interface logic (NLI) applied to U_h evaluated against predefined I_PCA linked to predefined training utterances (U_train). Increased UIR_PCA scores will result in a better outcome for the operator/cog interaction. IBM Watson API (Cog_IBM) applies a metric called weighted evidence scores (WES) to evaluate a confidence relationship between utterance and intent. WES confidence score derived from Cog_IBM equates UIR_PCA in this scenario. New U_train(s) are introduced to a set of objects that will systematically train Cog_IBM. Machine learning will categorize and rank the clauses/words in each phrase when applied to Natural Language Understanding (NLU) in Cog_IBM. Figure 1 describes an utterance path to response effectiveness. Methods in this paper will reveal a connection between the quality of an utterance and its influence on the UIR as it follows the path.

Figure 2 illustrates an architectural view in utterance/intent modeling. The model allows for interoperability between heterogenous Cog_x workspaces. Any PCA_x may tie its skill, action or bot to any combination of Cog_x platforms. A platform typically uses JavaScript object notation (JSON) to manage data structures that facilitate interaction models (UIR Model) to evaluate incoming U_h. The agent parses U_h followed by a comparison of the result with specific intent domains’ U_train. Intent domains build context around entities or slots (E). Each E can have synonyms (S) applied to them. Synonyms aid in fine-tuning UIRs so they stand apart from other very similar UIRs. Consider the following example.

$$ {\text{U}}_{\text{h}} = ``{\text{What book do I need for this class}}?'' $$

(1)

$$ {\text{U}}_{\text{train}} = ``{\text{Do I need a }}\left\{ {\text{material}} \right\}{\text{ for this }}\left\{ {\text{course}} \right\}?'' $$

(2)

$$ {\text{I}}_{\text{PCA}} = \, \left\{ {\text{getRequiredMaterials}} \right\} $$

(3)

$$ {\text{I}}_{\text{PCA}} .{\text{E}}.\left\{ {\text{type}} \right\} \, = \, \left\{ {{\text{material}},{\text{ course}}} \right\} $$

(4)

$$ {\text{I}}_{\text{PCA}} .{\text{E}}.\left\{ {\text{material}} \right\} \, = \, \left\{ {{\text{headphones}},{\text{ textbook}},{\text{ notebook}},{\text{ tablet}}} \right\} $$

(5)

$$ {\text{I}}_{\text{PCA}} .{\text{S}}.\left\{ {\text{textbook}} \right\} \, = \, \left\{ {{\text{book}},{\text{ publication}},{\text{ ISBN number}}} \right\} $$

(6)

The path to the intent is as follows:

$$ {\text{U}}_{\text{h}} .\left\{ {\text{book}} \right\} \to {\text{S}}.\left\{ {\text{textbook}} \right\} \to {\text{E}}.\left\{ {\text{material}} \right\} \to {\text{U}}_{\text{train}} .\left\{ {\text{material}} \right\} \to {\text{I}}_{\text{PCA}} .\left\{ {\text{getRequiredMaterials}} \right\} $$

3 Methods

3.1 Research Question

The following research question establishes a two-part goal:

Determine a set of measures (potentially metrics) to evaluate brittleness in a quantitative manner.
Evaluate brittleness effect based on the application of the measures in goal 1. A strong relationship between operator utterances and training utterances implies a strong utterance/intent (UIR) relationship. Strong utterance/intent relationships should lead to an improved response from the PCA, thereby reducing the brittleness effect. Future research will address phrasing quality and response quality from a human operator’s perspective.

RQ1: How can brittleness be measured and reduced in personal cognitive agents?

3.2 Hypothesis

I will evaluate three hypotheses in this paper, linking the quality of training utterances (QU_train) to an improved UIR score while applying a static set of articulated utterances U_h. Furthermore, I will assess training utterance quality by calculating cognitive value in an assessment algorithm called CogMetrix.

H1: As the number of unqualified training utterances $ \left( {\left| {\vec{U}_{train} } \right|} \right) $ in a set increases, UIR_PCA will also increase, thereby improving the confidence scores. H₀ = adj. R² < .8 and p-value > .05.
H2: As the quality of training utterances (QU_train|U_train) in a set increases, UIR_PCA will also increase, thereby improving the confidence scores. $ {\text{H}}_{0} =\Upsilon \left( {{\text{UIR}}\left( {{\text{QU}}_{\text{train}} } \right)|{\text{UIR}}\left( {{\text{U}}_{\text{train}} } \right)} \right) < \tau_{\text{UIR}} = - .2 $.
H3: As the number of Qualified Training Utterances $ \left( {|\overrightarrow {QU}_{train} |} \right) $ in a set increases, UIR_PCA will also increase, thereby improving the confidence scores. H₀ = adj. R² < .8 and p-value > .05.

Variables used in the preceding hypotheses are included in Table 1.

Table 1. Variables

Full size table

3.3 Cognitive Value

Cognitive value or cognitive gain is an emergent measure developed by Fulbright that utilizes representational information theory [13]. He builds on Vigo’s theory that quantifies structural complexity in information [32]. Structural complexity is further used as a foundation for a key component in cognitive value ($ \hbar $). $ \hbar $ identifies the amount of informative value an object offers to its representational concept. As it relates to this paper, a representational concept is the intent (I_PCA). As training utterances are collected, the relative effect on conceptual understanding trends in either a positive or negative direction. Any utterances that positively compliment a concept understanding are included in a subset of qualified utterances. As such, an optimization effect will emerge—offering a set of well-defined utterances that will yield a best-case for an efficient rule-based machine learning process. It is necessary to evaluate a master set of unqualified utterance candidates because the potential of multiple intents exists in a Cog_x application. This set is defined as a universe of unqualified utterances.

I apply cognitive value ($ \hbar $) as a quality measure compared against a discrimination threshold τ_Utrain used to determine qualified utterances (QU_train). The value of τ_Utrain is arbitrary and set to 1. When cognitive value is less than τ_Utrain, the training utterance is included in a new set of qualified training utterances. Cognitive value assesses change in structural complexity between attribute values in a set of objects called categorical stimuli. While there are many potential attributes one can use to evaluate speech in natural language understanding, I chose three for this exemplar: parts of speech model (POSModel), dominant entity and statement type. See an example JavaScript Object Notation (JSON) object for a set of utterances in Fig. 3.

The POSModel is a string of tags defined by Stanford University POS Tagging utility [33]. See an example used in this exercise called UtterancePOSModel in Fig. 3.

Furthermore, I extract a dominant entity based on keywords in the phrase. Dominant entities ultimately lead to intent resolution. The application compares keywords against an entity dictionary. An entity dictionary is part of an interaction model common in Cog_x applications. If a keyword is present in the dictionary, its entity lemma is returned and assessed for fitness to be assigned the dominant entity attribute. Assessing lemma fitness as a dominant entity is a process that goes beyond the scope of this paper and will be included future research. A sample dictionary can be found in Appendix 1.

Statement type is one of four possible values, declarative (1), imperative (2), interrogative (3) or exclamatory (4).

Next, I calculate structural complexity using Vigo’s Generalized Invariance Structure Theory (GIST) algorithm. GIST is an invariance extraction mechanism applied to a set of categorical stimuli in a concept [32]. Invariance is a measure of similarity in attribute values of categorical stimuli. Structural complexity is established by determining the amount of invariance in a set of objects. In this exemplar the categorical stimuli are the training utterances. Dimensions within the categorical stimuli are the POSModel, statement type and dominant entity. Examples of attribute values can be found in Fig. 3. The GIST algorithm itself goes beyond the scope of this paper, but I will include a generalized abstraction. The structural complexity equation is listed in line 7 where $ p $ = number of objects/utterances in the set and $ v $ is the amount of similarity/invariance of values in an object’s dimension.

$$ \psi \left( {\overbrace {\varvec{F}}^{{}}} \right) = pe^{{{-\!\!-}\sqrt {\left( {\frac{{v_{1} }}{p}} \right)^{2} + \left( {\frac{{v_{2} }}{p}} \right)^{2} + \cdots + \left( {\frac{{v_{D} }}{p}} \right)^{2} } }} $$

(7)

GIST calculates a Euclidian distance between values of free dimensions in an object by removing one bound dimension. Similar objects are adopted by comparing the distances to a discrimination threshold τ_d = 0 where d is the bound (or removed) dimension. Object dimension value distances are measured with a similarity function $ e^{{\Delta_{\left[ d \right]}^{r} \left( {\overrightarrow {{\varvec{obj}_{i} }} } \right.,\left. {\overrightarrow {{\varvec{obj}_{j} }} } \right)}} $. A 0 distance returns 1 when applied to the similarity function e^−1*0 = 1. I sum the 1s, taking the result, dividing it by the total number of objects $ (| {\overbrace {\varvec{F}}^{{}}} | = \varvec{p}) $. The process yields an invariance measure per dimension whose values are plugged into the structural complexity equation in line 7.

Consider the following concepts $ \overbrace{F} $ and $ \overbrace {G}^{{}} $. R is an element of set $ \overbrace {F}^{{}} $. $ \overbrace {G}^{{}} $ is a subset of $ \overbrace {F}^{{}} $ without R. I use equation in line 7 to calculate structural complexity for both sets.

$$ \overbrace {F}^{{}} = \vec{U}_{train} = \left\{ {{\text{Master}}\,{\text{set}}\,{\text{of}}\,{\text{unqualified}}\,{\text{training}}\,{\text{utterances}}\,{\text{listed}}\,{\text{in}}\,{\text{Appendix}}\, 2} \right\} $$

(8)

$$ {\mathbf{R}} = \vec{U}_{train} \left( 1 \right) \, = \, \left\{ { ` ` {\text{Do}}\,{\text{you}}\,{\text{require}}\,{\text{a}}\,{\text{charger}}\,{\text{for}}\,{\text{this}}\,{\text{class?''}}} \right\} $$

(9)

$$ \overbrace {\varvec{G}}^{{}} = \overbrace {F}^{{}} - {\mathbf{R}} $$

(10)

$$ \psi \left( {\overbrace {\varvec{F}}^{{}}} \right) = 1.558 $$

(11)

$$ \psi \left( {\overbrace {\varvec{G}}^{{}}} \right) = 1.773 $$

(12)

$$ \uptau_{\text{Utrain}} = 1 $$

(13)

Next, I calculate the structural complexity in $ \overbrace {G}^{{}} $ in as it relates to $ \overbrace {F}^{{}} $ and assess the outcome for its fitness as a qualified training utterance.

$$ \overrightarrow {QU}_{train} \left( {\mathbf{R}} \right) = \hbar \left( {{\mathbf{R}}|\overbrace {\varvec{F}}^{{}}} \right) <\uptau_{\text{Utrain}} $$

(14)

$$ \hbar \left( {{\mathbf{R}}|\overbrace {\varvec{F}}^{{}}} \right) = \frac{{\psi \left( {\overbrace {\varvec{G}}^{{}}} \right) - \psi \left( {\overbrace {\varvec{F}}^{{}}} \right)}}{{\psi \left( {\overbrace {\varvec{F}}^{{}}} \right)}} $$

(15)

$$ \frac{{\psi \left( {\overbrace {\varvec{G}}^{{}}} \right) - \psi \left( {\overbrace {\varvec{F}}^{{}}} \right)}}{{\psi \left( {\overbrace {\varvec{F}}^{{}}} \right)}} { = }\frac{1.773 - 1.558}{1.558} $$

(16)

$$ \hbar_{1} = 0.138 $$

(17)

$$ .138 < 1 $$

(18)

$$ {\mathbf{R}}\,{\text{is}}\,{\text{adopted}}\,{\text{and}}\,{\text{added}}\,{\text{to}}\,\overrightarrow {QU}_{train} $$

(12)

A selection table is found in Table 3.

3.4 Applications

I wrote two applications to evaluate UIR

WatsonAskSirDexConversationAPI– Connector between CogMetrix and Cog_IBM
CogMetrix – application of the Cognitive Agreement algorithm

3.5 Procedure

I will capture the change in UIR_PCA with respect to both unqualified training utterances and qualified training utterances via textual application of a static set of articulated utterances to Cog_IBM.

First, I define set of twenty (|U_h|) random articulated utterances $ (\vec{U}_{h} ) $ found in Table 3 followed by a random set of thirty-eight (|U_train|) unqualified training utterances $ (\vec{U}_{train} ) $ found in Appendix 2. I add U_train examples to Cog_IBM in stepwise fashion until I reach |U_train|. I apply all $ \vec{U}_{h} $ to Cog_IBM and record the results for each step.

Next, I build a subset of $ \vec{U}_{train} $ called $ \overrightarrow {QU}_{train} $ and calculate $ \hbar $ with CogMetrix for each U_train. CogMetrix tests each QU_train for $ \hbar $. A QU_train element is discarded when $ \hbar\geq\uptau_{\text{Utrain}} = 1 $, leaving a final set of qualified training utterances found in Appendix 3.

Having now created the set of qualified training utterances I can assess quality impact on UIR_PCA by first replacing all U_train with QU_train in Cog_IBM. I, then, in stepwise fashion, apply all $ \vec{U}_{h} $ to Cog_IBM and record the UIR_PCA results for each step.

Finally, I compare the results of the application of $ \vec{U}_{h} $ to both $ \vec{U}_{train} $ and $ \overrightarrow {QU}_{train} $ and assess the direction of change in UIR_PCA with regards to the impact of $ \vec{U}_{train} $ and $ \overrightarrow {QU}_{train} $ to satisfy H1 and H3 respectively. The desired ANOVA R² ≥ .8 and F-test with p-value < .05 should indicate a relative degree of confidence in UIR_PCA trends. A rejection of $ {\text{H}}_{0} =\Upsilon \left( {{\text{UIR}}_{\text{PCA}} \left( {{\text{QU}}_{\text{train}} } \right)|{\text{UIR}}\left( {{\text{U}}_{\text{train}} } \right)} \right) < \tau_{\text{UIR}} = - .2 $ should show a positive quality outcome for UIR_PCA.

4 Results and Discussion

Results are inconclusive for the test of H1. There is a low degree of confidence in a positive direction of UIR_PCA with respect to U_train despite a p-value < .05. Increasing the number of unqualified random training utterances does not seem to fully explain the change in UIR_PCA. Figure 4 shows the average change in WES/UIR scores. Table 2 is the data.

Table 2. Data for average change in UIR/WES as it relates to $ |\vec{\varvec{U}}_{{\varvec{train}}} | $.

Full size table

Conversely, results are better for H2. When testing the change in UIR_PCA for each U_h, more intent resolution instances occur with fewer targeted training utterances. Table 3 shows this behavior.

Table 3. Data for average change in UIR/WES as it relates to $ |\overrightarrow {{\varvec{QU}}}_{{\varvec{train}}} | $. The tolerance level for the change is .2.

Full size table

Finally, I satisfy H3 indicating an upward trend in UIR_PCA with the R² value being .93 and p-value <.05, concluding that adding more targeted quality training utterances does explain the change in UIR_PCA. Figure 5 and Table 4 illustrate the result.

Table 4. Data for average change in UIR/WES as it relates to $ |\overrightarrow {{\varvec{QU}}}_{{\varvec{train}}} | $.

Full size table

5 Final Thoughts and Future Research

There are clear opportunities to improve outcomes using RIT as a mechanism to assess fitness between training utterances and intent resolution. A rigorous process of selecting utterance attributes should bolster results for all three hypotheses. Expanding the study to include human participants utilizing mixed-method instruments will add a favorable degree of randomness missing from the method applied in this exercise. Finally, I would assess an entire Cog_x application that employs more than two intents.

Measuring utterance-intent relationships should improve rule-based machine learning algorithms used to prepare Cog_x applications. As such, employing UIR discrimination should mitigate interaction brittleness in personal cognitive augmentation.

References

Amazon has sold more than 11 million Echo devices, Morgan Stanley says (2016). http://www.seattletimes.com/business/amazon/amazon-has-sold-more-than-11-million-echo-devices-morgan-stanley-says
Gartner Says Worldwide Sales of Smartphones Grew 7 Percent in the Fourth Quarter of 2016 (2016). http://www.gartner.com/newsroom/id/3609817
SAP Ariba and IBM Join Forces to Transform Procurement with SAP Leonardo and Watson (2016). http://www.businesswire.com/news/home/20170517005157/en/SAP-Ariba-IBM-Join-Forces-Transform-Procurement
Why 2017 is the Year to Invest in Artificial Intelligence Stocks (2016). https://www.fool.com/investing/2017/01/16/why-2017-is-the-year-to-invest-in-artificial-intel.aspx
Bampton, H.J.: Solving Imperfect Information Games Using the Monte Carlo Heuristic. University of Tennessee, Knoxville (1994)
Google Scholar
Berliner, H.J.: Backgammon computer program beats world champion. Artif. Intell. 14(2), 205–220 (1980)
Article Google Scholar
Bush, S.F., Hershey, J., Vosburgh, K.: Brittle system analysis. arXiv preprint cs/9904016 (1999)
Bush, S.F., Hershey, J.E., Vosburgh, K.G., Osborn, B.E.: Apparatus and method for analyzing brittleness of a system. In: Google Patents (2004)
Google Scholar
Chauncey, K., Harriott, C., Prasov, Z., Cunha, M.: A framework for co-adaptive human-robot interaction metrics. Paper Presented at the Proceedings of the Workshop on Human-Robot Collaboration: Towards Co-Adaptive Learning Through Semi-Autonomy and Shared Control (HRC). IEEE/RSJ International Conference on Intelligent Robots and Systems, 9–14 October 2016, Daejeon, Korea (2016)
Google Scholar
Engelbart, D.C.: Augmenting human intellec:t a conceptual framework (1962). In: Packer, R., Jordan, K. (eds.) Multimedia. From Wagner to Virtual Reality, pp. 64–90. W. W. Norton & Company, New York (2001)
Google Scholar
Fischer, K.: How people talk with robots: designing dialogue to reduce user uncertainty. AI Mag. 32(4), 31–38 (2011)
Article Google Scholar
Fulbright, R.: The Cogs Are Coming: The Cognitive Augmentation Revolution. Association Supporting Computer Users in Education. Our Second Quarter Century of Resource Sharing, p. 40 (2016)
Google Scholar
Fulbright, R.: Cognitive augmentation metrics using representational information theory. Paper Presented at the International Conference on Augmented Cognition (2017)
Chapter Google Scholar
George, G., Haas, M.R., Pentland, A.: Big data and management. Acad. Manag. J. 57(2), 321–326 (2014)
Article Google Scholar
Guida, G., Tasso, C.: NLI: a robust interface for natural language person-machine communication. Int. J. Man Mach. Stud. 17(4), 417–433 (1982)
Article Google Scholar
Lally, A., Fodor, P.: Natural language processing with prolog in the IBM Watson system. The Association for Logic Programming (ALP) Newsletter (2011)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Article Google Scholar
Lee, I., Lee, K.: The internet of things (IoT): applications, investments and challenges for enterprises. Bus. Horiz. 58(4), 431–440 (2015)
Article Google Scholar
Lenat, D.B., Prakash, M., Shepherd, M.: CYC: using common sense knowledge to overcome brittleness and knowledge acquisition bottlenecks. AI Mag. 6(4), 65 (1985)
Google Scholar
Licklider, J.C.: Man-computer symbiosis. IRE Trans. Hum. Factors Electron. (1), 4–11 (1960)
Article Google Scholar
Makridakis, S.: The forthcoming artificial intelligence (AI) revolution: its impact on society and firms. Futures 90, 46–60 (2017)
Article Google Scholar
Moravec, H.P.: Towards automatic visual obstacle avoidance. Paper Presented at the 5^th International Conference on Artificial Intelligence. Massachusetts Institute of Technology (1977)
Google Scholar
Olsen, D.R., Goodrich, M.A.: Metrics for evaluating human-robot interactions. Paper Presented at the Proceedings of the PERMIS (2003)
Google Scholar
Postrel, V.: Power fantasies: the strange appeal of the Y2K bug. Reason-Santa Barbara then Los Angeles30, 4–5 (1999)
Google Scholar
Quigley, M., Conley, K., Gerkey, B., Faust, J., Foote, T., Leibs, J., Wheeler, R., Ng, A.Y.: ROS: an open-source robot operating system. Paper Presented at the ICRA Workshop on Open Source Software (2009)
Google Scholar
Scheutz, M., Cantrell, R., Schermerhorn, P.: Toward humanlike task-based dialogue processing for human robot interaction. AI Mag. 32(4), 77–84 (2011)
Article Google Scholar
Shortliffe, E.H., Davis, R., Axline, S.G., Buchanan, B.G., Green, C.C., Cohen, S.N.: Computer-based consultations in clinical therapeutics: explanation and rule acquisition capabilities of the MYCIN system. Comput. Biomed. Res. 8(4), 303–320 (1975)
Article Google Scholar
Talamadupula, K., Srivastava, B., Kephart, J.O.: Workflow complexity for collaborative interactions: where are the metrics?–A challenge. arXiv preprint arXiv:1709.04524 (2017)
Urmson, C.: The google self-driving car project. Talk at Robotics: Science and Systems (2011)
Google Scholar
Weiss, A., Bernhaupt, R., Lankes, M., Tscheligi, M.: The USUS evaluation framework for human-robot interaction. Paper Presented at the AISB 2009: Proceedings of the Symposium on New Frontiers in Human-Robot Interaction (2009)
Google Scholar
Weizenbaum, J.: ELIZA—a computer program for the study of natural language communication between man and machine. Commun. ACM 9(1), 36–45 (1966)
Article Google Scholar
Vigo, R.: Mathematical Principles of Human Conceptual Behavior: The Structural Nature of Conceptual Representation and Processing, vol. 22. Psychology Press, Hove (2014)
Google Scholar
Toutanova, K., Klein, D., Manning, C., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proceedings of the HLT-NAACL 2003, pp. 252–259 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

University of South Florida, Tampa, FL, USA
Grover Walters

Authors

Grover Walters
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Grover Walters .

Editor information

Editors and Affiliations

Tokyo University of Science, Tokyo, Japan
Sakae Yamamoto
Tokyo City University, Tokyo, Japan
Hirohiko Mori

Appendices

Appendix 1 - Dictionary JSON

Appendix 2 - List of Random Unqualified Training Utterances

Do I need my washing machine for the activity
Do you require a piano
Do you require a rusty nail
I would like to know if it is okay if I bring my mp3 player
Is it okay if I bring my glue stick
Should I bring my bow for this course
Should I bring my picture frame for the activity
Tell me what glue stick do you want me to bring for post-case analysis
What ‘old person’ things do you do?
What animal would you most like to eat?
What books on your shelf are begging to be read?
What current trend makes no sense to you?
What glue stick should I bring for exam 3
What has someone borrowed but never given back?
What inanimate object do you wish you could eliminate from existence?
What languages do you wish you could speak?
What mla citation style guide is necessary for post-case analysis
What mla citation style guide is necessary for speech
What movie would be greatly improved if it was made into a musical?
What tips or tricks have you picked up from your job/jobs?
What tomato do you want me to bring for word search assignment
What topic could you spend hours talking about?
What two films would you like to combine into one?
What was the last situation where some weird stuff went down and everyone acted like it was normal, and you weren’t sure if you were crazy or everyone around you was crazy?
What word is a lot of fun to say?
What’s a common experience for many people that you’ve never experienced?
What’s something people don’t worry about but really should?
What’s something that I don’t know?
What’s the funniest joke you know by heart?
Would you rather walk around work or school for the whole day without realizing there is a giant brown stain on the back of your pants or realize the deadline for that important paper/project was yesterday and you are nowhere near done?
What’s the most ironic thing you’ve seen happen?
What’s the most ridiculous thing you have bought?
What’s the silliest thing you’ve convinced someone of?
What’s the weirdest thing a guest has done at your house?
What’s your best example of easy come, easy go?
Where can I find a chocolate
Where can I find a thread for the semester
Where’s your go to restaurant for amazing food?

Appendix 3 - List of Random Unqualified Training Utterances

Selection	$ \hbar $	Utterance
keep	0.088	Do you require a piano
keep	0.088	Where can I find a chocolate
keep	0.088	Should I bring my bow for this course
keep	0.088	Should I bring my picture frame for the activity
keep	0.088	Where can I find a thread for the semester
keep	0.088	What glue stick should I bring for exam 3
keep	0.088	Is it okay if I bring my glue stick
keep	0.088	What mla citation style guide is necessary for speech
keep	0.088	What mla citation style guide is necessary for post-case analysis
keep	0.088	What tomato do you want me to bring for word search assignment
keep	0.088	Tell me what glue stick do you want me to bring for post-case analysis
keep	0.088	I would like to know if it is okay if I bring my mp3 player
remove	1.081	Would you rather walk around work or school for the whole day without realizing there is a giant brown stain on the back of your pants or realize the deadline for that important paper/project was yesterday and you are nowhere near done?
remove	1.081	Do you require a rusty nail
remove	1.081	What ‘old person’ things do you do?
remove	1.081	What languages do you wish you could speak?
remove	1.081	What current trend makes no sense to you?
remove	1.081	What topic could you spend hours talking about?
remove	1.081	What has someone borrowed but never given back?
remove	1.081	What’s something that I don’t know?
remove	1.081	Do I need my washing machine for the activity
remove	1.081	What animal would you most like to eat?
remove	1.081	What word is a lot of fun to say?
remove	1.081	What’s the funniest joke you know by heart?
remove	1.081	What’s the most ridiculous thing you have bought?
remove	1.081	Where’s your go to restaurant for amazing food?
remove	1.081	What two films would you like to combine into one?
remove	1.081	What books on your shelf are begging to be read?
remove	1.081	What’s the most ironic thing you’ve seen happen?
remove	1.081	What’s the silliest thing you’ve convinced someone of?
remove	1.081	What’s your best example of easy come, easy go?
remove	1.081	What’s something people don’t worry about but really should?
remove	1.081	What inanimate object do you wish you could eliminate from existence?
remove	1.081	What’s the weirdest thing a guest has done at your house?
remove	1.081	What tips or tricks have you picked up from your job/jobs?
remove	1.081	What movie would be greatly improved if it was made into a musical?
remove	1.081	What’s a common experience for many people that you’ve never experienced?
remove	1.081	What was the last situation where some weird stuff went down and everyone acted like it was normal, and you weren’t sure if you were crazy or everyone around you was crazy?

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Walters, G. (2018). Evaluating the Effectiveness of Personal Cognitive Augmentation: Utterance/Intent Relationships, Brittleness and Personal Cognitive Agents. In: Yamamoto, S., Mori, H. (eds) Human Interface and the Management of Information. Information in Applications and Services. HIMI 2018. Lecture Notes in Computer Science(), vol 10905. Springer, Cham. https://doi.org/10.1007/978-3-319-92046-7_46

Download citation

DOI: https://doi.org/10.1007/978-3-319-92046-7_46
Published: 07 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92045-0
Online ISBN: 978-3-319-92046-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Evaluating the Effectiveness of Personal Cognitive Augmentation: Utterance/Intent Relationships, Brittleness and Personal Cognitive Agents

Abstract

Similar content being viewed by others

How perceptions of intelligence and anthropomorphism affect adoption of personal intelligent agents

Perceptions of Agent Loyalty with Ancillary Users

AI increases unethical consumer behavior due to reduced anticipatory guilt

Keywords

1 Introduction

1.1 Inevitability of Cognitive Augmentation

1.2 Related Problems

1.3 Practical Contribution

2 Literature

2.1 Humans and Computers

2.2 A Cognitive Era Emerges

2.3 Brittleness

2.4 Utterance Intent Relationships

3 Methods

3.1 Research Question

3.2 Hypothesis

3.3 Cognitive Value

3.4 Applications

3.5 Procedure

4 Results and Discussion

5 Final Thoughts and Future Research

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendices

Appendix 1 - Dictionary JSON

Appendix 2 - List of Random Unqualified Training Utterances

Appendix 3 - List of Random Unqualified Training Utterances

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation