Abstract
Many task-oriented chatbots and virtual assistants like Siri, Alexa, and Google Assistant are built as Natural Language (command) Interfaces (NLIs) that allow users to issue natural language (NL) commands to be mapped to some actions for execution in the underlying application in order to accomplish some tasks intended by the users. A fundamental feature of such systems is the ability to understand users’ language and ground them to intended actions (often in symbolic form). Due to their diverse and wide-spread real-world applications, such NLI systems have driven research in language understanding, grounding and human-robot interactions over the years. This chapter discusses the scope for continual and interactive language learning in the context of NLIs and introduces some of the representative works along this direction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
J. Andreas, D. Klein, Alignment-based compositional semantics for instruction following, in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1165–1174 (2015)
B.D. Argall, S. Chernova, M. Veloso, B. Browning, A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–483 (2009)
Y. Artzi, L. Zettlemoyer, Weakly supervised learning of semantic parsers for mapping instructions to actions. Trans. Assoc. Comput. Linguist. 1, 49–62 (2013)
C. Baik, H.V. Jagadish, Y. Li, Bridging the semantic gap with SQL query logs in natural language interfaces to databases, in 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 374–385. IEEE (2019)
J. Berant, A. Chou, R. Frostig, P. Liang, Semantic parsing on freebase from question-answer pairs, in Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1533–1544 (2013)
Y. Bisk, K. Shih, Y. Choi, D. Marcu, Learning interpretable spatial operations in a rich 3d blocks world, in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32 (2018)
S.R.K. Branavan, H. Chen, L. Zettlemoyer, R. Barzilay, Reinforcement learning for mapping instructions to actions, in Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 82–90 (2009)
S.R.K. Branavan, L. Zettlemoyer, R. Barzilay, Reading between the lines: learning to map high-level instructions to commands, in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 1268–1277 (2010)
K. Burns, C.D. Manning, L. Fei-Fei, Neural abstructions: abstractions that support construction for grounded language learning (2021). arXiv:2107.09285
J.Y. Chai, Q. Gao, L. She, S. Yang, S. Saba-Sadiya, G. Xu, Language to action: towards interactive task learning with physical agents, in Proceedings of the 27th International Joint Conference on Artificial Intelligence, pp. 2–9 (2018)
S. Chaurasia, R. Mooney, Dialog for language to code, in Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp. 175–180 (2017)
J. Clarke, D. Goldwasser, M.-W. Chang, D. Roth, Driving semantic parsing from the world’s response, in Proceedings of the Fourteenth Conference on Computational Natural Language Learning, pp. 18–27. Association for Computational Linguistics (2010)
G. Csibra, G. Gergely, Social learning and social cognition: the case for pedagogy. Processes of change in brain and cognitive development. Atten. Perform. XXI 21, 249–274 (2006)
H. Daumé, J. Langford, D. Marcu, Search-based structured prediction. Mach. Learn. 75(3), 297–325 (2009)
H. De Vries, F. Strub, S. Chandar, O. Pietquin, H. Larochelle, A. Courville, Guesswhat?! visual object discovery through multi-modal dialogue, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5503–5512 (2017)
L. Dong, C. Quirk, M. Lapata, Confidence modeling for neural semantic parsing, in Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 743–753 (2018)
J. Duchi, E. Hazan, Y. Singer, Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12(7), (2011)
A. Effenberger, E. Yan, R. Singh, A. Suhr, Y. Artzi, Analysis of language change in collaborative instruction following. Proc. Soc. Comput. Linguist. 5(1), 194–202 (2022)
A. Elgohary, A.H. Awadallah et al., Speak to your parser: interactive text-to-SQL with natural language feedback, in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 2065–2077 (2020)
S. Ferré, Sparklis: an expressive query builder for SPARQL endpoints with guidance in natural language. Semant. Web 8(3), 405–418 (2017)
D. Fried, J. Andreas, D. Klein, Unified pragmatic models for generating and following instructions, in Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 1951–1963 (2018)
D. Fried, R. Hu, V. Cirik, A. Rohrbach, J. Andreas, L.-P. Morency, T. Berg-Kirkpatrick, K. Saenko, D. Klein, T. Darrell, Speaker-follower models for vision-and-language navigation, in Advances in Neural Information Processing Systems, pp. 3314–3325 (2018)
T. Gao, M. Dontcheva, E. Adar, Z. Liu, K.G. Karahalios, Datatone: managing ambiguity in natural language interfaces for data visualization, in Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology, pp. 489–500 (2015)
K.A. Gluck, J.E. Laird, Interactive Task Learning: Agents, Robots, and Humans Acquiring new Tasks Through Natural Interactions. MIT Press (2018)
J. Gray, K. Srinet, Y. Jernite, H. Yu, Z. Chen, D. Guo, S. Goyal, C.L. Zitnick, A. Szlam, Craftassist: a framework for dialogue-enabled interactive agents (2019). arXiv:1907.08584
I. Gur, S. Yavuz, Y. Su, X. Yan, Dialsql: dialogue based structured query generation, in Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1339–1349 (2018)
K. Guu, P. Pasupat, E. Liu, P. Liang, From language to programs: bridging reinforcement learning and maximum marginal likelihood, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1051–1062 (2017)
L. He, J. Michael, M. Lewis, L. Zettlemoyer, Human-in-the-loop parsing, in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 2337–2342 (2016)
W. Hwang, J. Yim, S. Park, M. Seo, A comprehensive exploration on WikiSQL with table-aware word contextualization (2019). arXiv:1902.01069
S. Iyer, I. Konstas, A. Cheung, J. Krishnamurthy, L. Zettlemoyer, Learning a neural semantic parser from user feedback, in 55th Annual Meeting of the Association for Computational Linguistics (2017)
C. Lawrence, S. Riezler, Nlmaps: a natural language interface to query openstreetmap, in Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations, pp. 6–10 (2016)
Y. Li, D. Rafiei, Natural language data management and interfaces. Synth. Lect. Data Manag. 10(2), 1–156 (2018)
J. Li, W. Wang, W.-S. Ku, Y. Tian, H. Wang, Spatialnli: a spatial domain natural language interface to databases using spatial comprehension, in Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 339–348 (2019)
P. Liang, Learning executable semantic parsers for natural language understanding. Commun. ACM 59(9), 68–76 (2016)
F. Li, H.V. Jagadish, Constructing an interactive natural language interface for relational databases. Proc. VLDB Endow. 8(1), 73–84 (2014)
X.V. Lin, C. Wang, L. Zettlemoyer, M.D. Ernst, Nl2bash: a corpus and semantic parser for natural language interface to the linux operating system, in Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (2018)
C. Liu, S. Yang, S. Saba-Sadiya, N. Shukla, Y. He, S.-C. Zhu, J. Chai, Jointly learning grounded task structures from language instruction and visual demonstration, in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1482–1492 (2016)
B. Liu, S. Mazumder, Lifelong and continual learning dialogue systems: learning during conversation. Proc. AAAI Conf. Artif. Intell. 35, 15058–15063 (2021)
S. Machines, J. Andreas, J. Bufe, D. Burkett, C. Chen, J. Clausman, J. Crawford, K. Crim, J. DeLoach, L. Dorner, J. Eisner, H. Fang, A. Guo, D. Hall, K. Hayes, K. Hill, D. Ho, W. Iwaszuk, S. Jha, D. Klein, J. Krishnamurthy, T. Lanman, P. Liang, C.H. Lin, I. Lintsbakh, A. McGovern, A. Nisnevich, A. Pauls, D. Petters, B. Read, D. Roth, S. Roy, J. Rusak, B. Short, D. Slomin, B. Snyder, Y.S. Stephon Striplin, Z. Tellman, S. Thomson, A. Vorobev, I. Witoszko, J. Wolfe, A. Wray, Y. Zhang, A. Zotov, Task-oriented dialogue as dataflow synthesis. Trans. Assoc. Comput. Linguist. 8, 556–571 (2020). https://doi.org/10.1162/tacl_a_00333. (September)
M. MacMahon, B. Stankiewicz, B. Kuipers, Walk the talk: Connecting language, knowledge, and action in route instructions. Def 2(6), 4 (2006)
S. Mazumder, On-the-job continual and interactive learning of factual knowledge and language grounding. Ph.D. thesis, University of Illinois at Chicago, 2021
S. Mazumder, B. Liu, S. Wang, S. Esmaeilpour, An application-independent approach to building task-oriented chatbots with interactive continual learning, in NeurIPS-2020 Workshop on Human in the Loop Dialogue Systems (2020)
S. Mazumder, O. Riva, Flin: a flexible natural language interface for web navigation, in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 2777–2788 (2021)
W.P. McCarthy, R. Hawkins, H. Wang, C. Holdaway, J.E. Fan, Learning to communicate about shared procedural abstractions, in Proceedings of the Annual Meeting of the Cognitive Science Society, vol. 43 (2021)
D. Misra, J. Langford, Y. Artzi, Mapping instructions and visual observations to actions with reinforcement learning, in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 1004–1015 (2017)
S. Mohan, J. Laird, Learning goal-oriented hierarchical tasks from situated interactive instruction, in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 28 (2014)
A. Mohseni-Kabir, C. Li, W. Victoria, D. Miller, B. Hylak, S. Chernova, D. Berenson, C. Sidner, C. Rich, Simultaneous learning of hierarchy and primitives for complex robot tasks. Auton. Robot. 43, 859–874 (2019)
A. Narayan-Chen, P. Jayannavar, J. Hockenmaier, Collaborative dialogue in minecraft. in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 5405–5415 (2019)
A. Neelakantan, Q.V. Le, M. Abadi, A. McCallum, D. Amodei, Learning a natural language interface with neural programmer, in International Conference on Learning Representations (2017)
P. Pasupat, T.-S. Jiang, E. Liu, K. Guu, P. Liang, Mapping natural language commands to web elements, in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 4970–4976 (2018)
P. Pasupat, P. Liang, Compositional semantic parsing on semi-structured tables, in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1470–1480 (2015)
Q. Ping, F. Niu, G. Thattai, J. Chengottusseriyil, Q. Gao, A. Reganti, P. Rajagopal, G. Tur, D. Hakkani-Tur, P. Nataraja, Interactive teaching for conversational AI (2020). arXiv:2012.00958
S. Ross, D. Bagnell, Efficient reductions for imitation learning, in Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 661–668 (2010)
S. Ross, G. Gordon, D. Bagnell, A reduction of imitation learning and structured prediction to no-regret online learning, in Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pp. 627–635 (2011)
R. Rubavicius, A. Lascarides, Interactive symbol grounding with complex referential expressions, in Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4863–4874 (2022)
P.E. Rybski, K. Yoon, J. Stolarz, M.M. Veloso, Interactive robot task training through dialog and demonstration, in Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, pp. 49–56 (2007)
M. Scheutz, E. Krause, B. Oosterveld, T. Frasca, R. Platt, Spoken instruction-based one-shot object and action learning in a cognitive robotic architecture, in Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, pp. 1378–1386 (2017)
V. Setlur, S.E. Battersby, M. Tory, R. Gossweiler, A.X. Chang, Eviza: a natural language interface for visual analysis, in Proceedings of the 29th Annual Symposium on User Interface Software and Technology, pp. 365–377 (2016)
L. She, J. Chai, Incremental acquisition of verb hypothesis space towards physical world interaction, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 108–117 (2016)
L. She, J. Chai, Interactive learning of grounded verb semantics towards human-robot communication, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1634–1644 (2017)
L. She, S. Yang, Y. Cheng, Y. Jia, J. Chai, N. Xi, Back to the blocks world: Learning new actions through situated human-robot dialogue, in Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL), pp. 89–97 (2014)
R. Shekhar, A. Venkatesh, T. Baumgärtner, E. Bruni, B. Plank, R. Bernardi, R. Fernández, Beyond task success: a closer look at jointly learning to see, ask, and guesswhat, in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 2578–2587 (2019)
K. Soh, TagUI: RPA/CLI tool for automating user interactions (2017), https://github.com/ kelaberetiv/TagUI
K. Srinet, Y. Jernite, J. Gray, A. Szlam, Craftassist instruction parsing: Semantic parsing for a voxel-world assistant, in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 4693–4714 (2020)
Y. Su, A.H. Awadallah, M. Khabsa, P. Pantel, M. Gamon, M. Encarnacion, Building natural language interfaces to web APIS, in Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 177–186 (2017)
Y. Su, A.H. Awadallah, M. Wang, R.W. White, Natural language interfaces with fine-grained user interaction: a case study on web APIS, in The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 855–864 (2018)
A. Suglia, A. Vergari, I. Konstas, Y. Bisk, E. Bastianelli, A. Vanzo, O. Lemon, Imagining grounded conceptual representations from perceptual information in situated guessing games, in The 28th International Conference on Computational Linguistics, pp. 1090–1102. International Committee on Computational Linguistics (2020)
R.S. Sutton, A.G. Barto, Reinforcement Learning: an Introduction. MIT press (2018)
J.J. Tehrani, F. Riede, Towards an archaeology of pedagogy: learning, teaching and the generation of material culture traditions. World Archaeol. 40(3), 316–331 (2008)
S. Tellex, T. Kollar, S. Dickerson, M. Walter, A. Banerjee, S. Teller, N. Roy, Understanding natural language commands for robotic navigation and mobile manipulation. Proc. AAAI Conf. Artif. Intell. 25, 1507–1514 (2011)
S. Tellex, N. Gopalan, H. Kress-Gazit, C. Matuszek, Robots that use language. Annu. Rev. Control. Robot. Auton. Syst. 3, 25–55 (2020)
J. Thomason, S. Zhang, R. Mooney, P. Stone, Learning to interpret natural language commands through human-robot dialog, in Proceedings of the 24th International Conference on Artificial Intelligence, pp. 1923–1929 (2015)
A.L. Thomaz, M. Cakmak, Learning about objects with human teachers, in Proceedings of the 4th ACM/IEEE International Conference on Human Robot Interaction, pp. 15–22 (2009)
G. Tolkachev, S. Mell, S. Zdancewic, O. Bastani, Counterfactual explanations for natural language interfaces, in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 113–118 (2022)
P. Utama, N. Weir, F. Basik, C. Binnig, U. Cetintemel, B. Hättasch, A. Ilkhechi, S. Ramaswamy, A. Usta, An end-to-end neural natural language interface for databases (2018). arXiv:1804.00401
A. Vogel, D. Jurafsky, Learning to follow navigational directions, in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 806–814 (2010)
S.I. Wang, S. Ginn, P. Liang, C.D. Manning, Naturalizing a programming language via interactive learning, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 929–938 (2017)
J.X. Wang, Z. Kurth-Nelson, D. Tirumala, H. Soyer, J.Z. Leibo, R. Munos, C. Blundell, D. Kumaran, M. Botvinick, Learning to reinforcement learn (2016). arXiv:1611.05763
T. Winograd, Understanding natural language (Cogn, Psychol, 1972)
L. Wittgenstein, Philosophical Investigations. John Wiley & Sons (2010)
H. Xiong, R. Sun, Transferable natural language interface to structured queries aided by adversarial generation, in 2019 IEEE 13th International Conference on Semantic Computing (ICSC), pp. 255–262. IEEE (2019)
Z. Yao, X. Li, J. Gao, B. Sadler, H. Sun, Interactive semantic parsing for if-then recipes via hierarchical reinforcement learning. Proc. AAAI Conf. Artif. Intell. 33, 2547–2554 (2019)
Z. Yao, Y. Su, H. Sun, W.-T. Yih, Model-based interactive semantic parsing: a unified framework and a text-to-SQL case study, in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5450–5461 (2019b)
Z. Yao, Y. Tang, W.-T. Yih, H. Sun, Y. Su, An imitation game for learning semantic parsers from user interaction, in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6883–6902 (2020)
W.-T. Yih, M.-W. Chang, X. He, J. Gao, Semantic parsing via staged query graph generation: question answering with knowledge base, in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1321–1331 (2015)
Y. Yu, A. Eshghi, G. Mills, O. Lemon, The burchak corpus: a challenge data set for interactive learning of visually grounded word meanings, in The 6th Workshop on Vision and Language, p. 1 (2017)
T. Yu, R. Zhang, K. Yang, M. Yasunaga, D. Wang, Z. Li, J. Ma, I. Li, Q. Yao, S. Roman et al., Spider: a large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-SQL task, in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 3911–3921 (2018)
J.M. Zelle, R.J. Mooney, Learning to parse database queries using inductive logic programing, in Proceedings of the AAAI Conference on Artificial Intelligence, pp. 1050–1055 (1996)
L.S. Zettlemoyer, M. Collins, Learning to map sentences to logical form: structured classification with probabilistic categorial grammars, in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence, pp. 658–666 (2005)
L. Zettlemoyer, M. Collins, Online learning of relaxed CCG grammars for parsing to logical form, in Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 678–687 (2007)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Mazumder, S., Liu, B. (2024). Continuous and Interactive Language Learning and Grounding. In: Lifelong and Continual Learning Dialogue Systems. Synthesis Lectures on Human Language Technologies. Springer, Cham. https://doi.org/10.1007/978-3-031-48189-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-031-48189-5_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-48188-8
Online ISBN: 978-3-031-48189-5
eBook Packages: Synthesis Collection of Technology (R0)