Continuous and Interactive Language Learning and Grounding

Mazumder, Sahisnu; Liu, Bing

doi:10.1007/978-3-031-48189-5_4

Sahisnu Mazumder⁴ &
Bing Liu⁵

Part of the book series: Synthesis Lectures on Human Language Technologies ((SLHLT))

56 Accesses

Abstract

Many task-oriented chatbots and virtual assistants like Siri, Alexa, and Google Assistant are built as Natural Language (command) Interfaces (NLIs) that allow users to issue natural language (NL) commands to be mapped to some actions for execution in the underlying application in order to accomplish some tasks intended by the users. A fundamental feature of such systems is the ability to understand users’ language and ground them to intended actions (often in symbolic form). Due to their diverse and wide-spread real-world applications, such NLI systems have driven research in language understanding, grounding and human-robot interactions over the years. This chapter discusses the scope for continual and interactive language learning in the context of NLIs and introduces some of the representative works along this direction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Hardcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

J. Andreas, D. Klein, Alignment-based compositional semantics for instruction following, in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1165–1174 (2015)
Google Scholar
B.D. Argall, S. Chernova, M. Veloso, B. Browning, A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–483 (2009)
Article Google Scholar
Y. Artzi, L. Zettlemoyer, Weakly supervised learning of semantic parsers for mapping instructions to actions. Trans. Assoc. Comput. Linguist. 1, 49–62 (2013)
Article Google Scholar
C. Baik, H.V. Jagadish, Y. Li, Bridging the semantic gap with SQL query logs in natural language interfaces to databases, in 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 374–385. IEEE (2019)
Google Scholar
J. Berant, A. Chou, R. Frostig, P. Liang, Semantic parsing on freebase from question-answer pairs, in Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1533–1544 (2013)
Google Scholar
Y. Bisk, K. Shih, Y. Choi, D. Marcu, Learning interpretable spatial operations in a rich 3d blocks world, in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32 (2018)
Google Scholar
S.R.K. Branavan, H. Chen, L. Zettlemoyer, R. Barzilay, Reinforcement learning for mapping instructions to actions, in Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 82–90 (2009)
Google Scholar
S.R.K. Branavan, L. Zettlemoyer, R. Barzilay, Reading between the lines: learning to map high-level instructions to commands, in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 1268–1277 (2010)
Google Scholar
K. Burns, C.D. Manning, L. Fei-Fei, Neural abstructions: abstractions that support construction for grounded language learning (2021). arXiv:2107.09285
J.Y. Chai, Q. Gao, L. She, S. Yang, S. Saba-Sadiya, G. Xu, Language to action: towards interactive task learning with physical agents, in Proceedings of the 27th International Joint Conference on Artificial Intelligence, pp. 2–9 (2018)
Google Scholar
S. Chaurasia, R. Mooney, Dialog for language to code, in Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp. 175–180 (2017)
Google Scholar
J. Clarke, D. Goldwasser, M.-W. Chang, D. Roth, Driving semantic parsing from the world’s response, in Proceedings of the Fourteenth Conference on Computational Natural Language Learning, pp. 18–27. Association for Computational Linguistics (2010)
Google Scholar
G. Csibra, G. Gergely, Social learning and social cognition: the case for pedagogy. Processes of change in brain and cognitive development. Atten. Perform. XXI 21, 249–274 (2006)
Google Scholar
H. Daumé, J. Langford, D. Marcu, Search-based structured prediction. Mach. Learn. 75(3), 297–325 (2009)
Article Google Scholar
H. De Vries, F. Strub, S. Chandar, O. Pietquin, H. Larochelle, A. Courville, Guesswhat?! visual object discovery through multi-modal dialogue, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5503–5512 (2017)
Google Scholar
L. Dong, C. Quirk, M. Lapata, Confidence modeling for neural semantic parsing, in Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 743–753 (2018)
Google Scholar
J. Duchi, E. Hazan, Y. Singer, Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12(7), (2011)
Google Scholar
A. Effenberger, E. Yan, R. Singh, A. Suhr, Y. Artzi, Analysis of language change in collaborative instruction following. Proc. Soc. Comput. Linguist. 5(1), 194–202 (2022)
Google Scholar
A. Elgohary, A.H. Awadallah et al., Speak to your parser: interactive text-to-SQL with natural language feedback, in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 2065–2077 (2020)
Google Scholar
S. Ferré, Sparklis: an expressive query builder for SPARQL endpoints with guidance in natural language. Semant. Web 8(3), 405–418 (2017)
Article Google Scholar
D. Fried, J. Andreas, D. Klein, Unified pragmatic models for generating and following instructions, in Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 1951–1963 (2018)
Google Scholar
D. Fried, R. Hu, V. Cirik, A. Rohrbach, J. Andreas, L.-P. Morency, T. Berg-Kirkpatrick, K. Saenko, D. Klein, T. Darrell, Speaker-follower models for vision-and-language navigation, in Advances in Neural Information Processing Systems, pp. 3314–3325 (2018)
Google Scholar
T. Gao, M. Dontcheva, E. Adar, Z. Liu, K.G. Karahalios, Datatone: managing ambiguity in natural language interfaces for data visualization, in Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology, pp. 489–500 (2015)
Google Scholar
K.A. Gluck, J.E. Laird, Interactive Task Learning: Agents, Robots, and Humans Acquiring new Tasks Through Natural Interactions. MIT Press (2018)
Google Scholar
J. Gray, K. Srinet, Y. Jernite, H. Yu, Z. Chen, D. Guo, S. Goyal, C.L. Zitnick, A. Szlam, Craftassist: a framework for dialogue-enabled interactive agents (2019). arXiv:1907.08584
I. Gur, S. Yavuz, Y. Su, X. Yan, Dialsql: dialogue based structured query generation, in Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1339–1349 (2018)
Google Scholar
K. Guu, P. Pasupat, E. Liu, P. Liang, From language to programs: bridging reinforcement learning and maximum marginal likelihood, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1051–1062 (2017)
Google Scholar
L. He, J. Michael, M. Lewis, L. Zettlemoyer, Human-in-the-loop parsing, in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 2337–2342 (2016)
Google Scholar
W. Hwang, J. Yim, S. Park, M. Seo, A comprehensive exploration on WikiSQL with table-aware word contextualization (2019). arXiv:1902.01069
S. Iyer, I. Konstas, A. Cheung, J. Krishnamurthy, L. Zettlemoyer, Learning a neural semantic parser from user feedback, in 55th Annual Meeting of the Association for Computational Linguistics (2017)
Google Scholar
C. Lawrence, S. Riezler, Nlmaps: a natural language interface to query openstreetmap, in Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations, pp. 6–10 (2016)
Google Scholar
Y. Li, D. Rafiei, Natural language data management and interfaces. Synth. Lect. Data Manag. 10(2), 1–156 (2018)
Article Google Scholar
J. Li, W. Wang, W.-S. Ku, Y. Tian, H. Wang, Spatialnli: a spatial domain natural language interface to databases using spatial comprehension, in Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 339–348 (2019)
Google Scholar
P. Liang, Learning executable semantic parsers for natural language understanding. Commun. ACM 59(9), 68–76 (2016)
Article Google Scholar
F. Li, H.V. Jagadish, Constructing an interactive natural language interface for relational databases. Proc. VLDB Endow. 8(1), 73–84 (2014)
Article Google Scholar
X.V. Lin, C. Wang, L. Zettlemoyer, M.D. Ernst, Nl2bash: a corpus and semantic parser for natural language interface to the linux operating system, in Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (2018)
Google Scholar
C. Liu, S. Yang, S. Saba-Sadiya, N. Shukla, Y. He, S.-C. Zhu, J. Chai, Jointly learning grounded task structures from language instruction and visual demonstration, in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1482–1492 (2016)
Google Scholar
B. Liu, S. Mazumder, Lifelong and continual learning dialogue systems: learning during conversation. Proc. AAAI Conf. Artif. Intell. 35, 15058–15063 (2021)
Google Scholar
S. Machines, J. Andreas, J. Bufe, D. Burkett, C. Chen, J. Clausman, J. Crawford, K. Crim, J. DeLoach, L. Dorner, J. Eisner, H. Fang, A. Guo, D. Hall, K. Hayes, K. Hill, D. Ho, W. Iwaszuk, S. Jha, D. Klein, J. Krishnamurthy, T. Lanman, P. Liang, C.H. Lin, I. Lintsbakh, A. McGovern, A. Nisnevich, A. Pauls, D. Petters, B. Read, D. Roth, S. Roy, J. Rusak, B. Short, D. Slomin, B. Snyder, Y.S. Stephon Striplin, Z. Tellman, S. Thomson, A. Vorobev, I. Witoszko, J. Wolfe, A. Wray, Y. Zhang, A. Zotov, Task-oriented dialogue as dataflow synthesis. Trans. Assoc. Comput. Linguist. 8, 556–571 (2020). https://doi.org/10.1162/tacl_a_00333. (September)
M. MacMahon, B. Stankiewicz, B. Kuipers, Walk the talk: Connecting language, knowledge, and action in route instructions. Def 2(6), 4 (2006)
Google Scholar
S. Mazumder, On-the-job continual and interactive learning of factual knowledge and language grounding. Ph.D. thesis, University of Illinois at Chicago, 2021
Google Scholar
S. Mazumder, B. Liu, S. Wang, S. Esmaeilpour, An application-independent approach to building task-oriented chatbots with interactive continual learning, in NeurIPS-2020 Workshop on Human in the Loop Dialogue Systems (2020)
Google Scholar
S. Mazumder, O. Riva, Flin: a flexible natural language interface for web navigation, in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 2777–2788 (2021)
Google Scholar
W.P. McCarthy, R. Hawkins, H. Wang, C. Holdaway, J.E. Fan, Learning to communicate about shared procedural abstractions, in Proceedings of the Annual Meeting of the Cognitive Science Society, vol. 43 (2021)
Google Scholar
D. Misra, J. Langford, Y. Artzi, Mapping instructions and visual observations to actions with reinforcement learning, in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 1004–1015 (2017)
Google Scholar
S. Mohan, J. Laird, Learning goal-oriented hierarchical tasks from situated interactive instruction, in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 28 (2014)
Google Scholar
A. Mohseni-Kabir, C. Li, W. Victoria, D. Miller, B. Hylak, S. Chernova, D. Berenson, C. Sidner, C. Rich, Simultaneous learning of hierarchy and primitives for complex robot tasks. Auton. Robot. 43, 859–874 (2019)
Article Google Scholar
A. Narayan-Chen, P. Jayannavar, J. Hockenmaier, Collaborative dialogue in minecraft. in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 5405–5415 (2019)
Google Scholar
A. Neelakantan, Q.V. Le, M. Abadi, A. McCallum, D. Amodei, Learning a natural language interface with neural programmer, in International Conference on Learning Representations (2017)
Google Scholar
P. Pasupat, T.-S. Jiang, E. Liu, K. Guu, P. Liang, Mapping natural language commands to web elements, in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 4970–4976 (2018)
Google Scholar
P. Pasupat, P. Liang, Compositional semantic parsing on semi-structured tables, in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1470–1480 (2015)
Google Scholar
Q. Ping, F. Niu, G. Thattai, J. Chengottusseriyil, Q. Gao, A. Reganti, P. Rajagopal, G. Tur, D. Hakkani-Tur, P. Nataraja, Interactive teaching for conversational AI (2020). arXiv:2012.00958
S. Ross, D. Bagnell, Efficient reductions for imitation learning, in Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 661–668 (2010)
Google Scholar
S. Ross, G. Gordon, D. Bagnell, A reduction of imitation learning and structured prediction to no-regret online learning, in Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pp. 627–635 (2011)
Google Scholar
R. Rubavicius, A. Lascarides, Interactive symbol grounding with complex referential expressions, in Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4863–4874 (2022)
Google Scholar
P.E. Rybski, K. Yoon, J. Stolarz, M.M. Veloso, Interactive robot task training through dialog and demonstration, in Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, pp. 49–56 (2007)
Google Scholar
M. Scheutz, E. Krause, B. Oosterveld, T. Frasca, R. Platt, Spoken instruction-based one-shot object and action learning in a cognitive robotic architecture, in Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, pp. 1378–1386 (2017)
Google Scholar
V. Setlur, S.E. Battersby, M. Tory, R. Gossweiler, A.X. Chang, Eviza: a natural language interface for visual analysis, in Proceedings of the 29th Annual Symposium on User Interface Software and Technology, pp. 365–377 (2016)
Google Scholar
L. She, J. Chai, Incremental acquisition of verb hypothesis space towards physical world interaction, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 108–117 (2016)
Google Scholar
L. She, J. Chai, Interactive learning of grounded verb semantics towards human-robot communication, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1634–1644 (2017)
Google Scholar
L. She, S. Yang, Y. Cheng, Y. Jia, J. Chai, N. Xi, Back to the blocks world: Learning new actions through situated human-robot dialogue, in Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL), pp. 89–97 (2014)
Google Scholar
R. Shekhar, A. Venkatesh, T. Baumgärtner, E. Bruni, B. Plank, R. Bernardi, R. Fernández, Beyond task success: a closer look at jointly learning to see, ask, and guesswhat, in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 2578–2587 (2019)
Google Scholar
K. Soh, TagUI: RPA/CLI tool for automating user interactions (2017), https://github.com/ kelaberetiv/TagUI
K. Srinet, Y. Jernite, J. Gray, A. Szlam, Craftassist instruction parsing: Semantic parsing for a voxel-world assistant, in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 4693–4714 (2020)
Google Scholar
Y. Su, A.H. Awadallah, M. Khabsa, P. Pantel, M. Gamon, M. Encarnacion, Building natural language interfaces to web APIS, in Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 177–186 (2017)
Google Scholar
Y. Su, A.H. Awadallah, M. Wang, R.W. White, Natural language interfaces with fine-grained user interaction: a case study on web APIS, in The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 855–864 (2018)
Google Scholar
A. Suglia, A. Vergari, I. Konstas, Y. Bisk, E. Bastianelli, A. Vanzo, O. Lemon, Imagining grounded conceptual representations from perceptual information in situated guessing games, in The 28th International Conference on Computational Linguistics, pp. 1090–1102. International Committee on Computational Linguistics (2020)
Google Scholar
R.S. Sutton, A.G. Barto, Reinforcement Learning: an Introduction. MIT press (2018)
Google Scholar
J.J. Tehrani, F. Riede, Towards an archaeology of pedagogy: learning, teaching and the generation of material culture traditions. World Archaeol. 40(3), 316–331 (2008)
Article Google Scholar
S. Tellex, T. Kollar, S. Dickerson, M. Walter, A. Banerjee, S. Teller, N. Roy, Understanding natural language commands for robotic navigation and mobile manipulation. Proc. AAAI Conf. Artif. Intell. 25, 1507–1514 (2011)
Google Scholar
S. Tellex, N. Gopalan, H. Kress-Gazit, C. Matuszek, Robots that use language. Annu. Rev. Control. Robot. Auton. Syst. 3, 25–55 (2020)
Article Google Scholar
J. Thomason, S. Zhang, R. Mooney, P. Stone, Learning to interpret natural language commands through human-robot dialog, in Proceedings of the 24th International Conference on Artificial Intelligence, pp. 1923–1929 (2015)
Google Scholar
A.L. Thomaz, M. Cakmak, Learning about objects with human teachers, in Proceedings of the 4th ACM/IEEE International Conference on Human Robot Interaction, pp. 15–22 (2009)
Google Scholar
G. Tolkachev, S. Mell, S. Zdancewic, O. Bastani, Counterfactual explanations for natural language interfaces, in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 113–118 (2022)
Google Scholar
P. Utama, N. Weir, F. Basik, C. Binnig, U. Cetintemel, B. Hättasch, A. Ilkhechi, S. Ramaswamy, A. Usta, An end-to-end neural natural language interface for databases (2018). arXiv:1804.00401
A. Vogel, D. Jurafsky, Learning to follow navigational directions, in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 806–814 (2010)
Google Scholar
S.I. Wang, S. Ginn, P. Liang, C.D. Manning, Naturalizing a programming language via interactive learning, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 929–938 (2017)
Google Scholar
J.X. Wang, Z. Kurth-Nelson, D. Tirumala, H. Soyer, J.Z. Leibo, R. Munos, C. Blundell, D. Kumaran, M. Botvinick, Learning to reinforcement learn (2016). arXiv:1611.05763
T. Winograd, Understanding natural language (Cogn, Psychol, 1972)
Book Google Scholar
L. Wittgenstein, Philosophical Investigations. John Wiley & Sons (2010)
Google Scholar
H. Xiong, R. Sun, Transferable natural language interface to structured queries aided by adversarial generation, in 2019 IEEE 13th International Conference on Semantic Computing (ICSC), pp. 255–262. IEEE (2019)
Google Scholar
Z. Yao, X. Li, J. Gao, B. Sadler, H. Sun, Interactive semantic parsing for if-then recipes via hierarchical reinforcement learning. Proc. AAAI Conf. Artif. Intell. 33, 2547–2554 (2019)
Google Scholar
Z. Yao, Y. Su, H. Sun, W.-T. Yih, Model-based interactive semantic parsing: a unified framework and a text-to-SQL case study, in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5450–5461 (2019b)
Google Scholar
Z. Yao, Y. Tang, W.-T. Yih, H. Sun, Y. Su, An imitation game for learning semantic parsers from user interaction, in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6883–6902 (2020)
Google Scholar
W.-T. Yih, M.-W. Chang, X. He, J. Gao, Semantic parsing via staged query graph generation: question answering with knowledge base, in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1321–1331 (2015)
Google Scholar
Y. Yu, A. Eshghi, G. Mills, O. Lemon, The burchak corpus: a challenge data set for interactive learning of visually grounded word meanings, in The 6th Workshop on Vision and Language, p. 1 (2017)
Google Scholar
T. Yu, R. Zhang, K. Yang, M. Yasunaga, D. Wang, Z. Li, J. Ma, I. Li, Q. Yao, S. Roman et al., Spider: a large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-SQL task, in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 3911–3921 (2018)
Google Scholar
J.M. Zelle, R.J. Mooney, Learning to parse database queries using inductive logic programing, in Proceedings of the AAAI Conference on Artificial Intelligence, pp. 1050–1055 (1996)
Google Scholar
L.S. Zettlemoyer, M. Collins, Learning to map sentences to logical form: structured classification with probabilistic categorial grammars, in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence, pp. 658–666 (2005)
Google Scholar
L. Zettlemoyer, M. Collins, Online learning of relaxed CCG grammars for parsing to logical form, in Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 678–687 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Intel Labs, Santa Clara, CA, USA
Sahisnu Mazumder
University of Illinois Chicago, Chicago, IL, USA
Bing Liu

Authors

Sahisnu Mazumder
View author publications
You can also search for this author in PubMed Google Scholar
Bing Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sahisnu Mazumder .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mazumder, S., Liu, B. (2024). Continuous and Interactive Language Learning and Grounding. In: Lifelong and Continual Learning Dialogue Systems. Synthesis Lectures on Human Language Technologies. Springer, Cham. https://doi.org/10.1007/978-3-031-48189-5_4

Download citation

DOI: https://doi.org/10.1007/978-3-031-48189-5_4
Published: 09 January 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-48188-8
Online ISBN: 978-3-031-48189-5
eBook Packages: Synthesis Collection of Technology (R0)

Publish with us

Policies and ethics