Abstract
In Chapter 3, we explored the architecture of neural conversational systems. In this chapter, we explore Large Language Models (LLMs) which are used in this architecture to process the user’s inputs and generate responses by the system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
For more discussion, see the paper Choosing the right language model for your NLP use case,https://towardsdatascience.com/choosing-the-right-language-model-for-your-nlp-use-case-1288ef3c4929
- 2.
- 3.
Based on Julien Simon, Large Language Models: A New Moore’s Law? https://huggingface.co/blog/large-language-models
- 4.
For further details, see the paper Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond, https://arxiv.org/abs/2304.13712
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to APress Media, LLC, part of Springer Nature
About this chapter
Cite this chapter
McTear, M., Ashurkina, M. (2024). Large Language Models. In: Transforming Conversational AI. Apress, Berkeley, CA. https://doi.org/10.1007/979-8-8688-0110-5_4
Download citation
DOI: https://doi.org/10.1007/979-8-8688-0110-5_4
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 979-8-8688-0109-9
Online ISBN: 979-8-8688-0110-5
eBook Packages: Business and ManagementApress Access BooksBusiness and Management (R0)