Abstract
In this chapter, we move to a new area of machine learning, namely, that of processing text data and applying an algorithm to it. This area of machine learning is known as natural language processing (NLP), which finds uses in many business applications including speech recognition, chatbots, language translation, and email spam detection (ham or spam).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to APress Media, LLC, part of Springer Nature
About this chapter
Cite this chapter
Testas, A. (2023). Natural Language Processing with Pandas, Scikit-Learn, and PySpark. In: Distributed Machine Learning with PySpark. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-9751-3_14
Download citation
DOI: https://doi.org/10.1007/978-1-4842-9751-3_14
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-9750-6
Online ISBN: 978-1-4842-9751-3
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)