Abstract
This chapter explores support vector machines (SVMs), widely employed supervised learning algorithms recognized for their effectiveness in binary classification tasks. SVMs aim to find an optimal hyperplane (a decision plane that separates objects with different class memberships) that maximizes the margin between data points of different classes. The hyperplane acts as a decision boundary, with one class on each side. The margin represents the perpendicular distance between the hyperplane and the closest points of each class. A larger margin indicates a better separation, while a smaller margin suggests a less optimal decision boundary.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to APress Media, LLC, part of Springer Nature
About this chapter
Cite this chapter
Testas, A. (2023). Support Vector Machine Classification with Pandas, Scikit-Learn, and PySpark. In: Distributed Machine Learning with PySpark. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-9751-3_10
Download citation
DOI: https://doi.org/10.1007/978-1-4842-9751-3_10
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-9750-6
Online ISBN: 978-1-4842-9751-3
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)