Abstract
So far, we have been querying data inside our SQL Server Big Data Cluster using external tables and T-SQL code. We do, however, have another method available to query data that is stored inside the HDFS filesystem of your Big Data Cluster. As you have read in Chapter 2, Big Data Clusters also have Spark included in the architecture, meaning we can leverage the power of Spark to query data stored inside our Big Data Cluster.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2020 Benjamin Weissman and Enrico van de Laar
About this chapter
Cite this chapter
Weissman, B., van de Laar, E. (2020). Working with Spark in Big Data Clusters. In: SQL Server Big Data Clusters . Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-5985-6_6
Download citation
DOI: https://doi.org/10.1007/978-1-4842-5985-6_6
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-5984-9
Online ISBN: 978-1-4842-5985-6
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)