Skip to main content

Bridging Spark SQL with JDBC

  • Chapter
  • First Online:
Modern Data Engineering with Apache Spark
  • 2221 Accesses

Abstract

In the last chapter, we looked at common patterns and techniques for harnessing the powerful core functionality available to us when transforming data using Spark SQL and the DataFrame APIs. While we certainly covered a lot of ground, we purposefully skipped over some of the more exciting capabilities available to us under the Spark SQL umbrella. Along that line, wouldn’t it seem to only make sense that we should be capable of connecting to and working directly with remote databases from the comfort of Apache Spark SQL? Additionally, wouldn’t it also be advantageous to use SQL's strongly typed semantics when reading data into Spark? Couldn’t we somehow also marry these rich type systems (inherent to Java/Scala) with both SQL and the strong internal typing mechanics of Apache Spark itself? Luckily, that is exactly what you will learn to do in this chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+
from €37.37 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

eBook
EUR 17.99
Price includes VAT (Netherlands)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 65.39
Price includes VAT (Netherlands)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to APress Media, LLC, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Haines, S. (2022). Bridging Spark SQL with JDBC. In: Modern Data Engineering with Apache Spark. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-7452-1_5

Download citation

Publish with us

Policies and ethics