Abstract
In this chapter, we describe the structure and use of the Apache Lucene and Solr third-party search engine components, how to use them with Hadoop, and how to develop advanced search capability customized for an analytical application. We will also investigate some newer Lucene-based search frameworks, primarily Elasticsearch, a premier search tool particularly well-suited towards building distributed analytic data pipelines. We will also discuss the extended Lucene/Solr ecosystem and some real-world programming examples of how to use Lucene and Solr in distributed big data analytics applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2017 Kerry Koitzsch
About this chapter
Cite this chapter
Koitzsch, K. (2017). Advanced Search Techniques with Hadoop, Lucene, and Solr. In: Pro Hadoop Data Analytics . Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-1910-2_6
Download citation
DOI: https://doi.org/10.1007/978-1-4842-1910-2_6
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-1909-6
Online ISBN: 978-1-4842-1910-2
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)