Skip to main content

Query Optimization: Issues and Challenges in Mining of Distributed Data

  • Conference paper
  • First Online:
Big Data Analytics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 654))

Abstract

The technique of finding the optimal processing method to answer a query is called Query optimization, whereas a collection of various sites, distributed over a computer network is called Distributed Database. In Distributed Database, the site communicates with each other through networks. There are various issues arise during evaluation of query cost, among which the processing cost and a transmission cost are important. There are several algorithms developed to find the best possible solution for a particular query, but they all have their certain limitations. The optimizer is mainly concern on search space, search strategy, and the cost model. It primarily focuses on these three factors. The mining cost of a query depends on the order of evaluation of the operators, for the same query we can have different cost if the order is changed. Hence, to find the optimal cost for a particular query is emerging as an open challenge for many researchers. Therefore, the cost-based query optimization technique has emerged as an important concept for dealing with the query optimization. This paper explores the issues and challenges of query optimization in mining of distributed data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Kossmann, D.: The state of the art in distributed query processing. ACM Comput. Surv. 32(4):422–469 (2000)

    Google Scholar 

  2. Lew, M.S., Sebe, N., Djeraba, C., Jain, R.: Content-based multimedia information retrieval: state of the art and challenges. ACM Trans. Multimedia Comput. Commun. Appl. 2(1):1–19 (2006)

    Google Scholar 

  3. Kossmann, D., Stocker, K.: Iterative dynamic programming: a new class of query optimization algorithms. ACM Trans. Database Syst. 25(1):43–82 (2000)

    Google Scholar 

  4. Kumar, T.V.V., Singh, V., Verma, A.K.: Distributed query processing plans generation using genetic algorithm. Int. J. Comput. Theor. Eng. 3(1):1793–8201 (2011)

    Google Scholar 

  5. Chaudhuri, S., Shim, K.: Optimization of queries with user defined predicates. ACM Trans. Database Syst. 24(2):177–228 (1999)

    Google Scholar 

  6. Drenick, P.E., Smith, E.J.: Stochastic query optimization in distributed database. ACM Trans. Database Syst. 18(2):262–288 (1993)

    Google Scholar 

  7. Chen, B.C., Ramakrishnan, R.: Bellwether analysis: searching for cost-effective query-defined predictors in large databases. ACM Trans. Knowl. Discov. Data. 3(1), Article 5 (2009)

    Google Scholar 

  8. Patil1, R., Chen, Z., Shi, Y.: Database keyword search: a perspective from optimization. In: 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, pp 30–33

    Google Scholar 

  9. Kosala, R., Blockeel, H.: Web mining research: a survey. ACM SIGKDD 2(1):1–15 (2000)

    Google Scholar 

  10. Pentaris, F., Ioannidis, Y.: Query optimization in distributed networks of autonomous database systems. ACM Trans. Database Syst. 31(2):537–583 (2006)

    Google Scholar 

  11. Liao, S.H., Chu, P.H., Hsiao, P.Y.: Data mining techniques and applications—a decade review from 2000 to 2011. Expert Syst. Appl. 39:11303–11311 (2012)

    Google Scholar 

  12. Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12) (2000)

    Google Scholar 

  13. Barrena, M., Jurado, E., Márquez-Neila, P., Pachón, C.: A flexible framework to ease nearest neighbor search in multidimensional data spaces. J. Data Knowl. Eng. 69:116–136 (2010)

    Google Scholar 

  14. Ozsoyoglu, G., Altingovde, I.S., Al-hamdani, A., Ozel, S.A., Ulusoy, O., Ozsoyoglu, Z.M.: Querying web metadata: native score management and text support in databases. ACM Trans. Database Syst. 29(4):581–634 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pramod Kumar Yadav .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Yadav, P.K., Rizvi, S. (2018). Query Optimization: Issues and Challenges in Mining of Distributed Data. In: Aggarwal, V., Bhatnagar, V., Mishra, D. (eds) Big Data Analytics. Advances in Intelligent Systems and Computing, vol 654. Springer, Singapore. https://doi.org/10.1007/978-981-10-6620-7_67

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6620-7_67

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6619-1

  • Online ISBN: 978-981-10-6620-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics