A Study on Competent Crawling Algorithm (CCA) for Web Search to Enhance Efficiency of Information Retrieval

Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 325)

Abstract

Today’s Web is very huge and evolving continually in dynamic nature. Search engines are the interface to retrieve information from huge repository of the World Wide Web. Due to the difficulty in accessing the information from massive storage of Web, search engines depend on the crawlers for locating and retrieving relevant Web pages. A Web crawler is a software system, which systematically finds and retrieves Web pages from the Web documents. Crawlers use many Web search algorithms for retrieving Web pages. This paper proposes a competent Web search crawling algorithm, which is derived from page rank and BFS Web search algorithm to enhance the efficiency of the relevant information search. In this paper, an attempt has been made to study and examine the work nature of crawlers and crawling algorithms in search engines for efficient information retrieval.

Keywords

Web search crawling algorithm BFS Web search algorithm Web crawler URL address CCA 

References

  1. 1.
    K.S. Shetty, S. Bhat, S. Singh, Symbolic verification of web crawler functionality and its properties, in International Conference on Computer Communication and Informatics (ICCCI, 2012)Google Scholar
  2. 2.
    A. Tripathy, P.K. Patra, A web mining architectural model of distributed crawler for internet searches using PageRank algorithm, in IEEE Asia-Pacific Services Computing Conference (2008)Google Scholar
  3. 3.
    A. Guerriero, F. Ragni, C. Martines, A dynamic URL assignment method for parallel web crawler. IEEEGoogle Scholar
  4. 4.
    A. Vadivel, S.G. Shaila, R.D. Mahalakshmi, J. Karthika, Component based effective web crawler and indexer using web services, in IEEE-International Conference on Advances in Engineering, Science and Management (ICAESM, 2012)Google Scholar
  5. 5.
    R.R. Trujillo, Simulation tool to study focused web crawling strategies (2006)Google Scholar
  6. 6.
    Accuracy, Precision, Recall and F-Score. Wikipedia, the free encyclopediaGoogle Scholar
  7. 7.
    S. Jaiganesh, P. Babu, K. Nimmati Satheesh, Comparative study of various web search algorithms for the improvement of web crawler. Int. J. Eng. Res. Technol. (IJERT) 2(4) (2013)Google Scholar
  8. 8.
    Y. Yang, Y. Du, Y. Hai, Z. Gao, A topic-specific web crawler with web page hierarchy based on HTML Dom-Tree, in Asia-Pacific Conference on Information Processing (2009)Google Scholar

Copyright information

© Springer India 2015

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringBharathidasan UniversityTrichyIndia
  2. 2.Teknuance Info SolutionsChennaiIndia

Personalised recommendations