Web-Document Filtering Using Concept Graph

  • Malrey Lee
  • Eun-Kwan Kang
  • Thomas M. Gatton
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3983)


This paper introduces a retrieval method based on conceptual graph. A hyperlink information is essential to construct conceptual graph. The information is very useful as it provides summary and further linkage to construct conceptual graph that has been provided by human. It also has a property which shows review, relation, hierarchy, generality, and visibility. Using this property, we extracted the keywords of web documents and made up of the conceptual graph among the keywords sampled from web pages. This paper extracts the keywords of web pages using anchor text one out of hyperlink information and makes hyperlink of web pages abstract as the link relation between keywords of each web page. I suggest this useful retrieval method providing querying word extension or domain knowledge by conceptual graph of keywords.


Query Processing Retrieval Method Concept Graph Index File Query Processor 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Berry, M.J.A., Linoff, G.: link analysis. In: Data Mining Techniques: For marketing, Sales, and Customer Support, pp. 216–242. Wiley Computer Publishing, Chichester (1998)Google Scholar
  2. 2.
    Brin, S., Page, L.: The Anatomy of a Large-Scale Hyper textual Web Search Engine. In: Proceeding of the seventh International World Wide Web Conference (2002)Google Scholar
  3. 3.
    Callan, J.P., Bruce Croft, W., Harding, S.M.: The INQUERY Retrieval System. Database and Expert Systems Applications, 78–83 (1992)Google Scholar
  4. 4.
    Carrie, J., Kazman, R.: WebQuery: Searching and Visualizing the Web through Connectivity. In: Proceeding of the Sixth International World Wide Web Conference (1997)Google Scholar
  5. 5.
    Chen, H., Schuffels, C., Orwig, R.: Internet Categorization and search: A Self Organizing Approach. Journal of Visual Communication and Image Representation 7, 88–102 (1996)CrossRefGoogle Scholar
  6. 6.
    Klienberg, J.M.: Authoritative Sources in a Hyperlinked Environment. In: Proceedings of the ACM-SIAM Symposium on Discrete Algorithms, pp. 668–677 (1998)Google Scholar
  7. 7.
    Koster, M.: ALIWEB: Archie-like indexing in the web. Computer Networks and ISDN Systems 27, 175–182 (1994)CrossRefGoogle Scholar
  8. 8.
    Marchiori, M.: The Quest for Correct Information on the Web: Hyper Search Engines. In: Proceeding of the Sixth International World Wide Web Conference (1997)Google Scholar
  9. 9.
    Mauldin, L.: Web-agent related research at the CMT. In: Proceedings of the ACM Special nterest Group on Networked Information Discovery and retrival (1994)Google Scholar
  10. 10.
    Neusess, C., Kent, R.E.: Conceptual Analysis of Resource Meta-information. Computer Networks and ISDN Systems 27, 973–984 (1995)CrossRefGoogle Scholar
  11. 11.
    Salton, G.: Developments in automatic text retrieval. Science 253, 974–979 (1991)CrossRefMathSciNetGoogle Scholar
  12. 12.
    Weiss, R., Velesz, B., Sheldon, M.A.: HyPursuit: A Hierarchical Network Search Engine that Exploits Content-Link Hypertext Clustering. In: ACM Conference on Hypertext, pp. 180–193 (2001)Google Scholar
  13. 13.
    Yuwono, B., Lee, K.L.: Search and Ranking Algorithms for Locating Resources on the World Wide Web. In: International Conference on Data Engineering, pp. 164–171 (2000)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Malrey Lee
    • 1
  • Eun-Kwan Kang
    • 2
  • Thomas M. Gatton
    • 3
  1. 1.School of Electronics & Information EngineeringChonbuk National UniversityChonBukKorea
  2. 2.Depart of Multimedia EngineeringJeonJu UniversityKorea
  3. 3.School of Engineering and TechnologyNational UniversityLa JollaUSA

Personalised recommendations