Flash Webpage Segmentation Based on Image Perception Using DWT and Morphological Operations

  • A. Krishna Murthy
  • K. S. Raghunandan
  • S. Suresha
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 259)

Abstract

Web page segmentation is an important step for many applications such as Information Retrieval, Noise Removal, Full Text Search, Information Extraction, and Automatic page adaptation and so on can benefit from this structure. Many segmentation methods have been proposed on HTML Web page segmentation whereas Flash Web pages have been omitted because of their less availability. But in recent days, we can see many Flash Web pages taking their appearance. In this paper, we are proposing segmentation method by using image processing techniques after processing Web pages as images, because of their unavailability of semantic structure. We perform the experimental analysis based on ground truth analysis (actual blocks in Web page as per human perception) and obtained the better performance level. We also measure the usefulness of Flash Web page blocks.

Keywords

Web page segmentation Flash web image segmentation Web blocks Haar wavelet 

References

  1. 1.
    Krishna Murthy, A., Suresha, S.: Comparative study on browsing on small screen devices. Int. J. Mach. Intell. 3(4), 354–358 (2011). ISSN: 0975–2927 and E-ISSN: 0975–9166Google Scholar
  2. 2.
    Book: Ed Tittel Complete Coverage of XML. Tata McGraw-Hill EditionGoogle Scholar
  3. 3.
  4. 4.
    Cai, D., Yu, S., Wen, J.-R., Ma, W.-Y.: VIPS: a vision based page segmentation algorithm. Technical Report MSR-TR-2003-79 (2003)Google Scholar
  5. 5.
    Xiang, P.F., Yang, X., Shi, Y.C.: Web page segmentation based on gestalt theory. In: Conference on Multimedia and Expo, pp. 2253–2256. IEEE (2007)Google Scholar
  6. 6.
    Chakrabarti, D., Kumar, R., Punera, K.: Small a graph-theoretic approach to webpage segmentation. In: 17th International Conference on WWW (2008)Google Scholar
  7. 7.
    Liu, X., Zhang, X., Tian, Y.: Webpage segmentation based on Gomory-Hu tree clustering in undirected planar graph. NSFC (2010)Google Scholar
  8. 8.
    Krishna Murthy, A., Suresha, S., Anil Kumar, K.M.: Analysis of issues in adapting web contents on mobile devices. In: International Conference on Data Mining and Warehousing. Elsevier Publications (2013)Google Scholar
  9. 9.
    Audithan, S., Chandrasekaran, R.M.: Document text extraction from document images using haar discrete wavelet transform. Eur. J. Sci. Res. 36(4), 502–512 (2009). ISSN: 1450-216XGoogle Scholar
  10. 10.
    Kovacevic, M., Dilligenti, M.: Recognition of common areas in a web page using visual information: a possible application in a page classification. In: 2nd IEEE International Conference on Data Mining (ICDM’02), p. 250 (2002)Google Scholar

Copyright information

© Springer India 2014

Authors and Affiliations

  • A. Krishna Murthy
    • 1
  • K. S. Raghunandan
    • 1
  • S. Suresha
    • 1
  1. 1.Department of Studies in Computer ScienceUniversity of MysoreMysoreIndia

Personalised recommendations