Web Pages Reordering and Clustering Based on Web Patterns

  • Miloš Kudělka
  • Václav Snášel
  • Ondřej Lehečka
  • Eyas El-Qawasmeh
  • Jaroslav Pokorný
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4910)


In this paper was proposed a method for the description of web pages using web patterns. We will explain what we mean by the term ”web pattern”. We will present a taxonomy web patterns and a description of some their types. In the description of web patterns we will focus on properties which are useful for automatic detection on web pages. As a result of the detection we get a description of a web page using found web patterns. The description can be used for reordering and clustering of a web page set.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Alexander, Ch.: A Pattern Language: Towns, Buildings, Construction. Oxford University Press, New York (1977)Google Scholar
  2. 2.
    Chakrabarti, S.: Mining the Web: Discovering Knowledge from Hypertext Data. Morgan Kaufmann Publishers, San Francisco (2003)Google Scholar
  3. 3.
    Chang, Ch.H., Kayed, M., Girgis, M.R., Shaalan, K.F.: A Survey of Web Information Extraction Systems. IEEE Transactions on Knowledge and Data Engineering 18(10), 1411–1428 (2006)CrossRefGoogle Scholar
  4. 4.
    Dearden, Finlay, J.: Pattern Languages in HCI: A critical review. Human-Computer Interaction 21(1), 49–102 (2006)CrossRefGoogle Scholar
  5. 5.
    Dong, J., Zhao, Y.: xperiments on Design Pattern Discovery. In: PROMISE 2007. Third International Workshop on Predictor Models in Software Engineering, p. 12 (2007)Google Scholar
  6. 6.
    Van Duyne, D.K., Landay, J.A., Hong, J.I.: The Design of Sites: Patterns, Principles, and Processes for Crafting a Customer-Centered Web Experience. Pearson Education (2002)Google Scholar
  7. 7.
    Ivory, M.Y., Megraw, R.: Evolution of Web Site Design Patterns. ACM Transactions on Information Systems 23(4), 463–497 (2005)CrossRefGoogle Scholar
  8. 8.
    Kiyavitskaya, N., Zeni, N., Cordy, J.R., Mich, L., Mylopoulos, J.: Text Mining Through Semi Automatic Semantic Annotation. In: PAKM 2006, pp. 143–154 (2006)Google Scholar
  9. 9.
    Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2006)Google Scholar
  10. 10.
    Kudělka, M., Snášel, V., Lehečka, O., El-Qawasmeh, E.: Semantic Analysis of Web Pages Using Web Patterns. In: WI 2006. International Conference on Web Intelligence, Hong Kong, pp. 329–333 (2006)Google Scholar
  11. 11.
    Kočibova, J., Klos, K., Lehečka, O., Kudělka, M., Snášel, V.: Web Page Analysis: Experiments Based On Discussion and Purchase Web Patterns. In: WI 2006. International Conference on Web Intelligence, Silicon Valley, CA, USA, pp. 221–225 (2007)Google Scholar
  12. 12.
    Nie, Z., Wen, J-R., Ma, W-Y.: Object-level Vertical Search. In: CIDR 2007, Asilomar, CA, USA, pp. 235–246 (2007)Google Scholar
  13. 13.
    Nie, Z., Ma, Y., Shi, S., Wen, J-R., Ma, W-Y.: Web Object Retrieval. In: WWW 2007, pp. 81–90 (2007)Google Scholar
  14. 14.
    Pivk, A.: Automatic Ontology Generation from Web Tabular Structures, PhD thesis, University of Maribor (2005)Google Scholar
  15. 15.
    Reis, D.C., Golgher, P.B., Silva, A.S., Laender, A.F.: Automatic web news extraction using tree edit distance. In: WWW 2004, pp. 502–511. ACM Press, New York (2004)CrossRefGoogle Scholar
  16. 16.
    Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18(11), 613–620 (1975)zbMATHCrossRefGoogle Scholar
  17. 17.
    Snášel, V., Řezanková, H., Húsek, D., Kudělka, M., Lehečka, O.: Semantic Analysis of Web Pages Using Cluster Analysis and Nonnegative Matrix Factorization. In: AWIC 2007, Fontainebleau, France, pp. 328–336. Springer, Heidelberg (2007)Google Scholar
  18. 18.
    Snášel, V.: GUI Patterns and Web Semantics. In: CISIM 2007, pp. 14–19. IEEE, Elk, Poland (2007)Google Scholar
  19. 19.
    Tidwell, J.: Designing Interfaces: Patterns for Effective Interaction Design. O’Reilly Media, Inc. (2006)Google Scholar
  20. 20.
    Tsantalis, N., Chatzigeorgiou, A., Stephanides, G., Halkidis, S.T.: Design Pattern Detection Using Similarity Scoring. IEEE Transactions on Software Engineering 32(11), 896–909 (2006)CrossRefGoogle Scholar
  21. 21.
    Yu, S., Cai, D., Wen, J-R., Ma, W-Y.: Improving Pseudo-Relevance Feedback in Web Information retrieval Using Web Page Segmentation. In: World Wide Web conference (WWW 2003), Hungary, pp. 203–211 (2003)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Miloš Kudělka
    • 1
  • Václav Snášel
    • 1
  • Ondřej Lehečka
    • 1
  • Eyas El-Qawasmeh
    • 2
  • Jaroslav Pokorný
    • 3
  1. 1.Department of Computer Science, VŠBTechnical University of OstravaOstrava-PorubaCzech Republic
  2. 2.Computer Science Dept.Jordan University of Science and TechnologyIrbidJordan
  3. 3.Depatment of Software EngineeringCharles University of PragueCzech Republic

Personalised recommendations