BRWM: A relevance feedback mechanism for web page clustering

  • Ioannis Anagnostopoulos
  • Christos Anagnostopoulos
  • Dimitrios D. Vergados
  • Ilias Maglogiannis
Part of the IFIP International Federation for Information Processing book series (IFIPAICT, volume 204)

Abstract

This paper describes an information system, which classifies web pages in specific categories according to a proposed relevance feedback mechanism. The proposed relevance feedback mechanism is called Balanced Relevance Weighting Mechanism — BRWM and uses the proportion of the already relevant categorized information amount for feature classification. Experimental measurements over an e-commerce framework, which describes the fundamental phases of web commercial transactions verified the robustness of using the mechanism on real data. Except from revealing the accomplished sequences in a web commerce transaction, the system can be used as an assistant and consultation tool for classification purposes. In addition, BRWM was compared with a similar relevance feedback mechanism from the literature over the established corpus of Reuters-21578 text categorization test collection, presenting promising results.

References

  1. 1.
    Klose M., Lechner U., ‘Design of Business Media-An integrated Model of Electronic Commerce’, In: Haseman, W.D.; Nazareth, D.L. (eds.), Proceedings of the Fifth Americas Conference on Information Systems (AMCIS’99), pp. 115–117, Milwaukee, WI, August 13–15, 1999.Google Scholar
  2. 2.
    Mahadevan B., ‘Business Models for Internet-Based ECommerce: An Anatomy’, California Management Review, Vol.42, No.4, 2000.Google Scholar
  3. 3.
    Timmers P., ‘Business Models for Electronic Markets’, In: Gadient Y., Schmid B. F., Selz D., EM-Electronic Commerce in Europe, EM-Electronic Markets, Vol. 8, No. 2, July 1998.Google Scholar
  4. 4.
    Lawrence E., Corbitt B., Tidwell A., Fisher J., Lawrence J., ‘Internet Commerce Digital Models for Business’, John Wiley & Sons, Brisbane, 1998.Google Scholar
  5. 5.
    Selz S., ‘Web Assessment: A model for the Evaluation and the Assessment of Successful Electronic Commerce Applications’, International Journal of Electronic Markets 7(3).Google Scholar
  6. 6.
    Schmid B.F., Lindemann, M.A, ‘Elements of a reference model for electronic markets’, Proceedings of the Thirty-First Hawaii International Conference on System Sciences, vol.4, pp. 193–201,1998.Google Scholar
  7. 7.
    Schmid, B., ‘What is new about the Digital Economy’, Electronic Markets, vol.11, no.l, 04/2001.Google Scholar
  8. 8.
    Anagnostopoulos I., Psoroulas I., Loumos V. and Kayafas E., Implementing a customised meta-search interface for user query personalisation, IEEE 24th International Conference on Information Technology Interfaces, ITI 2002 pp. 79–84, June 24–27, 2002, Cavtat/Dubrovnik, CROATIA.Google Scholar
  9. 9.
    Yang Y and Pedersen J (1997) A comparative study on feature selection in text categorization. In: Proceedings of the 14th International Conference in Machine Learning, ICML’97, pp. 412–420, 1997, Nashville, TN, USA.Google Scholar
  10. 10.
    Buckley C, Salton G and Allan J (1993) Automatic retrieval with locality information using SMART. In: Proceedings of the 1st Text REtrieval Conference (TREC-1), pp. 59–72, 1993, Gaithersburg, MD, USA.Google Scholar
  11. 11.
    Lewis D., An evaluation of phrasal and clustered representations on a text categorisation task, 15th Annual International ACM Conference on Research and Development in Information Retrieval (SIGIR 92), pp.37–50, 1992.Google Scholar
  12. 12.
    Yang Y., An evaluation of statistical approaches to text categorization, Journal of Information Retrieval, 1(1/2), pp.67–88, 1999.MATHGoogle Scholar
  13. 13.
    Dumais S, Platt J, Heckerman D and Sahami M, Inductive learning algorithms and representations for text categorization. In: Proceedings of the 7th international conference on Information and knowledge management, ACM Press 1998, Location, pp. 148–155.Google Scholar
  14. 14.
    Alsaffar A, Deogun J and Sever H, Optimal queries in information filtering. Lecture Notes in Artificial Intelligence (LNCS Series), 1932:435–443.Google Scholar
  15. 15.
    Sever H, Gogur A and Tolun M., Text Categorization with ILA. Lecture Notes in Computer Science — LNCS, 2869:300–307.Google Scholar

Copyright information

© International Federation for Information Processing 2006

Authors and Affiliations

  • Ioannis Anagnostopoulos
    • 1
  • Christos Anagnostopoulos
    • 2
  • Dimitrios D. Vergados
    • 1
  • Ilias Maglogiannis
    • 1
  1. 1.Department of Information and Communication Systems EngineeringUniversity of the AegeanKarlovassi, SamosGreece
  2. 2.Department of Cultural Technology and CommunicationMytiline LesvosGreece

Personalised recommendations