Selective Dissemination of XML Documents Using GAs and SVM
XML has emerged as a medium for interoperability over the Internet. As the number of documents published in the form of XML is increasing there is a need for selective dissemination of XML documents based on user interests. In the proposed technique, a combination of Self Adaptive Migration Model Genetic Algorithm (SAMGA) and multi class Support Vector Machine (SVM) are used to learn a user model. Based on the feedback from the users the system automatically adapts to the user’s preference and interests. The user model and a similarity metric are used for selective dissemination of a continuous stream of XML documents. Experimental evaluations performed over a wide range of XML documents indicate that the proposed approach significantly improves the performance of the selective dissemination task, with respect to accuracy and efficiency.
Unable to display preview. Download preview PDF.
- 1.Altinel, M., Franklin, M.: Efficient Filtering of XML Documents for Selective Dissemination of Information. In: Intl. Conf. on Very Large Databases (VLDB 2000), Cairo, Egypt, pp. 53–64 (2000)Google Scholar
- 2.Stanoi, I., Mihaila, G., Padmanabhan, S.: A Framework for Selective Dissemination of XML documents based on inferred user profiles. In: Proc. of the nineteenth International Conference on Data Engineering, ICDE 2003 (2003)Google Scholar
- 3.Yang, J., Korfhage, R.R.: Effects of Query Term Weights Modification in Annual Document Retrieval: A Study Based on a Genetic Algorithm. In: Proc. of the Second Symposium on Document Analysis and Information Retrieval, pp. 271–185 (1993)Google Scholar
- 4.Joachims, T.: Transductive inference for text classification using support vector machines. In: Machine Learning - Proc. Sixteenth Int’l. Conf. (ICML 1999), San Francisco, CA, USA, pp. 200–209 (1999)Google Scholar