A fuzzy extension of the XPath query language
Today the current state of the art in querying XML data is represented by XPath and XQuery, both of which rely on Boolean conditions for node selection. Boolean selection is too restrictive when users do not use or even know the data structure precisely, e.g. when queries are written based on a summary rather than on a schema. In this paper we describe a XML querying framework, called FuzzyXPath, based on Fuzzy Set Theory, which relies on fuzzy conditions for the definition of flexible constraints on stored data. A function called “deep-similar” is introduced to replace XPath’s typical “deep-equal” function. The main goal is to provide a degree of similarity between two XML trees, assessing whether they are similar both structure-wise and content-wise. Several query examples are discussed in the field of XML based metadata for e-learning.
KeywordsApproximate querying XPath Fuzzy set theory Flexible constraints
- Amer-Yahia, S., Cho, S., & Srivastava, D. (2002). Tree pattern relaxation. In C. S. Jensen, K. G. Jeffery, J. Pokorńy S. Saltenis, E. Bertino, K. Böhm, et al. (Eds.), Proceedings of the 8th international conference on extending database technology: Advances in database technology (25 – 27 March 2002). Extending database technology (Vol. 2287, pp. 496–513). London: Springer-Verlag.Google Scholar
- Bosc, P., & Pivert, O. (1992). Fuzzy querying in conventional databases. In L. A. Zadeh & J. Kacprzyk (Eds.), Fuzzy logic for the management of uncertainty (pp. 645–671). New York: Wiley.Google Scholar
- Bosc, P., Lietard, L., & Pivert, O. (1994). Soft querying, a new feature for database management systems. In D. Karagiannis (Ed.), Proceedings of the 5th international conference on database and expert systems applications (07 – 09 September 1994). Lecture notes in computer science, (Vol. 856, pp. 631–640). London: Springer-Verlag.Google Scholar
- Buche, P., Dibie-Barthèlemy, J., & Wattez, F. (2006). Approximate querying of XML fuzzy data. In Springer (Ed.), Proceedings of the 7th international conference FQAS 2006, (Vol. 4027/2006). Milan, Italy.Google Scholar
- Ciaccia, P., & Penzo, W. (2003). The collection index to support complex approximate queries. In Verlag, S. (Ed.), Proceedings of XSym 2003 (Vol. 2824, pp. 164–179).Google Scholar
- Damiani, E., & Tanca, L. (2000). Blind queries to XML data. In MT. Ibrahim, J. Küng, & N. Revell, (Eds.), Proceedings of the 11th international conference on database and expert systems applications (04 – 08 September 2000). Lecture notes in computer science (Vol. 1873, pp. 345–356). London: Springer-Verlag.Google Scholar
- Damiani, E., Lavarini, N., Oliboni, B., & Tanca, L. (2004). An approximate query environment for XML data. In V. Loia, M. Nikravesh, & L. Zadeh (Eds.), Fuzzy logic and the internet, studies in fuzziness and soft computing (Vol. 137, pp. 71–94). Berlin: Springer.Google Scholar
- Damiani, E., Oliboni, B., & Tanca, L. (2001). Fuzzy techniques for XML data smushing. In Proceedings of the international conference, 7th fuzzy days on computational intelligence, theory and applications (pp. 637–652). London: Springer.Google Scholar
- Damiani, E., Tanca, L., & Arcelli-Fontana, F. (2000). Fuzzy XML queries via context-based choice of aggregations. Kybernetika, 16(3).Google Scholar
- Galindo, J., Medina, J., Pons, O., & Cubero, J. (1998). A server for fuzzy SQL queries. In T. Andreasen, H. Christiansen, & H.L. Larsen (Eds.), Proceedings of the third international conference on flexible query answering systems (13 – 15 May 1998). Lecture notes in computer science, (Vol. 1495, 164–175). London: Springer-Verlag.Google Scholar
- Li, H. G., Aghili, S. A., Agrawal, D., & Abbadi, A. E. (2006). FLUX: Fuzzy content and structure matching of XML range queries. In Proceedings of WWW 2006, May 23-26, 2006. Edinburgh, Scotland.Google Scholar
- Mandreoli, F., Martoglia, R., & Tiberio, P. (2004). Approximate query answering for a heterogeneous XML document base. In Springer (Ed.), Proceedings of the 5th int. conf on web information systems engineering. Brisbane, Australia, November 22–24.Google Scholar
- Mouchaweh, M. S. (2004). Diagnosis in real time for evolutionary processes in using pattern recognition and possibility theory (invited paper). International Journal of Computational Cognition, 2(1), 79–112. ISSN 1542–5908.Google Scholar
- Nierman, A., & Jagadish, H. V. (2002). Evaluating structural similarity in XML documents. In Int’l workshop on the web and databases (WebDB). Madison,WI, 2002 June. http://citeseer.ist.psu.edu/nierman02evaluating.html.
- Schlieder, T. (2002). Schema-driven evaluation of approximate tree-pattern queries. In Proceedings EDBT (pp. 514–532). Prague, Czech Republic. http://citeseer.ist.psu.edu/article/schlieder02schemadriven.html
- W3C (1999). XML Path Language (XPath) Version 1.0. http://www.w3.org/TR/xpath.
- Yu, C., & Jagadish, H. V. (2006). XML schema summarization. In U. Dayal, K. Whang, D. Lomet, G. Alonso, G. Lohman, M. Kersten, S. K. Cha, & Y. Kim, (Eds.), Proceedings of the 32nd international Conference on very large data bases (Seoul, Korea, September 12 – 15, 2006). Very large data bases. VLDB endowment (pp. 319–330).Google Scholar