Overview of the INEX 2009 XML Mining Track: Clustering and Classification of XML Documents

Nayak, Richi; De Vries, Christopher M.; Kutty, Sangeetha; Geva, Shlomo; Denoyer, Ludovic; Gallinari, Patrick

doi:10.1007/978-3-642-14556-8_36

Richi Nayak¹⁹,
Christopher M. De Vries¹⁹,
Sangeetha Kutty¹⁹,
Shlomo Geva¹⁹,
Ludovic Denoyer²⁰ &
…
Patrick Gallinari²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6203))

Included in the following conference series:

International Workshop of the Initiative for the Evaluation of XML Retrieval

563 Accesses
10 Citations

Abstract

This report explains the objectives, datasets and evaluation criteria of both the clustering and classification tasks set in the INEX 2009 XML Mining track. The report also describes the approaches and results obtained by the different participants.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Altingovde, I., Atilgan, D., Ulusoy, O.: Exploiting Index Pruning Methods for Clustering XML Collections. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 379–386. Springer, Heidelberg (2010)
Google Scholar
Denoyer, L., Gallinari, P.: Report on the XML Mining Track at Inex 2005 and Inex 2006. Categorization and Clustering of XML Documents 41(1), 79–90 (2007)
Google Scholar
Denoyer, L., Gallinari, P.: Report on the XML Mining Track at Inex 2007. Categorization and Clustering of XML Documents 42(1), 22–28 (2008)
Google Scholar
Denoyer, L., Gallinari, P.: Overview of the inex 2008 xml mining track. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2008. LNCS, vol. 5631, pp. 401–411. Springer, Heidelberg (2009)
Chapter Google Scholar
De Vries, C., Geva, S., De Vine, L.: Clustering with Random Indexing K-tree and XML Structure. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 407–415. Springer, Heidelberg (2010)
Google Scholar
Chidlovskii, B.: Multi-label Wikipedia classification with textual and graph features. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 387–396. Springer, Heidelberg (2010)
Google Scholar
Hagenbuchner, M., Zhang, S., Scarselli, F., Chung Tsoi, A.: Supervised Encoding of Graph-of-Graphs for Classification and Regression Problems. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 449–461. Springer, Heidelberg (2010)
Google Scholar
Jardine, N., van Rijsbergen, C.J.: The Use of Hierarchic Clustering in Information Retrieval. Inform. Stor. Retr. 7, 217–240 (1971)
Article Google Scholar
Kutty, S., Nayak, R., Li, Y.: HCX: An Efficient Hybrid Clustering Approach for XML Documents. In: Proceedings of the ACM Document Engineering Symposium, Munich, Germany, pp. 94–97 (2009)
Google Scholar
Kutty, S., Nayak, R., Li, Y.: Clustering XML documents using Multi-feature Model. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 416–425. Springer, Heidelberg (2010)
Google Scholar
Largeron, C., Moulin, C., Gery, M.: UJM at INEX 2009 XML Mining Track. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 426–433. Springer, Heidelberg (2010)
Google Scholar
Nayak, R.: XML Data Mining: Process and Applications. In: Song, M., Wu, Y.-F. (eds.) Hand-book of Research on Text and Web Mining Technologies, ch.15, pp. 249–272. Idea Group Inc., USA
Google Scholar
Pinto, D., Tovar, M., Vilariño, D., Beltran, B., Salazar, H.: BUAP: Performance of K-Star at the INEX 2009 Clustering Task. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 434–440. Springer, Heidelberg (2010)
Google Scholar
Romero, A.E., de Campos, M.L., Fernandez-Luna, J.M., Huete, J.F., Mase-gosa, A.R.: Link-based text calssification using Bayesian networks. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 397–406. Springer, Heidelberg (2010)
Google Scholar
Yang, J., Wang, S.: Extended VSM for XML Document Classification using Frequent Subtrees. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 441–448. Springer, Heidelberg (2010)
Google Scholar
Suchanek, F., Kasneci, G., Weikum, G.: YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia. In: WWW 2007 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Science and Technology, Queensland University of Technology, GPO Box 2434, Brisbane, Qld, 4001, Australia
Richi Nayak, Christopher M. De Vries, Sangeetha Kutty & Shlomo Geva
University Pierre et Marie Curie, LIP6 – 104 avenue du président Kennedy, 75016, Paris, France
Ludovic Denoyer & Patrick Gallinari

Authors

Richi Nayak
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. De Vries
View author publications
You can also search for this author in PubMed Google Scholar
Sangeetha Kutty
View author publications
You can also search for this author in PubMed Google Scholar
Shlomo Geva
View author publications
You can also search for this author in PubMed Google Scholar
Ludovic Denoyer
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Gallinari
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Science and Technology, Queensland University of Technology, GPO Box 2434, 4001, Brisbane, Qld, Australia
Shlomo Geva
Archives and Information Studies/Humanities, University of Amsterdam, Turfdraagsterpad 9, 1012 XT, Amsterdam, The Netherlands
Jaap Kamps
Department of Computer Science, University of Otago, P.O. Box 56,, 9054, Dunedin, New Zealand
Andrew Trotman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nayak, R., De Vries, C.M., Kutty, S., Geva, S., Denoyer, L., Gallinari, P. (2010). Overview of the INEX 2009 XML Mining Track: Clustering and Classification of XML Documents. In: Geva, S., Kamps, J., Trotman, A. (eds) Focused Retrieval and Evaluation. INEX 2009. Lecture Notes in Computer Science, vol 6203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14556-8_36

Download citation

DOI: https://doi.org/10.1007/978-3-642-14556-8_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14555-1
Online ISBN: 978-3-642-14556-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics