Abstract
XML (eXtensible Markup Language) is fast becoming the de facto standard for information exchange over the Internet. As more and more sensitive information gets stored in the form of XML, sophisticated indexing schemes are required to speedup document storage and retrieval. XML documents can be hierarchically represented by elements. This paper describes a Lattice-map semantic indexing technique to cluster XML documents. To improve performance of information retrieval, documents can be indexed using Lattice-map technique. Similarity and Popularity operations are available in Lattice-map indexing technique and clustering algorithm is used for mining XML documents.
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
E. Bertino, G. Guerrini, and M. Mesiti. Measuring the structural similarity among XML documents and DTDs. Technical report, Tech. Report DISI-TR-02-02, Department of Computer Science, University of Genova, 2002.
Bo-Yeong Kang , Sang-Jo Lee Document indexing: a concept-based approach to term weight estimation
C. Chan and Y. Ioannidis, Bitmap Index Design and Evaluation, Proc. Of Int’l ACM SIGMOD Conference, 1998
A. Doucet and H. A. Myka. Naive clustering of a Large XML document collection. In Proc. 1st Annual Workshop of the Initiative for the Evaluation of XML retrieval Schloss Dagstuhl, Germany, 2002
D. Guillaume and F.Murtagh. Clustering of XML documents. Computer Physics Communications, pp.215–227, 1989.
P. Willet, Recent Trends in Hierarchical Document Clustering: a Critical Review, Information Processing and Management, 24:577-97, 1988
J. Yoon, V. Raghavan and Venu Chakilam Bitmap Indexing-based clustering and Retrieval of XML documents.
J. Yoon, V. Raghavan and Venu Chakilam, BitCube: A Three Dimensional Bitmap Indexing for XML Documents, 13th International Conference on Scientific and Statistical Database Management, FairFax, VA, 2001.
J. Yoon, Lafayette Presto authorization: a bitmap indexing scheme for high-speed access control to XML documents, IEEE Transactions on Knowledge and Data Engineering, Volume 18, Issue 7 pp 971-987, 2006
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this paper
Cite this paper
Natarajan, D.A., Premalatha, K., Kogilavani, A. (2007). Lattice Cube Semantic Index Based Mining on XML Documents. In: Sobh, T. (eds) Innovations and Advanced Techniques in Computer and Information Sciences and Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-6268-1_47
Download citation
DOI: https://doi.org/10.1007/978-1-4020-6268-1_47
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-6267-4
Online ISBN: 978-1-4020-6268-1
eBook Packages: EngineeringEngineering (R0)
