Skip to main content

Emergent XML Mining: Discovering an Efficient Mapping from XML Instances to Relational Schemas

  • Chapter
Emergent Web Intelligence: Advanced Semantic Technologies

Part of the book series: Advanced Information and Knowledge Processing ((AI&KP))

  • 713 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Barabási, A.-L., Albert, R.: Emergence of scaling in random networks. Science 286(5439), 509–512 (1999)

    Article  MathSciNet  Google Scholar 

  2. Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph structure in the web. In: Proceedings of the WWW International Conference, pp. 309–320 (2000)

    Google Scholar 

  3. DBLP (Digital Bibliography & Library Project): http://www.informatik.uni-trier.de/~ley/db/index.html. Accessed 2007

  4. Deutsch, A., Fernandez, M., Suciu, D.: Storing semistructured data with STORED. In: Proceedings of the ACM SIGMOD International Conference, pp. 431–442 (1999)

    Google Scholar 

  5. Elmasri, R., Navathe, S.B.: Fundamentals of Database Systems, 3rd edn. Addison-Wesley, Longman, Boston (1999)

    Google Scholar 

  6. Flake, G.W., Lawrence, S., Giles, C.L.: Efficient identification of Web communities. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 150–160 (2000)

    Google Scholar 

  7. Florescu, D., Kossmann, D.: Storing and querying XML data using an RDBMS. IEEE Data Engineering Bulletin 22(3), 27–34 (1999)

    Google Scholar 

  8. Goldman, R., Widom, J.: DataGuides: Enabling query formulation and optimization in semistructured databases. In: Proceedings of the VLDB International Conference, pp. 436–445 (1997)

    Google Scholar 

  9. Gupta, A.K., Suciu, D.: Stream processing of XPath queries with predicates. In: Proceedings of the ACM SIGMOD International Conference, pp. 419–430 (2003)

    Google Scholar 

  10. Hammerschmidt, B.C.: Keyx: Selective Key-Oriented Indexing in Native XML-Databases. IOS Press, Amsterdam (2006)

    MATH  Google Scholar 

  11. Ishikawa, H., Yokoyama, S., Isshiki, S., Ohta, M.: Project Xanadu: XML- and active-database-unified approach to distributed e-commerce. In: Proceedings of the DEXA Workshops, pp. 833–837 (2001)

    Google Scholar 

  12. Ishikawa, H., Ohta, M., Yokoyama, S., Nakayama, J., Katayama, K.: On the effectiveness of web usage mining for page recommendation and restructuring. In: Proceedings of the NODe Web and Database-Related Workshops, pp. 253–267. Springer, Berlin (2002)

    Google Scholar 

  13. Ishikawa, H., Yokoyama, S., Ohta, M., Katayama, K.: On mining XML structures based on statistics. In: Proceedings of International Conference on Knowledge-Based Intelligent Information and Engineering Systems, pp. 379–390. Springer, Berlin (2005)

    Chapter  Google Scholar 

  14. Jiang, H., Lu, H., Wang, W., Yu, J.X.: Path materialization revisited: An efficient storage model for XML data. In: Proceedings of the Australasian Database Conference, pp. 85–94 (2002)

    Google Scholar 

  15. Klettke, M., Meyer, H.: XML and object-relational database systems enhancing structural mappings based on statistics. In: Lecture Notes in Computer Science, vol. 1997, pp. 151–170. Springer, Berlin (2001)

    Google Scholar 

  16. Ohta, M., Narita, H., Katayama, K., Ishikawa, H.: Overlapping clustering methods for a Japanese meta search engine. In: Proceedings of the IASTED International Conference on Databases and Applications, pp. 100–106 (2004)

    Google Scholar 

  17. Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the web, Stanford Digital Library Technologies Project (1998)

    Google Scholar 

  18. Schmidt, A.R., Waas, F., Kersten, M.L., Florescu, D., Manolescu, I., Carey, M.J., Busse, R.: The XML benchmark project. Technical report, INS-R0103, CWI (2001) http://monetdb.cwi.nl/xml/index.html. Accessed 2007

  19. Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton, J.: Relational databases for querying XML documents: Limitations and opportunities. In: Proceedings of the VLDB International Conference, pp. 302–314 (1999)

    Google Scholar 

  20. Takekawa, H., Ishikawa, H.: Incrementally-updatable stream processors for XPath queries based on merging automata via ordered hash-keys. In: Proceedings of the DEXA Workshops, pp. 40–44 (2007)

    Google Scholar 

  21. Tekli, J., Chbeir, R., Yétongnon, K.: Efficient XML structural similarity detection using sub-tree commonalities. In: Proceedings of the Brazilian Symposium on Databases, ACM SIGMOD DiSC, pp. 116–130 (2007)

    Google Scholar 

  22. Tian, F., De Witt, D.J., Chen, J., Zhang, C.: The design and performance evaluation of alternative XML storage strategies. SIGMOD Record 31(1), 5–10 (2002)

    Article  Google Scholar 

  23. XHTML: http://www.w3.org/TR/xhtml1/ Accessed 2007

  24. XML: http://www.w3.org/XML/ Accessed 2007

  25. XML Schema: http://www.w3.org/TR/xmlschema-0/ Accessed 2007

  26. XPath: http://www.w3.org/TR/xpath20/ Accessed 2007

  27. XQuery: http://www.w3.org/XML/Query/ Accessed 2007

  28. Yokoyama, S., Ohta, M., Katayama, K., Ishikawa, H.: An access control method based on the prefix labeling scheme for XML repositories. In: Proceedings of the Australasian Database Conference, vol. 39, pp. 105–113. ACM, New York (2005)

    Google Scholar 

  29. Yoshikawa, M., Amagasa, T.: XRel: A path-based approach to storage and retrieval of XML documents using relational databases. ACM Transactions on Internet Technology 1(1), 110–141 (2001)

    Article  Google Scholar 

  30. Zhang, C., Naughton, J.F., DeWitt, D.J., Luo, Q., Lohman, G.M.: On supporting containment queries in relational database management systems. In: Proceedings of the ACM SIGMOD International Conference, pp. 425–436 (2001)

    Google Scholar 

Download references

Acknowledgements

This work is partially supported by the Ministry of Education, Culture, Sports, Science and Technology, Japan under Grants-in-Aid for Scientific Research (16 300 030, 19 300 026). We appreciate Mr. Takeyoshi Maku for his great efforts in the implementation and evaluation of the general ideas described in this chapter.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hiroshi Ishikawa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag London

About this chapter

Cite this chapter

Ishikawa, H. (2010). Emergent XML Mining: Discovering an Efficient Mapping from XML Instances to Relational Schemas. In: Badr, Y., Chbeir, R., Abraham, A., Hassanien, AE. (eds) Emergent Web Intelligence: Advanced Semantic Technologies. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-84996-077-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-1-84996-077-9_12

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-84996-076-2

  • Online ISBN: 978-1-84996-077-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics