Skip to main content

Achieving Scalability with Schema-Less Databases

  • Chapter
Advances in Data Management

Part of the book series: Studies in Computational Intelligence ((SCI,volume 223))

  • 607 Accesses

Abstract

Large enterprises continue to struggle with information and critical decision-making data being widely distributed, stored in a number of proprietary and heterogeneous formats, and remaining inaccessible for mining of critical information that spans the collected knowledge of the organization. NETMARK is an easy to use, scalable system for storing, decomposing, and indexing enterprise-wide information developed for NASA enterprise applications. Information is managed in a contextualized form, but one that is schema-less for immediate storage and retrieval without the need for a schema manager or database administrator. NETMARK is accessed via the WebDAV (HTTP) standard protocol for remote document management and a simple HTTP query algebra for immediate retrieval of information in an XML structured format for processing by applications such as Web 2.0 (AJAX) systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Halevy, A.Y., Rajaraman, A., Ordille, J.: Data Integration: The Teenage Years. In: Proc of VLDB (2006)

    Google Scholar 

  2. Litwin, W., Mark, L., Roussopoulos, N.: Interoperability of Multiple Autonomous Databases. ACM Computing Surveys 22(3), 267–293 (1990)

    Article  Google Scholar 

  3. Halevy, A.Y., Ashish, N., Bitton, D., Carey, M.J., Draper, D., Pollock, J., Rosenthal, A., Sikka, V.: Enterprise information integration: successes, challenges and controversies. In: SIGMOD Conference 2005, pp. 778–787 (2005)

    Google Scholar 

  4. Halevy, A.Y.: Data Integration: A Status Report. In: BTW 2003, pp. 24–29 (2003)

    Google Scholar 

  5. Draper, D., Halevy, A.Y., Weld, D.S.: The Nimble XML Data Integration System. In: ICDE 2001, pp. 155–160 (2001)

    Google Scholar 

  6. Papakonstantinou, Y., Borkar, V.R., Orgiyan, M., Stathatos, K., Suta, L., Vassalos, V., Velikhov, P.: XML queries and algebra in the Enosys integration platform. Data Knowl. Eng. 44(3), 299–322 (2003)

    Article  Google Scholar 

  7. Berners-Lee, T., Hendler, J., Lasilla, O.: The Semantic-Web. Scientific American (May 2001)

    Google Scholar 

  8. Neches, R., Fikes, R., Finin, T., Gruber, T., Patil, R., Senator, T., Swartout, W.R.: Enabling Technology for Knowledge Sharing. AI Magazine 12(3), 36–55 (1991)

    Google Scholar 

  9. MacGregor, R.M.: Inside the LOOM Description Classifier. SIGART Bulletin 2(3), 88–92 (1991)

    Article  MathSciNet  Google Scholar 

  10. Brachman, R.J., McGuinness, D.L., Patel-Schneider, P.F., Borgida, A.: "Reducing" CLASSIC to Practice: Knowledge Representation Theory Meets Reality. Artif. Intell. 114(1-2), 203–237 (1999)

    Article  MATH  Google Scholar 

  11. Gruber, T.R.: The Role of Common Ontology in Achieving Sharable, Reusable Knowledge Bases. In: KR 1991 (1991)

    Google Scholar 

  12. Collet, C., Huhns, M., Shen, W.: Resource Integration Using a Large Knowledge Base in Carnot. IEEE Computer 12(24) (December 1991)

    Google Scholar 

  13. Maluf, D.A., Tran, P.: Articulation Management for Intelligent Integration of Information. IEEE Systems Man and Cybernetics (2001)

    Google Scholar 

  14. Guha, R.V.: Context: A Formalization and Some Applications, Doctoral Dissertation, Stanford University (1991)

    Google Scholar 

  15. Lenat, D., Guha, R.: The Evolution of CycL, The Cyc Representation language; Special Issue on Implemented Knowledge Representation System. ACM SIGART 2(3), 84–87 (1991)

    Article  Google Scholar 

  16. McCarthy, J.: Notes on Formalizing Context. In: Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence (1993)

    Google Scholar 

  17. Maluf, D.A., Wiederhold, G.: Abstraction of Representation for Interoperation. In: Tenth International Symposium on Methodologies for Intelligent Systems. LNCS, pp. 441–455. Springer, Heidelberg (1997)

    Google Scholar 

  18. Mitra, P., Wiederhold, G.: An Ontology-Composition Algebra. In: Handbook on Ontologies 2004, pp. 93–116 (2004)

    Google Scholar 

  19. Maluf, D.A., Bell, D.G., Ashish, N., Knight, C., Tran, P.B.: Semi-Structured Data Management in the Enterprise: A Nimble, High-Throughput, and Scalable Approach. In: IDEAS 2005, pp. 115–124 (2005)

    Google Scholar 

  20. Gawdiak, Y., La, T., Lin, Y., Maluf, D., Tran, P.: US Patent 6,968,338, Extensible database framework for management of unstructured and semi-structured documents, Awarded November 22 (2005)

    Google Scholar 

  21. Maluf, D., Tran, P.: NETMARK: A Schema-Less Extension for Relational Databases for Managing Semi-structured Data Dynamically. In: Zhong, N., Raś, Z.W., Tsumoto, S., Suzuki, E. (eds.) ISMIS 2003. LNCS, vol. 2871, pp. 231–241. Springer, Heidelberg (2003)

    Google Scholar 

  22. Schmidt, A.R., Waas, F., Ketersen, M.L., Florescu, D., Manolescu, I., Carey, M.J., Busse, R.: The XML Benchmark Project. In: CWI (2001)

    Google Scholar 

  23. Paparizos, S., Al-Khalifa, S., Chapman, A., Jagadish, H.V., Lakshmanan, L.V.S., Nierman, A., Patel, J.M., Srivastava, D., Wiwatwattana, N., Wu, Y., Yu, C.: TIMBER: A Native System for Querying XML. In: SIGMOD Conference 2003, p. 672 (2003)

    Google Scholar 

  24. Maluf, D.A., Bell, D.G., Ashish, N., Putz, P., Gawdiak, Y.: Business Intelligence in Large Organizations: Integrating Which Data? In: Esposito, F., Raś, Z.W., Malerba, D., Semeraro, G. (eds.) ISMIS 2006. LNCS, vol. 4203, pp. 248–257. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  25. Maluf, D., Tran, P.: Managing Unstructured Data With Structured Legacy Systems. In: IEEE Aerospace Conference, Montana (2008)

    Google Scholar 

  26. Maluf, D.: Searching Across the International Space Station. In: IEEE Aerospace Conference, Montana (2007)

    Google Scholar 

  27. Maluf, D.: Knowledge Mining Application in a IVHM Testbed. In: IEEE Aerospace Conference, Montana (2006)

    Google Scholar 

  28. NETMARK XDB guide

    Google Scholar 

  29. NETMARK API

    Google Scholar 

  30. Jagadish, H.V., Khalifa, S., Chapman, A., Lakshmanan, L., Nierman, A., Paparizos, S., Patel, J., Srivastava, D., Wiwatwattanna, N., Wu, Y., Yu, C.: TIMBER: A Native XML Database. VLDB Journal 11, 274–291 (2002)

    Article  MATH  Google Scholar 

  31. Ives, Z., Halevy, A., Weld, D.: An XML query engine for network-bound data. VLDB Journal 11, 380–402 (2002)

    Article  MATH  Google Scholar 

  32. Funderbunk, J.E., Kiernan, G., Shanmugasundaram, J., Shekita, E., Wei, C.: XTABLES: Bridging relational technology and XML. IBM Systems Journal 41, 616–641 (2002)

    Article  Google Scholar 

  33. Li, Y., Yu, C., Jagadish, H.V.: Enabling Schema-Free XQuery with meaningful query focus. VLDB Journal (2008)

    Google Scholar 

  34. Botev, C., Shanmugasundaram, J.: Context-Sensitive Keyword Search and Ranking for XML. In: WebDB 2005, pp. 115–120 (2005)

    Google Scholar 

  35. Grust, T., Rittinger, J., Teubner, J.: Why off-the-shelf RDBMSs are better at XPath than you might expect. In: ACM SIGMOD Conference, pp. 949–958 (2007)

    Google Scholar 

  36. Georgiadis, H., Vassalos, V.: Xpath on Steroids: Exploiting Relational Engines for Xpath Performance. In: ACM SIGMOD Conference, pp. 317–328 (2007)

    Google Scholar 

  37. Boncz, P.A., Grust, T., van Keulen, M., Manegold, S., Rittinger, J., Teubner, J.: MonetDB/XQuery: a fast XQuery processor powered by a relational engine. In: ACM SIGMOD, pp. 479–490 (2006)

    Google Scholar 

  38. Vagena, Z., Moro, M., Tsotras, V.: Twig Query Processing over Graph Structured XML Data. In: Workshop on Web and Databases WebDB 2004, Paris, France (2004)

    Google Scholar 

  39. Xu, Y., Papakonstantinou, Y.: Efficient LCA based keyword search in XML data. In: EDBT 2008, pp. 535–546 (2008)

    Google Scholar 

  40. Madhavan, J., Cohen, S., Dong, X.L., Halevy, A.Y., Jeffery, S.R., Ko, D., Yu, C.: Web-Scale Data Integration: You can afford to Pay as You Go. In: CIDR 2007, pp. 342–350 (2007)

    Google Scholar 

  41. Anderson, N., Lee, E., Brockenbrough, J.S., Minie, M., Fuller, S., Brinkley, J., Tarczy-Hornoch, P.: Issues in Biomedical Research Data Management and Analysis: Needs and Barriers. Journal of the American Medical Informatics Association 14(4) (August 2007)

    Google Scholar 

  42. Xalan, http://xml.apache.org/xalan-j/

  43. XML, http://www.w3.org/XML/

  44. Docushare, http://docushare.xerox.com/ds/

  45. Oracle

    Google Scholar 

  46. MySQL

    Google Scholar 

  47. Apache

    Google Scholar 

  48. WebDAV

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Maluf, D.A., Knight, C.D. (2009). Achieving Scalability with Schema-Less Databases. In: Ras, Z.W., Dardzinska, A. (eds) Advances in Data Management. Studies in Computational Intelligence, vol 223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02190-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02190-9_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02189-3

  • Online ISBN: 978-3-642-02190-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics