Abstract
Large enterprises continue to struggle with information and critical decision-making data being widely distributed, stored in a number of proprietary and heterogeneous formats, and remaining inaccessible for mining of critical information that spans the collected knowledge of the organization. NETMARK is an easy to use, scalable system for storing, decomposing, and indexing enterprise-wide information developed for NASA enterprise applications. Information is managed in a contextualized form, but one that is schema-less for immediate storage and retrieval without the need for a schema manager or database administrator. NETMARK is accessed via the WebDAV (HTTP) standard protocol for remote document management and a simple HTTP query algebra for immediate retrieval of information in an XML structured format for processing by applications such as Web 2.0 (AJAX) systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Halevy, A.Y., Rajaraman, A., Ordille, J.: Data Integration: The Teenage Years. In: Proc of VLDB (2006)
Litwin, W., Mark, L., Roussopoulos, N.: Interoperability of Multiple Autonomous Databases. ACM Computing Surveys 22(3), 267–293 (1990)
Halevy, A.Y., Ashish, N., Bitton, D., Carey, M.J., Draper, D., Pollock, J., Rosenthal, A., Sikka, V.: Enterprise information integration: successes, challenges and controversies. In: SIGMOD Conference 2005, pp. 778–787 (2005)
Halevy, A.Y.: Data Integration: A Status Report. In: BTW 2003, pp. 24–29 (2003)
Draper, D., Halevy, A.Y., Weld, D.S.: The Nimble XML Data Integration System. In: ICDE 2001, pp. 155–160 (2001)
Papakonstantinou, Y., Borkar, V.R., Orgiyan, M., Stathatos, K., Suta, L., Vassalos, V., Velikhov, P.: XML queries and algebra in the Enosys integration platform. Data Knowl. Eng. 44(3), 299–322 (2003)
Berners-Lee, T., Hendler, J., Lasilla, O.: The Semantic-Web. Scientific American (May 2001)
Neches, R., Fikes, R., Finin, T., Gruber, T., Patil, R., Senator, T., Swartout, W.R.: Enabling Technology for Knowledge Sharing. AI Magazine 12(3), 36–55 (1991)
MacGregor, R.M.: Inside the LOOM Description Classifier. SIGART Bulletin 2(3), 88–92 (1991)
Brachman, R.J., McGuinness, D.L., Patel-Schneider, P.F., Borgida, A.: "Reducing" CLASSIC to Practice: Knowledge Representation Theory Meets Reality. Artif. Intell. 114(1-2), 203–237 (1999)
Gruber, T.R.: The Role of Common Ontology in Achieving Sharable, Reusable Knowledge Bases. In: KR 1991 (1991)
Collet, C., Huhns, M., Shen, W.: Resource Integration Using a Large Knowledge Base in Carnot. IEEE Computer 12(24) (December 1991)
Maluf, D.A., Tran, P.: Articulation Management for Intelligent Integration of Information. IEEE Systems Man and Cybernetics (2001)
Guha, R.V.: Context: A Formalization and Some Applications, Doctoral Dissertation, Stanford University (1991)
Lenat, D., Guha, R.: The Evolution of CycL, The Cyc Representation language; Special Issue on Implemented Knowledge Representation System. ACM SIGART 2(3), 84–87 (1991)
McCarthy, J.: Notes on Formalizing Context. In: Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence (1993)
Maluf, D.A., Wiederhold, G.: Abstraction of Representation for Interoperation. In: Tenth International Symposium on Methodologies for Intelligent Systems. LNCS, pp. 441–455. Springer, Heidelberg (1997)
Mitra, P., Wiederhold, G.: An Ontology-Composition Algebra. In: Handbook on Ontologies 2004, pp. 93–116 (2004)
Maluf, D.A., Bell, D.G., Ashish, N., Knight, C., Tran, P.B.: Semi-Structured Data Management in the Enterprise: A Nimble, High-Throughput, and Scalable Approach. In: IDEAS 2005, pp. 115–124 (2005)
Gawdiak, Y., La, T., Lin, Y., Maluf, D., Tran, P.: US Patent 6,968,338, Extensible database framework for management of unstructured and semi-structured documents, Awarded November 22 (2005)
Maluf, D., Tran, P.: NETMARK: A Schema-Less Extension for Relational Databases for Managing Semi-structured Data Dynamically. In: Zhong, N., Raś, Z.W., Tsumoto, S., Suzuki, E. (eds.) ISMIS 2003. LNCS, vol. 2871, pp. 231–241. Springer, Heidelberg (2003)
Schmidt, A.R., Waas, F., Ketersen, M.L., Florescu, D., Manolescu, I., Carey, M.J., Busse, R.: The XML Benchmark Project. In: CWI (2001)
Paparizos, S., Al-Khalifa, S., Chapman, A., Jagadish, H.V., Lakshmanan, L.V.S., Nierman, A., Patel, J.M., Srivastava, D., Wiwatwattana, N., Wu, Y., Yu, C.: TIMBER: A Native System for Querying XML. In: SIGMOD Conference 2003, p. 672 (2003)
Maluf, D.A., Bell, D.G., Ashish, N., Putz, P., Gawdiak, Y.: Business Intelligence in Large Organizations: Integrating Which Data? In: Esposito, F., Raś, Z.W., Malerba, D., Semeraro, G. (eds.) ISMIS 2006. LNCS, vol. 4203, pp. 248–257. Springer, Heidelberg (2006)
Maluf, D., Tran, P.: Managing Unstructured Data With Structured Legacy Systems. In: IEEE Aerospace Conference, Montana (2008)
Maluf, D.: Searching Across the International Space Station. In: IEEE Aerospace Conference, Montana (2007)
Maluf, D.: Knowledge Mining Application in a IVHM Testbed. In: IEEE Aerospace Conference, Montana (2006)
NETMARK XDB guide
NETMARK API
Jagadish, H.V., Khalifa, S., Chapman, A., Lakshmanan, L., Nierman, A., Paparizos, S., Patel, J., Srivastava, D., Wiwatwattanna, N., Wu, Y., Yu, C.: TIMBER: A Native XML Database. VLDB Journal 11, 274–291 (2002)
Ives, Z., Halevy, A., Weld, D.: An XML query engine for network-bound data. VLDB Journal 11, 380–402 (2002)
Funderbunk, J.E., Kiernan, G., Shanmugasundaram, J., Shekita, E., Wei, C.: XTABLES: Bridging relational technology and XML. IBM Systems Journal 41, 616–641 (2002)
Li, Y., Yu, C., Jagadish, H.V.: Enabling Schema-Free XQuery with meaningful query focus. VLDB Journal (2008)
Botev, C., Shanmugasundaram, J.: Context-Sensitive Keyword Search and Ranking for XML. In: WebDB 2005, pp. 115–120 (2005)
Grust, T., Rittinger, J., Teubner, J.: Why off-the-shelf RDBMSs are better at XPath than you might expect. In: ACM SIGMOD Conference, pp. 949–958 (2007)
Georgiadis, H., Vassalos, V.: Xpath on Steroids: Exploiting Relational Engines for Xpath Performance. In: ACM SIGMOD Conference, pp. 317–328 (2007)
Boncz, P.A., Grust, T., van Keulen, M., Manegold, S., Rittinger, J., Teubner, J.: MonetDB/XQuery: a fast XQuery processor powered by a relational engine. In: ACM SIGMOD, pp. 479–490 (2006)
Vagena, Z., Moro, M., Tsotras, V.: Twig Query Processing over Graph Structured XML Data. In: Workshop on Web and Databases WebDB 2004, Paris, France (2004)
Xu, Y., Papakonstantinou, Y.: Efficient LCA based keyword search in XML data. In: EDBT 2008, pp. 535–546 (2008)
Madhavan, J., Cohen, S., Dong, X.L., Halevy, A.Y., Jeffery, S.R., Ko, D., Yu, C.: Web-Scale Data Integration: You can afford to Pay as You Go. In: CIDR 2007, pp. 342–350 (2007)
Anderson, N., Lee, E., Brockenbrough, J.S., Minie, M., Fuller, S., Brinkley, J., Tarczy-Hornoch, P.: Issues in Biomedical Research Data Management and Analysis: Needs and Barriers. Journal of the American Medical Informatics Association 14(4) (August 2007)
Docushare, http://docushare.xerox.com/ds/
Oracle
MySQL
Apache
WebDAV
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Maluf, D.A., Knight, C.D. (2009). Achieving Scalability with Schema-Less Databases. In: Ras, Z.W., Dardzinska, A. (eds) Advances in Data Management. Studies in Computational Intelligence, vol 223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02190-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-02190-9_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02189-3
Online ISBN: 978-3-642-02190-9
eBook Packages: EngineeringEngineering (R0)