Skip to main content

Supporting High-Level Abstractions through XML Technology

  • Conference paper
Languages and Compilers for Parallel Computing (LCPC 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2958))

Abstract

Development of applications that process large scientific datasets is often complicated by complex and specialized data storage formats. In this paper, we describe the use of XML technologies for supporting high-level programming methodologies for processing scientific datasets. We show how XML Schemas can be used to give a high-level abstraction of a dataset to an application developer. A corresponding low-level Schema describes the actual layout of data and is used by the compiler for code generation. The compiler needs a systematic way for translating the high-level code to a low-level code. Then, it needs to transform the generated low-level code to achieve high locality and efficient execution. This paper describes our approach to these two problems. By using Active Data Repository as the underlying runtime system, we offer an XML based front-end for storing, retrieving, and processing flat-file based scientific datasets in a cluster environment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Afework, A., Beynon, M.D., Bustamante, F., Demarzo, A., Ferreira, R., Miller, R., Silberman, M., Saltz, J., Sussman, A., Tsang, H.: Digital dynamic telepathology - the Virtual Microscope. In: Proceedings of the 1998 AMIA Annual Fall Symposium, November 1998. American Medical Informatics Association (1998)

    Google Scholar 

  2. Beech, D., Lawrence, S., Maloney, M., Mendelsohn, N., Thompson, H.: XML Schema part 1: Structures, W3C working draft (May 1999), Available at http://www.w3.org/TR/1999/xmlschema-1

  3. Biron, P., Malhotra, A.: XML Schema part 2: Datatypes, W3C working draft (May 1999), Available at http://www.w3.org/TR/1999/xmlschema-2

  4. Boag, S., Chamberlin, D., Fernandez, M.F., Florescu, D., Robie, J., Simeon, J.: XQuery 1.0: An XML Query Language. W3C Working Draft (November 2002), available from http://www.w3.org/TR/xquery/

  5. Bray, T., Paoli, J., Sperberg-McQueen, C.: Extensible Markup Language (XML) 1.0 (February 1998), Available at http://www.w3.org/TR/REC-xml

  6. Chang, C., Ferreira, R., Sussman, A., Saltz, J.: Infrastructure for building parallel database systems for multi-dimensional data. In: Proceedings of the Second Merged IPPS/SPDP (13th International Parallel Processing Symposium & 10th Symposium on Parallel and Distributed Processing), April 1999. IEEE Computer Society Press, Los Alamitos (1999)

    Google Scholar 

  7. Chang, C., Moon, B., Acharya, A., Shock, C., Sussman, A., Saltz, J.: Titan: A high performance remote-sensing database. In: Proceedings of the 1997 International Conference on Data Engineering, April 1997, pp. 375–384. IEEE Computer Society Press, Los Alamitos (1997)

    Google Scholar 

  8. Ferreira, R., Moon, B., Humphries, J., Sussman, A., Saltz, J., Miller, R., Demarzo, A.: The Virtual Microscope. In: Proceedings of the 1997 AMIA Annual Fall Symposium, October 1997, pp. 449–453. American Medical Informatics Association, Hanley and Belfus, Inc. (1997); Also available as University of Maryland Technical Report CS-TR-3777 and UMIACS-TR-97-35

    Google Scholar 

  9. Ferreira, R., Agrawal, G., Saltz, J.: Compiler supported high-level abstractions for sparse disk-resident datasets. In: Proceedings of the International Conference on Supercomputing (ICS) (June 2002)

    Google Scholar 

  10. Kodukula, I., Ahmed, N., Pingali, K.: Data-centric multi-level blocking. In: Proceedings of the SIGPLAN 1997 Conference on Programming Language Design and Implementation, June 1997, pp. 346–357 (1997)

    Google Scholar 

  11. Li, X., Ferreira, R., Agrawal, G.: Compiler Support for Efficient Processing of XML Datasets. In: Proceedings of the International Conference on Supercomputing (ICS), June 2003, pp. 67–77. ACM Press, New York (2003)

    Google Scholar 

  12. Sarawagi, S., Thomas, S., Agrawal, R.: Integrating association rule mining with databases: alternative and implications. In: Proceedings of ACM SIGMOD International Conference on Management of Data (SIGMOD), June 1998. ACM Press, New York (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, X., Agrawal, G. (2004). Supporting High-Level Abstractions through XML Technology. In: Rauchwerger, L. (eds) Languages and Compilers for Parallel Computing. LCPC 2003. Lecture Notes in Computer Science, vol 2958. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24644-2_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24644-2_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21199-0

  • Online ISBN: 978-3-540-24644-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics