Skip to main content

Flexible and Customizable NL Representation of Requirements for ETL processes

  • Conference paper
Natural Language Processing and Information Systems (NLDB 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4592))

Abstract

The design of an Extract – Transform – Load (ETL) workflow for the population of a Data Warehouse is a complex and challenging procedure. In previous work, we have presented an ontology-based approach to facilitate the conceptual design of an ETL scenario. In this paper, we elaborate on this work, by investigating the application of Natural Language (NL) techniques to the ETL environment and we present a flexible and customizable template-based mechanism for generating natural language representations for the ETL process requirements and operations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Bontcheva, K.: Generating Tailored Textual Summaries from Ontologies. In: Gómez-Pérez, A., Euzenat, J. (eds.) The Semantic Web: Research and Applications. LNCS, vol. 3532, Springer, Heidelberg (2005)

    Google Scholar 

  • Bontcheva, K., Wilks, Y.: Automatic Report Generation from Ontologies: The MIAKT Approach. In: Meziane, F., Métais, E. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 3136, Springer, Heidelberg (2004)

    Google Scholar 

  • Buchholz, E., Cyriaks, H., Düsterhöft, A., Mehlan, H., Thalheim, B.: Acquiring Complex Information from Natural Language for EER Database Design. In: NLDB (1995)

    Google Scholar 

  • Dalianis, H., Hovy, E.H.: Aggregation in Natural Language Generation. In: EWNLG (1993)

    Google Scholar 

  • Du, S., Metzler, D.P.: An Automated Multi-component Approach to Extracting Entity Relationships from Database Requirement Specification Documents. In: Kop, C., Fliedl, G., Mayr, H.C., Métais, E. (eds.) NLDB 2006. LNCS, vol. 3999, Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  • IBM. IBM WebSphere DataStage

    Google Scholar 

  • Ilieva, M.G., Ormandjieva, O.: Automatic Transition of Natural Language Software Requirements Specification into Formal Presentation. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 3513, Springer, Heidelberg (2005)

    Google Scholar 

  • Informatica. PowerCenter

    Google Scholar 

  • Kedad, Z., Métais, E.: Dealing with Semantic Heterogeneity During Data Integration. In: Akoka, J., Bouzeghoub, M., Comyn-Wattiau, I., Métais, E. (eds.) ER 1999. LNCS, vol. 1728, Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  • Kedad, Z., Métais, E.: Ontology-Based Data Cleaning. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 2553, Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  • Kof, L.: Natural Language Processing: Mature Enough for Requirements Documents Analysis? In: Montoyo, A., Muńoz, R., Métais, E. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 3513, Springer, Heidelberg (2005)

    Google Scholar 

  • Luján-Mora, S., Vassiliadis, P., Trujillo, J.: Data Mapping Diagrams for Data Warehouse Design with UML. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) Conceptual Modeling – ER 2004. LNCS, vol. 3288, Springer, Heidelberg (2004)

    Google Scholar 

  • Metais, E., Meunier, J., Levreau, G.: Database Schema Design: A Perspective from Natural Language Techniques to Validation and View Integration. In: ER (1993)

    Google Scholar 

  • Microsoft. Data Transformation Services

    Google Scholar 

  • Oracle. Oracle Warehouse Builder Product Page

    Google Scholar 

  • Reape, M., Mellish, C.: Just what is aggregation anyway. In: ENLG (1999)

    Google Scholar 

  • Reiter, E., Mellish, C., Levine, J.: Automatic generation of technical documentation. In: Applied Artificial Intelligence 9(3) (1995)

    Google Scholar 

  • Rolland, C., Proix, C.: A Natural Language Approach for Requirements Engineering. In: Loucopoulos, P. (ed.) CAiSE 1992. LNCS, vol. 593, Springer, Heidelberg (1992)

    Chapter  Google Scholar 

  • Simitsis, A.: Mapping Conceptual to Logical Models for ETL Processes. In: DOLAP (2005)

    Google Scholar 

  • Skoutas, D., Simitsis, A.: Ontology-based Conceptual Design of ETL Processes for both Structured and Semi-structured Data. In: IJSWIS (to appear, 2007)

    Google Scholar 

  • Skoutas, D., Simitsis, A.: Flexible and Customizable NL Representation of Requirements for ETL Processes. Technical Report, http://www.dblab.ntua.gr/~asimi/publications/SkSi07b.pdf

  • Storey, V.C., Goldstein, R.C., Ullrich, H.: Naive Semantics to Support Automated Database Design. In: IEEE TKDE, vol. 14(1) (2002)

    Google Scholar 

  • Min, T.A., Berger, L.: Transformation of Requirement Specifications Expressed in Natural Language into an EER Model. In: ER (1993)

    Google Scholar 

  • Trujillo, J., Lujan-Mora, S.: A UML Based Approach for Modeling ETL Processes in Data Warehouses. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, Springer, Heidelberg (2003)

    Google Scholar 

  • Tsen, F.S.C., Chen, A.L.P., Yang, W.-P.: On mapping natural language constructs into relational algebra through E-R representation. In: DKE (9) (1992)

    Google Scholar 

  • Vassiliadis, P., Simitsis, A., Skiadopoulos, S.: Conceptual Modeling for ETL Processes. In: DOLAP (2002)

    Google Scholar 

  • Wilcock, G., Jokinen, K.: Generating Responses and Explanations from RDF/XML and DAML+OIL. In: IJCAI (2003)

    Google Scholar 

  • Wilcock, G.: Talking OWLs: Towards an Ontology Verbalizer. In: Fensel, D., Sycara, K.P., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, Springer, Heidelberg (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zoubida Kedad Nadira Lammari Elisabeth Métais Farid Meziane Yacine Rezgui

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Skoutas, D., Simitsis, A. (2007). Flexible and Customizable NL Representation of Requirements for ETL processes. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds) Natural Language Processing and Information Systems. NLDB 2007. Lecture Notes in Computer Science, vol 4592. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73351-5_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73351-5_42

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73350-8

  • Online ISBN: 978-3-540-73351-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics