Abstract
The design of an Extract – Transform – Load (ETL) workflow for the population of a Data Warehouse is a complex and challenging procedure. In previous work, we have presented an ontology-based approach to facilitate the conceptual design of an ETL scenario. In this paper, we elaborate on this work, by investigating the application of Natural Language (NL) techniques to the ETL environment and we present a flexible and customizable template-based mechanism for generating natural language representations for the ETL process requirements and operations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bontcheva, K.: Generating Tailored Textual Summaries from Ontologies. In: Gómez-Pérez, A., Euzenat, J. (eds.) The Semantic Web: Research and Applications. LNCS, vol. 3532, Springer, Heidelberg (2005)
Bontcheva, K., Wilks, Y.: Automatic Report Generation from Ontologies: The MIAKT Approach. In: Meziane, F., Métais, E. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 3136, Springer, Heidelberg (2004)
Buchholz, E., Cyriaks, H., Düsterhöft, A., Mehlan, H., Thalheim, B.: Acquiring Complex Information from Natural Language for EER Database Design. In: NLDB (1995)
Dalianis, H., Hovy, E.H.: Aggregation in Natural Language Generation. In: EWNLG (1993)
Du, S., Metzler, D.P.: An Automated Multi-component Approach to Extracting Entity Relationships from Database Requirement Specification Documents. In: Kop, C., Fliedl, G., Mayr, H.C., Métais, E. (eds.) NLDB 2006. LNCS, vol. 3999, Springer, Heidelberg (2006)
IBM. IBM WebSphere DataStage
Ilieva, M.G., Ormandjieva, O.: Automatic Transition of Natural Language Software Requirements Specification into Formal Presentation. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 3513, Springer, Heidelberg (2005)
Informatica. PowerCenter
Kedad, Z., Métais, E.: Dealing with Semantic Heterogeneity During Data Integration. In: Akoka, J., Bouzeghoub, M., Comyn-Wattiau, I., Métais, E. (eds.) ER 1999. LNCS, vol. 1728, Springer, Heidelberg (1999)
Kedad, Z., Métais, E.: Ontology-Based Data Cleaning. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 2553, Springer, Heidelberg (2002)
Kof, L.: Natural Language Processing: Mature Enough for Requirements Documents Analysis? In: Montoyo, A., Muńoz, R., Métais, E. (eds.) Natural Language Processing and Information Systems. LNCS, vol. 3513, Springer, Heidelberg (2005)
Luján-Mora, S., Vassiliadis, P., Trujillo, J.: Data Mapping Diagrams for Data Warehouse Design with UML. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) Conceptual Modeling – ER 2004. LNCS, vol. 3288, Springer, Heidelberg (2004)
Metais, E., Meunier, J., Levreau, G.: Database Schema Design: A Perspective from Natural Language Techniques to Validation and View Integration. In: ER (1993)
Microsoft. Data Transformation Services
Oracle. Oracle Warehouse Builder Product Page
Reape, M., Mellish, C.: Just what is aggregation anyway. In: ENLG (1999)
Reiter, E., Mellish, C., Levine, J.: Automatic generation of technical documentation. In: Applied Artificial Intelligence 9(3) (1995)
Rolland, C., Proix, C.: A Natural Language Approach for Requirements Engineering. In: Loucopoulos, P. (ed.) CAiSE 1992. LNCS, vol. 593, Springer, Heidelberg (1992)
Simitsis, A.: Mapping Conceptual to Logical Models for ETL Processes. In: DOLAP (2005)
Skoutas, D., Simitsis, A.: Ontology-based Conceptual Design of ETL Processes for both Structured and Semi-structured Data. In: IJSWIS (to appear, 2007)
Skoutas, D., Simitsis, A.: Flexible and Customizable NL Representation of Requirements for ETL Processes. Technical Report, http://www.dblab.ntua.gr/~asimi/publications/SkSi07b.pdf
Storey, V.C., Goldstein, R.C., Ullrich, H.: Naive Semantics to Support Automated Database Design. In: IEEE TKDE, vol. 14(1) (2002)
Min, T.A., Berger, L.: Transformation of Requirement Specifications Expressed in Natural Language into an EER Model. In: ER (1993)
Trujillo, J., Lujan-Mora, S.: A UML Based Approach for Modeling ETL Processes in Data Warehouses. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, Springer, Heidelberg (2003)
Tsen, F.S.C., Chen, A.L.P., Yang, W.-P.: On mapping natural language constructs into relational algebra through E-R representation. In: DKE (9) (1992)
Vassiliadis, P., Simitsis, A., Skiadopoulos, S.: Conceptual Modeling for ETL Processes. In: DOLAP (2002)
Wilcock, G., Jokinen, K.: Generating Responses and Explanations from RDF/XML and DAML+OIL. In: IJCAI (2003)
Wilcock, G.: Talking OWLs: Towards an Ontology Verbalizer. In: Fensel, D., Sycara, K.P., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Skoutas, D., Simitsis, A. (2007). Flexible and Customizable NL Representation of Requirements for ETL processes. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds) Natural Language Processing and Information Systems. NLDB 2007. Lecture Notes in Computer Science, vol 4592. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73351-5_42
Download citation
DOI: https://doi.org/10.1007/978-3-540-73351-5_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73350-8
Online ISBN: 978-3-540-73351-5
eBook Packages: Computer ScienceComputer Science (R0)