Dialogue Manager for a NLIDB for Solving the Semantic Ellipsis Problem in Query Formulation

  • Rodolfo A. Pazos R.
  • Juan C. Rojas P.
  • René Santaolaya S.
  • José A. Martínez F.
  • Juan J. Gonzalez B.
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6277)


A query written in natural language (NL) may involve several linguistic problems that cause a query not being interpreted or translated correctly into SQL. One of these problems is implicit information or semantic ellipsis, which can be understood as the omission of important words in the wording of a query written in NL. An exhaustive survey on NLIDB works has revealed that most of these works has not systematically dealt with semantic ellipsis. In experiments conducted on commercial NLIDBs, very poor results have been obtained (7% to 16.9%) when dealing with query corpora that involve semantic ellipsis. In this paper we propose a dialogue manager (DM) for a NLIDB for solving semantic ellipsis problems. The operation of this DM is based on a typification of elliptical problems found in queries, which permits to systematically deal with this problem. Additionally, the typification has two important characteristics: domain independence, which permits the typification to be applied to queries of different databases, and generality, which means that it holds for different languages such as English, French, Italian, Spanish, etc. These characteristics are inherited to the dialogue processes implemented in the DM, since they are based on this typification. In experiments conducted with this DM and a NLIDB on a corpus of elliptical queries, an increase of correctly answered queries of 30-35% was attained.


Query Processing Dialogue Process Query Formulation Dialogue Manager Dialog Process 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    González, J.J.: Traductor de Lenguaje Natural Español a SQL para un Sistema de Consultas a Bases de Datos. PhD dissertation. Computer Sci. Dept., Centro Nacional de Investigación y Desarrollo Tecnológico, Cuernavaca, Mexico (2005)Google Scholar
  2. 2.
    Pazos, R.A., Pérez, J., González, J.J., Gelbukh, A., Sidorov, G., Rodríguez, M.J.: A Domain Independent Natural Language Interface to Databases Capable of Processing Complex Queries. In: Gelbukh, A., de Albornoz, Á., Terashima-Marín, H. (eds.) MICAI 2005. LNCS (LNAI), vol. 3789, pp. 833–842. Springer, Heidelberg (2005)Google Scholar
  3. 3.
    González, J.J., Pazos, R.A., Cruz, C., Fraire, H.J., Aguilar, S., Pérez, J.: Issues in Translating from Natural Language to SQL in a Domain-Independent Natural Language Interface to Databases. In: Gelbukh, A., Reyes-Garcia, C.A. (eds.) MICAI 2006. LNCS (LNAI), vol. 4293, pp. 922–931. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  4. 4.
    Rocher, G.: Traducción de Queries en Prolog a SQL. BS thesis, Universidad de las Américas, Puebla, Mexico (1999)Google Scholar
  5. 5.
    CORDIS: Telematics for Libraries - Projects, VILIB (1999),
  6. 6.
    Chae, J., Lee, S.: Frame Based Decomposition Method for Korean Language Query Processing. Computer Processing of Oriental Languages 11(4), 353–379 (1998)Google Scholar
  7. 7.
    GPLSI: Procesamiento del Lenguaje Natural (1998),
  8. 8.
    Microsoft TechNet, Chapter 32- English Query Best Practices (2009),
  9. 9.
    Popescu, A., Etzioni, O., Kautz, H.: Towards a Theory of Natural Language Interfaces to Databases. In: Proc. International Conference on Intelligent User Interfaces, Miami, USA, pp. 149–157 (2003)Google Scholar
  10. 10.
    ELF Software, ELF Software Documentation Series (2002),
  11. 11.
    Reis, P., Matias, J., Mamede, N.: Edite - A Natural Language Interface to Databases: a New Dimension for an Approach. In: Proc. 4th International Conference on Information and Communication Technology in Tourism, Edinburgh, Scottland (1997)Google Scholar
  12. 12.
    Cercone, N., Mcfetridge, P., Popowish, F., Fass, D., Groeneboer, C., Hall, G.: The System X Natural Language Interface: Design, Implementation and Evaluation. Technical report. Centre for System Science, Simon Fraser University, British Columbia, Canada (1993)Google Scholar
  13. 13.
    Androutsopoulus, I., Ritchie, G., Thanish, P.: MASQUE/SQL, an Efficient and Portable Natural Language Query Interface for Relational Databases. In: Proc. 6th International Conference on Industrial & Engineering Applications of Artificial Intelligence and Expert Systems, Edinburgh, UK, pp. 327–330 (1993)Google Scholar
  14. 14.
    Minock, M.: A STEP Towards Realizing Codd’s Vision of Rendezvous with the Casual User. In: Proc. 33rd International Conference on Very Large Databases, Vienna, Austria, pp. 1358–1361 (2007)Google Scholar
  15. 15.
    Minock, M.: Natural Language Access to Relational Databases through STEP. Technical report. Dept. Computer Science, University of Umea, Umea, Sweden (2004)Google Scholar
  16. 16.
    Bagnasco, C., Bresciani, P., Magnini, B., Strapparava, C.: Natural Language Interpretation for Public Administration Database Querying in the TAMIC Demonstrator. In: Proc. 2nd International Workshop on Applications of Natural Language to Information Systems, Amsterdam, The Netherlands (1996)Google Scholar
  17. 17.
    Chu, W., Yang, H., Chiang, K., Minock, M., Chow, G., Larson, C.: Cobase: A Scalable and Extensible Cooperative Information System. Journal of Intelligent Information Systems 6, 223–259 (1996)CrossRefGoogle Scholar
  18. 18.
    Ott, N.: Aspects of the Automatic Generation of SQL Statements in a Natural Language Query Interface. Information Systems 17(2), 147–159 (1992)CrossRefGoogle Scholar
  19. 19.
    Boldasov, M.V., Sokolova, G.E.: QGen – Generation Module for the Register Restricted InBASE System. In: Gelbukh, A. (ed.) CICLing 2003. LNCS, vol. 2588, pp. 465–476. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  20. 20.
    DARPA Air Travel Information System, ATIS0 (1990),
  21. 21.
    Rojas, J.C.: Administrador de Diálogo para una Interfaz de Lenguaje Natural a Bases de Datos. PhD dissertation. Centro Nacional de Investigación y Desarrollo Tecnológico, Cuernavaca, Mexico (2009)Google Scholar
  22. 22.
    Pazos, R., Santaolaya, R., Rojas, J.C., Pérez, J.: Shedding Light on a Troublesome Issue in NLIBDs: Word Economy in Query Formulation. LNCS, pp. 641–648. Springer, Heidelberg (2008)Google Scholar
  23. 23.
    Pazos, R.A., Santaolaya, R., Rojas, J.C., Martínez, J.A., Pérez, J., Cruz, L.: Domain Independent Dialog Processes for Solving the Word-Economy Problem in a NLIDB. Polish Journal of Environmental Studies 17(4C), 457–462 (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Rodolfo A. Pazos R.
    • 1
  • Juan C. Rojas P.
    • 2
  • René Santaolaya S.
    • 2
  • José A. Martínez F.
    • 1
  • Juan J. Gonzalez B.
    • 1
  1. 1.Inst. Tecnológico de Cd. Madero, Cd. MaderoMexico
  2. 2.Centro Nacional de Investigación y Desarrollo Tecnológico, CuernavacaMexico

Personalised recommendations