Schema-Based Natural Language Semantic Mapping

  • Niculae Stratica
  • Bipin C. Desai
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3136)

Abstract

This paper addresses the problem of mapping Natural Language to SQL queries. It assumes that the input is in English language and details a methodology to build a SQL query based on the input sentence, a dictionary and a set of production rules. The dictionary consists of semantic sets and index files. A semantic set is created for each table or attribute name and contains synonyms, hyponyms and hypernyms as retrieved by WordNet and complemented manually. The index files contain pointers to records in the database, ordered by value and by data type. The dictionary and the production rules form a context-free grammar for producing the SQL queries. The context ambiguities are addressed through the use of the derivationally related forms based on WordNet. Building the run time semantic sets of the input tokens helps solving the ambiguities related to the database schema. The proposed method introduces two functional entities: a pre-processor and a runtime engine. The pre-processor reads the database schema and uses WordNet to create the semantic sets and the set of production rules. It also reads the database records and creates the index files. The run time engine matches the input tokens to the dictionary and uses the rules to create the corresponding SQL query.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Miller, G.: WordNet: A Lexical Database for English. Communications of the ACM 38(1), 39–41 (1995)CrossRefGoogle Scholar
  2. 2.
    Stratica, N., Kosseim, L., Desai, B.C.: NLIDB Templates for Semantic Parsing. In: Proceedings of Applications of Natural Language to Data Bases, NLDB 2003, Burg, Germany, June 2003, pp. 235–241 (2003)Google Scholar
  3. 3.
    Minker, J.: Information storage and retrieval - a survey and functional description. In: SIGIR, vol. 12, pp. 1–108 (1997)Google Scholar
  4. 4.
    Rubin, S.H., Chen, S.-C., Shyu, M.-L.: Field-Effect Natural Language Semantic Mapping. In: Proceedings of the 2003 IEEE International Conference on Systems, Man & Cybernetics, Washington, D.C., USA, October 5-8, pp. 2483–2487 (2003)Google Scholar
  5. 5.
    Mazlack, L.J., Feinauer, R.A.: Establishing a Basis for Mapping Natural- Language Statements Onto a Database Query Language. In: SIGIR, pp. 192–202 (1980)Google Scholar
  6. 6.
    Allen, J.: Natural Language Understanding, University of Rochester, The Benjamin Cummings Publishing Company, Inc. (1995) ISBN: 0-8053-0334-0Google Scholar
  7. 7.
    Baker, K., Franz, A., Jordan, P.: Coping with Ambiguity in Knowledge- based Natural Language Analysis. In: Florida AI Research Symposium, Pensacola, Florida, pp. 155–159 (1994)Google Scholar
  8. 8.
    Hirst, G.: Semantic Interpretation and the Resolution of Ambiguity. Cambridge University Press, Cambridge (1986)Google Scholar
  9. 9.
    Latent Semantic Analysis Laboratory at the Colorado University site visited in (March 2004), http://lsa.colorado.edu/
  10. 10.
    Sleator, D., Temperley, D.D.: Parsing English with A Link Grammar .In:Proceedings of the Third Annual Workshop on Parsing Technologies (1993)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Niculae Stratica
    • 1
  • Bipin C. Desai
    • 1
  1. 1.Department of Computer ScienceConcordia UniversityWest MontrealCanada

Personalised recommendations