An Advanced Web-Based Hindi Language Interface to Database Using Machine Learning Approach

  • Zorawar Singh VirkEmail author
  • Mohit Dua
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9734)


NLIDB (Natural Language Interface to Database) system is a key step to many areas like Database Mining, Search Engines, Medical Science, Artificial Intelligence etc. Existing literature reveals that Natural Language Interfaces were extensively studied from the 1970’s to 1990’s. The paper proposes a NLIDB system namely Advanced WB-HLIDB (Web based Hindi Language Interface to Database using Machine Learning Approach) that uses the natural language as Hindi language and is based on the Machine Learning technique. The main features that have been introduced are Web Based Graphical User Interface to the system along with the use of Clustering and Similarity functions. This paper discusses the use of Similarity approach instead of the ‘Like’ approach which had been in practice till recently. Various other components such as multiword selection, recognizing misspelled words, storing of successful queries, auto complete function, built in keywords etc. have also been implemented.


NLIDB WB-HLIDB HLIDB Machine learning Data mining Similarity functions KNN algorithm Hindi shallow parser 


  1. 1.
    Woods, W.A., Kaplan, R.M., Webber, B.N.: The lunar sciences natural language information system: Final Report. In: BBN Report 2378, Bolt Beranek and Newman Inc., Cambridge, Massachusetts (1972)Google Scholar
  2. 2.
    Shingala, P.V.: Enhancing the relevance of information retrieval by querying the database in natural form. In: International Conference on Intelligent Systems and Signal Processing (ISSP), pp. 408–412. IEEE (2013)Google Scholar
  3. 3.
    Khalid, M.A., Jijkoun, V., de Rijke, M.: Machine learning for question answering from tabular data. In: 18th International Workshop on Database and Expert Systems Applications, pp. 392–396, DEXA (2007)Google Scholar
  4. 4.
    Dua, M., Kumar, S., Virk, Z.S.: Hindi language graphical user interface to database management system. In: 12th International Conference on Machine Learning and Applications, pp. 549–554. IEEE, Florida (2013)Google Scholar
  5. 5.
    Kumar, R., Dua, M., Jindal, S.: Domain-independent Hindi language interface to relational database. In: International Conference on Computation of Power, Energy, Information and Communication, pp. 81–86. IEEE (2014)Google Scholar
  6. 6.
  7. 7.
  8. 8.
    Guo, G., Wang, H., Bell, D.J., Bi, Y., Greer, K.: KNN model-based approach in classification. In: Meersman, R., Schmidt, D.C. (eds.) CoopIS/DOA/ODBASE 2003. LNCS, vol. 2888, pp. 986–996. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  9. 9.
    Marukatat, S., Methasate, I.: Fast nearest neighbor retrieval using randomized binary codes and approximate euclidean distance. J. Pattern Recognit. Lett. 34, 1101–1107 (2013). ElsevierCrossRefGoogle Scholar
  10. 10.
    Sarawagi, S., Kirpal, A.: Efficient set joins on Similarity Predicates. In: International Conference on Management of Data (SIGMOD 2004), pp. 743–754. ACM (2004)Google Scholar
  11. 11.
    Ye, J.: Multicriteria Decision-Making method using the Dice Similarity Measure based on the Reduct Intuitionistic Fuzzy sets of Interval-Valued Intuitionistic Fuzzy Sets. J. Appl. Math. Modell. 36, 4466–4472 (2012). ElsevierMathSciNetCrossRefzbMATHGoogle Scholar
  12. 12.
    Sahami, M., Heilman, T.: A web-based kernel function for measuring the similarity of short text snippets. In: 15th International Conference on World Wide Web, pp. 377–386. ACM (2006)Google Scholar
  13. 13.
    Mott, R.: Maximum likelihood estimation of the statistical distribution of smith waterman local sequence similarity scores. Bull. Math. Biol. 54, 59–75 (1992). SpringerCrossRefzbMATHGoogle Scholar
  14. 14.
    Cohen, W., Ravikumar, P., Fienberg, S.: A comparison of string metrics for matching names and records. In: American Association for Artificial Intelligence (2003)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.Department of Computer EngineeringNational Institute of TechnologyKurukshetraIndia

Personalised recommendations