Idioms Modeling in a Computer Ontology as a Morphosyntactic Disambiguation Strategy

The Case of Tibetan Corpus of Grammar Treatises
  • Alexei Dobrov
  • Anastasia Dobrova
  • Pavel Grokhovskiy
  • Maria SmirnovaEmail author
  • Nikolay Soms
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11107)


The article presents the experience of developing computer ontology as one of the tools for Tibetan idioms processing. A computer ontology that contains a consistent specification of meanings of lexical units with different relations between them represents a model of lexical semantics and both syntactic and semantic valencies, reflecting the Tibetan linguistic picture of the world. The article presents an attempt to classify Tibetan idioms, including compounds, which are idiomatized clips of syntactic groups that have frozen inner syntactic relations and are often characterized by omission of grammatical morphemes; and the application of this classification for idioms processing in computer ontology. The article also proposes methods of using computer ontology for avoiding idioms processing ambiguity.


Tibetan language Idioms Compounds Computer ontology Tibetan corpus Natural language processing Corpus linguistics Immediate constituents 



This work was supported by the Russian Foundation for Basic Research, Grant No. 16-06-00578 Morphosyntactycal analyser of texts in the Tibetan language.


  1. 1.
    Grokhovskii, P.L., Zakharov, V.P., Smirnova, M.O., Khokhlova, M.V.: The corpus of tibetan grammatical works. In: Automatic documentation and mathematical linguistics, vol. 49, no. 5, pp. 182–191 (2015). Scholar
  2. 2.
    Gruber, T.R.: A translation approach to portable ontology specifications (PDF). Knowl. Acquis. 5(2), 199–220 (1993). Scholar
  3. 3.
    Aho, A.V., Corasick, M.J.: Efficient string matching: an aid to bibliographic search. Commun. ACM 18(6), 333–340 (1975)MathSciNetCrossRefGoogle Scholar
  4. 4.
    Krippendorff, K.: Combinatorial Explosion. Web Dictionary of Cybernetics and Systems. PRINCIPIA CYBERNETICA WEB
  5. 5.
    Dobrov, A.V.: Semantic and ontological relations in AIIRE natural language processor. Comput. Model. Bus. Eng. Domains. Rzeszow-Sofia: ITHEA, 147–157 (2014)Google Scholar
  6. 6.
    Miller, G.A., Beckwith, R., Fellbaum, C.D., Gross, D., Miller, K.: WordNet: an online lexical database. Int. J. Lexicograph. 3(4), 235–244 (1990)CrossRefGoogle Scholar
  7. 7.
    Melcuk, I.: Phrasemes in language and phraseology in linguistics. In: Everaert, M., Van der Linden, E.J., Schenk, A., Schreuder, R. (eds.) Idioms: Structural and Psychological Perspectives, pp. 167–232. Lawrence Erlbaum, New Jersey (1995)Google Scholar
  8. 8.
    Pelletier, F.J.: The principle of semantic compositionality. Topoi 13, 11 (1994)MathSciNetCrossRefGoogle Scholar
  9. 9.
    Beyer, S.: The Classical Tibetan Language. State University of New York, New York (1992)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Alexei Dobrov
    • 1
  • Anastasia Dobrova
    • 2
  • Pavel Grokhovskiy
    • 1
  • Maria Smirnova
    • 1
    Email author
  • Nikolay Soms
    • 2
  1. 1.Saint-Petersburg State UniversitySaint-PetersburgRussia
  2. 2.LLC “AIIRE”Saint-PetersburgRussia

Personalised recommendations