Abstract
The article presents the experience of developing computer ontology as one of the tools for Tibetan idioms processing. A computer ontology that contains a consistent specification of meanings of lexical units with different relations between them represents a model of lexical semantics and both syntactic and semantic valencies, reflecting the Tibetan linguistic picture of the world. The article presents an attempt to classify Tibetan idioms, including compounds, which are idiomatized clips of syntactic groups that have frozen inner syntactic relations and are often characterized by omission of grammatical morphemes; and the application of this classification for idioms processing in computer ontology. The article also proposes methods of using computer ontology for avoiding idioms processing ambiguity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Following the definition of Krippendorf, combinatorial explosion is understood here as a situation “when a huge number of possible combinations are created by increasing the number of entities which can be combined” [4]. As applied to parsing, these are cases of exponential growth in the number of parsing versions as the length of the parsed text and, thus, the amount of its parsed ambiguous fragments increase.
- 2.
These 3 interpretations represent 3 groups of versions that arose only because of incorrect combinations of adjuncts, each group consisting of 18 versions (2 for ring-lugs, multiplied by 3 for dpal-yon and by 3 for mdzes-chos rig-pa). Thus, the total amount was 54.
- 3.
Example (9) has 32 versions of parsing. These 3 interpretations represent groups of versions that arose only because of incorrect designating of subject for the verb bsu ‘to invite’.
References
Grokhovskii, P.L., Zakharov, V.P., Smirnova, M.O., Khokhlova, M.V.: The corpus of tibetan grammatical works. In: Automatic documentation and mathematical linguistics, vol. 49, no. 5, pp. 182–191 (2015). https://doi.org/10.3103/S0005105515050064
Gruber, T.R.: A translation approach to portable ontology specifications (PDF). Knowl. Acquis. 5(2), 199–220 (1993). https://doi.org/10.1006/knac.1993.1008
Aho, A.V., Corasick, M.J.: Efficient string matching: an aid to bibliographic search. Commun. ACM 18(6), 333–340 (1975)
Krippendorff, K.: Combinatorial Explosion. Web Dictionary of Cybernetics and Systems. http://pespmc1.vub.ac.be/ASC/Combin_explo.html. PRINCIPIA CYBERNETICA WEB
Dobrov, A.V.: Semantic and ontological relations in AIIRE natural language processor. Comput. Model. Bus. Eng. Domains. Rzeszow-Sofia: ITHEA, 147–157 (2014)
Miller, G.A., Beckwith, R., Fellbaum, C.D., Gross, D., Miller, K.: WordNet: an online lexical database. Int. J. Lexicograph. 3(4), 235–244 (1990)
Melcuk, I.: Phrasemes in language and phraseology in linguistics. In: Everaert, M., Van der Linden, E.J., Schenk, A., Schreuder, R. (eds.) Idioms: Structural and Psychological Perspectives, pp. 167–232. Lawrence Erlbaum, New Jersey (1995)
Pelletier, F.J.: The principle of semantic compositionality. Topoi 13, 11 (1994)
Beyer, S.: The Classical Tibetan Language. State University of New York, New York (1992)
Acknowledgment
This work was supported by the Russian Foundation for Basic Research, Grant No. 16-06-00578 Morphosyntactycal analyser of texts in the Tibetan language.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Dobrov, A., Dobrova, A., Grokhovskiy, P., Smirnova, M., Soms, N. (2018). Idioms Modeling in a Computer Ontology as a Morphosyntactic Disambiguation Strategy. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2018. Lecture Notes in Computer Science(), vol 11107. Springer, Cham. https://doi.org/10.1007/978-3-030-00794-2_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-00794-2_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00793-5
Online ISBN: 978-3-030-00794-2
eBook Packages: Computer ScienceComputer Science (R0)