Abstract
Lojban, a constructed interlingua, has a small number of predicates–totaling 1342. In our prior independent work, involving lojban for machine translation in parallel, when converting English into lojbanic English (our extensions), the number of test cases was too small, because of the lack of a rich lojban vocabulary. There is a procedure for creating new, composite, lojban predicates in terms of existing predicates. Although workable, it lacks an architectural framework that is consistent with English word-sense usage, and enables efficient production of thousands of new predicates.
To address lojban’s insufficient vocabulary, we developed an interactive algorithm which will recast WordNet synonym sets’ (synset) definitions into existing lojban primitive predicates. The output is in terms of our lojbanic English. The final output is a new dictionary of synset definitions, with lojban semantics, but using lojbanic English. The linguistic engineer, interactively, lojbanizes a synset’s definition. Thus, the synsets are well defined, unambiguous, and there are no circular definitions–no confusion–because lojban primitives have these properties.
If a relevant subset, e.g., 1 / 10, of the unique synset definitions–totaling 116718, are converted into lojban predicates, then 1.945 man years would be required for this effort. This algorithm can be run in parallel on distinct subsets, employing many users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Aside: j is pronounced as zh as in measure. Lojban is not case sensitive including lojban itself.
- 2.
Order is descending by number of speakers.
- 3.
There are 117, 659 total definitions.
- 4.
In this context, a linguistic engineer is someone who is sufficiently skilled in computer programming as well as linguistics, and knowledgeable of lojban.
References
Cowan, J.: The Complete Lojban Language. Logical Language Group Inc., Fairfax (1997)
Hintz, G.: Semantic parsing using Lojban - on the middle ground between semantic ontology and language (2014). http://www.inf.uni-hamburg.de/en/inst/ab/lt/teaching/theses/completed-theses/2014-ma-hinz.pdf
Maziarz, M., Piasecki, M., Szpakowicz, S.: The chicken and egg problem in wordnet design: synonymy, synsets and constitutive relations. Lang. Resour. Eval. 47 (2013)
McCandless, M.: Searching relational content with Lucene’s BlockJoinQuery. http://blog.mikemccandless.com/2012/01/searching-relational-content-with.html
Nicholas, N.: Lojban as a machine translation inter-language. In: Fourth Pacific Rim International Conference on AI: Workshop ‘Future Issues for Multilingual Text Processing’, Department of Linguistics and Applied Linguistics, University of Melbourne (2006)
Sapir-Whorf Hypothesis. http://www.google.com/search?q=apir+worf+hypothesis&ie=utf-8&oe=utf-8&client=ubuntu&channel=fs
Swan, M.: Practical English Usage, 3rd edn. Oxford University Press, Oxford (2015)
WordNet. http://wordnet.princeton.edu/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Immes, L., Levkowitz, H. (2018). An Interactive Knowledge Maintenance Algorithm for Recasting WordNet Synonym-Set Definitions into Lojbanic Primitives, then into Lojbanic English. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds) Beyond Databases, Architectures and Structures. Facing the Challenges of Data Proliferation and Growing Variety. BDAS 2018. Communications in Computer and Information Science, vol 928. Springer, Cham. https://doi.org/10.1007/978-3-319-99987-6_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-99987-6_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99986-9
Online ISBN: 978-3-319-99987-6
eBook Packages: Computer ScienceComputer Science (R0)