Skip to main content

An Interactive Knowledge Maintenance Algorithm for Recasting WordNet Synonym-Set Definitions into Lojbanic Primitives, then into Lojbanic English

  • Conference paper
  • First Online:
Beyond Databases, Architectures and Structures. Facing the Challenges of Data Proliferation and Growing Variety (BDAS 2018)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 928))

  • 881 Accesses

Abstract

Lojban, a constructed interlingua, has a small number of predicates–totaling 1342. In our prior independent work, involving lojban for machine translation in parallel, when converting English into lojbanic English (our extensions), the number of test cases was too small, because of the lack of a rich lojban vocabulary. There is a procedure for creating new, composite, lojban predicates in terms of existing predicates. Although workable, it lacks an architectural framework that is consistent with English word-sense usage, and enables efficient production of thousands of new predicates.

To address lojban’s insufficient vocabulary, we developed an interactive algorithm which will recast WordNet synonym sets’ (synset) definitions into existing lojban primitive predicates. The output is in terms of our lojbanic English. The final output is a new dictionary of synset definitions, with lojban semantics, but using lojbanic English. The linguistic engineer, interactively, lojbanizes a synset’s definition. Thus, the synsets are well defined, unambiguous, and there are no circular definitions–no confusion–because lojban primitives have these properties.

If a relevant subset, e.g., 1 / 10, of the unique synset definitions–totaling 116718, are converted into lojban predicates, then 1.945 man years would be required for this effort. This algorithm can be run in parallel on distinct subsets, employing many users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Aside: j is pronounced as zh as in measure. Lojban is not case sensitive including lojban itself.

  2. 2.

    Order is descending by number of speakers.

  3. 3.

    There are 117, 659 total definitions.

  4. 4.

    In this context, a linguistic engineer is someone who is sufficiently skilled in computer programming as well as linguistics, and knowledgeable of lojban.

References

  1. Cowan, J.: The Complete Lojban Language. Logical Language Group Inc., Fairfax (1997)

    Google Scholar 

  2. Hintz, G.: Semantic parsing using Lojban - on the middle ground between semantic ontology and language (2014). http://www.inf.uni-hamburg.de/en/inst/ab/lt/teaching/theses/completed-theses/2014-ma-hinz.pdf

  3. Maziarz, M., Piasecki, M., Szpakowicz, S.: The chicken and egg problem in wordnet design: synonymy, synsets and constitutive relations. Lang. Resour. Eval. 47 (2013)

    Google Scholar 

  4. McCandless, M.: Searching relational content with Lucene’s BlockJoinQuery. http://blog.mikemccandless.com/2012/01/searching-relational-content-with.html

  5. Neo4j. http://neo4j.com/docs/cypher-refcard/3

  6. Nicholas, N.: Lojban as a machine translation inter-language. In: Fourth Pacific Rim International Conference on AI: Workshop ‘Future Issues for Multilingual Text Processing’, Department of Linguistics and Applied Linguistics, University of Melbourne (2006)

    Google Scholar 

  7. Sapir-Whorf Hypothesis. http://www.google.com/search?q=apir+worf+hypothesis&ie=utf-8&oe=utf-8&client=ubuntu&channel=fs

  8. Swan, M.: Practical English Usage, 3rd edn. Oxford University Press, Oxford (2015)

    Google Scholar 

  9. WordNet. http://wordnet.princeton.edu/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luke Immes .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Immes, L., Levkowitz, H. (2018). An Interactive Knowledge Maintenance Algorithm for Recasting WordNet Synonym-Set Definitions into Lojbanic Primitives, then into Lojbanic English. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds) Beyond Databases, Architectures and Structures. Facing the Challenges of Data Proliferation and Growing Variety. BDAS 2018. Communications in Computer and Information Science, vol 928. Springer, Cham. https://doi.org/10.1007/978-3-319-99987-6_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-99987-6_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-99986-9

  • Online ISBN: 978-3-319-99987-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics