Skip to main content

COLLEAP – COntextual Language LEArning Pipeline

  • Conference paper
Advances in Web-Based Learning – ICWL 2013 (ICWL 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8167))

Included in the following conference series:

  • 1795 Accesses

Abstract

In this paper we present a concept as well as a prototype of a tool pipeline to utilize the abundant information available on the World Wide Web for contextual, user driven creation and display of language learning material. The approach is to capture Wikipedia articles of the user’s choice by crawling, to analyze the linguistic aspects of the text via natural language processing and to compile the gathered information into a visually appealing presentation of enriched language information. The tool is designed to address the Japanese language, with a focus on kanji, the pictographic characters used in Japanese scripture.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wentland, W., et al.: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008). European Language Resources Association (ELRA), Marrakech (2008)

    Google Scholar 

  2. Judea, A., Nastase, V., Strube, M.: Concept-based selectional preferences and distributional representations from wikipedia articles. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)

    Google Scholar 

  3. Fujii, A., Fujii, Y., Tokunaga, T.: Effects of document clustering in modeling wikipedia-style term descriptions. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)

    Google Scholar 

  4. Lin, C.Y., Hovy, E.: The automated acquisition of topic signatures for text summarization. In: Proceedings of the 18th Conference on Computational Linguistics, COLING 2000, vol. 1, pp. 495–501. Association for Computational Linguistics, Stroudsburg (2000)

    Chapter  Google Scholar 

  5. Bond, F., et al.: Enhancing the Japanese WordNet. In: Proceedings of the 7th Workshop on Asian Language Resources, ALR7, pp. 1–8. Association for Computational Linguistics, Stroudsburg (2009)

    Chapter  Google Scholar 

  6. Strube, M., Ponzetto, S.P.: Wikirelate! computing semantic relatedness using wikipedia. In: Proceedings of the 21st National Conference on Artificial Intelligence, pp. 1419–1424. AAAI Press (2006)

    Google Scholar 

  7. McClain, Y.: Handbook of Modern Japanese Grammar. Hokuseido Press (1981)

    Google Scholar 

  8. Kudo, T.: Mecab: Yet another part-of-speech and morphological analyzer, http://mecab.googlecode.com/svn/trunk/mecab/doc/index.html (last accessed: April 28, 2013)

  9. Denis, A., et al.: Representation of linguistic and domain knowledge for second language learning in virtual worlds. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)

    Google Scholar 

  10. Moneglia, M., et al.: The IMAGACT cross-linguistic ontology of action. a new infrastructure for natural language disambiguation. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)

    Google Scholar 

  11. Lefever, E., Hoste, V., Cock, M.D.: Discovering missing wikipedia inter-language links by means of cross-lingual word sense disambiguation. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)

    Google Scholar 

  12. Breen, J.: Multiple indexing in an electronic kanji dictionary. In: Enhancing and Using Electronic Dictionaries, COLING, Geneva, Switzerland, pp. 1–7 (2004)

    Google Scholar 

  13. Saravanan, K., et al.: An empirical study of the occurrence and co-occurrence of named entities in natural language corpora. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)

    Google Scholar 

  14. Sweller, J., van Merrienboer, J.J., Paas, F.G.: Cognitive architecture and instructional design. Educational Psychology Review 10(3), 251–296 (1998)

    Article  Google Scholar 

  15. Suchanek, F.M., et al.: Yago2s: Modular high-quality information extraction with an application to flight planning. In: BTW, pp. 515–518 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wloka, B., Winiwarter, W. (2013). COLLEAP – COntextual Language LEArning Pipeline. In: Wang, JF., Lau, R. (eds) Advances in Web-Based Learning – ICWL 2013. ICWL 2013. Lecture Notes in Computer Science, vol 8167. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41175-5_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41175-5_34

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41174-8

  • Online ISBN: 978-3-642-41175-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics