A New Annotation Tool for Aligned Bilingual Corpora

  • Georgios Petasis
  • Mara Tsoumari
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7499)

Abstract

This paper presents a new annotation tool for aligned bilingual corpora, which allows the annotation of a wide range of information, ranging from information about words (such as part-of-speech tags or named-entities) to quite complex annotation schemas involving links between aligned segments, such as co-reference or translation equivalence between aligned segments in the two languages. The annotation tool is implemented as a component of the Ellogon language engineering platform, exploiting its extensive annotation engine, its cross-platform abilities and its linguistic processing components, if such a need arises. The new annotation tool is distributed with an open source license (LGPL), as part of the Ellogon language engineering platform.

Keywords

Annotation tools collaborative annotation adaptable annotation schemas 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E., Ciravegna, F.: Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Web Semant. 4, 14–28 (2006)CrossRefGoogle Scholar
  2. 2.
    Fragkou, P., Petasis, G., Theodorakos, A., Karkaletsis, V., Spyropoulos, C.: Boemie ontology-based text annotation tool. In: Proceedings of LREC 2008. ELRA (2008)Google Scholar
  3. 3.
    Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V., Aswani, N., Roberts, I., Gorrell, G., Funk, A., Roberts, A., Damljanovic, D., Heitz, T., Greenwood, M.A., Saggion, H., Petrak, J., Li, Y., Peters, W.: Text Processing with GATE (2011)Google Scholar
  4. 4.
    Barlow, M.: ParaConc: concordance software for multilingual parallel corpora. In: Proceedings of LREC 2002 (2002)Google Scholar
  5. 5.
    Tiedemann, J.: ISA & ICA - two web interfaces for interactive alignment of bitexts. In: Proceedings of LREC 2006 (2006)Google Scholar
  6. 6.
    Day, D., McHenry, C., Kozierok, R., Riek, L.: Callisto: A configurable annotation workbench. In: Proceedings of LREC 2004 (2004)Google Scholar
  7. 7.
    Farwell, D., Helmreich, S., Dorr, B., Green, R., Reeder, F., Miller, K., Levin, L., Mitamura, T., Hovy, E., Rambow, O., Habash, N., Siddharthan, A.: Interlingual Annotation of Multilingual Text Corpora and FrameNet, Berlin (2008)Google Scholar
  8. 8.
    Ide, N., Véronis, J.: Multext: Multilingual text tools and corpora. In: Proceedings of COLING 1994, vol. 1, pp. 588–592. ACL (1994)Google Scholar
  9. 9.
    Choi, J.D., Bonial, C., Palmer, M.: Multilingual propbank annotation tools: Cornerstone and jubilee. In: Proceedings of the NAACL HLT 2010 Demo Session, pp. 13–16. ACL (2010)Google Scholar
  10. 10.
    Tsoumari, M., Petasis, G.: Coreference Annotator - A new annotation tool for aligned bilingual corpora. In: Proceedings of AEPC 2, RANLP 2011 (2011)Google Scholar
  11. 11.
    Petasis, G., Karkaletsis, V., Paliouras, G., Androutsopoulos, I., Spyropoulos, C.D.: Ellogon: A New Text Engineering Platform. In: Proceedings of LREC 2002 (2002)Google Scholar
  12. 12.
    Müller, C., Strube, M.: Multi-level annotation of linguistic data with MMAX2. In: Braun, S., Kohn, K., Mukherjee, J. (eds.) Corpus Technology and Language Pedagogy: New Resources, New Tools, New Methods, Peter Lang, Frankfurt a.M., Germany, pp. 197–214 (2006)Google Scholar
  13. 13.
    Ogren, P.V.: Knowtator: A protégé plug-in for annotated corpus construction. In: Moore, R.C., Bilmes, J.A., Chu-Carroll, J., Sanderson, M. (eds.) HLT-NAACL. The Association for Computational Linguistics (2006)Google Scholar
  14. 14.
    Corcho, O.: Ontology based document annotation: trends and open research problems. Int. J. Metadata Semant. Ontologies 1, 47–57 (2006)CrossRefGoogle Scholar
  15. 15.
    Petasis, G.: The SYNC3 Collaborative Annotation Tool. In: Proceedings of LREC 2012 (2012)Google Scholar
  16. 16.
    Sidiropoulou, M.: Contrast in english and greek newspaper reporting: A translation perspective. In: Proceedings of 8th International Symposium on English & Greek: Description and/or Comparison of the Two Languages, Thessaloniki, Greece, School of English, Aristotle University (1994)Google Scholar
  17. 17.
    Sidiropoulou, M.: Linguistic Identities through Translation. Approaches to Translation Studies, vol. 23. Rodopi B.V., Amsterdam (2004)Google Scholar
  18. 18.
    Newmark, P.: A Textbook of Translation. Prentice-Hall International, New York (1988)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Georgios Petasis
    • 1
  • Mara Tsoumari
    • 2
  1. 1.Software and Knowledge Engineering Laboratory, Institute of Informatics and TelecommunicationsNational Centre for Scientific Research (N.C.S.R.) “Demokritos”AthensGreece
  2. 2.School of English, Faculty of PhilosophyAristotle University of ThessalonikiThessalonikiGreece

Personalised recommendations