Skip to main content

Part of the book series: SpringerBriefs in Statistics ((BRIEFSSTATIST))

  • 502 Accesses

Abstract

This chapter introduces the general design characteristics of PRESEMT and provides a detailed description of all resources required as well as all pre-processing steps needed, such as corpora processing and model creation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Ganchev K, Gillenwater J, Taskar B (2009) Dependency grammar induction via bitext projection constraints. In: Proceedings of the 47th Annual Meeting of the ACL, Singapore, 2–7 August, pp 369–377

    Google Scholar 

  • Koehn P (2005) Europarl: a parallel corpus for statistical machine translation. MT Summit 2005, Phuket, Thailand

    Google Scholar 

  • Lafferty J, McCallum A, Pereira F (2001) Conditional random fields: probabilistic models for segmenting and labelling sequence data. In: 28th International Conference on Machine Learning, ICML 2011, Bellevue, Washington, USA, pp 282–289

    Google Scholar 

  • Markantonatou S, Sofianopoulos S, Giannoutsou O, Vassiliou M (2009) Hybrid machine translation for low- and middle-density languages. In: Nirenburg S (ed) Language engineering for lesser-studied languages. IOS Press, pp 243–274

    Google Scholar 

  • McCallum AK (2002) MALLET: a machine learning for language toolkit. http://mallet.cs.umass.edu

  • Munteanu DS, Marcu D (2005) Improving machine translation performance by exploiting non-parallel corpora. Comput Linguist 31(4):477–504

    Google Scholar 

  • Och FJ, Ney H (2004) The alignment template approach to statistical machine translation. Comput Linguist 30(4):417–449

    Article  MATH  Google Scholar 

  • Pomikálek J, Rychlý P (2008) Detecting co-derivative documents in large text collections. In: Proceedings of LREC2008, Marrakech, Morrocco, pp 1884–1887

    Google Scholar 

  • Prokopidis P, Georgantopoulos B, Papageorgiou H (2011) A suite of NLP tools for Greek. In: Proceedings of the 10th ICGL Conference, Komotini, Greece pp 373–383

    Google Scholar 

  • Schmid H (1994) Probabilistic part-of-speech tagging using decision trees. In: Proceedings of International Conference on New Methods in Language Processing, Manchester, UK, pp 44–49

    Google Scholar 

  • Sha F, Pereira FCN (2003) Shallow parsing with conditional random fields. In: Proceedings of HLT-NAACL Conference, pp 213–220

    Google Scholar 

  • Tambouratzis G, Simistira F, Sofianopoulos S, Tsimboukakis N, Vassiliou M (2011) A resource-light phrase scheme for language-portable MT. In: Proceedings of the 15th International Conference of the European Association for Machine Translation, 30–31 May, Leuven, Belgium, pp 185–192

    Google Scholar 

  • Tambouratzis G, Troullinos M, Sofianopoulos S, Vassiliou M (2012) Accurate phrase alignment in a bilingual corpus for EBMT systems. In: Proceedings of the 5th BUCC Workshop, held within the LREC-2012 Conference, May 26, Istanbul, Turkey, pp 104–111

    Google Scholar 

  • Tambouratzis G, Sofianopoulos S, Vassiliou M (2013) Language-independent hybrid MT with PRESEMT. In: Proceedings of HYTRA-2013 Workshop, held within the ACL-2013 Conference, Sofia, Bulgaria, 8 August, pp 123–130

    Google Scholar 

  • Tsuruoka Y, Tsujii J, Ananiadou S (2009) Fast full parsing by linear-chain conditional random fields. In: Proceedings of the 12th EACL Conference, 30 March–3 April, Athens, Greece, pp 790–798

    Google Scholar 

  • Wallach HM (2004) Conditional random fields: an introduction. University of Pennsylvania CIS Technical Report, MS-CIS-04-21, February 24

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to George Tambouratzis .

Rights and permissions

Reprints and permissions

Copyright information

© 2017 The Author(s)

About this chapter

Cite this chapter

Tambouratzis, G., Vassiliou, M., Sofianopoulos, S. (2017). Implementation. In: Machine Translation with Minimal Reliance on Parallel Resources. SpringerBriefs in Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-63107-3_2

Download citation

Publish with us

Policies and ethics