Statistical Machine Translation Using the Self-Organizing Map

  • V. F. López
  • J. M. Corchado
  • J. F. De Paz
  • S. Rodríguez
  • J. Bajo
Conference paper
Part of the Advances in Intelligent and Soft Computing book series (AINSC, volume 79)


The paper describes a contextual environment using the Self-Organizing Map, which can model a semantic agent (SOMAgent) that learns the correct meaning of a word used in context in order to deal with specific phenomena such as ambiguity, and to generate more precise alignments that can improve the first choice of the Statistical Machine Translation system giving linguistic knowledge.


Machine Translation Statistical Machine Translation Parallel Corpus Word Alignment Precise Alignment 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Brown, P.F., Della Pietra, V.J., Della Pietra, S.A., Mercer, R.: The mathematics of statistical machine translation: parameter estimation. Comput. Linguist. 19(2), 263–311 (1993)Google Scholar
  2. 2.
    Casacuberta, F., Vidal, E., Vilar, J.M.: Architectures for speech-to-speech translation using finite-state models. In: Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems, pp. 39–44 (2002)Google Scholar
  3. 3.
    Chappelier, C., Rajman, M.: A generalized CYK algorithm for parsing stochastic CFG. In: First Workshop on Tabulation in Parsing and Deduction (TAPD 1998), Paris, pp. 133–137 (1998)Google Scholar
  4. 4.
    Charniak, E.: A maximum entropyinspired parser. In: Proceedings of NAACL 2000, pp. 132–139 (2000)Google Scholar
  5. 5.
    Charniak, J.: Learning non-isomorphic tree mappings for machine translation. In: Proceedings of ACL 2003, (Compain Volume) pp. 205–208 (2003)Google Scholar
  6. 6.
    Chiang, D.: A hierarchical phrasebased model for statistical machine translation. In: Proceedings of ACL 2005, pp. 263–270 (2005)Google Scholar
  7. 7.
    Chiang, D.: Hierarchical phrase based translation. Computational Linguistics (2007)Google Scholar
  8. 8.
    Doddington, G.: Automatic evaluation of machine translation quality using n-gram cooccurrence statistics. In: Proceedings ARPA Workshop on Human Language Technology (2002)Google Scholar
  9. 9.
    Honkela, T.: Philosophical Aspects of Neural, Probabilistic and Fuzzy Modeling of Language Use and Translation. In: International Joint Conference on Neural Networks, IJCNN 2007, pp. 2881–2886 (2007)Google Scholar
  10. 10.
    Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: NAACL 2003: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, pp. 48–54. Association for Computational Linguistics, Morristown (2003)CrossRefGoogle Scholar
  11. 11.
    Kohonen, T.: Self-organized Formation of Topologically Correct Feature Maps. In: Neurocomputing, pp. 511–522. The MIT Press, Cambridge (1990)Google Scholar
  12. 12.
    Kohonen, T.: Self-organized Maps. Proceedings of the IEEE 78(9), 1464–1480 (1990)CrossRefGoogle Scholar
  13. 13.
    López, V., Alonso, L., Moreno, M.: A SOMAgent for Identification of Semantic Classes and Word Disambiguation. In: 7th International Conference on Practical Applications of Agents and Multi-Agent Systems (PAAMS 2009). Advances in Intelligent and Soft Computing, vol. 55, pp. 207–215 (2009) ISBN: 978-3-642-00486-5Google Scholar
  14. 14.
    Marcu, D., Wong, W.: A Phrase-Based, Joint Probability Model for Statistical Machine Translation. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, pp. 133–139 (2002)Google Scholar
  15. 15.
    Mariño, J.B., Banchs, R.E., Crego, J.M., Gispert, A., de Lambert, F.P., Costa-jussá, M.R.: N-gram based machine translation. Computational Linguistics 32(4), 527–549 (2006)CrossRefMathSciNetGoogle Scholar
  16. 16.
    Melamed, I.D.: Statistical machine translation by parsing. In: Proceedings of ACL 2004, pp. 111–114 (2004)Google Scholar
  17. 17.
    Och, F., Ney, H.: A systematic comparison of various statistical alignment models. Computational Linguistics 29(1), 19–52 (2003)CrossRefGoogle Scholar
  18. 18.
    Papineni, K., Roukos, S., Ward, T., Zhu, W.-J.: BLUE: a method for automatic evaluation of machine translation. In: Proceedings of the Annual Meeting of the Association for Compuational Linguistics, ACL (2002)Google Scholar
  19. 19.
    Picó, D.: Combining Statistical and Finite-State Methods for Machine Translation. Thesis for the degree of doctor.Universitat Politécnica de Valéncia. Departament de Sistemes Informátics I Computació. Spain (2005)Google Scholar
  20. 20.
    Strube, V.L., Carneiro, P.R., Filho, I.: Distributing linguistic knowledge in a multiagent natural language processing system: re-modelling the dictionary. Procesamiento del lenguaje natura 23, 104–109 (1998)Google Scholar
  21. 21.
    Venugopal, A., Zollmann, A., y Vogel, S.: An Efficient Two-Pass Approach to Synchronous-CFG Driven Statistical MT. In: Proceedings of HLT/NAACL 2007, pp. 500–507 (2007)Google Scholar
  22. 22.
    Zollmann, A., Venugopal, A.: Syntax augmented machine translation via chart parsing. In: Proceedings of NAACL 2006 (2006)Google Scholar
  23. 23.
    Yamada, K., Knight, K.: A decoder for syntax-based statistical MT.In: Annual Meeting of the ACL. Proceedings of the 40th Annual Meeting on Association for Computational (2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • V. F. López
    • 1
  • J. M. Corchado
    • 1
  • J. F. De Paz
    • 1
  • S. Rodríguez
    • 1
  • J. Bajo
    • 1
  1. 1.Dept. Informática y AutomáticaUniversity of SalamancaSalamancaSpain

Personalised recommendations