Advertisement

Machine Translation

, Volume 15, Issue 4, pp 267–294 | Cite as

Unit Completion for a Computer-aided Translation Typing System

  • Philippe Langlais
  • George Foster
  • Guy Lapalme
Article

Abstract

This work is in the context of, a system thatwatches over the users as they type a translation andrepeatedly suggests completions for the text already entered.The users may either accept, modify, or ignore these suggestions. Wedescribe the design, implementation, and performance of aprototype which suggests completions of units of texts that arelonger than one word.

interactive machine translation machine-aided human translation statistical language models statistical translation models target-text mediation word completion 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berger, Adam L., Stephen A. Della Pietra, and Vincent J. Della Pietra: 1996,‘A Maximum Entropy Approach to Natural Language Processing’, Computational Linguistics 22, 39–71.Google Scholar
  2. Blanchon, Hervé: 1991, ‘Problè mes de désambiguï sation interactive et TAO personnelle’ [Problems with Interactive Disambiguation and Personal MT], L'environnement traductionnel, journées scientifiques du réseau thématique de recherche “Lexicologie, terminologie, traduction”, Mons, pp. 31–48.Google Scholar
  3. Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra and Robert L. Mercer: 1993, ‘The Mathematics of Machine Translation: Parameter Estimation’, Computational Linguistics 19, 263–312.Google Scholar
  4. Brown, Ralf D. and Sergei Nirenburg: 1990, ‘Human-Computer Interaction for Semantic Disambiguation’, COLING 90: Papers presented to the 13th International Conference on Computational Linguistics, Helsinki, Vol. 3, pp. 42–47.Google Scholar
  5. Chen, Kuang Hua and Hsin-Hsi Chen: 1994, ‘Extracting Noun Phrases from Large-Scale Texts: A Hybrid Approach and its Automatic Evaluation’, 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, pp. 234–241.Google Scholar
  6. Derouault, A.-M. and B. Merialdo: 1986, ‘Natural Language Modeling for Phoneme-to-Text Transcription’, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 8, 742–749.Google Scholar
  7. Dunning, Ted: 1993, ‘Accurate Methods for the Statistics of Surprise and Coincidence’, Computational Linguistics 19, 61–74.Google Scholar
  8. Evans, David A. and Chengxiang Zhai: 1996, ‘Noun-Phrase Analysis in Unrestricted Text for Information Retrieval’, 34th Annual Meeting of the Association for Computational Linguistics, Santa Cruz, CA, pp. 17–24.Google Scholar
  9. Foster, George, Pierre Isabelle and Pierre Plamondon: 1997, ‘Target-Text Mediated Interactive Machine Translation’, Machine Translation 12, 175–194.Google Scholar
  10. Fung, Pascale: 1995, ‘A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora’, 33rd Annual Meeting of the Association for Computational Linguistics, Cambridge, MA, pp. 236–243.Google Scholar
  11. Furuse, Osamu and Hitoshi Iida: 1996, ‘Incremental Translation Utilizing Constituent Boundary Patterns’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, pp. 412–417.Google Scholar
  12. Gaussier, Éric: 1995, Modè les statistiques et patrons morphosyntaxiques pour l'extraction de lexiques bilingues [Statistical Models and Morphosyntactic Templates for the Extraction of Bilingual Lexicons], Thè se de doctorat, Université de Paris 7.Google Scholar
  13. Haruno, Masahiko, Satoru Ikehara and Takefumi Yamazaki: 1996, ‘Learning Bilingual Collocations by Word-Level Sorting’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, pp. 525–530.Google Scholar
  14. Ikehara, Satorn, Satoshi Shirai and Hajime Uchino: 1996, ‘A Statistical Method for Extracting Uninterrupted and Interrupted Collocations from Very Large Corpora’, COLING-96: The 16 th International Conference on Computational Linguistics, Copenhagen, pp. 574–579.Google Scholar
  15. Jelinek, Frederick: 1990, ‘Self-Organized Language Modeling for Speech Recognition’, in A. Waibel and K. Lee (eds), Readings in Speech Recognition, Morgan Kaufmann, San Mateo, CA, pp. 450–506.Google Scholar
  16. Kaji, Hiroyuki and Toshiko Aizono: 1996, ‘Extracting Word Correspondances from Bilingual Corpora Based on Word Co-Occurence Information’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, pp. 23–28.Google Scholar
  17. Kay, Martin: 1973, The MIND System', in R. Rustin (ed.) Natural Language Processing, Algorithmics Press, New York, pp. 155–188.Google Scholar
  18. Kuhn, Roland and Renato De Mon: 1990, ‘A Cache-Based Natural Language Model for Speech Recognition’, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 12, 570–583.Google Scholar
  19. Kupiec, Julian: 1993, ‘An Algorithm for Finding Noun Phrase Correspondences in Bilingual Corpora’, 31st Annual Meeting of the Association for Computational Linguistics, Columbus, OH, pp. 17–22.Google Scholar
  20. Langlais, Philippe and George Foster: 2000, ‘Using Context-Dependent Interpolation to Combine Statistical Language and Translation Models for Interactive Machine Translation’, Proceedings of the 6th International Conference on Computer-Assisted Information Retrieval (RIAO), Paris, pp. 507–519.Google Scholar
  21. Langlais, Philippe, Sébastien Sauvé, George Foster, Elliott Macklovitch, and Guy Lapalme: 2000, ‘Evaluation of TransType, a Computer-aided Translation Typing System: A Comparison of a Theoretical-and a User-Oriented Evaluation Procedures’, LREC 2000 Second International Conference on Language Resources and Evaluation, Athens, pp. 641–648.Google Scholar
  22. Lin, Dekang: 1999, ‘Automatic Identification of Non-Compositional Phrases’, 37th Annual Meeting of the Association for Computational Linguistics, College Park, MD, pp. 317–324.Google Scholar
  23. Maruyama, Hiroshi, Hideo Watanabe and Shiho Ogino: 1990, ‘An Interactive Japanese Parser for Machine Translation’, COLING-90: Papers presented to the 13th International Conference on Computational Linguistics, Helsinki, Vol. 2, pp. 257–262.Google Scholar
  24. Melamed, I. Dan: 1997, ‘Automatic Discovery of Non-Compositional Compounds in Parallel Data’, Second Conference on Empirical Methods in Natural Language Processing, Providence, RI, pp. 97–108.Google Scholar
  25. Nagao, Makoto and Shinsuke Mon: 1994, ‘A New Method of n-Gram Statistics for Large Number of n and Automatic Extraction of Words and Phrases from Large Text Data of Japanese’, COLING 94: The 15th International Conference on Computational Linguistics, Kyoto, pp. 611–615.Google Scholar
  26. Nieß en, S., S. Vogel, H. Ney and C. Tillman: 1998, ‘A DP Based Search Algorithm for Statistical Machine Translation’, COLING-ACL '98: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 960–966.Google Scholar
  27. Och, Franz Josef, Christoph Tillman and Herman Ney: 1999, ‘Improved Alignment Models for Statistical Machine Translation’, Proceedings of the 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, College Park, MD, pp. 20–28.Google Scholar
  28. Och, Franz Josef and Hans Weber: 1998, ‘Improving Statistical Natural Language Translation with Categories and Rules’, COLING-ACL '98: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 985–989.Google Scholar
  29. Ohomori, Kumiko and Masanobu Higashida: 1999, ‘Extracting Bilingual Collocations from Non-Aligned Parallel Corpora’, Proceedings of the 8th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 99), Chester, pp. 88–97.Google Scholar
  30. Rapp, Reinhard: 1999, ‘Automatic Identification of Word Translations from Unrelated English and German Corpora’, 37th Annual Meeting of the Association for Computational Linguistics, College Park, MD, pp. 519–526.Google Scholar
  31. Russell, Graham: 1998, Identification of Salient Token Sequences, Internal report, RALI, University of Montreal.Google Scholar
  32. Shimohata, Sayori, Toshiyuki Sugio and Junji Nagata: 1997, ‘Retrieving Collocations by Cooccurrences and Word Order Constraints’, 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, pp. 476–481.Google Scholar
  33. Su, Keb-Yih, Ming-Wen Wu and Jing-Shin Chang: 1994, ‘A Corpus-Based Approach to Automatic Compound Extraction’, 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, pp. 242–247.Google Scholar
  34. Tanaka, Kumiko and Hideya Iwasaki: 1996, ‘Extraction of Lexical Translations from Non-Aligned Corpora’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, pp. 580–585.Google Scholar
  35. Tanaka, Takaaki and Yoshihiro Matsuo: 1999, ‘Extraction of Translation Equivalents from Non-Parallel Corpora’, Proceedings of the 8th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 99), Chester, pp. 109–119.Google Scholar
  36. Tillman, C., S. Vogel, H. Ney and A. Zubiaga: 1997, ‘A DP Based Search Using Monotone Alignments in Statistical Translation’, 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, pp. 289–296.Google Scholar
  37. Wang, Ye-Yi and Alex Waibel: 1997, ‘Decoding Algorithm in Statistical Machine Translation’, 35 th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, pp. 366–372.Google Scholar
  38. Wang, Ye-Yi and Alex Waibel: 1998a, ‘Fast Decoding for Statistical Machine Translation’, 5th International Conference on Spoken Language Processing, ICSLP'98 Proceedings, Sydney, pp. 2775–2779.Google Scholar
  39. Wang, Ye-Yi and Alex Waibel: 1998, ‘Modeling with Structures in Statistical Machine Translation’, COLING-ACL '98: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 1357–1363.Google Scholar
  40. Whitelock, P. J., M. McGee Wood, B. J. Chandler, N. Holden and H. J. Horsfall: 1986, ‘Strategies for Interactive Machine Translation: The Experience and Implications of the UMIST Japanese Project’, 11th International Conference on Computational Linguistics: Proceedings of Coling '86, Bonn, pp. 329–334.Google Scholar
  41. Wu, Dekai: 1995, ‘Stochastic Inversion Transduction Grammars, with Application to Segmentation, Bracketing, and Alignment of Parallel Corpora’, Fourteenth International Joint Conference on Artificial Intelligence, Montreal, pp. 1328–1335.Google Scholar
  42. Wu, Dekai and Hongsing Wong: 1998, ‘Machine Translation with a Stochastic Grammatical Channel’, COLING-ACL '98: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 1408–1414.Google Scholar

Copyright information

© Kluwer Academic Publishers 2000

Authors and Affiliations

  • Philippe Langlais
    • 1
  • George Foster
    • 2
  • Guy Lapalme
    • 3
  1. 1.RALI/DIROUniversité de Montré alMontréCanada
  2. 2.RALI/DIROUniversité de Montré alMontréCanada
  3. 3.RALI/DIROUniversité de Montré alMontréCanada

Personalised recommendations