Abstract
This paper sets forth some ideas for the evolution of translation tools in the near future. The proposed improvements consist in a closer integration of terminology and sentence databases. In particular, we suggest that bilingual sentence databases (translation memories) could be refined by splitting sentences into large “bricks” of text. We also propose a mechanism through which bilingual sentence databases could be generalized by replacing the known terms by variable placeholders, thus yielding technical sentence “skeletons”.
This latter idea is the most original and, in our view, the most promising for future implementations. Although these ideas are not yet supported by experiments, we believe that they can be implemented using simple techniques, following the general philosophy that such tools should go as far as possible while remaining robust and useful for the human translator.
Similar content being viewed by others
References
Brown, P., Della Pietra, S., Della Pietra, V., and Mercer, R.: 1993, 'The Mathematics of Machine Translation: Parameter Estimation', Computational Linguistics 19, 263–311.
Dagan, I. and Church, K.: 1997, 'Termight: Coordinating Man and Machine in Bilingual Terminology Acquisition', this issue, pp. 89–107.
Daille, B., Gaussier, E., and Langé, J.-M.: 1994, 'Towards Automatic Extraction of Monolingual and Bilingual Terminology', in Proceedings of the 15th International Conference on Computational Linguistics, COLING-94, Kyoto, Japan, pp. 515–521.
Gale, W. and Church, K.: 1991, 'Identifying Word Correspondences in Parallel Texts', in Proceedings of the Fourth DARPA Speech and Natural Language Workshop, Pacific Grove, California.
Gaussier, E. and Langé, J.-M.: 1994, 'Some Methods for the Extraction of Bilingual Terminology', in International Conference on New Methods in Language Processing (NeMLaP), Manchester, UK, pp. 224–228.
Gaussier, E. and Langé, J.-M.: 1995, 'Modèles statistiques pour l'extraction de lexiques bilingues', T.A.L. (Traitement automatique des langues) 36, 133–155.
Gaussier, E., Langé, J.-M., and Meunier, F.: 1992, 'Towards Bilingual Terminology', in Proceedings of the ALLC/ACH Conference, Oxford, England, pp. 121–124.
Isabelle, P., Dymetman, M., Foster, G., Jutras, J.-M., Macklovitch, E., Perrault, F., Ren, X., and Simard, M.: 1993, 'Translation Analysis and Translation Automation', in Proceedings of the Fifth International Conference on Theoretical and Methodological Issues in Machine Translation TMI'93, Kyoto, Japan, pp. 201–217.
Langé, J.-M. and Bonnet, E.: 1994, 'The Multiple Uses of Parallel Corpora', in Proceedings of Teaching and Language Corpora (TALC' 94), Lancaster, U.K.
Macklovitch, E.: 1994, 'Using Bi-textual Alignment for Translation Validation: the TransCheck System', in Technology Partnerships for Crossing the Language Barrier: Proceedings of the First Conference of the Association for Machine Translation in the Americas, Columbia, Maryland, pp. 157–168.
Macklovitch, E.: 1995, 'Qu'est-ce que c'est au juste que la cohérence terminologique', in Actes des IVèmes Journées scientifiques de l'AUPELF-UREF: Lexicomatiques et dictionnairiques, Lyon, France.
Meunier, F.: 1993, Découpage de phrases et alignement de sous-phrases dans un corpus bilingue, Rapport de DEA en Informatique Fondamentale, Université Paris 7.
Sato, S. and Nagao, M.: 1990, 'Towards Memory-Based Translation', in COLING-90: Papers presented to the 13th International Conference on Computational Linguistics, Helsinki, Vol. 3, 247–252.
Simard, M., Foster, G., and Isabelle, P.: 1992, 'Using Cognates to Align Sentences in Bilingual Corpora', in Fourth International Conference on Theoretical and Methodological Issues in Machine Translation: Empiricist vs. Rationalist Methods in MT. TMI-92, Montréal, Canada, pp. 67–82.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Langé, JM., Gaussier, É. & Daille, B. Bricks and Skeletons: Some Ideas for the Near Future of MAHT. Machine Translation 12, 39–51 (1997). https://doi.org/10.1023/A:1007918521219
Issue Date:
DOI: https://doi.org/10.1023/A:1007918521219