Abstract
When computers are confronted with text (C.A.I., lexicography, machine translation, information retrieval, office automation, etc.) spelling-error bottlenecks greatly decrease the efficiency of systems. In this paper, we demonstrate how spelling errors can be efficiently handled from an algorithmic, computer science point of view. We first show that the various types of spelling errors must receive different treatments, and we especially focus on phonographic errors, on which relatively little work has been done. We provide some quantitative data on the phonographic structure of language and develop a mathematical framework for modeling the various types of errors, including phonographic ones. Finally, we outline an application to fast retrieval of misspelt words in dictionaries, which can be implemented in natural-language interfaces to make computers interact more gracefully with non-specialists.
Similar content being viewed by others
References
Bellman, R. E. Dynamic Programming. Princeton University Press: Princeton, NJ, 1957.
Blanche-Benveniste, C., Chervel, A. L'orthographe. Paris: Maspéro, 1969.
Blair, Ch. R. A Program for Correcting Spelling Errors. Information and Control, 3 (1960), 60–67.
Catach, N. Que faut-il entendre par système graphique du français? Langue Française, 20 (1973), 30–44.
Catach, N. La phonetisation automatique du françis. Paris: CNRS, 1984.
Damerau, D. N. A Technique for Computer Detection, and Correction of Spelling Errors. Comm. A.C.M., 7, 3 (1964), 171–76.
Davidson, L. Retrieval of Mispelled Names in an Airline's Passenger Record System. Comm. A.C.M., 5, 3 (1962), 169–71.
Durham, I., Lamb, D. A., Saxe, J. B. Spelling Correction in User Interfaces. Comm. A.C.M., 26, 10 (1983), 764–73.
Gak, V. G. Francuzkaya ortografia. Moscow, 1959. Trans. L'orthographe du français. Paris: SELAF, 1976.
Gak, V. G. Ortografia v svete strukturnojo analasia. In Problemi strukturnoi lingvistiki. Moscow, 1962.
Horejsi, V. Analyse structurale de 1'orthographe française. Philologica Praegensia, V (1962), 225–36.
Horejsi, V. Formes parlées, formes écrites et système orthographique des langues. Folia Linguistica, V, 1/2 (1970), 195–203.
Horejsi, V. Les graphonèmes en français et leurs parties composantes. Etudes de Linguistique Appliquée, 8 (1972), 10–17.
Lowrance, R., Wagner, R. A. An Extension of the String-toString Correction Problem. Journal A.C.M., 22, 2 (1975), 177–83.
Morgan, H. L. Spelling Correction in System Programs. Comm. A.C.M., 13, 2 (1970), 90–94.
Odell, M. K., Russell, R.. C. U.S. Patent nos. 1, 261, 167 (1918) and 1, 435, 663 (1922).
Peterson, J. L. Computer Programs for Detecting and Correcting Spelling Errors. Comm. A.C.M., 23, 12 (1980), 676–87.
Pollock, J. J. Spelling Error Detection and Correction by a Computer: Some Notes and a Bibliography. J. Doc., 38, 4 (1982), 282–91.
Pollock, J. J., Zamora, A. Collection and Characterization of Spelling Errors in Scientific and Scholarly Texts. Journal of the American Society for Information Science, 34, 1 (1983), 51–58.
Thimonnier, R. Le système graphique du français. Paris: Plon, 1967.
Thimonnier, R. Code orthographique et grammatical. Paris: Hatier, 1970.
Veronis, J. Etude quantitative sur le système graphique et phonographique du français. Cahiers de Psychologie Cognitive, 6, 5 (1986), 501–31.
Wagner, C. K., Fischer, M. J. The String-to-String Correction Problem. Journal A.C.M., 21, 1 (1974), 168–73.
Author information
Authors and Affiliations
Additional information
Jean Veronis is a researcher in the Groupe Représentation et Traitement des Connaissances at the Centre National de la Recherche Scientifique in Marseille.
Rights and permissions
About this article
Cite this article
Veronis, J. Computerized correction of phonographic errors. Comput Hum 22, 43–56 (1988). https://doi.org/10.1007/BF00056348
Issue Date:
DOI: https://doi.org/10.1007/BF00056348