Abstract
Measuring dialect distances can be based on the comparison of words, and the comparison words should be based on the comparison of sounds. Inthis research we used an adjusted version of an articulation-based system,developed by Almeida and Braun (1986) for findingsound distances, using the IPA system.For comparison of two pronunciations of a word corresponding with twodifferent varieties, we used the Levenshtein algorithm, which finds theeasiest way in which one word can be changed into the other by inserting,deleting or substituting sounds. As operations weights of these threeoperations we used distances as found with the Almeida & Braun system.The dialect distance is now equal to the average of a range of word distances. We applied the technique to 360 Dutch dialects. The transcriptions of 125 words for each dialect are taken from the Reeks Nederlandse Dialectatlassen (Blancquaert and Peé, 1925–1982). We get a division with clear similarities to traditional dialect maps when classifying dialects. Using logarithmic sound distances improves results compared to results based on constant sound distances.
Similar content being viewed by others
References
Almeida A. (1984) ZurMethodik der Datenaufbereitung in der Linguistik: Das Beispiel phonetischer Transkription. In Berger, L. (ed.), Sprechausdruck, Scriptor, Frankfurt am Main, pp. 111–122.
Almeida A., Braun A. (1985) What is Transcription? In Kürschner, W. and Vogt, R. (eds.), Grammatik, Semantik, Textlinguistik. Akten des 19. Linguistischen Kolloquiums Vechta 1984, Vol. 1, Tübingen, pp. 37–48.
Almeida A., Braun A. (1986) “De Richtig” und “falsch” in phonetischer Transkription; Vorschläge zum Vergleich von Transkriptionen mit Beispielen aus deutschen Dialekten. Zeitschrift für Dialektologie und Linguistik, LIII(2), pp. 158–172.
Bürkle (1986) Zur Validität eines Maßes zur Reliabilitätsbestimmung Phonetisch-segmenteller Transkriptionen. Zeitschrift für Dialektologie und Linguistik, pp. 173–181.
Cucchiarini C. (1993) Phonetic Transcription: A Methodological and Emperical Study. Ph.D. thesis, Katholieke Universiteit Nijmegen, Nijmegen.
Daan J., Blok D. P. (1969) Van Randstad tot Landrand; toelichting bij de kaart: Dialecten en Naamkunde. Noord-Hollandsche Uitgevers Maatschappij, Amsterdam.
Goossens J. (1965) Die niederländische Strukturgeographie und die “Reeks Nederlandse Dialectatlassen”. Bijdragen en mededelingen der Dialectencommissie van de Koninklijke Nederlandse Akademie van Wetenschappen te Amsterdam. N.V. Noord-Hollandsche Uitgevers Maatschappij, Amsterdam.
Heeringa W. (2001) De selectie en digitalisatie van dialecten en woorden uit de Reeks Nederlandse Dialectatlassen. TABU: Bulletin voor taalwetenschap, 31(1/2), pp. 61–103.
Hoppenbrouwers C., Hoppenbrouwers G. (2001) De indeling van de Nederlandse streektalen. Dialecten van 156 steden en dorpen geklasseerd volgens de FFM. Koninklijke Van Gorcum B.V., Assen.
IPA (1999) Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet. Cambrige University Press, Cambrige.
Jain A. K., Dubes R. C. (1988) Algorithms for Clustering Data. Englewood Cliffs, Prentice Hall, New Yersey.
Kessler B. (1995) Computational Dialectology in Irish Gaelic. In Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, EACL, Dublin, pp. 60–67.
Ladefoged P., Maddieson I. (1996) The Sounds of the World's Languages. Blackwell, Oxford etc.
Nerbonne J., Heeringa W. (1997) Measuring Dialect Distance Phonetically. In Coleman, J. (ed.), Workshop on Computational Phonology, Special Interest Group of the Association for Computational Linguistics, Madrid, pp. 11–18.
Nerbonne J., Heeringa W. (1998) Computationele vergelijking en classificatie van dialecten. Taal en Tongval, Tijdschrift voor Dialectologie, 50(2), pp. 164–193.
Nerbonne J., Heeringa W. (2001) Computational Comparison and Classification of Dialects. Dialectologia et Geolinguistica. Journal of the International Society for Dialectology and Geolinguistics, 2001(9), pp. 69–83.
Nerbonne J., Heeringa W., Kleiweg P. (1999a) Comparison and Classifications of Dialects. In Proceedings of the 9th Meeting of the European Chapter of the Association for Computational Linguistics, Bergen, pp. 281–282.
Nerbonne J., Heeringa W., Kleiweg P. (1999b) Edit Distance and Dialect Proximity. In Sankoff, D. and Kruskal, J. (eds.), Time Warps, String Edits, and Macro Molecules; The Theory and Practice of Sequence Comparison, 2nd edition, CSLI, Stanford, pp. v–xv.
Nerbonne J., Heeringa W., van den Hout E., van der Kooi P., Otten S., van de Vis W. (1996) Phonetic Distance between Dutch Dialects. In Durieux, G., Daelemans, W., and Gillis, S. (eds.), CLIN VI, Papers from the Sixth CLIN Meeting, University of Antwerp, Center for Dutch Language and Speech (UIA), Antwerp, pp. 185–202.
Te Winkel J. (1901) Geschiedenis der Nederlandsche taal. Blom & Olivierse, Culemborg, naar de tweede Hoogduitsche uitgave met toestemming van den schrijver vertaald door Dr. F. C.Wieder. Met eene Kaart.
Vieregge W. H., Rietveld A. C. M., Jansen C. I. E. (1984) A Distinctive Feature Based System for the Evaluation of Segmental Transcription in Dutch. In van den Broecke, M. P. R. and Cohen, A. (eds.), Proceedings of the 10th International Congress of Phonetic Sciences, Foris Publications, Dordrecht and Cinnaminson, pp. 654–659.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Heeringa, W., Braun, A. The Use of the Almeida-Braun System in the Measurement of Dutch Dialect Distances. Computers and the Humanities 37, 257–271 (2003). https://doi.org/10.1023/A:1025067132504
Issue Date:
DOI: https://doi.org/10.1023/A:1025067132504