Perception of Dutch vowels by Cypriot Greek listeners: To what extent can listeners’ patterns be predicted by acoustic and perceptual similarity?

Georgiou, Georgios P.; Dimitriou, Dimitra

doi:10.3758/s13414-023-02781-7

Perception of Dutch vowels by Cypriot Greek listeners: To what extent can listeners’ patterns be predicted by acoustic and perceptual similarity?

Open access
Published: 22 September 2023

Volume 85, pages 2459–2474, (2023)
Cite this article

Download PDF

You have full access to this open access article

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Perception of Dutch vowels by Cypriot Greek listeners: To what extent can listeners’ patterns be predicted by acoustic and perceptual similarity?

Download PDF

Georgios P. Georgiou^1,2 &
Dimitra Dimitriou³

800 Accesses
Explore all metrics

Abstract

There have been numerous studies investigating the perception of non-native sounds by listeners with different first language (L1) backgrounds. However, research needs to expand to under-researched languages and incorporate predictions conducted under the assumptions of new speech models. This study aimed to investigate the perception of Dutch vowels by Cypriot Greek adult listeners and test the predictions of cross-linguistic acoustic and perceptual similarity. The predictions of acoustic similarity were formed using a machine-learning algorithm. Listeners completed a classification test, which served as the baseline for developing the predictions of perceptual similarity by employing the framework of the Universal Perceptual Model (UPM), and an AXB discrimination test; the latter allowed the evaluation of both acoustic and perceptual predictions. The findings indicated that listeners classified each non-native vowel as one or more L1 vowels, while the discrimination accuracy over the non-native contrasts was moderate. In addition, cross-linguistic acoustic similarity predicted to a large extent the classification of non-native sounds in terms of L1 categories and both the acoustic and perceptual similarity predicted the discrimination accuracy of all contrasts. Being in line with prior findings, these findings demonstrate that acoustic and perceptual cues are reliable predictors of non-native contrast discrimination and that the UPM model can make accurate estimations for the discrimination patterns of non-native listeners.

Language-dependent cue weighting in distinctive feature: evidence from the perception of Mandarin high vowels by native English speakers

Article Open access 15 September 2023

Comparison of the prediction accuracy of machine learning algorithms in crosslinguistic vowel classification

Article Open access 20 September 2023

Individual differences in perceptual adaptability of foreign sound categories

Article 24 September 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Although newborns are able to distinguish phonetic contrasts in a great number of human languages, even if they had never heard them before, this ability declines after the age of 6–12 months (Cheour et al., 1998; Eimas et al., 1987; Kuhl et al., 1992; Werker & Tees, 1983). It has been proposed that this decline from infancy to adulthood is an outcome of continuous exposure to a particular language, which alters the way non-native speech sounds are perceived (Iverson et al., 2003). As a consequence, listeners of a non-native language struggle to distinguish sounds that are not present in their first language (L1) phonological system (Bohn & Munro, 2007; Strange, 1995); this capacity can be reversed to some extent through phonetic training (e.g., see Georgiou 2021a, 2022a). For example, Spanish listeners usually fail to discriminate English /i – ɪ/ since both vowels are assimilated to Spanish /i/ (Cebrian, 2019; Morrison, 2008). In contrast, German listeners do not have significant difficulties in mastering this English contrast since the German vowel system contains a vowel contrast that is an articulatorily and acoustically close instance of English /i – ɪ/ (Llompart & Reinisch, 2019).

A previous body of work has demonstrated that the size and complexity of the L1 and L2 phonological systems determine listeners’ perceptual abilities in the L2 sounds (e.g., Escudero et al. 2014; Fox et al., 1995; Georgiou et al., 2020; Iverson & Evans, 2007). Iverson and Evans (2007) found that speakers of German and Norwegian, two languages with a larger and more complex vowel system than English, had better perceptual abilities than speakers of Spanish, a language with a smaller and less complex vowel system than English. Nevertheless, evidence from other studies shows that smaller and less complex L1 phonological systems do not always lead to perceptual difficulties (e.g., Alispahic et al., 2014; Alispahic et al., 2017; Elvin et al., 2014). For instance, Elvin et al. (2014) concluded that there is no advantage for Australian English listeners in discriminating the Brazilian Portuguese vowels in comparison to Spanish speakers, although Australian English has a larger vowel inventory than Brazilian Portuguese and Spanish has a smaller vowel inventory. It seems that the consideration of the size and complexity of the L1-L2 phonological inventories alone cannot always predict perceptual abilities in the non-native language.

Several theoretical models assume either explicitly or implicitly that acoustic-phonetic similarity between native and non-native sounds can predict the perceptual patterns of the latter sounds (Escudero 2009; Flege, 1995; Georgiou, 2021b). Escudero and Boersma (2003) argued that when humans perceive speech, they integrate the various auditory dimensions they hear in a manner that mirrors the way those dimensions are combined during speech production. They pointed out that this integration is governed by the “optimal perception hypothesis,” which posits that a listener will prefer auditory dimensions that effectively distinguish sounds in the production of their L1 (Escudero, 2009). Essentially, this means that listeners use their knowledge of L1 speech sounds to guide their perception of speech, relying on auditory dimensions that are most useful in distinguishing between relevant sounds in their L1. Given that the preference for acoustic cues differs across different varieties, the optimal perception of both L1 and the target language can be achieved through a comprehensive acoustic description of both languages’ acoustic features. This description would allow for an explanation of how listeners perceive the target language sounds. In addition, the so-called “full copying hypothesis” suggests that at the initial stage of L2 acquisition, learners establish a copy of their existing L1 perception grammar to perceive the non-native sounds (Elvin & Escudero, 2019); therefore, they initially rely on their L1 to map unfamiliar sounds. So, acoustic similarity between L1 and non-native sounds, which can be roughly defined as the phonetic distance that separates an L1 sound from a target language sound, can be used as a reference for predicting the non-native speech perception patterns.

Different methodologies have been used to calculate cross-linguistic acoustic similarity, with the goal of predicting the mapping of non-native sounds to the speakers’ L1 vowel system and the discrimination of non-native sound contrasts. For example, many studies have employed Euclidean Distances (e.g., Elvin et al., 2014; Georgiou et al., 2020), reporting that these values provide accurate perceptual predictions. Linear Discriminant Analysis (LDA) (Klecka, 1980) is a machine-learning classifier that aims to identify a linear transformation maximizing the separation between different classes in the reduced-dimensional space, thus improving the accuracy of classification (Park & Park, 2008). The role of LDA in cross-linguistic speech perception is to assess how well target language sounds fit with the center of gravity of the input corpus tokens, providing a predicted estimation of how each sound is mapped to the speakers’ L1 categories (Elvin et al., 2021). Data is split into training and testing subsets, which include acoustic measures from speech samples such as formant frequencies, durations etc. After training the L1 model using the extracted measures, the same measures of non-native sounds from the testing subset are supplied to the model in order to calculate the proportion of their categorization to L1 categories. The confusion matrix can be used to predict the discrimination accuracy of non-native sound contrasts by employing the theoretical framework of a particular speech model. LDA has been used in the past in speech perception studies for the calculation of cross-linguistic acoustic similarity. For example, Gilichinskaya and Strange (2010) investigated the perceptual assimilation of American English vowels to the L1 categories of inexperienced Russian listeners. The results indicated that the algorithm predicted modal assimilation responses for all but one vowel, demonstrating that acoustic similarity is a good predictor of non-native sound categorization. Similarly, in a more recent study, Georgiou (2023a) used LDA to assess whether acoustic similarity could estimate the classification of L2 English vowels as Cypriot Greek L1 categories. The results verified the model’s predictions as the majority of L2 vowels classified with the highest proportion were predicted with success. Other studies provide further support for these findings (e.g., Elvin et al., 2021).

Among the most widely used models in cross-linguistic speech perception is the Perceptual Assimilation Model (PAM) (Best, 1995). PAM aims to predict the discrimination of particular non-native sound contrasts by listeners with little or no experience in the target language. The model suggests that phonological and articulatory-phonetic similarity between two non-native sounds can estimate how these sounds will be discriminated. It proposes six types of assimilations of non-native sounds to L1 phonological categories: Two category (TC) assimilation in which both sounds are assimilated to two different L1 categories (excellent discrimination), Single Category (SC) assimilation in which the two sounds are assimilated to the same L1 category (poor discrimination), Category Goodness (CG) assimilation in which one sound is assimilated as a good phonetic exemplar to an L1 category, while the other as a bad exemplar to the same category (moderate to very good discrimination), Uncategorized – Categorized (UC) assimilation in which one sound is assimilated to an L1 category, while the other does not (very good discrimination), Uncategorized-Uncategorized in which both sounds are not perceived as exemplars of an L1 category (poor to very good discrimination), and Non-assimilable (NA) in which both sounds are perceived as nonspeech sounds (good to very good discrimination). L2LP (Escudero, 2009) is another highly cited speech perception model that provides predictions about the discrimination of non-native contrasts by naïve learners. The predictions rely on cross-linguistic acoustic similarity and are developed using measures such as Euclidean Distances and LDA. Rather than assimilation patterns, it proposes three different learning scenarios, which can estimate the discrimination accuracy of non-native sound contrasts. The Similar Scenario (similar to the TC assimilation of PAM) occurs when two sounds are mapped to two different L1 phonological categories. The New Scenario takes place when two sounds are mapped to a single L1 category (similar to the SC assimilation of PAM). The Subset Scenario occurs when one or both sounds are equated to two or more L1 categories. The Similar Scenario exhibits the most accurate discrimination, followed by the Subset Scenario (only if there is no perceptual overlap), the New Scenario, and finally the Subset Scenario, but only if both non-native sounds are mapped to the same subset of L1 categories.

The Universal Perceptual Model (UPM) (Georgiou, 2021b) has been developed to account for the difficulties of listeners/speakers regarding the discrimination of non-native sound contrasts under the assumption that there is a universal capability to perceive speech sounds during the lifespan. The model supports that speech sounds are perceptual in nature, constraining the perception of phonetic categories extracted from the speech signal. UPM establishes predictions regarding the classification of non-native sounds in terms of L1 categories and the discrimination of two non-native sounds based on cross-linguistic acoustic similarity using machine-learning algorithms such as LDA. The results of the perceptual classification test, which refers to the actual classification of non-native sounds by listeners/learners of the target language, is used to develop the predictions for the discrimination accuracy of particular non-native sound contrasts according to the framework of UPM. Therefore, the model supports the connection between acoustic similarity and speech perception as emerges from machine-learning algorithms and classification and sound contrast discrimination as emerges from perceptual tasks completed by humans, just like other important models. UPM indicates that sound contrast discrimination is determined by the overlap degree between the two contrast members, which depends on how closely the two members are perceptually associated with each other based on their classification in terms of one or more L1 sounds. Specifically, the degree of overlap is determined by observing the classification proportions of each of the two non-native vowels in terms of one or more L1 categories. Crucially for UPM, only above chance classifications matter, that is, non-native sounds classified with a proportion above a chance score, which is determined by dividing the total number of responses by 1.00 (or 100%). UPM proposes three types of overlap between two non-native sounds: complete, partial, and no overlap. Completely overlapping contrasts share the same above chance responses or the same set of above chance responses, partially overlapping contrasts share at least one above chance response, and no overlapping contrasts do not share any above chance responses.

For example, if the chance score is 0.20 (or 20%) and if non-native /i/ is classified with a proportion of 0.90 as L1 /i/ and non-native /ɪ/ is classified with a proportion of 0.85 as L1 /i/, then both non-native sounds comprise above chance responses (≥ 0.20) and the contrast is completely overlapping since the two non-native vowels are both classified to a great extent as the same L1 category. However, if non-native /i/ is classified with a proportion of 0.90 as L1 /i/ and non-native /ɪ/ is classified with a proportion of 0.70 as L1 /i/ and with 0.30 as L1 /ε/, then both non-native sounds share only one above chance response (that is, the classification of both non-native vowels as L1 /i/), thereby forming a partially overlapping contrast. Nonoverlapping contrasts are the most discriminable followed by partially and completely overlapping contrasts. However, in completely overlapping contrasts, the discrimination accuracy may be comparable to that of the partially overlapping contrasts if listeners are able to perceive phonetic distance between two non-native sounds, which is usually determined by measuring the difference between the goodness-of-fit ratings of the two responses in the classification test. Figure 1 shows an example of the overlapping degrees of UPM.

This study aims to investigate the perception of Dutch vowels by Cypriot Greek listeners using the theoretical framework of UPM. The vowel systems of Cypriot Greek and Dutch differ to a great extent. Cypriot Greek has the five vowel qualities /i ε ɐ ɔ u/; note that a more generic representation includes the qualities /i e a o u/. There are not any length distinctions, but stressed vowels tend to be longer than unstressed vowels (Georgiou & Themistocleous, 2021). The Dutch vowel system is more complex than the Cypriot Greek system including 12 monophthongs (without schwa). Moulton (1962) distinguishes between the five lax or short /ɪ ɛ ʏ ɑ ɔ/ and the seven tense or long vowels /i y a u e o ø/. Length distinction is considered as part of the syllable rather than a phonological feature (Booij, 1995). There are only very few studies regarding the perception of non-native vowels by Cypriot Greek speakers. For example, Georgiou (2019) examined the perception of English vowels by Cypriot Greek children with low and high proficiency in English. The results showed that English vowels /iː ɪ/, /e ɜː/, /æ ʌ ɑː/, /ɒ ɔː/, /ʊ uː/ were mostly assimilated to Greek phonological categories /i/, /e/, /a/, /o/ and /u/ respectively for children of both proficiency levels. Also, children struggled to discriminate particular English sound contrasts: /iː – ɪ/ and /e – ɜː/ could be discriminated only in a moderate manner, while /æ – ʌ/ and /ɒ – ɔː/ yielded poor discrimination. Georgiou (2021b) assessed the classification of Italian vowels in terms of Cypriot Greek categories and the ability of Cypriot Greek speakers to discriminate pairs of Italian vowel contrasts. The study employed the theoretical framework and predictions of UPM. It was found that Italian vowels /i/, /e/, /ε/, /a/, /o/, /ɔ/, /u/ were classified as above chance responses in terms of Cypriot Greek cardinal vowels /i/, /i e/, /e/, /a/, /u/, /o/, /u/ respectively. The non-overlapping /ɔ – o/ contrast was discriminated well, the partially overlapping /i – e/ and /e – ε/ contrasts were discriminated to a moderate extent and the completely overlapping /o – u/ contrast was discriminated poorly. The results confirmed the predictions of UPM concerning the discriminability of sound contrasts based on their overlapping degree. In another study, Georgiou (2022b) found that both cross-linguistic acoustic similarity and UPM could predict the accuracy of the challenging English /iː – ɪ/ vowel contrast as discriminated by Cypriot Greek speakers.

A second aim of this study is to assess the capacity of the LDA model in predicting the classification/discrimination of non-native sounds based on cross-linguistic acoustic similarity and the ability of the UPM model to make accurate empirical predictions about the discrimination accuracy of non-native sound contrasts based on perceptual similarity. To better understand the acquisition of non-native speech, research needs to include under-researched sets of languages such as Cypriot Greek and Dutch. This is among the first studies that examine the perception of Dutch vowel contrasts by speakers of any Greek variety; for another study examining the perception of other Dutch contrasts by Standard Modern Greek and Cypriot Greek listeners, see Georgiou (2023b). Dutch was chosen since it contains a large and more complex vowel system compared to Cypriot Greek and therefore speakers of the latter variety will experience difficulties in accurately perceiving particular Dutch vowels. It also contains vowel qualities of which the perceptual categorization by Greek speakers has not been investigated in previous studies (e.g., /ʏ/, /ø/, /y/).

The study’s protocol is based on a production and a perception study. In the production study, Cypriot Greek and Dutch speakers produced their L1 vowels and their speech patterns were analyzed using speech processing software. The output of the Cypriot Greek speakers was used to train a machine-learning LDA model, and the output of Dutch speakers was fed into the trained model to generate predictions about the classification of Dutch vowels in terms of listeners’ L1 categories. In the perception study, listeners classified the Dutch vowels in terms of L1 categories and discriminated particular Dutch vowel contrasts using an AXB test. The classification test helped us evaluate the predictions of the LDA model, which provided classification data based on the vowels’ acoustic features (acoustic similarity), and the predictions of UPM, which rely on the overlapping degree of the sound contrast members as reported by the classification test (perceptual similarity).

Production study

Methodology

Participants

A total number of 32 speakers participated in the study. Twelve participants aged 20–45 years (M_age = 33, SD = 7.9) were Cypriot Greek speakers. They were born and raised in Cyprus and originated from moderate-income families. Their language development was typical and they never experienced any hearing or other cognitive issues. The listeners did not have knowledge of Dutch. Twenty participants were Dutch speakers. The output of these speakers was obtained from the database of Van der Harst (2011), which includes acoustic measurements of Dutch vowels produced by 160 Dutch high school teachers. We selected productions from 20 speakers with an age range of 22–40 years, who belonged to the Netherlandic community. All participants were females.

Stimuli

The stimuli of the production test undertaken by the Cypriot Greek listeners consisted of the five Cypriot Greek vowels /i ε ɐ ɔ u/. Vowels were embedded in a /pVs/ context (V = vowel) and were part of the carrier phrase ‘Léne <target word> tóra’ (‘they say <target word> now’). The Dutch stimuli included the Dutch monophthongs /a ɑ i u ɪ ɔ ɛ ʏ o e ø y/. These vowels were embedded in monosyllabic words before coda [s] with the exception of /y/, which was embedded before coda [t] as this vowel does not occur before /s/ besides proper names. The words were part of the carrier phrase “Hoor je <target word>”. While the target words were in a phrase-final position in Dutch and a phrase-medial position in Cypriot Greek, no impact is expected on the classification results. This is because some additional analyses we ran indicated no important differences between the productions in the two conditions. Specifically, five female adult Cypriot Greek speakers produced their native vowels in both a phrase-medial and phrase-final position. The analyses were conducted using linear mixed-effects models in R with F1, F2, F3, and DURATION as dependent variables, VOWEL and CONDITION as fixed factors and PARTICIPANTS as a random factor. The findings showed no significant effect of CONDITION on F1, F2, and F3, while there was a significant effect on DURATION. However, a Tukey posthoc test revealed that these differences concerned only two out of five vowels. So, the differences were minimal.

Procedure

The Cypriot Greek listeners performed the production test individually in quiet rooms. They were instructed to appropriately sit in front of a PC monitor and repeat the carrier phrases presented through Microsoft PowerPoint as if speaking to a friend. They produced a total number of 240 items (5 vowels × 4 repetitions × 12 speakers) and the output was recorded using a professional audio recorder at a 44.1 kHz sampling rate. The stimuli were randomized for each participant. The output of Dutch speakers was retrieved from the database of Van der Harst (2011). Speakers produced a total number of 240 items (12 vowels × 20 speakers) and all values were measured at the midpoint.

The target words from the Cypriot Greek speakers’ output were isolated and sent to Praat (Boersma & Weenink, 2023) for speech analysis. The visual inspection of spectrograms and waveforms based on identifiable acoustic landmarks helped us measure the boundaries of each vowel to extract formant frequencies and vocalic duration. To generate all tracks, the length of windows was set at 0.025 ms, the pre-emphasis at 50 Hz, and the spectrogram view range at 5,500 Hz, with a formant ceiling of 5,500 Hz, suitable for average adult female speakers. For the measurement of formant frequencies, the starting point of vowels’ acoustic analysis was regarded as the end of the burst of the preceding stop consonant /p/ and the onset point of V. The last point of vowels’ acoustic analysis was regarded as the end of periodicity of V as shown in the waveform and the formant structure as shown in the spectrogram (i.e., the acoustic energy concentrated in specific frequency regions) and the onset point of the second consonant /s/. Formants were measured through visual inspection of the spectrogram at their midpoint, where vowels exhibit the least effect from neighboring segments. Vowel durations were extracted through manual labelling of the starting and ending points of each vowel token by the first author. The duration of the vowels emerged from the measurement of the interval between the starting and ending point of the vocalic part. F1, F2, and F3 were normalized using the vowels package (Kendall & Thomas, 2018) with the Lobanov method. The normalized values were transformed into Hz in R using the formulas proposed by NORM (Thomas & Kendall, 2007). An example of the segmentation process is illustrated in Fig. 2.

LDA was employed to examine the classification of Dutch vowels in terms of Cypriot Greek listeners’ L1 categories. The analysis was conducted using the MASS package (Ripley et al., 2023) in R (R Core Team, 2023) (for a similar procedure, see Strange et al., 2005 and Gilichinskaya & Strange, 2010). The training and testing sets consisted of two different files that included the normalized acoustic measurements of Cypriot Greek and Dutch vowels respectively. Based on the data of the training set, we trained an L1 LDA model on mean F1, F2, and F3 midpoint values and mean vocalic duration of Cypriot Greek vowels. The cross-validation method showed that the trained model indicated 97.9% correct classification. Therefore, the model’s high accuracy allowed us to use F1, F2, F3, and vocalic duration of Dutch vowels from the testing set and feed these values into the L1 model.

Results

Production

Based on the Euclidean Distance of the vowels [d = √[(x2 – x1)² + (y2 – y1)²], the results of the production test show that Cypriot Greek /i/ is a very close acoustic instance of Dutch /e/ (d = 66) and then /i/ (d = 157) in terms of F1 and F2. Cypriot Greek /ε/ is very close in the vowel space to Dutch /ε/ (d = 97). Cypriot Greek /ɐ/ and Dutch /a/ seem to be acoustically close to each other (d = 101), while Cypriot Greek /ɔ/ is primarily close to Dutch /ɑ/ (d = 188) and then /ɔ/ (d = 216) and /o/ (d = 270). Cypriot Greek /u/ is spectrally very close to Dutch /ɔ/ (d = 48) and then /o/ (d = 99). F1 × F2 of Cypriot Greek and Dutch vowels are illustrated in Figure 3.

Among Cypriot Greek vowels, the longest duration was observed for /ɐ/. Vowels /ɔ/ and /ε/ had similar durations, while /i/ and /u/ had the shortest durations. Among Dutch vowels, /a/ had the longest duration. Dutch vowels /e o ø/, which are considered long, also had long durations. By contrast, Dutch long /i y u/ presented with short durations. The duration of Cypriot Greek vowels was closer to the duration of Dutch vowels /ɪ ɛ ʏ ɑ ɔ i y u/. Tables 1 and 2 present the average F1, F2, F3, and duration values of Cypriot Greek and Dutch vowels respectively as produced by L1 speakers of these languages.

Table 1 Average normalized F1, F2, F3, and duration values of Cypriot Greek vowels (scaled)

Full size table

Table 2 Average normalized F1, F2, F3, and duration values of Dutch vowels (Van der Harst, 2011). Standard deviations are shown in the parenthesis

Full size table

Linear discriminant analysis (LDA)

The results of LDA showed that Dutch vowels /a i u ɪ ε ʏ o ø/ were optimal responses (i.e., above chance responses in terms of a single L1 category) to Cypriot Greek vowels. Moreover, Dutch /a ɑ i u ɪ ɔ ɛ ʏ o e ø y/ were classified with the highest proportion as Cypriot Greek /ɐ ɐ i u i u ε ε ɔ ε ε i/ respectively. Apart from providing predictions about non-native vowel classification, the outcomes of the classifier can be used to develop predictions about the discrimination of non-native contrasts using the UPM framework and specifically the concept of overlapping degrees of non-native contrast members against L1 categories. In turn, these predictions are assessed through the perceptual classification test in which listeners were asked to classify the non-native vowels as their L1 categories and the discrimination test in which they discriminated particular non-native contrasts.

For the discrimination predictions, we have chosen four Dutch vowel contrasts which we anticipate to be difficult to discriminate by Cypriot Greek listeners, that is, /i – ɪ/, /ø – y/, /ɔ – o/, and /ɛ – ʏ/. We did not focus on easier contrasts (i.e., non-overlapping) as they do not create difficulties at all and are usually discriminated in an excellent manner. Under the UPM framework and based on LDA, it is expected that /ɔ – o/and /ø – y/ will be partially overlapping contrasts with moderate-to-good discrimination, while /i – ɪ/ and /ɛ – ʏ/ will be completely overlapping contrasts with poor discrimination. However, we need to consider an important parameter. Every speech sound consists of acoustic correlates or cues that distinguish it from other sounds (Chodroff & Wilson, 2020). Given that listeners pay attention to the most pertinent acoustic cues of a specific sound (Curtin et al., 2009), we assume that they may employ a single acoustic measure to classify the non-native sounds (i.e., formants or duration). Thus, we conducted a stepwise LDA to examine the power of individual acoustic measures (see Alispahic et al., 2017). We initially ran a Wilks’ lambda (Λ) test using the klaR package (Roever et al., 2022) from R to determine the variables that minimize Λ at an F-value with p < 0.05 and therefore improve the overall performance of the algorithm. The stepwise procedure started with the predictor that differentiated best between the vowels and included additional predictors one by one. The first step included F2 (classification accuracy: 90.8%), the second step included F1 + F2 (classification accuracy: 96.7%), the third step included F1 + F2 + duration (classification accuracy: 97.9%), and the fourth step, which is the final model, included F1 + F2 + F3 + duration. The results of the stepwise analysis are presented in Table 3.

Table 3 Classification results of stepwise LDA

Full size table

According to the stepwise analysis, Dutch /ɔ/ exhibits some F1, F2, F3, and duration similarity to Cypriot Greek /ɔ u/, /u/, /ɔ u/, and /ɔ/, while Dutch /o/ exhibits F1, F2, F3, and duration similarity to Cypriot Greek /u/, /u/, /ɔ/, and /ɔ/. Therefore, /ɔ – o/ is expected to be a partially overlapping contrast with moderate-to-good discrimination. Dutch /ø/ exhibits F1, F2, F3, and duration similarity to Cypriot Greek /ε/, /ε ɐ/, /ε/ and /ε/, while Dutch /y/ exhibits F1, F2, F3, and duration similarity to Cypriot Greek /ε/, /ε ɐ/, /i ε/, and /i/. Dutch /ø – y/ is expected to be partially overlapping with moderate-to-good discrimination. Dutch /i/ exhibits F1, F2, F3, and duration similarity to Cypriot Greek /i/, while Dutch /ɪ/ exhibits F1, F2, F3, and duration similarity to Cypriot Greek /i ε/, /i ε/, /i/, and /i/. Although the final model did show that both /i – ɪ/ will be classified as Cypriot Greek /i/, partial similarity for F1 and F2 between the two contrast members may aid listeners to discriminate better the target contrast. This is because, if listeners rely either on F1 or F2 or both, they may associate the non-native vowels with different categories. Therefore, Dutch /i – ɪ/ will be discriminated in a moderate-to-good manner. However, if listeners rely on both formants and duration, the discrimination will be poor. Dutch /ɛ/ exhibits F1, F2, F3, and duration similarity to Cypriot Greek /ε/, while Dutch /ʏ/ exhibits F1, F2, F3, and duration similarity to Cypriot Greek /ε/, /ε ɐ/, /ε/, and /ε u/. Therefore, Dutch /ɛ – ʏ/ will likely present with moderate-to-good discrimination because there is some partial overlap between the two contrast members in terms of F2 and duration. Nevertheless, if listeners rely on F1 and F3, there is chance for poor discrimination.