Introduction

Even though learning difficulties (LD) in reading (RD) and math (MD) often co-occur (11–70%; e.g., Moll et al., 2019; Willcutt et al., 2019), the reason for their comorbidity (RDMD) is not yet fully understood. Two main explanatory approaches for the comorbidity of RD and MD—the “multi-deficit model” vs. the “three independent disorders model”—shape the current research debate (e.g., McGrath et al., 2019). The “multi-deficit model” (e.g., McGrath et al., 2019; Pennington, 2006) explains learning disorders by complex interactions between core deficits (specific for a particular disorder) as well as other risk factors that are shared between disorders. Specifically, domain-specific core deficits underlying RD (e.g., phonological processing) and MD (e.g., numerosity and magnitude processing) sum up, resulting in an so-called “additiveprofile in children with RDMD (e.g., Kißler et al., 2020; Peters et al., 2020; Wilson et al., 2015). Additionally, deficits in domain-general cognitive processes (e.g., memory skills, attention; cf. Moll, 2022; Prado, 2018) have been reported for both disorders and seem to be shared between RD and MD. Consequently, deficits in domain-general cognitive processes in RDMD’s profile are reffered to as “under-additive.” “Under-additivity” means that the deficits in the cognitive profile of RDMD are smaller than the sum of isolated RDs’ and MDs’ deficits (e.g., Landerl et al., 2009). In contrast, the three independent disorders model regards isolated RD or MD as well as comorbid RDMD as separate disorders characterized by a unique cognitive profile (e.g., Skeide et al., 2018). In this view, compared to isolated RD or MD, the cognitive profile of RDMD is characterized by qualitatively distinct and/or more deficits in reading and math skills as well as in domain-general cognitive processes. Consequently, in the three independent disorders model, the core and domain-general deficits in RDMD are larger than the sum of the deficits observed in isolated RDs’ or MDs’ profile (“over-additivity”) (e.g., Landerl et al., 2009).

Since it can be assumed that a better understanding of the underlying causes of RDMD could help designing a more targeted support of these doubly disadvantaged children (e.g., McGrath et al., 2019; Moll et al., 2019), the main goal of our meta-analysis was to clarify the actual distribution of domain-specific core deficits and domain-general deficits in RDMD’s cognitive profile. Building on recent research, we define domain-general deficits as proposed potential risk factors under the broad umbrella of executive functions (EF) seen as relevant for developing isolated or combined LD (e.g., Peng & Fuchs, 2014; Schuchardt & Mähler, 2016; Watson, 2016). To specify, core domain-general risk factors of LD (cf. Moll, 2022) like memory skills, inhibition, shifting or updating, processing speed, and attention are included in our used definition of EF.

Finally, we conducted separate analyses for studies’ reading, math, and EF outcomes to detect whether children with RDMD show an additive or an interactive (under- or over-additivity) profile of isolated RD and MD. With respect to recent studies (e.g., Compton et al., 2012; Moll et al., 2019; Peters et al., 2020), a broad range of moderators for isolated and combined RD and MD was analyzed to investigate their role in describing heterogeneity of results. To the best of our knowledge, to date, only one meta-analysis examined whether the multi-deficit or cognitive subtype model for RD was supported (Peng et al., 2022). In this meta-analysis, domain-general skills were analyzed in a slightly more differentiated manner than here. However, Peng et al. (2022) only examined children with RD besides typical achievers (here C), whereas the current study additionally takes MD as well as RDMD into account.

Explanatory Values for Co-occurring Learning Difficulties in Reading and Math

According to diagnostic guidelines (DSM-5; American Psychiatric Association, 2013; ICD 10; World Health Organization, 1992) and recent research on LD, isolated RD appears to be primarily caused by deficits in phonological processing, leading to problems in spelling, word recognition, and reading comprehension. In contrast, isolated MD seems to be mainly based on deficient processing of numbers and magnitude (e.g., Compton et al., 2012; Landerl et al., 2009). Besides these core deficits, both RD and MD have been associated with shared domain-general skills, which are proposed as potential risk factor developing isolated or combined LD. To specify, key domain–general skills in the development of LD are memory skills (phonological and visual working memory), inhibition, shifting or updating (main executive function skills), processing speed, and attention, as well as general language skills (cf., Moll, 2022). The key role of these domain-general skills can be explained by their importance for an adequate development of both, reading and math skills (e.g., Moll, 2022; Peng & Fuchs, 2014; Peng et al., 2020). In the current study, most of these skills were comprised under the term EF, with the exception of general language skills as well as verbal processing speed (RAN), which were placed among the reading-related skills.

Despite high prevalence rates of co-occurring RD and MD ranging from 11 to 70% (e.g., Gross-Tur et al., 1996; Landerl & Moll, 2010; Moll et al., 2019; Willcutt et al., 2019), the composition of the cognitive profile of RDMD and the underlying causes are not yet fully understood. Particularly, the specific role of core deficits associated with RD or MD, as well as the role of domain-general deficits in EF for the cognitive profile of comorbid RDMD, has not been clarified. The latter can be traced back to the lack of available studies with adequate sample size considering children with isolated and combined RD and MD. However, a better understanding of comorbidity can be considered promising for a suitable promotion and prevention of RDMD (e.g., McGrath et al., 2019; Moll et al., 2019).

Based on a comprehensive research review of neuro-cognitive findings, Ashkenazi and colleagues (2013) suggested three possible explanations for the frequent co-occurrence of RD and MD. We describe these below, adding the “cognitive subtype” hypothesis (e.g., Rucklidge & Tannock, 2002) to complete the aforementioned review.

The Under-Additivity Hypothesis (“Shared Risk Factor”)

The under-additivity or “shared risk factor” hypothesis is based on two lines of research. According to this view, co-occurring LD in reading and math are caused by common deficits in domain-general skills, which are necessary to process information adequately in both learning domains (reading and math). In line with this view, a number of studies have found deficits of children with isolated or combined LD in EF like inhibition, shifting, or specific working memory components (e.g., Peng & Fuchs, 2014; van der Sluis et al., 2004). Evidence for the shared risk factor hypothesis has also been reported in a comprehensive research review of neuro-cognitive studies (c.f. Prado, 2018). Furthermore, the shared risk factor hypothesis is in line with the assumptions of recent multi-deficit model approaches (e.g., McGrath et al., 2019; Pennington, 2006), that explain learning disorders by complex interactions between core deficits (disorder specific risk factors) as well as risk factors that are shared between disorders (domain-general risk factors). In addition to shared risk factors in domain-general processes, verbal or phonological processing deficits, which have originally been conceptualized as a domain-specific core deficit of RD, are discussed as cause for the comorbidity between RD and MD. The association between verbal respectively language risk factors and mathematics is based on the idea of a number representation in three different codes (c.f. Triple-Code Model (TCM); Dehaene, 1992), a verbal code (“one, two, three”), a visual-Arabic code (“1, 2, 3”), and a magnitude code (“ •, ••, •••”). Impairments in language skills have been shown to affect the verbal code leading to problems in verbal math tasks, such as counting and fact retrieval (c.f. Prado, 2018). However, with regard to the evidence of an explanatory value of verbal processes for RDMD, results of individual studies differ. Slot and colleagues (2016) detected underlying deficits in phonological processing as shared risk factors for RD and MD, while taking further cognitive skills like number sense and EF into account. Furthermore, Moll and colleagues (2015a) found that cognitive deficits in children with RDMD in performing verbal number processing tasks (e.g., fact retrieval, counting, transcoding) could lead back to problems in both numerosity and phonological processing. Finally, a large neuroscientific study (Viesel-Nordmeyer & Prado, 2023) revealed associations between children’s and adolescents’ arithmetic skills and their grey matter volume (GMV) in brain areas related to verbal processing (cf. Dehaene, 1992).

The Additivity Hypothesis

Several studies suggest that comorbidity of RD and MD is characterized by an additive combination of risk factors (e.g., Kißler et al., 2020; Landerl et al., 2009; Moll et al., 2015a; Schuchardt et al., 2008). To statistically test additivity versus under- or over-additivity for the cognitive profile of RDMD, a statistically significant interaction of RD and MD factors (e.g., in an analysis of variance or ANOVA) must be absent (additivity) or present (under- or over-additivity). Recent studies focusing on this issue have frequently provided evidence for the additivity of the examined skills based on the absence of an RD by MD interaction in ANOVA results. This was reported for math-related skills such as symbolic magnitude comparisons, number line (Landerl et al., 2009) or number processing (Raddatz et al., 2017), for literacy and language skills such as reading fluency or vocabulary (Peters et al., 2020), and for EF skills including all three components of working memory (Moll et al., 2015a; Schuchardt et al., 2008), inhibition (De Weerdt et al., 2013a) and spatial visualization (e.g., Peters et al., 2020).

Furthermore, in a recent study (Kißler et al., 2020) using Bayesian methods for a better interpretation of significant and non-significant interaction effects, the additivity of cognitive profiles in children with RDMD could be confirmed. Specifically, additivity in the cognitive profile of comorbid RDMD could be validated for various children’s math skills (e.g., symbolic magnitude comparisons, number line, counting), for measurements of literacy related skills (i.e., RAN), and for alertness as well as for all three working memory components (cf. Baddeley, 2012). However, some studies found small but significant interactive patterns between RD and MD. For example, Dirks and colleagues (2008) showed an interaction of RD and MD in reading comprehension, and de Weerdt and colleagues (2013b) reported an interaction of RD and MD in the central executive.

The Over-Additivity Hypothesis (“Cognitive Subtype”)

According to the “cognitive subtype” hypothesis (c.f. Rucklidge & Tannock, 2002) or the “three independent disorders model” (Neale & Kendler, 1995), RDMD represents a separate disorder group with a cognitive profile that is distinct from the profile of the isolated RD and MD groups. According to this view, RDMD is characterized by a more impaired cognitive profile compared to the deficit profiles of isolated RD and MD (“over-additivity”). Interestingly, the hypothesis of RDMD as a separate disorder group is also suggested by the ICD-10, where RDMD is listed as a distinct diagnostic category. Moreover, distinct multimodal neural signatures associated with RDMD compared to isolated RD and MD were found using fMRI (Skeide et al., 2018), supporting the assumption of a qualitatively different disorder subtype.

Aims of the Current Review

In the current meta-analytic review, we strive to answer the question whether domain-specific core deficits as well as domain-general cognitive deficits in RD and MD lead to additive effects or interact (under-additivity or over-additivity; e.g., Landerl et al., 2009) in the cognitive profile of RDMD. Specifically, our goal is to investigate whether the multi-deficit model (additivity of the core deficits, under-additivity of EF) or the three independent disorder model (over-additivity of deficits) better explains the cognitive profile in children with RDMD. Power analyses showed that taking adequate statistical power of 0.80 into account, an interaction effect of medium size (Cohen’s f = 0.25 or d = 0.50) can be detected in 2 (RD vs. no RD) × 2 (MD vs. no MD) factorial ANOVAs with a sample size of N = 128 (Faul et al., 2007). A small interaction effect (Cohen’s f = 0.10 or d = 0.20) would require a sample size of N = 787, which in turn would require screening around 10,000 children given the standard prevalence rates. Such an enterprise requires large multicenter studies. Performing a meta-analysis is an appropriate alternative to investigate whether small interaction effects of RD x MD actually occur in the current literature.

In line with the multi-deficits model, we hypothesized an additive profile of RDMD in the domain-specific core deficits of the isolated LD groups (e.g., phonological processing of RD; numerosity processing of MD) as well as an under-additivity in domain-general EF (cf. Dehaene, 1992). Thus, we did not assume any interaction between RD and MD in reading-related skills (e.g., Kißler et al., 2020; Peters et al., 2020) or math-related skills (e.g., De Weerdt et al., 2013a; Kißler et al., 2020; Landerl et al., 2009; Peters et al., 2020; Raddatz et al., 2017) but in EF (e.g., De Weerdt et al., 2013b). To the best of our knowledge, to date, no meta-analysis has been published that examines the multi-deficit model of comorbid RDMD in relation to the cognitive profiles of isolated RD and MD and a control group (C). Specifically, we are unware of any meta-analyses clarifying the cognitive profile of the core deficits in isolated RD and MD in comparison to the cognitive profile of RDMD (“additivity”, “over-additivity” or “under-additivity”) with respect to reading and math skills and domain general skills. In addition, concerning subcomponents of math, reading, and EF, our hypotheses regarding performance differences were as follows.

Math Tasks

Based on the triple-code model by Dehaene (1992), numbers can be processed in three modules pertaining to different number representation forms: analogue magnitudes, verbal code, and Arabic number form. Building on recent research, we assumed that children with MD show deficits in tasks tapping all modules of number processing (e.g., Moll et al., 2015a) as well as in the verbal code (e.g., Viesel-Nordmeyer & Prado,2022). Furthermore, we hypothesized that children with RD exhibit deficits in number processing tasks tapping the verbal code (cf. Prado, 2018) as well as in tasks mapping the verbal and the Arabic code (e.g., von Aster & Shalev, 2007; cf. Prado, 2018).

Reading Tasks

We assumed that children with MD show no deficits in reading-related tasks (e.g., Moll et al., 2015a), whereas children with RD present deficits in all reading-related tasks (Moll et al., 2015a). These assumptions would support the original classification of the isolated RD and MD with regard to the domain-specific core deficits in LD.

Executive Functions Tasks

Based on the reported literature (e.g., Peng & Fuchs, 2014), we assumed that children with MD are impaired in inhibition, shifting, and visuospatial updating as well as in the visuospatial and verbal working memory tasks. Furthermore, we presumed that children with RD are not impaired in the visuospatial sketchpad tasks, but in some attention-related EF tasks (e.g., inhibition, shifting), as they often show subclinical attention problems without fulfilling diagnostic criteria. Furthermore, we assumed children with RD to be impaired in verbal working memory tasks (e.g., Peng et al., 2013), like verbal updating or phonological loop.

Method

Inclusion Criteria

With respect to our research question, the following inclusion criteria (Fig. 1) were mandatory for the literature search described below: Only studies that comprised at least one group of children with isolated RD or MD and a group with comorbid RDMD were taken into account. In the included studies, RD was defined by impaired skills in word reading accuracy (33.8%), word reading fluency (25.4%), both (21.1%), reading comprehension (7.0%), mixed (11.3%; including all four), or spelling (1.4%). MD was defined by impaired skills in arithmetic (59.2%) or mix math skills (40.8%; including word problems and geometry). The following cut-off criteria (percentile ranks: PR) to classify LD groups were used: RD: PR 1–10: 11.9%; PR 11–25: 52.4%; PR > 25: 35.7%; MD: PR 1–10: 10.7%; PR 11–25: 57.1%; PR > 25: 32.1%; RDMD: PR 1–10: 10.3%; PR 11–25: 62.1%; PR > 25: 27.6%. Studies with (8.1%) and without (91.9%) IQ discrepancy criterion were considered. Furthermore, studies had to focus on outcomes of math, reading, or executive functions. Finally, we decided for an age span limit from 6 to 12 years, the age span where LD mostly become visible.

Fig. 1
figure 1

PRISMA diagram showing the selection process of studies for the systematic search

Literature Search

Studies included in the meta-analysis were identified in three steps. First, studies were identified conducting electronic literature search based on the following search term (dyscalcul* OR acalcul* OR “math* disabilit*” OR “math* difficult” OR “arithmetic difficult*” OR “arithmetic disabilit*”) AND (dyslex* OR “read* difficult*” OR “read* disabilit*”) AND (comorbid* OR “learn* disorder” OR “learn* disabilit*” OR “learn* difficult*”) using the platform EBSCOHOST, which resulted in n = 491 hits. To complement the search process, we applied the search term (Mathematics AND Reading AND Learning (disabilities OR difficulties OR disorders)) using the database ERIC, resulting in n = 216 hits. In a second step, the abstracts and if necessary the full-texts were screened to identify whether studies fulfilled the inclusion criteria defined above. Thereby, three additional studies which were not included in the EBSCOHOST search hits could be added from the ERIC search hits. Both step 1 and step 2 were carried out by the two first authors. One of the first authors identified an additional study within the publication time until 2014 (agreement rate: 98.2%).To standardize the meta-analytic sample, papers including different samples (n = 2) and time points (n = 6) had to be split. Finally, 72 studies fulfilling the inclusion criteria could be included. Eventually, grey literature was gathered sending e-mails to scientists in the research field of interest via mailing lists. This procedure helped to expand our sample by two further studies (n = 74). Third, all references of the papers included were screened to find studies not identified before. Another added study completed our sample, resulting in a total of 75 studies. However, one of the identified studies had to be sorted out during the coding procedure, because the outcome measure could not be assigned to one of the outcome variables of interest. Consequently, the final sample represents 74 studies from 15 countries published between 1985 and 2020.

Coding Procedure

The 74 studies were double coded by one of the two first authors and one trained graduate student. This is described in the following subsection. After coding, and interrater reliability of κ= 0.95 was computed averaged over categories (sample specific: κ = 0.88; range 0.03–1.00; task specific: κ = 0.97; range: 0.04–1.00). Eventually, all disagreements could be solved by consulting the original article or in discussion.

Meta-analytic Procedure

The standardized mean difference effect size used was Hedges’ g (Hedges, 1981). Because many effect sizes were clustered within studies, we utilized a three-level meta-analytic approach (Konstantopoulos, 2011). This statistical procedure splits the total variance into sampling variance and two sources of true variance: between-study variance, based on the variability between studies, and within-study variance, based on the variability of effect sizes within studies. By taking the within-study dependencies into account, unbiased estimates of standard errors can be obtained. We used a restricted maximum likelihood estimator (REML) to avoid bias in variance estimates (Viechtbauer, 2005). All analyses were conducted in R (R Core Team, 2021), using the metafor (Viechtbauer, 2010) and dmetar (Harrer et al., 2020) packages.

In order to investigate heterogeneity of effect sizes, we conducted moderator analyses. Moderators were grouped as sample-specific or task-specific moderators. Separate analyses were calculated for each of the possible moderators (cf. Table 1).

Table 1 Coding criteria for moderators and operational definitions

Sample-specific moderators were year of publication and mean age of children in a study, as well as all aspects of heterogeneity of LD between studies (e.g., Fischbach et al., 2013) like fulfillment of IQ discrepancy criterion, IQ difference between the respective LD or control groups, and percentile rank in diagnosis. Moreover, since comorbidity rates between ADHD and LD in reading and math are high (e.g., Czamara et al., 2013), we considered whether studies excluded children with ADHD. Finally, we included differences in socio-economic status (SES) between groups, which is also discussed as a risk factor for diagnosing LD (e.g., Blair and Scott, 2002) .

Task-specific moderators included screening tests of LD and the observed outcome variables (tasks) of included studies. Since the content of the math screening was quite uniform, we only distinguished between arithmetic or mixed tests (e.g., parallel consideration of arithmetic, geometry, and word problems). Given that the reliability of speeded math tests is critically discussed (e.g., Lonnemann & Hasselhorn, 2019), we also coded whether math screening tests were power tests, speed tests, or time-limit tests. For the screening tests of reading, we coded whether they assessed accuracy, fluency, a combination of both, or reading comprehension. In contrast to the math screening tests, we assumed a higher diversity for math outcomes (tasks). Thus, we decided coding math outcome variables classifying with respect to the triple-code model (Dehaene, 1992). We distinguished between non-symbolic approximate magnitude, symbolic verbal-auditory, or symbolic visual processing. The type of reading-related outcome was coded as measuring RAN, phonological processing, language skills (e.g., syntax awareness, articulation speed), or reading skills (accuracy, speed, comprehension). EF were coded considering a common taxonomy for children with LD (Gilmore & Cragg, 2018; Moll, 2022): core executive functions (inhibition, shifting, updating), working memory (phonological loop, visuospatial sketchpad), processing speed, and alertness.

In order to investigate whether the cognitive profile of the comorbid RDMD group resulted additively from the cognitive profile of the isolated LD groups (additivity) or whether the comorbid RDMD group profile deviated from the sum of the isolated LD profiles (under-additivity, over-additivity), the assumption of additivity was translated into a linear hypothesis. For this procedure, three terms or effect sizes were needed: g1 (C – RD), g2 (C – MD), and g3 (C – RDMD). The following linear hypothesis representing additivity was investigated meta-analyticly:

$${\mathrm{g}}_{Add}={\mathrm{g}}_{1}+{\mathrm{g}}_{2}-{\mathrm{g}}_{3}=[(C-RD)+(C-MD)] - (C -RDMD)=0$$

Since the effect size g3 was subtracted from the term g1 + g2, a negative value would represent over-additivity (RDMD group is more impaired than expected based on isolated LD groups), whereas a positive value would represent under-additivity (RDMD group is less impaired than expected based on isolated LD groups). In contrast, additivity corresponds to a sum of 0. To compare effect sizes across categories, we conducted moderator analyses based on Tukey’s HSD test (Hays, 1994). All tests of statistical significance and p-values were Bonferroni-Holm-corrected (Holm, 1979).

Funnel plots using the trim and fill method (e.g., Duval & Tweedie, 2000) and multivariate meta-regressions with standard errors of effect sizes as moderator (Egger-type tests; cf. Rodgers & Pustejovsky, 2020) were used to check for publication bias. Furthermore, we computed Cook’s distance (Cook & Weisberg, 1982) for all group differences in math, reading, and EF outcomes. With this procedure, outliers based on a threshold of 2 standard deviations (SD) were identified and excluded from further analysis (cf. Viechtbauer & Cheung, 2010). This yielded the following exclusion of studies: for analyses of math outcomes, we excluded two studies for C-MD as well as one study each for C-RD and C-RDMD. For reading outcomes, one study each for C-MD and C-RD and two for C-RDMD were removed. For executive function outcomes, one study for C-MD, two for C-RD, and four studies for C-RDMD were excluded.

Results

The 74 studies included in our meta-analysis comprised 11,588 individuals (female = 51.8%) with and without LD. Overall, 74 RDMD groups (mean n = 34.0; female = 48.3%), 52 RD groups (mean n = 30.1, female = 43.5%), 70 MD groups (mean n = 32.4, female = 57.4%), and 66 C groups (mean n = 80.6, female = 52.6%) were considered. Heterogeneity and publication bias analysis will be provided in the following section. Subsequently, findings for overall and moderator effects for the three subcomponents of interest (i.e., math, reading, and EF skills) will be reported. Finally, results of the linear hypothesis for the assumed additivity will be shown under consideration of possible moderators.

Heterogeneity and Publication Bias

As can be seen in Table 2, substantial heterogeneity was found for nearly all effect sizes of interest. Measures of between-study variance (I2b) and within-study variance (I2w) indicated a higher proportion of between-studies variances for math outcomes. However, in reading outcomes, between-study variance was less pronounced than within-study variance. Finally, compared to between-study variance, variability of executive function outcomes was less clearly attributable to within-study variance.

Table 2 Tests of heterogeneity

Egger-type testing (Table 3) showed evidence for publication bias in some of the group comparisons. Specifically, significant effects were found for C-RDMD in math, reading, and EF outcomes. Moreover, evidence for publication bias was also evident comparing MD and RDMD groups in reading and math outcomes. Finally, Egger-type testing indicated publication bias for group comparisons of C-MD in math as well as C-RD in executive function outcomes. Based on funnel plots (Fig. 2), evidence for publication bias could be attributed to the lack of small studies with a lower effect size than the mean. Using the trim and fill method, it became clear that all effects became smaller when taking imputed studies into account.

Table 3 Egger-type tests
Fig. 2
figure 2

Funnel plots under use of the trim and fill method for group comparisons in math, reading, and executive function outcomes. White dots indicate observed effect sizes; black dots indicate imputed missing effect sizes. RD, reading learning difficulties; MD, math learning difficulties; RDMD, combined learning difficulties in math and reading; C, control

Overall and Moderator Effects for Mathematical Outcomes

Overall effect sizes of mathematical outcomes (Table 4) already point to an additive effect for C-RDMD (0.84), comprised of C-RD (0.27) and C-MD (0.64). These three effects differed significantly from each other (all p < 0.01), suggesting that the comorbid group was more strongly impaired than the MD and RD group in mathematical outcomes. Moreover, the MD group displayed larger deficits than the RD group in mathematical outcomes. The same pattern emerged at the level of mathematical outcome categories.

Table 4 Overall and moderator effects for math outcomes

Post hoc testing revealed differences of factor levels only for a few moderators in group comparisons: Results for “type of math outcomes” in C-RDMD as well as MD-RDMD indicated higher differences between groups in math tasks addressing symbolic verbal-auditory and symbolic visual processing compared to math tasks addressing non-symbolic approximate processing. Moreover, for group comparisons of MD-RDMD, the “exclusion of children with ADHD” decreased effect sizes in math outcomes. Furthermore, the consideration of the “IQ discrepancy criterion” to classify children with MD increased group differences of C-MD.

Overall and Moderator Effects for Reading Outcomes

Similar to math outcomes, the overall effect sizes of group comparisons in reading outcomes (cf. Table 5) for C-RD (0.51), C-MD (0.15), and C-RDMD (0.69) pointed to an additive effect for the cognitive profile of children with RDMD. Again, these three effects differed significantly from each other (all p < 0.01), suggesting that the comorbid group was more strongly impaired than the RD and MD group in reading outcomes. Moreover, the RD group showed larger deficits in reading outcomes than the MD group. However, we found that RD and RDMD groups did not differ significantly in RAN (p = 0.69) and reading skills (p = 0.83). The latter result can be explained in terms of the small number of studies on which it is based.

Table 5 Overall and moderator effects for reading outcomes

In comparing children with RD to those without (C-RD, C-RDMD, MD-RDMD), effect sizes increased hierarchically for the order of the following factor levels: RAN < language skills < phonological processing < reading. Deficits of children with RD or RDMD were mostly pronounced in general reading tasks. For group comparisons of C-MD, the sequence of phonological processing and reading was the other way around (RAN < language skills < reading < phonological processing). This finding suggests that children with MD struggle more in phonological processing than in general reading tasks. Besides these findings, post hoc testing also revealed a decrease of group differences in C-RD when children with ADHD were excluded.

Overall and Moderator Effects for Executive Function Outcomes

Finally, overall effect sizes of group comparisons for executive function skills (Table 6) again suggested additivity (C-RD = 0.35; C-MD = 0.40; C-RDMD = 0.71). Two of these three effects differed significantly from each other: the comorbid group was significantly more impaired than both the MD and RD group (both p < 0.01). However, MD and RD groups did not significantly differ from each other (p = 0.66), suggesting similar overall EF deficits in the isolated LD groups. At the level of EF outcome categories, the comorbid group differed from the MD group and the RD group in all EF categories except attention (attention: p = 0.31 and p = 0.43, respectively).

Table 6 Overall and moderator effects for executive function outcomes

Post hoc testing of moderator analyses provided evidence for the specific role of “type of executive function outcome” for C-RDMD. Effect sizes of different factor levels indicated that children with RDMD seem to struggle more in working memory processes and processing speed than in attention, but displayed the strongest impairment in processing speed. Moreover, higher percentile ranks for the classification of groups seemed to increase the differences between RD-RDMD in EF outcomes.

Testing the Linear Hypothesis

Overall results (Table 7) provided evidence for an additive effect for the cognitive profile of RDMD in math, reading, and executive function skills, respectively. However, this result seemed to be less clear-cut for EF (p = 0.06) than for math or reading. This becomes clear when focusing on outcome-specific results for EF: in inhibition, shifting, and updating, profiles of children with comorbid RD and MD were clearly under-additive (gAdd = 0.22, p < 0.01), implying that children with RDMD were less impaired in inhibition, shifting, and updating than expected in the case of additivity (Table 7). Testing effects of potential moderators of additivity did not provide any notable results. Specifically, when taking age into account, a negligible effect of under-additivity in math emerged (gAdd = 0.16, p = 0.04), whereas a small effect of over-additivity in math was detected when taking SES into account (gAdd =  − 0.28, p = 0.05). Furthermore, concerning EF, a negligible effect of under-additivity was found when taking publication year into account (gAdd = 0.11, p = 0.04). All other results at the outcome level support the notion of additivity, albeit at times with substantial uncertainty due to small sample sizes.

Table 7 Linear hypothesis testing of additivity

Furthermore, we investigated whether additivity was given from a practical point of view by checking whether gAdd deviated from zero, using equivalence testing (Lakens, 2017). We determined a small effect (gAdd = 0.20, − gAdd =  − 0.20) as the smallest effect size of interest (SESOI), indicating that practical additivity was present in the case of absolute values were smaller than 0.20 (|gAdd|< 0.20). Statistically, we investigated whether the 90% confidence intervals included or exceeded the SESOI (0.20, − 0.20; Lakens, 2017). We found 90% confidence intervals of gAdd for math [− 0.08, 0.16], reading [− 0.10, 0.13], and EF [0.01, 0.19] all within the acceptable SESOI range for equivalence, supporting the notion that at the overall level, additivity was present in math, reading, and EF.

Discussion

The key goal of the present meta-analysis was to answer the question whether the cognitive profiles of children with comorbid RDMD show additive or rather interactive effects (over- or under-additivity; cf. Landerl et al., 2009) of isolated RD and MD. Three subcomponents of risk factors were examined in order to better understand the cognitive profiles of co-occurring RDMD: reading-related skills, number processing skills (resp. math-related skills), and EF skills. Results suggest that the cognitive profiles regarding the three subcomponents were similar: Deficits in reading-related skills, number processing skills, and EF in the RDMD group represent the sum of the deficits in the isolated groups (additive profile). However, robustness of results with respect to profile additivity differed across domains, and group differences varied depending on characteristics of screening and outcome tasks. These results are discussed in more detail in the following sections.

Additivity in Math-Related Cognitive Skills

In math tasks, all three LD groups showed impairments in comparison to the control group. In line with our hypothesis, children with MD showed deficits in tasks tapping all modules of number processing (e.g., Moll et al., 2015a) including the verbal code (e.g., Viesel-Nordmeyer & Prado, 2023). In contrast, we expected that children with RD would show deficits in number processing tasks tapping the verbal code (e.g., Simmons & Singleton, 2009). However, results show deficits of children with RD in tasks tapping all modules of number processing, although deficits were less pronounced than in the MD groups.

As expected, the RDMD group showed the largest deficits in all math outcome categories. Compared to MD but also to C, the RDMD group displayed larger deficits in tasks tapping the symbolic verbal and visual code than the non-symbolic code. Consequently, our results emphasized broader problems of children with RDMD processing advanced, more demanding math skills. The latter is in line with a study examining group comparisons of isolated RD and MD, RDMD, and a control group (C) in different areas of math cognition in second graders (Hanich et al., 2001). In this study, children with RDMD performed worse than children with MD in math areas that were related to language processing (story problems, advanced arithmetic) or multiple cognitive processing (exact calculation), but not in approximate arithmetic, place value, or retrieval of number facts.

Interestingly, excluding children with ADHD leads to substantially lower math impairments of RDMD compared to MD. This finding could be explained by the higher risk of the comorbid RDMD group for additional comorbidities, such as ADHD (e.g., Czamara et al., 2013) as compared to isolated LD groups. Finally, larger differences between the C and the MD group when considering the IQ discrepancy criterion are likely to be related to the fact that the discrepancy criterion results in more strongly impaired MD children.

Eventually, linear hypothesis testing for math outcomes revealed no interaction between RD and MD. Therefore, additivity of the cognitive profiles of RD and MD for math-related cognitive skills in children with RDMD was supported.

Additivity in Reading-Related Cognitive Skills

As expected, all three LD groups showed impairments in reading-related tasks compared to the control group. In addition to differences in reading tasks, we analyzed group differences in reading-related tasks such RAN, phonological processing, and language skills. Impairments in phonological processing tasks were not only present in the RD groups (isolated RD and RDMD), but also in isolated MD. This result stands in contrast with some findings from individual studies examining phonological processing skills in group comparisons of children with isolated and combined RD and MD. For example, Peters and colleagues (2020) as well as Landerl and colleagues (2009) could not find any impairments of phonological awareness in individuals with MD. While these findings suggest that phonological awareness is a domain-specific risk factor of RD, two meta-analyses of behavioral studies, examining the relations between phonological processing skills (Yang et al., 2021) or language skills (Peng et al., 2020) and math, reported a strong relation between phonological awareness and math skills. Moreover, a meta-analysis of neuroscientific studies emphasizes substantial functional overlap of arithmetic and phonological processing areas in children’s as well as adults’ brains (Pollack & Ashby, 2018). Whereas the aforementioned meta-analyses did not directly target children with LD, our meta-analysis supports the view that small impairments in various reading-related skills might be evident in individuals with MD. The low degree of heterogeneity in group comparisons concerning reading in our meta-analysis, indicating small differences between individual studies, further underscores the statistical power of the current study (e.g., Lifeng, 2020). Consequently, the advantage of a meta-analysis to detect even small effect sizes in the research topic of interest can be considered promising. However, at the same time, a possible role of phonological awareness as proxy for broader language skills (e.g., Moll et al., 2015b) which are consistently shown to be a risk factor of MD should be taken into account.

As already reported for mathematics, we did not find strong support for an interaction of RD and MD testing our linear hypothesis. Thus, our results for the group comparisons in reading-related skills point to an additive RDMD profile. This is in line with the majority of findings from individual studies. In these studies, no interaction of RD and MD was reported for reading speed (Peters et al., 2020) as well as RAN (Kißler et al., 2020). To the best of our knowledge, there is just one individual study reporting an interaction of RD and MD for reading comprehension skills (Dirks et al., 2008). The effect size in this study, however, was small.

Finally, when comparing the screening results in the RD group with the unimpaired control group, we found out that effect sizes depend on the presence or absence of co-occurring ADHD in the RD group. Effect sizes between RD and controls were larger when RD co-occurred with ADHD, which is in line with the high comorbidity rates observed between RD and ADHD as well as between RD, MD, and ADHD (e.g., Czamara et al., 2013).

Additivity in Executive Function Skills

As expected, all three LD groups showed impairments in at least some components of EF skills, indicating that EF underlie adequate processing in reading and math skills (e.g., Watson, 2016). Thereby, MD groups seem to be more impaired in EF skills than RD groups. Based on recent individual studies regarding the cognitive profile of children with MD, MD can be seen as a strongly heterogeneous group (e.g., Salvador et al., 2019) with different manifestations. The different manifestations or symptoms of math problems are likely to be related to different cognitive risk factors (e.g., Dehaene, 1992), including various components of EF skills (e.g., Gilmore & Cragg, 2018).

Against our expectations, the MD group was impaired in all considered EF skills. This finding underscored the aforementioned role of various cognitive skills in mathematical processing (e.g., Gilmore & Cragg, 2018). For the RD group, almost the same results were found, with the exception of attention, for which no differences between RD and C could be detected. On the one hand, this finding contrasts with our hypothesis as well as with the results of a previous meta-analysis addressing EF deficit profiles in RD and RDMD (Peng et al., 2013). In this meta-analysis, children with RD only showed impairments in some of the considered EF skills, namely verbal working memory, inhibition, and processing speed. In visual working memory and updating, however, no deficits were shown in children with RD (Peng et al., 2013). On the other hand, the consideration of ADHD in the individual studies leads to decreased differences in EF between the C and RD group. It could be possible that the problems of children with ADHD in learning to read (e.g., Czamara et al., 2013) are not only based on attention problems, but also could be explained by other ADHD-related accompanying symptoms like impaired working memory processing (e.g., Kofler et al., 2019).

In line with the study by Peng and Fuchs (2014) as well as an additional meta-analysis of Peng and colleagues (2013) on working memory in children with LD, we found that children with RDMD were the most impaired group showing deficits in all EF skills with the strongest deficit in processing speed and comparably smaller deficits in attention. These results are in line with findings from a recent study (Raddatz et al., 2017) where processing speed differences in response times (but not accuracy) in a simple reaction time task between RDMD and C could be revealed. Our mentioned result of comparably lower attention problems in children with RDMD as compared to processing speed is mirrored by a similar result in the RD group. This result might be related to a substantial proportion of children with undiagnosed ADHD or attention problems without further learning problems in RD or RDMD (e.g., Mattison & Mayes, 2010) in some primary studies.

Finally, unlike for reading and math skills, testing the linear hypothesis for EF with respect to additivity revealed less robust and clear-cut results. With respect to inhibition, shifting, and updating, under-additivity was found, implying an impairment of RDMD in these specific EF skills which is smaller than expected. This finding contrasts with the result of De Weerdt and colleagues (2013a) who found additivity for the cognitive profile of RDMD, especially for inhibition. Moreover, the result by De Weerdt and colleagues (2013a) regarding the additivity of EF skills in RDMD has been reported in other studies as well (cf., additivity for spatial visualization: Peters et al., 2020; additivity for all three components of working memory: Moll et al., 2015a; Schuchardt et al., 2008). Only one previous study in which RD and MD interact with respect to the central executive was found, but only with small effect sizes (De Weerdt et al., 2013b). However, inhibition, shifting, and updating are subsumed under the umbrella of central executive tasks (e.g., Miyake et al., 2000), and thus bring our results in line with the aforementioned findings of De Weerdt and colleagues (2013b) regarding under-additivity of the central executive in the cognitive profile of RDMD.

Limitations of the Current Meta-analysis

Meta-analyses often encounter the problem of publication bias associated with missing, small, or insignificant effect sizes of smaller studies (e.g., Viechtbauer, 2010). We addressed this problem as soon as possible by conducting a three-level meta-analytic approach (Konstantopoulos, 2011) in order to take heterogeneity at different levels into account. However, heterogeneity was still present and particularly large in math skills, even after outlier adjustment (e.g., Viechtbauer, 2010). Building on theory (e.g., Triple-Code-Modell; Dehaene, 1992) and recent research (e.g., Salvador et al., 2018), we attribute this finding to the diversity of the mathematical domain and the associated diverse cognitive requirements. Even though we used a meta-analytic approach to address the problem of statistical power (e.g., Faul et al., 2007) in additivity hypothesis testing, the included numbers of studies in some sub-calculations was still small. This is especially the case for the simultaneous consideration of all moderators in linear-hypothesis testing, but even when considering them individually. However, the key results of our meta-analysis could be supported by using statistical techniques based on equivalence testing (e.g., Lakens, 2017).

Since most of the included studies are cross-sectional, a correlation of screening and outcome variables cannot be excluded. Given that EF tasks are a key outcome in this study, even with a meta-analytic approach the problem of psychometric adequacy of testing EF (e.g., Manchester et al., 2004) cannot be solved. Whereas ADHD seems to play an important role when investigating the cognitive profile of RDMD, ADHD could only be included as a covariate in this study. Consequently, despite the power of the used multivariate meta-analytic approach, some limitations remain.

Conclusions

Using a meta-analytical approach, our study helped to detect the composition of deficits in the cognitive profile of combined RDMD with respect to cognitive profiles in isolated RD and MD. With this understanding, the way to a more targeted support and prevention is paved. First of all, our study showed that it is particularly important to consider the relation of language and math skills in fostering scholastic skills in children with isolated LD. First intervention approaches considering the specific role of language in math (e.g., Arizmendi et al., 2021) already exist and should be further implemented. In addition, our study underlines that the diversity of math tasks and the different associated cognitive processes (e.g., Dehaene, 1992) should also be taken into account more explicitly. Individual learning support that considers this diversity appears to be extremely beneficial (e.g., Messer et al., 2018). Moreover, such support would meet the high heterogeneity within the MD group, which was found in our study as well as recent research (e.g., Kißler et al., 2021).

Finally, additional possible weaknesses in children such as ADHD, which can promote learning difficulties in reading and math skills as an epiphenomenon, should be considered as soon as the first problems in one of the two domains arise. Learning difficulties are usually multifactorial in their development, and central executive processing is involved in the rise of difficulties in both—reading and math skills.