Genome-Wide Studies of Specific Language Impairment

Reader, Rose H.; Covill, Laura E.; Nudel, Ron; Newbury, Dianne F.

doi:10.1007/s40473-014-0024-z

Genome-Wide Studies of Specific Language Impairment

Genetics & Neuroscience (P Gejman, Section Editor)
Open access
Published: 26 September 2014

Volume 1, pages 242–250, (2014)
Cite this article

Download PDF

You have full access to this open access article

Current Behavioral Neuroscience Reports Aims and scope Submit manuscript

Genome-Wide Studies of Specific Language Impairment

Download PDF

Rose H. Reader¹,
Laura E. Covill¹,
Ron Nudel¹ &
…
Dianne F. Newbury^1,2

2470 Accesses
23 Citations
4 Altmetric
Explore all metrics

Abstract

Specific language impairment (SLI) is a multifactorial neurodevelopmental disorder which occurs unexpectedly and without an obvious cause. Over a decade of research suggests that SLI is highly heritable. Several genes and loci have already been implicated in SLI through linkage and targeted association methods. Recently, genome-wide association studies (GWAS) of SLI and language traits in the general population have been reported and, consequently, new candidate genes have been identified. This review aims to summarise the literature concerning genome-wide studies of SLI. In addition, this review highlights the methodologies that have been used to research the genetics of SLI to date, and also considers the current, and future, contributions that GWAS can offer.

Embryonic origin of two ASD subtypes of social symptom severity: the larger the brain cortical organoid size, the more severe the social symptoms

Article Open access 25 May 2024

Diagnostic Uncertainty in Childhood Motor Speech Disorders: A Review of Recent Tools and Approaches

Article Open access 07 June 2024

Towards a dynamic, comprehensive conceptualization of dyslexia

Article Open access 13 January 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The term ‘language impairment’ (LI) encompasses a wide range of disorders that can impair, or delay, both verbal and written language abilities. Specific language impairment (SLI) is one such disorder. It occurs in 5–8 % of English-speaking school children and is an example of a verbal language impairment [1, 2]. SLI occurs despite adequate intelligence, access to education and no major neurological deficit [3]. A diagnosis is reliant on exclusion of disorders that might cause the language impairment, such as autism, cerebral palsy, hearing loss or intellectual disability [4]. The aetiology of SLI, like that of many complex neurological disorders, is not well defined, but research over the last decade has provided us with strong evidence to support the likelihood of a polygenic genetic basis [5–13, 14•]. Although it might be expected that factors such as language input would impact a child’s likelihood of developing SLI, studies have shown that most children learn to talk competently even if they are taught by adults with language impairment. This suggests that, in comparison with genes, language input has little influence on a child’s risk of developing SLI [3].

Methodologies

Early genetic studies of SLI have been dominated by quantitative trait loci (QTL) mapping and genome-wide linkage analysis (GWLA) methodologies. Linkage studies rely on family data, aim to identify regions of the genome that are shared between related affected individuals [15] and can be either parametric or non-parametric. Parametric studies require definition of an inheritance model (e.g. dominant or recessive), penetrance and allele frequencies, while non-parametric methods are, for the most part, model free [15]. The specification of a model for SLI is not always straightforward; SLI tends to run in families and is considered to be highly heritable [16], but it only rarely follows a monogenic inheritance pattern [13]. This sporadic form of inheritance suggests that a collection of genes are at least partly responsible for the manifestation and severity of SLI in affected individuals [12]. Linkage studies may be limited in their application to complex genetic disorders because of their low resolution and pedigree limitations. Nonetheless, they can be conducted using small sample sizes and can effectively reduce the number of candidate regions to consider in preparation for targeted association analyses [17]. In contrast to linkage studies, association analyses have a high resolution and directly implicate specific genetic variants by comparing allele frequencies between cases and controls. Significantly associated variants would likely either be in linkage disequilibrium (LD) with the causal variant or be the causative variant themselves. Unlike linkage studies, association analyses assume that the causal variant will be constant across cases and will occur at a reduced frequency in controls. This can be a limiting factor, as complex disorders such as SLI are likely to involve a variable combination of variants (genetic heterogeneity), even within the same gene. In addition to case–control methods, association methods can be applied within family-based cohorts by comparison of allele frequencies between probands and unaffected family members [13].

SLI GWLA Studies

Two initial GWLA studies of SLI identified novel candidate regions that are linked to performance on tests of language-related ability, in families affected by SLI [5–7]. The first study, conducted by the SLI Consortium (SLIC), identified two novel susceptibility loci on chromosomes 16q23–24 (SLI1, OMIM 606711) and 19q13 (SLI2, OMIM 606712) [6]. A subsequent study by Bartlett et al. highlighted 13q21 (SLI3, OMIM 607134) and 2p22 as additional loci predisposing to SLI [7].

SLIC analysed 98 UK families (473 individuals), each with a proband whose language scores fell ≥1.5 standard deviation (SD) below the mean for their age and nonverbal IQ scores that were within the specified normal range (>80). This study considered three quantitative language measures derived from the Clinical Evaluation of Language Fundamentals—Revised (CELF-R) and non-word repetition (NWR) [6]. CELF-R is a commonly used test battery designed to assess both receptive and expressive language ability in school-age children [18]. The NWR test measures the ability to retain novel phonological (speech sound) information for short periods of time; this is commonly impaired in people with SLI [5, 11]. Following the language tests, linkage analysis was performed using 400 highly polymorphic microsatellite markers. Significant linkage was detected on chromosome 16 with the NWR trait (LOD score 3.55; P = 0.00003) and on chromosome 19 with the CELF-R expressive language score [ELS] (LOD score 3.55; P = 0.0004) [6].

Bartlett et al. analysed five Canadian families (including 73 individuals) of Celtic ancestry. Individuals were sorted into three categorical groups, labelled ‘language-impaired’, ‘reading-impaired’ and ‘clinically impaired’ on the basis of their performance across six quantitative language tests. Language-impaired individuals were defined by a Spoken Language Quotient (SLQ) score (from the Test of Language Development) [19] of ≤85, reading-impaired individuals had a non-word reading and IQ discrepancy, and clinically impaired individuals had a history of speech and/or reading therapy [7, 8]. Linkage to region 13q21 was detected under a recessive model in the reading-impaired group (LOD score 3.92; P < 0.01), and linkage to 2p22 was detected under a recessive model of the language-impaired group (LOD score 2.86; P < 0.06) [7].

The lack of overlap between the SLI susceptibility regions, implicated by these two independent linkage studies, not only supports the theory of locus heterogeneity but also demonstrates the statistical complexity of replicating genetic loci in different cohorts with variable phenotypes. Susceptibility regions 13q21 and 2p22, highlighted by Bartlett et al., may not have been detected by SLIC, because of alternative allele frequencies within the UK sample set. Although the Canadian families in the study by Bartlett et al. were not considered population isolates, they were selected from a different ethnic background compared with that of the SLIC cohort, and thus the markers carried in these regions may have been elevated to a detectable linkage peak in this group. Furthermore, SLIC and Bartlett utilised slightly different linkage methodologies and diagnostic criteria for determining SLI affection status. SLIC diagnosed probands on the basis of a clinical verbal-language battery [5, 6], whereas Bartlett included a more varied set of phenotypes, including reading ability (designed to reflect the proband’s overall language ability), and then classified all individuals as being affected or unaffected under three alternative definitions [7, 8]. SLIC used non-parametric linkage methods, whereas Bartlett used parametric linkage, assuming 7 % population penetrance, and used both dominant and recessive models of inheritance [5–8].

Despite the inconsistency of loci linked to SLI, both SLIC and Bartlett have since replicated their findings [5, 8]. SLIC conducted a targeted linkage study in 2004, with an additional 86 nuclear families selected and characterised as described for the SLIC samples above. Linkage was detected again on chromosomes 16 (LOD score 2.86; P < 0.02) and 19 (LOD score 2.31; P < 0.02), both to the NWR trait. The two SLIC cohorts were then pooled to total 184 families and 840 individuals. In this pooled dataset, highly significant linkage was detected on chromosome 16 [5].

In 2007, SLIC applied a genome-wide multivariate linkage approach to their pooled cohort, which was able to analyse linkage to multiple quantitative traits simultaneously. In total, they investigated 11 measures of spoken and receptive language ability, reading ability and non-verbal IQ. This study supported the Consortium’s previous evidence linking loci on chromosomes 16q (P = 0.008) and 19q (P = 0.017), and highlighted a novel region of linkage on chromosome 10q26 (P = 0.019) [9]. The linkage on chromosome 16q was specific to NWR in the previous univariate SLIC studies, and in the multivariate study it was linked to NWR and literacy measures (single-word reading and single-word spelling) [5, 6, 9], indicating that variation in this region will likely impact phonological memory. An inability to store short-term verbal information is likely to impair the ability of an individual to acquire and retain language skills, and this has been a growing, aetiological theory surrounding SLI [20, 21]. In contrast, linkage to chromosome 19q had previously been detected with multiple traits, firstly to ELS [6], then to NWR [5], and in the multivariate study it was found to be linked to a variety of expressive and language traits [9]. This suggests that the risk variants within this linkage region may impact upon a variety of language abilities.

One final study further expanded the SLIC cohort, analysing an additional 300 individuals from 93 families affected by SLI, and again replicated linkage of chromosome 16q (P = 0.002) with NWR, and linkage of chromosome 19q (P = 0.007) with ELS [22].

Bartlett et al. also expanded their cohort to include 22 US-based nuclear families with at least one individual per family affected by SLI, in combination with the original families studied [8]. In total, 365 DNA samples were genotyped for microsatellites on chromosomes 2 and 13, enabling replication of the chromosome region 13q21 linkage, using similar parametric modelling procedures. Further analysis revealed that only a small percentage of the families contributed to this linkage, suggesting that the risk factor is not sufficient or necessary for SLI to manifest itself in all families. In this combined sample set, the linkage region on chromosome 2 could not be replicated, suggesting that if SLI risk factors exist on chromosome 2, they may have a small effect size with low penetrance, which would make them difficult to identify using a linkage study [8].

In addition to the SLIC and Bartlett studies, a few other groups have also investigated the genetic basis of SLI, using single families and isolated populations [12, 23, 24, 25•]. These studies provided a unique opportunity to look at individuals with an increased level of shared environmental and genetic influences. When certain phenotypes become more prevalent in isolated populations, it may suggest that a founding genetic influence has been shared amongst the group and may thus be more common and somewhat easier to identify.

A classic linkage study, conducted prior to the two described above, linked language impairment to a region designated SPCH1 on chromosome 7q [26, 27]. This study analysed a single, large, three-generation pedigree known as the KE family, in which approximately half of the individuals were affected with a severe speech and language disorder. GWLA identified a region on chromosome 7q31 (maximum LOD score 6.62) that co-segregated with the language disorder in this particular family [26] and has since been narrowed down to a causative mutation in the FOXP2 (forkhead box P2) gene (OMIM 605317) [24]. It is important to note that not all members of the KE family would meet the selection criteria for studies of specific language impairment, because they had evidence of both intellectual disability and motor impairment. In addition, the severe language impairment exhibited by members of the KE family surpasses the typical SLI phenotype, in the sense that it involves a variety of associated neurological dysfunctions. All of the FOXP2 coding regions, or exons, have since been screened for association with SLI, using 43 SLIC probands, but no associations or mutations were detected in this study [28]. It is likely, then, that the FOXP2 gene remains functional in typical SLI probands. Despite this, the FOXP2 gene is clearly vital for language acquisition, as demonstrated by the KE phenotype [26]. The KE phenotype has since been described as childhood apraxia of speech (CAS), which has been linked to disruptive variants within FOXP2, as demonstrated by similarly affected families [29–32]. Subsequent studies of CAS also identified 16 submicroscopic deletions and duplications (copy number variants [CNVs]) in half of the participants [33]. These fell across ten different chromosomes and have the potential to cause disruptions in speech and language development. Of note, overlapping deletions at chromosome 16p13.2 were found in two of the participants, though the region does not overlap with previously implicated loci on chromosome 16, and the phenotypes associated with CNVs in this region have not been characterised [33]. High-throughput sequencing methods have also been applied to individuals affected by CAS [34]. Although this study sequenced the entire exomes (i.e. all known gene-coding regions) of ten CAS probands, it reported only mutations that affected known candidate genes. The study reported potentially clinically relevant variants (i.e. those variants that were predicted to have a deleterious effect upon protein function and had a reported population frequency of <0.3 %) in eight of the ten individuals investigated. These were distributed across six candidate genes that had previously been associated with CAS (FOXP1 [forkhead box P1], CNTNAP2 [contactin associated protein-like 2]) or overlapping phenotypes (ATP13A4 [ATPase type 13A4], CNTNAP1 [contactin associated protein 1], KIAA0319 and SETX [senataxin]) but did not include any FOXP2 mutations [34]. Although preliminary, the findings of this study suggest that the application of high-throughput methodologies and comprehensive analyses of the arising data may prove fruitful in future studies of speech and language impairments.

In 2010, a linkage study was conducted on a three-generation German family, the NE family, with multiple members affected by variable language and literacy impairments [23]. Psychoacoustic tests demonstrated an auditory processing deficit that co-segregated with these impairments. The investigators hypothesised that this deficit disallowed the affected family members to discriminate between tone durations, putting them at an increased risk of language-related disorders. Linkage analysis suggested that a large 58.5 Mb region, containing 600 genes, on chromosome 12p13.31–12q14.3 may contain a contributory variant, but the specific variant has yet to be identified [23].

Another independent linkage study investigated an isolated Chilean population with an increased prevalence of SLI. A series of language tests indicated that ~35 % of the island’s children met criteria for SLI, and a further 27.5 % had impaired language skills accompanied by other neurological deficits [12]. Parametric and non-parametric linkage analyses found consistent linkage to a 48 Mb stretch of chromosome 7q31–q36 (LOD score 6.73; P = 4.0 × 10⁻¹¹), which included the genes FOXP2 and CNTNAP2 (OMIM 604569) [12], the significance of which is discussed later in this article. No single co-segregating chromosome regions were identified using parametric linkage analyses, which supports the likelihood of a polygenic aetiology of SLI in this population.

A recent linkage study detected a heterozygous 4 kb deletion at chr2q36 (SLI5; OMIM 615432) in 15 Southeast Asian probands with language delay and white-matter hyper-intensities (WMH), which are common markers of aging [25•]. The deletion, which eliminated exon 3 of the TM4SF20 [transmembrane 4 L six family member 20] gene, co-segregated with language delay in the 15 families studied and appeared to represent an ancestral haplotype confined to Southeast Asian populations, notably Vietnamese, Thai and Burmese, with an allele frequency of ~1 % [25•]. The function of the TM4SF20 gene is unknown, and its function has yet to be assessed in other SLI populations.

Another study described a geographically isolated, Russian-speaking population with an increased prevalence of SLI [35]. The settlement involves ~871 people, 20–40 % of whom have language impairment [35]. At present, no genetic studies have been conducted using this population, but it is likely that the increased prevalence of SLI is caused by a founder mutation that is now widespread amongst the settlers, as seen in previous family and isolated-population studies of language impairment [12, 23, 25•].

Populations with an increased prevalence of language impairment can assist with the identification of candidate genes. It is assumed that the causative variant will be found more commonly within the affected population and will thus be easier to associate with the impairment. Future studies would benefit from investigating the role of the candidate genes that have been identified in these populations.

SLI Targeted Association Studies

Several studies have suggested shared genetic aetiology of speech disorders and reading disability (RD)—in particular, Speech Sound Disorder (SSD)—which is distinct from SLI but is rarely treated as such. SSD is considered to be a problem associated with phonological awareness, which leads to a deficit in age-appropriate speech sound production [36].

Targeted linkage studies have identified significant linkage for SSD within RD candidate regions. These include chromosomes 6p22, 15q21 (DYX1; OMIM 127700), 1p36 (DYX8; OMIM 608995) [36] and 3p12–q13 (SSD; OMIM 608445 and DYX5; OMIM 606896) [37]. A more recent study carried out GWLA in a large pedigree, exhibiting familial SSD and a motor sequencing deficit. The study identified linkage peaks on chromosomes 6p21, 7q32, 7q36 (overlapping with SLI4) and 8q24 primarily with the motor sequencing phenotype, and on 7q31.q32–34 with SSD [38].

A recent study also investigated whether autism and SLI share genetic risk factors, by conducting linkage analysis in 79 families affected by SLI and/or autism. Two linkage peaks were found on chromosomes 15q23–26, relating to oral language ability, and 16p12, relating to written language ability, within the subset of families that included individuals with both autism and SLI. However, neither linkage peak was independently associated with one disorder, indicating that each locus influenced both SLI and autism [39•].

Although the numbers of studies in this field are relatively small and consider only targeted regions, the findings indicate a high level of heterogeneity and the involvement of many contributory factors in language disorders. Following the discovery of SLI linkage regions, the subsequent targeted studies in this field aimed to narrow down those broad regions to allow identification of specific risk variants. Targeted studies have the advantage of investigating smaller regions with greater marker density, and therefore resolution, making them more powerful in detecting linkage and association.

A targeted association study, focusing on the chromosome 16q region [5, 6, 9, 22], pinpointed candidate genes for SLI [11]. The SLI1 region was screened using 1906 single nucleotide polymorphisms (SNPs) across 58 genes. Two clusters of variants were significantly associated with NWR and fell within CMIP (c-maf-inducing protein) at rs6564903 (P = 5.5 × 10⁻⁷) and ATP2C2 (calcium-transporting ATPase type 2C) at rs11860694 (P = 2.0 × 10⁻⁵). These findings suggest that CMIP and ATP2C2 are somehow involved in developing, or retaining, phonological short-term memories, which may be important for language acquisition [11]. Note that since the function of the identified variants remains unknown, it is possible that their effects are exerted through transcripts other than ATP2C2 and CMIP.

In addition, a study by Lonardo et al. presented a patient with postural, motor and speech delay, and severe learning and behavioural difficulties [40]. They established that the patient possessed an inverted de novo tandem duplication of 16q22q11.2, although they could not establish a true correlation of genotype to phenotype, because of the size of the duplication [40]. The findings of this study support the implications that a region of chromosome 16 plays an important role in the development of speech and language. Although the duplication at 16q22q11.2 that was observed in this study does not include CMIP (16q23.2) and ATP2C2 (16q24.1), it does suggest that chromosome 16 may include a number of genes that influence the development of language [5, 6, 9, 11, 22, 40].

An additional gene on chromosome 7q35–q36, known as CNTNAP2 (which is down-regulated by FOXP2 [10]), has also been associated with SLI. A functional study identified a CNTNAP2 fragment that was bound by the FOXP2 protein, regulating its expression. A targeted association analysis in the SLIC cohort identified significant associations between NWR and nine intronic SNPs in CNTNAP2 [10]. A subsequent association study supported the link between variants within the same region of CNTNAP2 and language ability in a general population sample [41].

Recently, a targeted association study considered the role of the HLA region on chromosome 6p, which is important for the function of the immune system, in SLI [42]. Among the genes implicated in the study, which included both quantitative and case–control analyses, were HLA-A (major histocompatibility complex, class I, A), HLA-B (major histocompatibility complex, class I, B) and HLA-DRB1 (major histocompatibility complex, class II, DR beta 1), which were previously implicated in autism [43], attention-deficit/hyperactivity disorder (ADHD) [44, 45] and schizophrenia [46].

In summary, five regions have consistently been associated with SLI, using GWLA, and targeted association studies have enabled identification of risk variants and candidate genes within three of these regions [13, 47] (see Table 1).

Table 1 A summary of specific language impairment (SLI) linkage in OMIM

Full size table

More recent publications in the language impairment field have begun to incorporate high-throughput SNP bead/chip assays to conduct genome-wide association studies (GWAS) of SLI and related traits [14•, 48, 49].

GWAS of Language Impairment

GWAS offer a high-resolution scan of a growing number of consistently variable loci within the genome and have been empowered by the development of genetic variation databases, such as the HapMap project [50] and the 1000 Genomes Project [51]. GWAS are conducted using SNP arrays and compare the frequencies of hundreds of thousands common di-allelic SNPs between affected and unaffected individuals. Under the common-disorder, common-variant hypothesis, accumulatively, these common SNPs may predispose individuals to various genetically complex diseases, including SLI. Thus GWAS methods are an ideal, and cost-effective, way of pinpointing possible risk variants.

Compared with other neurodevelopmental disorders, relatively few GWAS have been performed for SLI. Traditional case–control association studies typically require thousands of cases and controls [52] to gather adequate power, but at present no SLI cohort reaches this size. It may be possible to employ family-based association methods, which can use information from parents as well as probands and may provide increased power. For example, Weinberg et al. developed a method that uses case–parents trios and obtained moderate power when testing for child genetic effects using 100 case–parents trios and high power when testing for parent-of-origin effects (with effect sizes ranging between 2 and 3) [53]. In addition, despite being underpowered, smaller cohorts may still detect true associations that contribute across disorder populations [54]. A recent study performed a GWAS using the SLIC cohort [14•]. This study used family subsets (e.g. case–parents trios) in a multinomial association method which compares the likelihoods of null and risk models [55]. This approach allows for greater power when using parents and offers flexibility in the type of analysis performed. This particular study considered parent-of-origin effects, in addition to the more commonly considered child genotype model, and used quantitative language measures to assign SLI affection status. The authors found genome-wide significant paternal parent-of-origin effects with a SNP in NOP9 [nucleolar protein 9] (P = 3.74 × 10⁻⁸) on chromosome 14q12 and suggestive maternal parent-of-origin effects on chromosome 5p13 (P = 1.16 × 10⁻⁷) in a linkage region previously implicated in autism [56] and ADHD [57]. The NOP9 gene codes for an RNA-binding protein [58], which was shown to be dysregulated in schizophrenia patients [59]. The Avon Longitudinal Study of Parents and their Children (ALSPAC) population cohort was used to replicate the maternal association on chromosome 5; paternal genotypes were unavailable (P = 0.001), albeit in the opposite direction (i.e. the risk allele was different).

As discussed above, children with SLI often have other language disorders, or learning disorders, such as RD, SSD, ADHD or autism, which suggests that there may be an overlap of genetic risk factors [13, 60–62]. It may therefore be beneficial to combine cohorts to investigate language disorders together and increase the sample sizes. In a study of educational attainment, Rietveld et al. demonstrated that the size of a cohort can outweigh a strict phenotype selection in defining the underlying genetic aetiology of a disorder [63]. In a similar approach, more recent language impairment studies have utilised GWAS methods to investigate overlapping genetic risk factors between learning disorders.

GWAS Involving Disorders that Co-occur with SLI

Two GWAS of language-related traits in population samples have been reported. The first examined reading and language-related traits across the entire range in two independent population cohorts (the Brisbane Adolescent Twin Sample [BATS] and ALSPAC) [64] and then went on to perform a meta-analysis. The most significant association was with a SNP in ABCC13 [ATP-binding cassette, sub-family C (CFTR/MRP) member 13] (P = 7.34 × 10⁻⁸ in the combined cohort) on chromosome 21q11.2. ABCC13 (OMIM 608835) is thought to be a pseudogene that used to be involved with transporting molecules across intra- and extracellular membranes [64]. A second study also used the ALSPAC cohort but selected individuals for low language and/or reading skills from this sample set and compared allele frequencies within these subsets with the rest of the ALSPAC distribution [48]. The authors reported associations when considering individuals with concurrent RD and language impairment in ZNF385D [zinc finger protein 385D] (P = 5.45 × 10⁻⁷) on chromosome 3p24.3 and COL4A2 (collagen, type IV, alpha 2) [OMIM 120090] (P = 7.59 × 10⁻⁷) on chromosome 13q34. When examining individuals with only language impairment, the strongest association was reported for a SNP on chromosome 4q26, in NDST4 (N-deacetylase/N-sulfotransferase (heparan glucosaminyl) 4) [OMIM 615039] (P = 1.4 × 10⁻⁷).

Very recently, Gialluisi et al. conducted a GWAS meta-analysis involving individuals with both reading and language impairments [65]. This study included 1,862 individuals from families affected by RD or SLI in the UK Reading Disability (UK-RD), SLIC and Colorado Learning Disabilities Research Centre (CLDRC) databases. A family-based GWAS was performed on sibling pairs from each of the three datasets before combining all of the samples and conducting a meta-analysis. Two SNPs, rs59197085 on chromosome 7q32.1 and rs5995177 on chromosome 22q12.3, were found to be associated at a suggestive level (P ≈ 10⁻⁷) with a range of reading and language traits [65]. rs59197085 falls within the CCDC136 (coiled-coil domain containing 136) gene and falls within the SLI4 region of linkage. rs5995177 lies within the RBFOX2 (RNA-binding protein, fox-1 homolog 2) gene, which is heavily expressed in the neurons of the developing foetal brain [66]. RBFOX2 has been implicated in many neurological disorders, including Rolandic epilepsy [67] and autism [68]. In addition, RBFOX2 is both a FOXP2 target [69] and part of a cascade of genes that interact with the DYX1C1 (dyslexia susceptibility 1 candidate 1) gene [70, 71]. This study did not replicate the associations found in ATP2C2, CMIP or CNTNAP2. This is likely due to the inclusion of multiple affection phenotypes—language, reading and learning disability [65]. The combination of all of those phenotypes would likely increase genetic heterogeneity and lead to weaker associations with specific components, although the authors sought to overcome this limitation by implementing a principal component approach.

Conclusion

Linkage and association studies have highlighted a number of genetic regions that may predispose individuals to SLI, as summarised in Table 1. Targeted association studies have been particularly successful in narrowing linkage regions into candidate genes, of which there are now four to consider (Table 1). More recently, GWAS have been used to identify SNPs that might contribute to language ability or predispose individuals to language impairment, successfully highlighting additional genes for consideration. However, there is currently very little overlap in the findings between these various studies (Fig. 1). Future studies concerning SLI would benefit from functional studies, determining the role of candidate genes and risk variants that have been identified to date. These studies will allow us to identify exactly how the risk variants predispose individuals to language impairment and help elucidate the pathways underlying such complex disorders. In addition, it is clear that future studies will benefit from larger sample sizes, clearer definitions of SLI, more consistent selection criteria and investigation of related phenotypes. Neuroimaging studies, for example, have been very successful in identifying brain regions that are important for language and are noticeably altered in language-impaired individuals [72–74]. Such biological markers may prove useful as endophenotypes, representing altered brain activation patterns and structural volumes that correlate with language impairment and RD candidate gene dysfunction [48, 75]. It is also likely that we will start to see data from whole-genome and exome sequencing studies that will complement existing linkage and association studies, as evidenced by preliminary studies in CAS [34]. Sequencing methods can offer greater power to detect rare and common variants, even if they have only small effect sizes, but still require large sample sizes and careful controls for reliable identification of candidate genes [76]. Ultimately, in combination with the other methodologies and criteria described in this article, whole-genome and exome sequencing will assist with the identification of novel candidate genes and variants that may contribute to the risk of developing SLI; this may lead to earlier clinical intervention. The combination of data and expertise across these diverse disciplines will enable better understanding of the biological basis of language impairment.

References

Papers of particular interest, published recently, have been highlighted as: • Of importance

Tomblin JB et al. Prevalence of specific language impairment in kindergarten children. J Speech Lang Hear Res. 1997;40(6):1245–60.
Article CAS PubMed Google Scholar
Law J et al. Screening for primary speech and language delay: a systematic review of the literature. Int J Lang Commun Disord. 1998;33(S1):21–3.
Article PubMed Google Scholar
Bishop DVM. What causes specific language impairment in children? Curr Dir Psychol Sci. 2006;15(5):217–21.
Article PubMed Central PubMed Google Scholar
Tomblin JB, Records NL, Zhang X. A system for the diagnosis of specific language impairment in kindergarten children. J Speech Hear Res. 1996;39(6):1284–94.
Article CAS PubMed Google Scholar
SLI Consortium (SLIC). Highly significant linkage to the SLI1 locus in an expanded sample of individuals affected by specific language impairment. Am J Hum Genet. 2004;74(6):1225–38.
Article Google Scholar
SLI Consortium (SLIC). A genomewide scan identifies two novel loci involved in specific language impairment. Am J Hum Genet. 2002;70(2):384–98.
Article Google Scholar
Bartlett CW et al. A major susceptibility locus for specific language impairment is located on 13q21. Am J Hum Genet. 2002;71(1):45–55.
Article CAS PubMed Central PubMed Google Scholar
Bartlett CW et al. Examination of potential overlap in autism and language loci on chromosomes 2, 7, and 13 in two independent samples ascertained for specific language impairment. Hum Hered. 2004;57(1):10–20.
Article PubMed Central PubMed Google Scholar
Monaco AP. Multivariate linkage analysis of specific language impairment (SLI). Ann Hum Genet. 2007;71(Pt 5):660–73.
Article CAS PubMed Google Scholar
Vernes SC et al. A functional genetic link between distinct developmental language disorders. N Engl J Med. 2008;359(22):2337–45.
Article CAS PubMed Central PubMed Google Scholar
Newbury DF et al. CMIP and ATP2C2 modulate phonological short-term memory in language impairment. Am J Hum Genet. 2009;85(2):264–72.
Article CAS PubMed Central PubMed Google Scholar
Villanueva P et al. Genome-wide analysis of genetic susceptibility to language impairment in an isolated Chilean population. Eur J Hum Genet. 2011;19(6):687–95.
Article CAS PubMed Central PubMed Google Scholar
Newbury DF, Monaco AP. Genetic advances in the study of speech and language disorders. Neuron. 2010;68(2):309–20.
Article CAS PubMed Central PubMed Google Scholar
Nudel R et al. Genome-wide association analyses of child genotype effects and parent-of-origin effects in specific language impairment. Genes Brain Behav. 2014;13(4):418–29. This GWAS publication employs a novel approach to identify a new candidate gene and variants that may increase the risk of developing the disorder. It demonstrates the use of family-based association methods that can model parent-of-origin effects, as well as child genotype effects. Such methods allow for greater power when using small, family-based cohorts.
Article CAS PubMed Google Scholar
Kruglyak L et al. Parametric and nonparametric linkage analysis: a unified multipoint approach. Am J Hum Genet. 1996;58(6):1347–63.
CAS PubMed Central PubMed Google Scholar
Bishop DV, North T, Donlan C. Genetic basis of specific language impairment: evidence from a twin study. Dev Med Child Neurol. 1995;37(1):56–71.
Article CAS PubMed Google Scholar
Carlson CS et al. Mapping complex disease loci in whole-genome association studies. Nature. 2004;429(6990):446–52.
Article CAS PubMed Google Scholar
Semel E, Wiig EH, Secord W. Clinical evaluation of language fundamentals—revised. San Antonio: Psychological Corporation; 1992.
Google Scholar
Newcomer PL, H.D. Test of language development—2, Primary. 1998; Texas: Pro-Ed.
Bavin EL et al. Spatio‐visual memory of children with specific language impairment: evidence for generalized processing problems. Int J Lang Commun Disord. 2005;40(3):319–32.
Article PubMed Google Scholar
Archibald LD, Gathercole S. Nonword repetition in specific language impairment: more than a phonological short-term memory deficit. Psychon Bull Rev. 2007;14(5):919–24.
Article PubMed Google Scholar
Falcaro M et al. Genetic and phenotypic effects of phonological short-term memory and grammatical morphology in specific language impairment. Genes Brain Behav. 2008;7(4):393–402.
Article CAS PubMed Google Scholar
Addis L et al. A locus for an auditory processing deficit and language impairment in an extended pedigree maps to 12p13.31–q14.3. Genes Brain Behav. 2010;9(6):545–61.
Article CAS PubMed Central PubMed Google Scholar
Lai CS et al. A forkhead-domain gene is mutated in a severe speech and language disorder. Nature. 2001;413(6855):519–23.
Article CAS PubMed Google Scholar
Wiszniewski W et al. TM4SF20 ancestral deletion and susceptibility to a pediatric disorder of early language delay and cerebral white matter hyperintensities. Am J Hum Genet. 2013;93(2):197–210. Wiszniewski et al. describe a 4 kb heterozygous deletion on chromosome 2 that co-segregates with language delay in 15 Southeast Asian families. This deletion notably eliminates an entire exon from the gene TM4SF20, the function of which is currently unknown.
Article CAS PubMed Central PubMed Google Scholar
Fisher SE et al. Localisation of a gene implicated in a severe speech and language disorder. Nat Genet. 1998;18(2):168–70.
Article CAS PubMed Google Scholar
Lai CS et al. The SPCH1 region on human 7q31: genomic characterization of the critical interval and localization of translocations associated with speech and language disorder. Am J Hum Genet. 2000;67(2):357–68.
Article CAS PubMed Central PubMed Google Scholar
Newbury DF et al. FOXP2 is not a major susceptibility gene for autism or specific language impairment. Am J Hum Genet. 2002;70(5):1318–27.
Article CAS PubMed Central PubMed Google Scholar
MacDermot KD et al. Identification of FOXP2 truncation as a novel cause of developmental speech and language deficits. Am J Hum Genet. 2005;76(6):1074–80.
Article CAS PubMed Central PubMed Google Scholar
Shriberg LD et al. Speech, prosody, and voice characteristics of a mother and daughter with a 7;13 translocation affecting FOXP2. J Speech Lang Hear Res. 2006;49(3):500–25.
Article PubMed Google Scholar
Palka C et al. Mosaic 7q31 deletion involving FOXP2 gene associated with language impairment. Pediatrics. 2012;129(1):e183–8.
Article PubMed Google Scholar
Raca G et al. Childhood apraxia of speech (CAS) in two patients with 16p11.2 microdeletion syndrome. Eur J Hum Genet. 2013;21(4):455–9.
Article CAS PubMed Central PubMed Google Scholar
Laffin JJ et al. Novel candidate genes and regions for childhood apraxia of speech identified by array comparative genomic hybridization. Genet Med. 2012;14(11):928–36.
Article CAS PubMed Central PubMed Google Scholar
Worthey EA et al. Whole-exome sequencing supports genetic heterogeneity in childhood apraxia of speech. J Neurodev Disord. 2013;5(1):29.
Article PubMed Central PubMed Google Scholar
Rakhlin N et al. The language phenotype of a small geographically isolated Russian-speaking population: implications for genetic and clinical studies of developmental language disorder. Appl Psycholinguist. 2013;34(05):971–1003.
Article Google Scholar
Smith SD et al. Linkage of speech sound disorder to reading disability loci. J Child Psychol Psychiatry. 2005;46(10):1057–66.
Article PubMed Google Scholar
Stein CM et al. Pleiotropic effects of a chromosome 3 locus on speech-sound disorder and reading. Am J Hum Genet. 2004;74(2):283–97.
Article CAS PubMed Central PubMed Google Scholar
Peter B, Matsushita M, Raskind WH. Motor sequencing deficit as an endophenotype of speech sound disorder: a genome-wide linkage analysis in a multigenerational family. Psychiatr Genet. 2012;22(5):226–34.
Article PubMed Central PubMed Google Scholar
Bartlett CW et al. A genome scan for loci shared by autism spectrum disorder and language impairment. Am J Psychiatry. 2014;171(1):72–81. This recent publication investigates the potential overlap of genetic risk factors between SLI and autism. Interestingly, this linkage study identified two peaks in a subset of families presenting with both disorders. Both linkage peaks appeared to influence both SLI and autism, which suggests they share a genetic aetiology.
Article PubMed Google Scholar
Lonardo F et al. Clinical, cytogenetic and molecular-cytogenetic characterization of a patient with a de novo tandem proximal-intermediate duplication of 16q and review of the literature. Am J Med Genet A. 2011;155A(4):769–77.
Article PubMed Google Scholar
Whitehouse AJ et al. CNTNAP2 variants affect early language development in the general population. Genes Brain Behav. 2011;10(4):451–6.
Article CAS PubMed Central PubMed Google Scholar
Nudel R et al. Associations of HLA alleles with specific language impairment. J Neurodev Disord. 2014;6(1):1.
Article PubMed Central PubMed Google Scholar
Torres AR et al. The association and linkage of the HLA-A2 class I allele with autism. Hum Immunol. 2006;67(4–5):346–51.
Article CAS PubMed Google Scholar
Wang Y-P, et al. Study on the association between HLA-DRB1 genes and ADHD in Xi’an. Chin J Child Health Care. 2008;(1):14–15, 18.
Odell JD et al. Association of genes within the major histocompatibility complex with attention deficit hyperactivity disorder. Neuropsychobiology. 1997;35(4):181–6.
Article CAS PubMed Google Scholar
Palmer CG et al. HLA-B maternal–fetal genotype matching increases risk of schizophrenia. Am J Hum Genet. 2006;79(4):710–5.
Article CAS PubMed Central PubMed Google Scholar
Caylak E. Biological/biochemical features and molecular genetics of specific language impairment (SLI). eLS. 2011. doi:10.1002/9780470015902.a0023688.
Google Scholar
Eicher JD et al. Genome-wide association study of shared components of reading disability and language impairment. Genes Brain Behav. 2013;12(8):792–801.
Article CAS PubMed Central PubMed Google Scholar
Becker J et al. Genetic analysis of dyslexia candidate genes in the European cross-linguistic NeuroDys cohort. Eur J Hum Genet. 2014;22(5):675–80.
Article CAS PubMed Google Scholar
The International Hapmap Project. Available online: http://www.hapmap.org/. Accessed 2 Jul 2014.
1000 Genomes Project Consortium. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491(7422):56–65.
Article Google Scholar
McCarthy MI et al. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet. 2008;9(5):356–69.
Article CAS PubMed Google Scholar
Weinberg CR, Wilcox AJ, Lie RT. A log-linear approach to case-parent-triad data: assessing effects of disease genes that act either directly or through maternal effects and that may be subject to parental imprinting. Am J Hum Genet. 1998;62(4):969–78.
Article CAS PubMed Central PubMed Google Scholar
Scerri TS et al. PCSK6 is associated with handedness in individuals with dyslexia. Hum Mol Genet. 2011;20(3):608–14.
Article CAS PubMed Central PubMed Google Scholar
Ainsworth HF et al. Investigation of maternal effects, maternal–fetal interactions and parent-of-origin effects (imprinting), using mothers and their offspring. Genet Epidemiol. 2011;35(1):19–45.
Article PubMed Central PubMed Google Scholar
Liu J et al. A genomewide screen for autism susceptibility loci. Am J Hum Genet. 2001;69(2):327–40.
Article CAS PubMed Central PubMed Google Scholar
Ogdie MN et al. A genomewide scan for attention-deficit/hyperactivity disorder in an extended sample: suggestive linkage on 17p11. Am J Hum Genet. 2003;72(5):1268–79.
Article CAS PubMed Central PubMed Google Scholar
Thomson E, Rappsilber J, Tollervey D. Nop9 is an RNA binding protein present in pre-40S ribosomes and required for 18S rRNA synthesis in yeast. RNA. 2007;13(12):2165–74.
Article CAS PubMed Central PubMed Google Scholar
Glatt SJ et al. Similarities and differences in peripheral blood gene-expression signatures of individuals with schizophrenia and their first-degree biological relatives. Am J Med Genet B Neuropsychiatr Genet. 2011;156b(8):869–87.
Article PubMed Google Scholar
Bishop DV. Genetic influences on language impairment and literacy problems in children: same or different? J Child Psychol Psychiatry. 2001;42(2):189–98.
Article CAS PubMed Google Scholar
Newbury DF et al. Investigation of dyslexia and SLI risk variants in reading- and language-impaired subjects. Behav Genet. 2011;41(1):90–104.
Article CAS PubMed Central PubMed Google Scholar
Willcutt EG et al. Etiology and neuropsychology of comorbidity between RD and ADHD: the case for multiple-deficit models. Cortex. 2010;46(10):1345–61.
Article PubMed Central PubMed Google Scholar
Rietveld CA et al. GWAS of 126,559 individuals identifies genetic variants associated with educational attainment. Science. 2013;340(6139):1467–71.
Article CAS PubMed Central PubMed Google Scholar
Luciano M et al. A genome-wide association study for reading and language abilities in two population cohorts. Genes Brain Behav. 2013;12(6):645–52.
Article CAS PubMed Central PubMed Google Scholar
Gialluisi A et al. Genome-wide screening for DNA variants associated with reading and language traits. Genes Brain Behav. 2014;13(7):686–701.
Article CAS PubMed Google Scholar
Gehman LT et al. The splicing regulator RBFOX2 is required for both cerebellar development and mature motor function. Genes Dev. 2012;26(5):445–60.
Article CAS PubMed Central PubMed Google Scholar
Lal D et al. RBFOX1 and RBFOX3 mutations in Rolandic epilepsy. PLoS One. 2013;8(9):e73323.
Article CAS PubMed Central PubMed Google Scholar
Voineagu I et al. Transcriptomic analysis of autistic brain reveals convergent molecular pathology. Nature. 2011;474(7351):380–4.
Article CAS PubMed Central PubMed Google Scholar
Ayub Q et al. FOXP2 targets show evidence of positive selection in European populations. Am J Hum Genet. 2013;92(5):696–706.
Article CAS PubMed Central PubMed Google Scholar
Norris JD et al. A negative coregulator for the human ER. Mol Endocrinol. 2002;16(3):459–68.
Article CAS PubMed Google Scholar
Massinen S et al. Functional interaction of DYX1C1 with estrogen receptors suggests involvement of hormonal pathways in dyslexia. Hum Mol Genet. 2009;18(15):2802–12.
Article CAS PubMed Google Scholar
Smit DJ et al. Neuroimaging and genetics: exploring, searching, and finding. Twin Res Hum Genet. 2012;15(3):267–72.
Article PubMed Central PubMed Google Scholar
Shaywitz SE, Shaywitz BA. Paying attention to reading: the neurobiology of reading and dyslexia. Dev Psychopathol. 2008;20(4):1329–49.
Article PubMed Google Scholar
Vandermosten M et al. A tractography study in dyslexia: neuroanatomic correlates of orthographic, phonological and speech processing. Brain. 2012;135(Pt 3):935–48.
Article PubMed Google Scholar
Eicher JD, Gruen JR. Imaging-genetics in dyslexia: connecting risk genetic variants to brain neuroimaging and ultimately to reading impairments. Mol Genet Metab. 2013;110(3):201–12.
Article CAS PubMed Central PubMed Google Scholar
Yu TW et al. Using whole-exome sequencing to identify inherited causes of autism. Neuron. 2013;77(2):259–73.
Article CAS PubMed Central PubMed Google Scholar

Download references

Acknowledgments

We would like to thank all of the families, professionals and individuals who participate in our research.

The work of the Wellcome Trust Centre in Oxford is supported by the Wellcome Trust [grant number 090532/Z/09/Z]. Dianne Newbury is a Medical Research Council (MRC) Career Development Fellow and a Junior Research Fellow at St. John’s College, University of Oxford. The work of the Newbury lab is funded by the Medical Research Council (grant numbers G1000569/1 and MR/J003719/1). Ron Nudel is funded by a University of Oxford Nuffield Department of Medicine Prize Studentship.

Author Contributions

All authors conceptualized and wrote this review.

Compliance with Ethics Guidelines

ᅟ

Conflict of Interest

Dianne Newbury received a grant from the Medical Research Council. Laura Covill, Rose Reader and Ron Nudel have no conflicts of interest.

Human and Animal Rights and Informed Consent

This article does not contain any studies with human or animal subjects performed by the authors.

Author information

Authors and Affiliations

Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Rose H. Reader, Laura E. Covill, Ron Nudel & Dianne F. Newbury
St John’s College, University of Oxford, Oxford, OX1 3JP, UK
Dianne F. Newbury

Authors

Rose H. Reader
View author publications
You can also search for this author in PubMed Google Scholar
Laura E. Covill
View author publications
You can also search for this author in PubMed Google Scholar
Ron Nudel
View author publications
You can also search for this author in PubMed Google Scholar
Dianne F. Newbury
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dianne F. Newbury.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Reader, R.H., Covill, L.E., Nudel, R. et al. Genome-Wide Studies of Specific Language Impairment. Curr Behav Neurosci Rep 1, 242–250 (2014). https://doi.org/10.1007/s40473-014-0024-z

Download citation

Published: 26 September 2014
Issue Date: December 2014
DOI: https://doi.org/10.1007/s40473-014-0024-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Genome-Wide Studies of Specific Language Impairment

Abstract

Similar content being viewed by others

Embryonic origin of two ASD subtypes of social symptom severity: the larger the brain cortical organoid size, the more severe the social symptoms

Diagnostic Uncertainty in Childhood Motor Speech Disorders: A Review of Recent Tools and Approaches

Towards a dynamic, comprehensive conceptualization of dyslexia

Introduction

Methodologies

SLI GWLA Studies

SLI Targeted Association Studies

GWAS of Language Impairment

GWAS Involving Disorders that Co-occur with SLI

Conclusion

References

Papers of particular interest, published recently, have been highlighted as: • Of importance

Acknowledgments

Author Contributions

Compliance with Ethics Guidelines

Conflict of Interest

Human and Animal Rights and Informed Consent

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Genome-Wide Studies of Specific Language Impairment

Abstract

Similar content being viewed by others

Embryonic origin of two ASD subtypes of social symptom severity: the larger the brain cortical organoid size, the more severe the social symptoms

Diagnostic Uncertainty in Childhood Motor Speech Disorders: A Review of Recent Tools and Approaches

Towards a dynamic, comprehensive conceptualization of dyslexia

Introduction

Methodologies

SLI GWLA Studies

SLI Targeted Association Studies

GWAS of Language Impairment

GWAS Involving Disorders that Co-occur with SLI

Conclusion

References

Papers of particular interest, published recently, have been highlighted as: • Of importance

Acknowledgments

Author Contributions

Compliance with Ethics Guidelines

Conflict of Interest

Human and Animal Rights and Informed Consent

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation