Kiwi genome provides insights into evolution of a nocturnal lifestyle
Kiwi, comprising five species from the genus Apteryx, are endangered, ground-dwelling bird species endemic to New Zealand. They are the smallest and only nocturnal representatives of the ratites. The timing of kiwi adaptation to a nocturnal niche and the genomic innovations, which shaped sensory systems and morphology to allow this adaptation, are not yet fully understood.
We sequenced and assembled the brown kiwi genome to 150-fold coverage and annotated the genome using kiwi transcript data and non-redundant protein information from multiple bird species. We identified evolutionary sequence changes that underlie adaptation to nocturnality and estimated the onset time of these adaptations. Several opsin genes involved in color vision are inactivated in the kiwi. We date this inactivation to the Oligocene epoch, likely after the arrival of the ancestor of modern kiwi in New Zealand. Genome comparisons between kiwi and representatives of ratites, Galloanserae, and Neoaves, including nocturnal and song birds, show diversification of kiwi’s odorant receptors repertoire, which may reflect an increased reliance on olfaction rather than sight during foraging. Further, there is an enrichment of genes influencing mitochondrial function and energy expenditure among genes that are rapidly evolving specifically on the kiwi branch, which may also be linked to its nocturnal lifestyle.
The genomic changes in kiwi vision and olfaction are consistent with changes that are hypothesized to occur during adaptation to nocturnal lifestyle in mammals. The kiwi genome provides a valuable genomic resource for future genome-wide comparative analyses to other extinct and extant diurnal ratites.
KeywordsGene Ontology Olfactory Receptor Zebra Finch Membrane Proteome Foreground Branch
Conserved domain database
Cluster database at high identity with tolerance
Giga base pairs
G protein-coupled receptor
Hiden markov model
kilo base pairs
Likelihood ratio test
Mega base pairs
Polymerase chain reaction
Ultra-conserved non-coding element
New Zealand’s geographic isolation, after the separation from Gondwana around 80 million years ago, provides an unequaled opportunity to study the results of evolutionary processes following geographic isolation. In New Zealand, the ecological niches typically occupied by mammals in most other parts of the world are dominated by birds. Kiwi (genus Apteryx), the national symbol of New Zealand, belong to a group of flightless birds, the ratites. This group is geographically broadly distributed including both extant members, which are the ostrich in Africa, the emu in Australia, the cassowary in New Guinea, and the rhea in South America, and, as extinct members, the moa from New Zealand and the elephant birds from Madagascar. New Zealand is thus the only landmass to have been inhabited by two ratite lineages. Strikingly, the two lineages are highly divergent in size with moa having a body size of up to 3 m  while kiwi, the smallest of the ratites, reaches only the size of a chicken. Moreover, while moa occupied the diurnal niche, kiwi are the only ratites, and one of only a few bird lineages (less than 3 % of the bird species ), that are nocturnal. Although the kiwi eye is unusually small for a nocturnal bird, it has a nocturnal-type retina . This may indicate that the nocturnal adaptation of kiwi is recent, or alternatively, that changes in eye size are not a prerequisite for nocturnality.
We have sequenced and assembled the genome of Apteryx mantelli, the North Island brown kiwi, to improve our understanding of how genomic features evolve during adaptation to nocturnality and the ground-dwelling niche. We have also sequenced the transcriptome from embryonic tissue to provide support for the genome annotation. We identified genomic changes in kiwi that affect physiological functions, including vision and olfaction, which have been predicted to characterize nocturnal adaptation in the early history of mammals .
Genome sequencing, assembly, and annotation
Kiwi genome assembly characteristics and genomic features compared with other avian genomes (see Additional file 1: Table S4)
Size of assembly (Gb)
N50 scaffolds (Mb)
Heterozygous SNP rate per kb
Falco cherrug 
Falco peregrinus 
Taeniopygia guttata 
Ficedula albicolis 
Anas platyrhynchos 
Gallus gallus 
Meleagris gallopavo 
We identified a set of 27,876 genes following de novo gene prediction on the assembled genome (Additional file 1: Note: De novo gene prediction and gene annotation). To refine these gene annotations we used 47.5 Gb of transcript sequence data from kiwi embryonic tissue together with the de novo gene predictions and protein evidence from three well-annotated bird species (G. gallus, T. guttata, M. gallopavo) as input to the MAKER genome annotation pipeline . A validated set of 18,033 genes was selected based on their alignment to orthologous genes in other birds and on supporting evidence provided by kiwi transcript sequences. In total, the gene models spanned 306.62 Mb of the assembly, with exons accounting for 23.96 Mb (approximately 1.6 %) of the total kiwi genome.
Evolution of gene families
Changes of gene-family sizes have been inferred for multiple de novo assembled genomes [17, 18]. However, many of these genomes have rather fragmented assemblies (Table 1); thus, results should be interpreted cautiously, only after manual inspection and ideally independent experimental confirmation.
We therefore manually examined the 130 gene families that had either significant expansion or contraction specifically to the kiwi branch. After excluding expansions that were caused by fragmentation of the assembly  only 85 gene families remained significant (Additional file 1: Table S6). Of these, 63 gene families are expanded in the kiwi. An analysis of gene family functions  showing expansion in kiwi identified enrichment in categories including signal transduction, calcium homeostasis, and motor activity (FDR <0.0001, Additional file 1: Figure S2A). Among the gene families that show contraction on the kiwi branch we found an enrichment of development-related Gene Ontology (GO) categories (FDR <0.0001, Additional file 1: Figure S2B).
Diversification of tetrapods and the colonization of terrestrial habitats are often accompanied by changes of physiological systems specifically in cellular signal transduction . Membrane proteins are involved in cellular signaling, hence we aimed to determine more specifically which classes of membrane-expressed proteins have undergone changes in the number of coding genes. To this end we annotated the membrane proteome in kiwi, human, all birds, and reptiles present in Ensembl 74, two additional ratites (ostrich and tinamou) and two nocturnal birds (chuck-will’s-widow and barn owl) (Additional file 1: Note: Detection and classification of the membrane proteome; Additional file 1: Table S7). We manually inspected the classes which showed expansion in kiwi, to ensure that the higher number of predicted genes is not a result of assembly fragmentation. We found a significant expansion in kiwi of genes coding for adhesion and immune-related proteins (Additional file 1: Table S7). Additionally, we found a significant expansion of the Ephrin kinases class, which are functionally involved in the development of the sensory-motor innervation of the limb  and later on in tendons condensation and developing feather buds .
Patterns of natural selection
To determine whether any branch-specific selection is present in kiwi we estimated branch ω-values (Ka/Ks substitution ratios) for 4,152 orthologous genes in eight bird species: kiwi, ostrich, tinamou, chuck-will’s-widow, barn owl, chicken, zebra finch, and turkey using CODEML . Ortholog assignment was based on the orthology relation among chicken, zebra finch, and turkey defined in Ensembl 73 (Additional file 1: Note: Orthologs and Ka/Ks calculation). The kiwi average ω across all the orthologs is comparable to that in ostrich, and higher than in tinamou and night birds (0.291, 0.313, 0.145, 0.202, and 0.200 for kiwi, ostrich, tinamou, chuck-will’s-widow, and barn owl, respectively). This implies a relatively faster overall rate of functional evolution in kiwi and ostrich.
In addition to gene-family expansions/contractions, we used evidence of branch-specific selection to identify genes and functional pathways that may underlie kiwi-specific adaptations. For the 4,152 orthologous genes in the eight bird species we used the branch models from CODEML to perform likelihood ratio tests , comparing a simple model of one ω for all sites and branches versus a model where kiwi is defined as the foreground branch and the other birds as background. We first considered genes with a significantly higher ω on the kiwi branch than that in all other birds (LRT >3.84, significance at 5 %, 1 degree of freedom). Functional enrichment using GO  categories was tested using a hypergeometric test (Additional file 1: Note: Gene ontology and rapidly evolving genes). The same test was performed on genes evolving significantly slower in kiwi. To assign functional categories as either kiwi-specific, or shared with other ratites or nocturnal birds, a similar procedure was performed for each species of Palaeognathae (ostrich, tinamou) and night birds (chuck-will’s-widow, barn owl) by assigning each in turn as the foreground branch in CODEML.
After multiple testing correction using family-wise error rate none of the categories remained significant. For further analysis we considered only GO categories that had (1) a P value <0.05; (2) at least three significantly changed genes; and (3) the number of significant genes was at least 5 % of the total genes annotated in the GO category. GO categories that were over-represented (P value <0.05) on the kiwi branch, but not present in any of the other considered species, were identified as potentially kiwi-specific changes (Additional file 1: Note: Gene ontology and rapidly evolving genes). Notably, faster-evolving categories present in kiwi, but absent in any of the other species, are related to mitochondrion, feeding behavior and energy reserve metabolic process, visual perception, and eye photoreceptor cell differentiation (Additional file 1: Table S8A). Sensory perception of light stimulus is a faster evolving category shared, surprisingly, with the ostrich (Additional file 1: Table S8B). Among slower evolving categories, the mitochondrial outer membrane was one of the kiwi-specific categories (Additional file 1: Table S9A), while anion channel activity was a shared category with chuck-will’s-widow (Additional file 1: Table S9B). For the potentially biological meaningful categories which could explain kiwi-specific physiology we extracted the genes clustering in the node. GO categories have a high potential to deliver false-positive enrichment, which could be considered biologically meaningful a posteriori . Therefore, future studies need to verify the adaptive functionality of genes belonging to the respective category (Additional file 1: Tables S8C and S9C).
Annotated opsins in the Apteryx mantelli genome
AptMant0 annotation ID
External gene ID
ω Apt. mantelli
No obvious alteration
Partial sequence TM7
Deleterious mutation Glu3.49Lys
Partial sequence, deleterious mutation Glu6.30Gly
No obvious alteration
No obvious alteration
No obvious alteration
No obvious alteration
No obvious alteration
Kiwi sensory adaptations – vision
Nocturnality is accompanied by a number of specific changes, including adaptations in visual processing . In contrast to most nocturnal animals, that have large eyes relative to their body size, kiwi have small eyes and reduced optic lobes in the brain . However, the kiwi retina has a higher proportion of rods than cones which is consistent with adaptation to nocturnality . Besides black/white vision mediated via rhodopsin (RHO), most birds have trichromatic or tetrachromatic vision, for which various additional opsins are responsible: OPN1LW (red), OPN1MW (green, RH2), OPN1SW (blue, subtypes SWS1, SWS2) . We identified these genes in the kiwi assembly. The RHO gene in kiwi shows no interruption and no obvious function-impairing amino acid changes compared to other vertebrates. We were able to assemble only a partial sequence of the red opsin OPN1LW (transmembrane (TM) helix 7) and found no previously described deleterious amino acid changes within this region .
Similarly, at the N-terminal end of TM6 in OPN1SW we identified a highly conserved Glu6.30 which is present in all bird orthologs sequenced so far, except for kiwi OPN1SW where Glu6.30 is substituted by Gly. Previous functional characterization has shown that mutation of Glu6.30 destabilizes the H-bond network resulting in constitutively active opsins and other rhodopsin-like GPCRs [32, 33]. A constitutively active opsin is functionally incapable of light signal transmission  and is therefore non-functional.
Besides these two functionally well-characterized positions, we identified several other amino acids substitutions in kiwi OPN1MW and OPN1SW. Further, tests for branch and branch-site specific ω values for OPN1MW and OPN1SW on the kiwi branch showed no evidence for positively selected sites in kiwi (Additional file 1: Note: Vision analysis), suggesting that the greater ω values for kiwi are likely due to loss of constraint on these genes. Hence these genes are likely to be drifting and, considering the fact that only 8 % of all inactivating mutations in GPCRs are stop codons while almost 65 % are missense mutations [35, 36, 37], the described loss-of-function mutations in OPN1MW and OPN1SW render color vision of kiwi, unlike for other sequenced ratites (Fig. 2), absent – at least for the green and blue spectral ranges.
We tentatively dated the opsin-loss-of-function event as an indicator of the timing of adaptation to the nocturnal niche. Assuming that the loss of constraint happened on the kiwi branch in a short period of time and changed the rate of selection, measured by the ω value, from the average over bird lineages (0.021 for OPN1MW and 0.014 for OPN1SW, Table 2) to the neutral ω value of 1, the loss of function was dated to 30–38 million years ago (Additional file 1: Note: Vision analysis), which places the event shortly after the arrival of kiwi in New Zealand .
Kiwi sensory adaptations – olfaction
Kiwi are unique among birds in having nostrils present at the end of their prominent beaks and have been reported to depend largely on tactile and olfactory senses for foraging . To investigate whether the genome shows signs of olfactory adaptation in kiwi we assessed the numbers of olfactory receptor (OR) genes  and the diversity in the OR sequence .
The only previous approach to molecular characterization of the olfactory system in kiwi was based on PCR amplification of ORs with degenerate primers . This allowed only a rough estimation of the number of ORs of 478 genes (95 % confidence interval 156–1,708 genes). PCR with degenerate primers only produces incomplete fragments of the genes and hence the accurate quantification of gene families with highly similar sequences, as in the case of ORs, is prone to over-estimation . In contrast, de novo genome assembly facilitates a global assessment of the gene repertoire  and can therefore be used to provide a more accurate estimate of the OR repertoire. We thus annotated the OR genes in kiwi, as part of the entire membrane proteome, on the basis of putative functionality and seven transmembrane helices (7TM) (Additional file 1: Note: Olfactory receptor genes identification and annotation). The number of non-OR receptor families was comparable to other avian species, suggesting that the membrane proteome is well annotated in kiwi (Additional file 1: Table S7). This analysis revealed an initial set of 82 OR genes in the kiwi genome. However, ORs are highly duplicated across the genome and such regions could be prone to being overcollapsed during the assembly process. We therefore estimated the copy number of each annotated OR using a correction based on coverage. To obtain the correction factor for each OR, read-coverage in the OR region was divided by the genome-wide average coverage corresponding to its GC bin. Following this correction we estimated that up to 141 OR genes are present in the kiwi genome, of which 86 encode for full-length receptors while the rest are most likely pseudogenes due to frameshifts, premature stop codons, or truncations (Additional file 1: Note: Olfactory receptor genes identification and annotation). The estimated proportion of intact ORs among all OR genes in kiwi (61 %) is lower than previously reported for Apteryx australis  (78.6 %), but much higher than in zebra finch (38 %) .
Comparative analysis of the OR repertoire shows that the kiwi genome has both the α and the γ subgroups of type 1 OR genes, as reported for other bird genomes sequenced so far . Unlike the majority of other birds analyzed so far, kiwi has a higher number of γ subgroup ORs. Gene family size estimates are highly dependent on genome quality  and continuous curation is ongoing even for well-annotated genomes: for example, in the chicken olfactory repertoire the number of annotated ORs changed by a factor of eight in two consecutive Ensembl releases (release 73 – 251 ORs and release 74 – 30 ORs). Further improvement of genome qualities, including kiwi, are therefore required for the identification of a complete set of ORs. Thus, a correlation between olfactory acuity and the number of ORs in different birds could be subject to error.
Phenotypic diversity in olfaction is, in part, attributable to genetic variation with a wider range of odors thought to be detectable given more genetic variation . Since the absolute number of ORs might be a poor predictor of olfactory abilities, we investigated the variation in the γ ORs sequence as a measure of the range of possible detectable odors. The average protein sequence entropy was calculated to check for variation within the γ-c clade in each species (Additional file 1: Note: γ-c clade OR within-species protein sequence entropy).
Previous studies have shown that Shannon entropy (H) analysis is a sensitive tool for estimating the diversity of a system [47, 48]. For protein sequence, H ranges from 0 (only one residue is present at that position in the multiple sequence alignment) to 4.322 (all 20 residues are equally represented in that position). Typically H ≤2 is attributed to high conservation . H values in birds were in the range of 0.34±0.05 (zebra finch) to 1.11±0.12 (chicken). The average entropy in kiwi sequences was 1.23±0.15, significantly higher than all other bird species investigated (P value = 0.003 Wilcoxon Signed-Rank test, Additional file 1: Note: γ-c clade OR within-species protein sequence entropy). We conclude that overall the γ-c clade of ORs are highly similar in sequence, in accordance with previously published data . However, since detection of a wider range of odors is correlated to genetic variation of ORs , the significantly higher H in kiwi ORs is suggestive for a broad odor acuity in this species in comparison to other birds.
The most prominent phenotype of kiwi, lack of wings, has been linked to energy conservation  and to the limited resources in New Zealand in late Oligocene . Like most ratites, kiwi are flightless, but the phylogenetic tree of Palaeognathae implies that this phenotype evolved several times independently in this order . Unlike ostriches and rheas, that possess prominent wings, kiwi show only vestigial invisible wings, while moa lack even vestiges .
To determine whether we can identify the genetic basis for the extremely regressed wings in kiwi we annotated genes in the highly conserved signaling pathways related to limb development (Additional file 1: Note: Kiwi morphology analysis; Additional file 1: Figure S3). These include genes belonging to the FGFs, TBX cluster, HOX cluster (Additional file 1: Figure S4; Additional file 1: Table S11), WNT, SALL, and FIBIN genes, known to be responsible for limb and wing development  (Additional file 1: Table S12). Growth and transcription factors typically influence the development of both upper and lower limbs, while FIBIN is currently the only gene described to be exclusively involved in the development of the upper limb .
For these clusters of genes, we aligned corresponding orthologs and translated multiple alignments, which were then manually inspected. No insertions, deletions, and/or stop codons that would clearly disrupt the open reading frame could be identified in the inspected genes. Additionally, we found all 39 HOX genes expected for the Sauropsid ancestor  and investigation of regulatory sequences within the HOX clusters by phylogenetic footprinting showed no preferential loss of conserved DNA elements in Apteryx mantelli compared to Galliformes (Additional file 1: Figure S4; Additional file 1: Table S11).
To detect signs of different evolution in kiwi wing and tail developmental genes we performed a selective constraint analysis using the CODEML branch test (Additional file 1: Note: Selection analysis on limb development genes; Additional file 1: Table S12). Of these genes FIBIN was the only gene that showed signals of positive selection on the avian tree including chicken, turkey, and zebra finch (Additional file 1: Figure S5). Three sites with signs of positive selection that were 100 % conserved in the other species show a different amino acid in kiwi: exchanges of Ser136Ala, Gln148Arg, and Phe162Cys (positions are relative to the mouse Fibin coding sequence). The functional relevance of these substitutions is unclear and needs to be studied when experimental tests of FIBIN function become available.
Since no obvious alterations could be found in the coding sequences of genes involved in developmental processes, which could explain the regressed-wing morphology of kiwi, we further analyzed ultra-conserved non-coding elements (UCNEs) (Additional file 1: Note: Ultra-conserved non-coding elements analysis). UCNEs are defined as DNA non-coding regions of ≥95 % sequence identity between human and chicken, longer than 200 bp . The majority of UCNEs cluster in genomic regions containing genes coding for transcription factors and developmental regulators  and experimental studies in transgenic animals have shown that some of these sequences can act as tissue-specific enhancers during developmental processes . Of the 4,351 UCNEs annotated in UCNEbase , 19 showed more than the expected 5 % sequence variation as defined in the database  (Additional file 1: Table S13). Among these, four were related to HOXA, TBX2, Sp8, and TFAP2A genes which have been previously described in limb development pathways [53, 58, 59], suggesting that changes in non-coding elements could be involved in kiwi’s loss of wings.
With their small body size, extremely large egg size, nocturnal life style, and prominent nostrils at the end of their beaks, among several other traits, kiwi represent probably the most unusual member of the ratites . A recent mitochondrial DNA phylogeny placed kiwi as the closest relatives of the extinct Madagascan elephant birds . Whether dispersal or vicariance best describe ratite distribution has been debated for over a century . A phylogeny including 169 bird species, built on 32 kb from 19 independent loci, showed ostrich as basal in the Palaeognathae clade . In contrast, our phylogeny, based on 623 1:1 orthologs in 16 species, totaling approximately 700 kb, places the tinamou as basal to Palaeognathae with 100 % bootstrap confidence (Fig. 1; Additional file 1: Figure S6). However, when the phylogeny was constructed for 10 bird species using just UCNEs (totaling >1 Mb) the topology of the tree matches that obtained from fewer loci from a larger number of species which agrees with a previous publication  (Additional file 1: Figure S7). Including more ratites and a larger number of (hand-curated) loci should provide better resolution of the tree topology, and indeed the topology we obtain here is well-supported. However, we note that the topology changes depending on the gene sets that are included (Additional file 1: Figs. S6 and S7) and that when using ultra-conserved sequences the phylogeny differs from that obtained from a larger, more representative set of genes. Hence, future availability of additional genomes and ortholog sets from multiple ratites will allow a better understanding of their origin.
Nevertheless, a previous study has estimated that kiwi diverged from the Madagascan elephant birds about 50 million years ago  (Additional file 1: Figure S8). This estimate post-dates the split of Madagascar and New Zealand from Gondwana, which took place around 100 and 80 million years ago, respectively, and implies that ratites must have dispersed by flight and also that kiwi arrived on New Zealand less than 50 million years ago. This conclusion is supported by the fossil record in New Zealand, which includes a flighted kiwi ancestor . At the time kiwi arrived, moa already inhabited New Zealand and it has been hypothesized that moa were monopolizing the diurnal ground niche, which forced kiwi to adapt to an alternative nocturnal lifestyle . This would suggest that kiwi adapted to the nocturnal niche soon after arriving on the island. The loss of function that we observe in OPN1SW is indicative of adaptation to nocturnality . We dated the loss of function in several color vision opsins to 30–38 million years ago, which is consistent with the arrival of the kiwi in New Zealand less than 50 million years ago, and their subsequent adaptation to a nocturnal niche.
In contrast to birds, which almost certainly have a diurnal origin, the nocturnal bottleneck hypothesis suggests that mammals were nocturnal for about 160 million years in their evolution as they were restricted to nighttime activity to avoid dinosaurs which were the dominant diurnal taxon at this time . According to this hypothesis, several traits typical for mammals, including a well-developed sense of smell, limited color vision, increased eye size, and an energetic metabolism optimized for sun radiation-independent body temperature regulation, have been shaped by the nocturnal environment [65, 66]. Nocturnally adapted Mesozoic mammals also tended to have a small body size, an insectivorous diet, and low energy metabolism . Interestingly, kiwi has the smallest body size among flightless ratites, the lowest metabolic rate among birds [68, 69], and an insectivorous diet, suggesting a pattern of evolution that is similar to the evolution of mammals under nocturnality. Consistent with this hypothesis, our genome-wide scans for patterns of positive selection showed enrichment in GO categories like mitochondrion functions and energy reserve metabolic process (Additional file 1: Table S8A), both related to metabolic rate. Moreover, we found strong evidence for a loss of color vision in kiwi and their retinal structure also clearly supports adaptation to vision under low light levels . Although the small eye size of kiwi  is unusual for a nocturnal species, based on the retinal anatomy Corfield et al. rejected a regressive evolution model for kiwi vision and suggested that kiwi have an acuity in detecting low light levels similar to other nocturnal species . This suggests that molecular mutations and retinal structure changed faster than eye size. In birds, eye size was described to scale to body mass with an exponent similar to brain mass and metabolic rate . Thus, the low metabolic rate of kiwi  could be the constraint for their relatively small eyes. Alternatively, kiwi might serve as an example that adaptations in the retinal structure could be sufficient, and changes in eye size are not absolutely necessary. This conclusion may be supported by the absence of variation in eye shape according to activity pattern observed in lizards and non-primate mammals .
It has long been hypothesized that unlike most bird species kiwi is more similar to mammals in their reliance on olfactory and mechanical cues for foraging, perceived by the nostrils and mechanoreceptors located at the end of its bill, for foraging . We found that the kiwi, unlike other ratites, has an increased diversity in the bird-specific γ-c clade ORs. Since OR diversity is hypothesized to correlate positively with olfactory acuity in vertebrates [42, 73], the significantly higher diversity in kiwi ORs compared to other birds (Additional file 1: Figure S9) suggests that kiwi may be able to distinguish a larger range of odors than other birds.
Steiger et al. formulated two possible scenarios that could explain γ ORs evolution in birds: the first hypotheses that species-specific γ ORs arose from independent expansion events in each species, while the second assumes that the ancient γ OR clade was more diverse and became homogenized by concerted evolution within species . Some γ ORs of kiwi, ostrich, tinamou, and nocturnal birds clustered with their reptilian counterparts, while others clustered basal to the clade containing most bird γ ORs (Fig. 3). This supports a two-fold conclusion: (1) γ ORs in kiwi are more diverse in sequence than in other birds investigated, which was verified by the significantly higher sequence entropy; and (2) since kiwi is basal to the Neognathae (Fig. 1), the ancestral state of γ OR clade is probably diversified compared to other modern birds.
Since its arrival in New Zealand sometime after 50 million years ago, the kiwi adapted to a nocturnal, ground-dwelling niche. The onset of adaptation to nocturnality appears to have been approximately 30–38 million years ago, about one-fifth of the time proposed for the evolution of mammals in a nocturnal environment. The molecular changes present in the kiwi genome are in accordance with the adaptations that are hypothesized to have occurred during early mammalian adaptation to nocturnality. This suggests similar patterns of adaptation to the nocturnal niche both in kiwi and mammals. Further comparative analyses, including other diurnal Palaeognathae, as well as additional nocturnal bird groups and their diurnal sister species, should shed further light on the genomic imprints of adaptation to a nocturnal life style.
Methods and materials
Genome sequence assembly and annotation
We sequenced Apteryx mantelli female individuals, which originate from the far North (kiwi code 73) and central part – Lake Waikaremoana (kiwi code AT5 and kiwi code 16–12) of North Island (Additional file 1: Figure S10). They were sampled in 1986 (kiwi code 73) and 1997 (kiwi code AT5 and 16–12) in ‘operation nest egg’ carried out by Rainbow and Fairy Springs, Rotorua. No animals were killed or captured as a result of this study and genome assembly was performed with iwi approval from the Te Parawhau and Waikaremoana Māori Elders Trust.
We extracted genomic DNA from Apteryx mantelli embryos. Libraries with insert sizes of 240 bp, 420 bp, 800 bp, 2 kb, 3 kb, and 4 kb were obtained from individual kiwi code 73, and mate-paired-end libraries 7 kb, 9 kb, 11 kb, and 13 kb, from individual kiwi code 16–12. DNA from individual AT5 was used to build a 350 bp insert-size library with the purpose of confirming kiwi-specific sequence polymorphisms and was not included in the genome assembly (Additional file 1: Note: Sampling, DNA library preparation and sequencing; Additional file 1: Table S1). Paired-end sequencing was performed on HiScanSQ and HiSeq platforms with read lengths of 101 bp and 96 bp, respectively.
Sequencing errors were corrected using Quake  (Additional file 1: Note: Filtering and read correction; Additional file 1: Figure S1). A total of 52.53 Gb of high-quality sequence was used for de novo assembly with SOAPdenovo . The short-insert-size libraries (240 bp, 420 bp, 800 bp) were used to build contigs. Based on paired-end information scaffolds were generated using all libraries (2 kb, 3 kb, 4 kb, 7 kb, 9 kb, 11 kb, 13 kb). Remaining gaps in the scaffolds were closed using the paired-end information (Additional file 1: Note: Genome assembly). This final assembly (AptMant0) was used for all subsequent analyses.
Gene annotation was performed with the MAKER pipeline , using several sources of evidence: de novo gene predictions, RNA-Seq data, and protein evidence from three species (G. gallus, T. guttata, and M. gallopavo) (Ensembl version 72). Briefly, after repeat masking, gene models were predicted by Augustus version 2.7  using the training dataset for chicken. Apteryx mantelli RNA-Seq data were then aligned to AptMant0 using NCBI BLASTN version 2.2.27+  and BLASTX was used to align protein sequences to identify regions of homology. Finally, using both the ab initio and evidence-informed gene predictions, Maker updated features such as 5’ and 3’ UTRs based on RNA-Seq evidence and a consensus gene set was retrieved (Additional file 1: Note: De novo gene prediction and gene annotation).
Comparative genome analysis
Triplet orthologs between chicken, zebra finch, and turkey were downloaded from Ensembl 73. Kiwi genes were considered orthologs to a triplet if the ortholog assignment from Maker agreed with the orthologous gene assigned in each of the three considered species. The ostrich, tinamou, chuck-will’s-widow, and barn owl orthologs were assigned by orthology to the chicken proteins. After assigning orthology in the eight avian species, coding sequences were aligned and two different sets of alignments were compiled for further analysis:
Set 1: alignments of all eight species that do not contain a single frameshift indel.
Set 2: the longest uninterrupted run of at least 200 aligned bases in each multiple sequence alignment, for which we first ensured that gaps in the alignment were not introduced by unresolved bases in our assembly.
The CODEML program from the package PAML  was run first on four avian lineages: G. gallus, T. guttata, M. gallopavo, and A. mantelli to compare the kiwi genome to high-quality annotated ones. Six pairwise combinations were run to obtain estimates of non-synonymous (Ka) and synonymous (Ks) changes in the four avian lineages. Ka and Ks distributions were compared pairwise between all four avian species on a set of 3,754 orthologous genes which presented no frameshifts or indels (Additional file 1: Figure S11).
We next scanned for differently evolving genes with the CODEML program under a branch model (model = 2, two ωs for foreground and background branches, respectively, vs. model = 0, one ω for all branches, compared via likelihood ratio test)  using the set of orthologs as defined above in the eight bird species (Additional file 1: Note: Orthologs and Ka/Ks calculation).
Branch specific ω values were used to identify GO categories that are evolving significantly different on each of the following bird species: kiwi, ostrich, tinamou, barn owl, and chuck-will’s-widow. GO categories enrichment was tested using the FUNC  package.
A hypergeometric test was run for each species separately on genes having a significantly higher ω. Multiple testing correction was done using family-wise error rate. Categories with P value <0.05 were considered for further analysis if at least three significantly changed genes were present in the GO category, and the number of significant genes was greater or equal to 5 % of the total genes annotated in the respective GO category. The same test was applied on genes with a significantly smaller ω in each of the species. Kiwi-specific categories were considered those which showed no enrichment in any of the other ratites or night birds (Additional file 1: Note: Gene Ontology and rapidly evolving genes).
We used the TreeFam methodology to define gene families  across 16 genomes: Gallus gallus, Anas platyrhynchos, Ficedula albicollis, Meleagris gallopavo, Taeniopygia guttata, Pelodiscus sinensis, Anolis carolinensis, Homo sapiens, Mus musculus, Gasterosteus aculeatus, Ornithorhynchus anatinus, downloaded from Ensembl 73 , Tinamus guttatus, Struthio camelus, Antrostomus carolinensis, Tyto alba, downloaded from GigaDB , and Apteryx mantelli. The longest transcript was chosen for further analysis. For the single-copy orthologous families, genes were aligned against each other. To build a consensus phylogenetic tree (Fig. 1) the resulting alignments were loaded in PAUP*  version 4.0d105 and trees were inferred using maximum likelihood, with default parameters. To measure the confidence for certain subtrees, a series of 100 bootstrap replicates were performed (Additional file 1: Note: Nuclear loci phylogeny).
We determined the branch-specific expansion and contraction of the orthologous protein families among the 16 species using CAFE (computational analysis of gene family evolution) version 3.0  with lambda option of 0.0007 (Additional file 1: Note: Gene families evolution using CAFE). Pfam IDs corresponding to the TreeFam families were assigned to GO categories. We tested whether significant (P <0.05) contraction/expansion events cluster in different GO categories using ClueGO with a hypergeometric test  (Additional file 1: Figure S2).
Membrane proteome annotation
Complete protein sequence sets for the following bird and reptile species were downloaded from Ensembl 74 : Taeniopygia guttata, Meleagris gallopavo, Ficedula albicollis, Anas platyrhynchos, Pelodiscus sinensis, Gallus gallus, and Anolis carolinensis. Homo sapiens from the same Ensembl version was used as outgroup. Protein sequences of ratites (Tinamus guttatus, Struthio camelus) and nocturnal birds (Antrostomus carolinensis, Tyto alba) were downloaded from GigaDB ; although these genomes are more fragmented than the ones from Ensembl, annotation of the membrane proteome in birds adapted, like kiwi, to the nocturnal niche and the ones belonging to the same clade as kiwi, allows to differentiate between events that are clade-specific or shaped by nocturnality. Only the longest protein sequence for each gene was considered for analysis. Membrane proteins and signal peptides were predicted for all species with Phobius . These proteins were classified based on a manually curated human membrane proteome dataset, which describes family relationship and molecular function. The predicted membrane proteins were aligned to the human membrane proteome dataset with the BLASTP program of the BLAST package using default settings (v. 2.2.27+) . Each predicted membrane protein was classified according to its best human hit with an e-value <10−6. Predicted membrane proteins with no hit were deemed unclassified, along with those proteins that hit an unclassified human protein (Additional file 1: Note: Detection and classification of the membrane proteome; Additional file 1: Table S7).
Vision evolutionary analysis
Opsins are G protein-coupled receptors known to play a role in light signal transduction and night-day cycle (Table 2). For these genes ω was estimated by appointing sequentially kiwi, ostrich, tinamou, chuck-will’s-widow, and barn owl as the foreground branch under the CODEML branch model (model = 2)  as described for comparative genome analysis. Inactivating mutations were verified by checking that they were present in reads from both sequenced individuals and in other kiwi species, by Sanger sequencing (OPN1MW) (Fig. 2; Additional file 1: Note: Vision analysis).
Olfaction evolutionary analysis
Olfactory receptors (ORs) in kiwi were annotated using both the Augustus de novo gene prediction and the Maker information after scaffold positions were checked and redundant sequences were removed.
Functional ORs from chicken  were downloaded and aligned against the kiwi transcriptome using TblastN with default parameters. After collecting overall hits for each query (every chicken OR served as query), identical (same) hits from each run were removed to obtain a non-redundant dataset.
A Pfam search against the kiwi proteome with a default e-value cutoff of 1.0 was used to identify sequences that contained 7tm_4 domain (olfactory domain).
The 7tm_4 domain was searched against the kiwi proteome by a CDD search (conserved domain database search).
Separate HMM profiles were built from conserved 7tm regions of functional ORs of chicken, turkey, and zebra finch obtained from previous studies . Using the three HMM profiles, HMM searches were performed against the kiwi proteome and non-redundant hits were retrieved from combined results of all three searches.
A CD-HIT (Cluster Database at High Identity with Tolerance) was performed to remove identical sequences with a cutoff of 100 %. Preliminary phylogenetic analysis was performed using a maximum likelihood approach (Additional file 1: Note: Olfactory receptor genes identification and annotation). Non-ORs were removed if they clustered separately from ORs. We excluded pseudogene candidates if at least one premature stop codon and/or frameshifts could be identified in the kiwi sequence.
OR repertoire estimates were curated based on genomic coverage calculated using samtools mpileup version 0.1.18  on the alignment of the 240 bp, 420 bp, 800 bp insert-size libraries to AptMant0 (Additional file 1: Note: Olfactory receptor genes identification and annotation). The correction factor for each annotated OR was obtained by dividing the read coverage in that region to the GC-content corresponding average coverage over the entire genome. For example, if an OR sequence had a GC content of 50 %, we calculated the average genome-wide coverage corresponding to the GC bin of 50 % to be 35-fold (Additional file 1: Note: Genome coverage and estimation of genome size; Additional file 1: Figure S13). Given a coverage in the respective OR region of 105-fold, we obtained a correction factor of 3 after dividing the OR sequence coverage (that is, 105-fold) by the GC-bin corresponding coverage (that is, 35-fold). The final number of estimated ORs was obtained by multiplying the number of initially annotated genes with their corresponding correction factors.
Using the same annotation procedure, the OR gene repertoire was estimated in all bird and reptile genomes from Ensembl 74, two nocturnal birds (chuck-will’s-widow and barn owl) and two Palaeognathae (ostrich and tinamou) for comparative phylogenetic analysis with the kiwi OR dataset. All obtained OR genes were then aligned using MAFFT  v7, with BLOSUM62 as the scoring matrix and default settings of option E-INS-I. Phylogenetic analyses were run using both maximum likelihood (ML) and neighbor joining (NJ) methods (Additional file 1: Note: Comparative phylogenetic analysis on ORs from kiwi and other bird and reptile genomes). The reliability of the phylogenetic trees was evaluated with 500 bootstrap replicates.
We calculated Shannon entropy (H) using within species multiple sequence alignments of γ ORs for all birds and reptiles genomes separately with a built-in function from BioEdit  (Additional file 1: Note: γ-c clade OR within-species protein sequence entropy).
Previously characterized wing development genes  were assigned orthologs in kiwi, chicken, zebra finch, and turkey (Additional file 1: Figure S3; Additional file 1: Table S12). We aligned the sequences and multiple alignments were translated and manually inspected for sequence differences as well as insertions/deletions and rearrangements. We examined selective pressures under the branch models implemented in CODEML . The one-ratio model (model = 0, NSsites = 0) was used to estimate the same ω ratio for all branches in the phylogeny. Then, the two-ratio model (model = 2, NSsites = 0), with a background ω ratio and a different ω on the kiwi branch, was used to detect selective pressure acting specifically on the kiwi branch. These two models were compared via a LRT (1 degree of freedom), as mentioned above .
Scaffolds and isolated contigs harboring (putative) HOX genes were identified by BLAST and mapped to all 673 sauropsid HOX protein sequences from GenBank. Translated HOX sequences of Apteryx were aligned to the HOX proteins extracted from Genbank and differences were identified by manual inspection. Potential regulatory sequences in the HOX cluster region were identified by phylogenetic footprinting using tracker2  (Additional file 1: Figure S4).
To retrieve the entire coding region of the FIBIN gene in kiwi, we designed primers based on the chicken and ostrich sequence (Additional file 1: Table S14). Using the 276-bp fragment amplified by Sanger sequencing, we blasted transcriptome sequences from kiwi and iteratively assembled the entire coding sequence. Since FIBIN showed signs of positive selection in the preliminary analysis as described above, extended selection analysis was performed using 15 species: human, mouse, bat, whale, dolphin, turtle, lizard, python, flycatcher, chicken, zebra finch, frog, zebrafish, and pufferfish (Additional file 1: Note: Fibin identification and selection analysis; Additional file 1: Figure S5). The branch-site tests were used to detect signals of selective pressure on each branch (NSsites = 2, model = 2, compared to the same model but with omega fixed to 1, via LRT). Amino acid changes with signs of selection and specific for the kiwi were visualized in both sequenced individuals.
Chicken UCNEs annotations were downloaded from the ultra-conserved non-coding element UCNEbase . Orthologous regions in Apteryx mantelli and Struthio camelus, Tinamus guttatus, Tyto alba, Antrostomus carolinensis genomes, downloaded from GigaDB , and birds from Ensembl 74  Ficedula albicollis, Taeniopygia guttata, Anas platyrhynchos, and Meleagris gallopavo were established using Blast 2.2.25  with ‘blastn’ and default parameters. Gallus gallus genome Ensembl 74 was used as control in the orthology assignment. Orthologous regions from each of the species were aligned  to the reference UCNE and the number of mismatches between the UCNE and the target genomes were determined (Additional file 1: Note: Ultra-conserved non-coding elements analysis).
Assembly, raw DNA, and RNA sequencing reads have been deposited in the European Nucleotide Archive under the BioProject with accession number: PRJEB6383.
UCNEs multiple fasta files and analysis have been deposited on .
The kiwi FIBIN sequence was deposited in GenBank under BankIt 1821198 FIBIN KR364000.
This work was supported by grants of the Deutsche Forschungsgemeinschaft and intramural support (Medical Faculty, University of Leipzig), as well as the Australian Research Council, the Swedish Research Council, NSERC (postgraduate fellowship to GR), and the Max Planck Society. BDB was funded by grant no. 2011/12500-2, São Paulo Research Foundation (FAPESP). This research was endorsed by Māori Elders from the Te Parawhau Trust and from Waikaremoana iwi. We are very thankful for technical and methodical support provided by Knut Finstermeier, Anne Butthof, Knut Krohn, Michael Dannemann, Udo Stenzel, Mathias Stiller, and Rigo Schulz. We thank Andreas Reichenbach for helpful discussions on kiwi vision and Petra Korlević for the drawings in Fig. 1 and Additional file 1: Figure S10.
- 2.Iviartin GR. Sensory capacities and the nocturnal habit of owls (Strigiformes). IBIS. 1986;128:266–77.Google Scholar
- 43.Preston GM. Cloning gene family members using PCR with degenerate oligonucleotide primers. In: White BA (ed.) PCR cloning protocols: from molecular cloning to genetic engineering; In series: Methods in molecular biology (Clifton, N.J.) 67; Humana Press: 1997 pg 433-49. ISBN 0896034436Google Scholar
- 47.Margulies DH, Natarajan K, Rossjohn J, McCluskey J. Fundamental Immunology. 7th ed. Philadelphia, PA: Wolters Kluwer Health/Lippincott Williams & Wilkins; 2012. p. 511.Google Scholar
- 52.Grzimek B, Schlager N, Olendorf D, McDade MC. Grzimek’s animal life encyclopedia. Gale: Gale, MI; 2004.Google Scholar
- 59.Gestri G, Osborne RJ, Wyatt AW, Gerrelli D, Gribble S, Stewart H, et al. Reduced TFAP2A function causes variable optic fissure closure and retinal defects and sensitizes eye development to mutations in other morphogenetic regulators. Hum Genet. 2009;126:791–803.PubMedCentralPubMedCrossRefGoogle Scholar
- 63.Worthy TH, Worthy JP, Tennyson AJD, Salisbury SW, Hand SJ, Scofield RP. Miocene fossils show that kiwi (Apteryx, Apterygidae) are probably not phyletic dwarves. In: Göhlich UB, Kroh A, editors. Proceedings of the 8th International Meeting Society of Avian Paleontology and Evolution. Vienna, 2012, Verlag des Naturhistorischen Museums in Wien, Vienna; 2013. p. 63–80.Google Scholar
- 65.Striedter GF. Principles of brain evolution. Sinauer Associates Inc.,U.S. ISBN: 978-0-87893-820-9. 2004/2005Google Scholar
- 69.Sales J. The endangered kiwi: a review. Folia Zoologica Praha. 2005;54:1.Google Scholar
- 82.Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999;41:95–8.Google Scholar
- 87.Kiwi Genome. Available at: http://www.bioinf.uni-leipzig.de/~studla/KIWI-HOX/.
- 88.Kiwi Annotated HOX Cluster. Available at: https://bioinf.eva.mpg.de/KIWI-HOX/
- 89.Kiwi Annotated UCNEs. Available at: https://bioinf.eva.mpg.de/KIWI-UCNEs/
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.