Abstract
Haberlea rhodopensis, a resurrection species, is the only plant known to be able to survive multiple extreme environments, including desiccation, freezing temperatures, and long-term darkness. However, the molecular mechanisms underlying tolerance to these stresses are poorly studied. Here, we present a high-quality genome of Haberlea and found that ~ 23.55% of the 44,306 genes are orphan. Comparative genomics analysis identified 89 significantly expanded gene families, of which 25 were specific to Haberlea. Moreover, we demonstrated that Haberlea preserves its resurrection potential even in prolonged complete darkness. Transcriptome profiling of plants subjected to desiccation, darkness, and low temperatures revealed both common and specific footprints of these stresses, and their combinations. For example, PROTEIN PHOSPHATASE 2C (PP2C) genes were substantially induced in all stress combinations, while PHYTOCHROME INTERACTING FACTOR 1 (PIF1) and GROWTH RESPONSE FACTOR 4 (GRF4) were induced only in darkness. Additionally, 733 genes with unknown functions and three genes encoding transcription factors specific to Haberlea were specifically induced/repressed upon combination of stresses, rendering them attractive targets for future functional studies. The study provides a comprehensive understanding of the genomic architecture and reports details of the mechanisms of multi-stress tolerance of this resurrection species that will aid in developing strategies that allow crops to survive extreme and multiple abiotic stresses.
Similar content being viewed by others
Introduction
Haberlea rhodopensis Friv. from the Gesneriaceae family belongs to the resurrection species, a small group of plants that can tolerate water loss to air-dried state (i.e., complete desiccation) and regain normal turgor and physiological activities upon rehydration [1, 2]. There are even fewer plant species, confined to a few Gymnosperms, that can tolerate long-term darkness, and no studied land plant except H. rhodopensis can tolerate both stresses simultaneously [3, 4]. This resurrection plant can also tolerate freezing temperatures during the winters in its natural habitat, and high levels of oxidative stress [1, 5, 6]. Yet, despite these unique features, the molecular mechanisms that enable H. rhodopensis to cope with combined stresses remain poorly explored, although they may provide new strategies to improve climate resilience of agriculturally relevant crops.
The genetic repertoire of H. rhodopensis, a diploid species, is unknown due to the unavailability of its genome sequence. However, advances in genome sequencing and assembly techniques have opened the doors for high-quality assemblies. Recently, complementary methodological approaches offered by Pacific Biosciences (PacBio), Oxford Nanopore, BioNano Genomics, and others, as well as chromosome conformation capture sequencing (Hi-C), have gained popularity and have been used to develop high-quality genome assemblies of different plant species [7,8,9,10,11].
To unravel the genetic basis of the unique multi-stress tolerance of H. rhodopensis, we report a high-quality de novo sequencing, assembly, and analysis of its genome. In addition, we performed a comparison with the genomes of 20 other land plant species, including various model and resurrection species, to identify gene families expanded in and specific to H. rhodopensis. To understand the molecular mechanisms of multi-stress tolerance, we performed transcriptional profiling during several extreme stresses and their combinations. Altogether, the comprehensive analysis presented in the study delineated the unique genome and transcriptome features of H. rhodopensis that allow it to withstand individual and combined extreme environmental conditions.
Materials and methods
Plant material, growth, stress treatments, and RNA extraction
The initial H. rhodopensis Friv. plants were collected from the Rhodopi mountain near Assenovgrad, Bulgaria (location: 24° 52′ E, 41° 55′ N; elevation 690 m), and their in vitro culture was established. H. rhodopensis plants were grown and propagated under optimal conditions (16/8 h light/dark photoperiod, 35 µE m−2 s−1 light intensity, 20 °C), as previously described [3]. For darkness experiments, in vitro cultivated plants at the rosette leaf stage were divided into groups and subjected to three stress conditions: desiccation (until the relative water content (RWC) reached 5%), complete darkness for one month, and a combination of the two stress factors (desiccation in darkness). Well-hydrated plants had an RWC of about 85%. Desiccation to an air-dry state was conducted by removing the growth media for seven days at room temperature and 70% relative humidity until plants reached an air-dried state at RWC of 5%. Plants were rehydrated for five days by returning them to well-hydrated media until reaching the original RWC. For dark treatments, plants were subjected to darkness for one month, followed by a recovery period of seven days under normal light conditions. For recovery period, the plants desiccated in darkness were rehydrated in the dark, while those that were desiccated in the light were rehydrated in the light. Rosette leaves were collected in a dark room illuminated with non-actinic green light and immediately frozen in liquid nitrogen. In parallel, groups of control plants were maintained at normal conditions and were taken along the experiment with the respective time points to provide proper developmental controls.
Control plants, as well as plants subjected to desiccation (5% RWC), one-month darkness, the combination of the two stress factors (desiccation in darkness), and plants that have recovered from the stress conditions, were used to isolate and purify RNA for RNA-seq as described previously [1]. Samples were obtained in three biological replicates. The details of samples used for transcriptome sequencing are provided in Table S1. The isolated RNA was subjected to RNA-seq using an Illumina HiSeq X Ten by BGI Tech Solutions, Tai Po, Hong Kong.
For the low temperatures experiments, in vitro cultivated plants at the rosette leaf stage were transferred to Petri dishes with the following soil: Rėkyva Remix Fine peat substrate and agro perlite in 2.5:1 ratio, supplemented with a fertilizer mix containing 1.2 g Ca(H2PO4)2, 0.5 g K2SO4, 0.5 g NH4NO3 and 0.2 g MgSO4 per 1 L of potting mix. Thereafter, plants were cultivated in the same conditions as described above for at least three additional months to allow them to adapt to the new environment. At the onset of the experiment, 120 plants were divided into two populations: one group was left without water for 18 days (until plants reached ~ 10% RWC), while irrigation was retained for the other one. Then, each set was divided further into four subpopulations based on the temperature treatment: standard temperature, 24 h chilling (4 °C), 24 h freezing (− 4 °C), and chilling followed by recovery (also for 24 h) and subsequent freezing. Thus, eight experimental groups were obtained, each consisting of 15 plants used to generate three independent pools (5 plants per pool). Leaves were flash-frozen in liquid nitrogen and used for further physiological and transcriptomic analyses. RNA was extracted by Zymo Research Quick-RNA Miniprep kit, following the manufacturer’s instructions, with a few modifications as follows: inclusion of an incubation step with the lysis buffer at room temperature for 30 min; doubling the number of washing steps with RNA wash buffer (to remove impurities from H. rhodopensis leaves); reduction of the utilized ground leaf tissue to ~ 10–15 mg; and a slight prolongation of the centrifugation times. The isolated RNA was subjected to RNA-seq using the DNBseq™ sequencing technology by BGI Tech Solutions.
Measurements of electrolyte leakage and relative water content
Electrolyte leakage was assessed by measuring the increase in conductivity with an HI 873 conductivity meter (Hanna Instruments, Woonsocket, RI, USA). Haberlea leaves were briefly washed with ultrapure water (conductivity of 1 µS). The leaves were then incubated in ultrapure water (1 µS) for 10 min. The conductivity of the resultant solution was measured and compared with the total conductivity obtained after boiling the leaves. RWC was determined using the formula RWC (%) = [(FM − DM)/(TM − DM)], where FM, DM, and TM are the fresh, dry, and turgid masses of the leaves, respectively. TM was measured after immersing the leaves in H2O for 24 h, and DM was determined after drying the leaves at 80 °C for 48 h. RWC was determined in three biological replicates.
Library preparation and genome sequencing
High-molecular-weight genomic DNA was prepared from leaves using a custom-designed protocol established at the Massey NGS facility, Massey University, New Zealand [12]. Genomic DNA was used to generate a 10–20 kb sequencing library according to the instructions of Pacific Biosciences (PacBio). The library was sequenced with a PacBio Sequel II instrument using 24 SMRT cells to generate ultra-long reads by DNA Link Inc., Seodaemun-gu, Seoul, Korea. Additionally, the genomic DNA was used to generate a Dovetail HiC library as per the manufacturer’s protocol, which was further sequenced on an Illumina platform using a paired-end strategy (Dovetail Genomics, CA, USA) (Data S1).
Genome assembly and assessment
The long reads from PacBio sequencing were assembled using FALCON-Unzip [13] with default parameters and Canu v1.7 [14] with parameters tuned for heterozygous genomes. Due to the high heterozygosity of the H. rhodopensis genome (1.48%; estimated using GenomeScope available at http://qb.cshl.edu/genomescope/; [15]), the following Canu parameters were used “corMhapSensitivity = normal corOutCoverage = 200 correctedErrorRate = 0.105 "batOptions = -dg 3 -db 3 -dr 1 -ca 500 -cp 50"”. The assemblies obtained were polished using one round of Arrow (SMRT Link v5.1, https://www.pacb.com/support/software-downloads/) with default parameters. The Arrow-polished Canu assembly was processed to obtain primary contigs using Purge Haplotigs v1.1.1 [16] with default parameters. The Canu-de-duplicated and FALCON-Unzip assemblies were scaffolded using the HiC reads utilizing the SALSA pipeline v2.2-14-g974589f [17]. The assembly statistics were generated using QUAST v4.1 [18]. Additionally, the completeness of assemblies was evaluated using BUSCO v5.4.4 [19],"viridiplantae_odb10" dataset). Based on the assembly and completeness statistics, Canu + SALSA assembly was deemed the best assembly and was used for downstream analysis (Fig. S1).
Repeat identification
Both de novo and homology-based repeat identification approaches were used to identify and annotate repeats in the H. rhodopensis genome. First, a de novo repeat library was constructed using RepeatModeler version open-1.0.10 with default parameters [20]. The de novo repeat library obtained was combined with the known Viridiplantae-based repeats from RepBase version 20170127 [21] to generate a custom repeat library. This library was then used to screen the genome for repeats using RepeatMasker version open-4.0.7 (“-u -gff -e ncbi -xsmall” [22]).
Gene prediction and annotation
The raw reads from RNA-seq datasets were trimmed to remove low-quality bases and sequencing adaptors using Trimmomatic v0.39 [23]. The trimmed reads were filtered for ribosomal RNA using SortMeRNA v2.1 [24]. The filtered RNA-seq reads were aligned against the genome assembly using HISAT2 v2.1.0 [25] and assembled into individual transcripts using StringTie2 v2.0 [26], which were merged to obtain a consensus assembly. Further, the ab initio gene model predictions were obtained by the BRAKER2 pipeline v2.1.5 [27] using evidence from RNA-seq, de novo transcriptome assembly [3], and homology-based alignments with known plant proteins from SwissProt. Finally, EVidenceModeler v1.1.1 [28] was employed to combine RNA seq-based predictions with ab initio gene predictions to obtain a final set of gene models. Functional annotation to the predicted genes was performed using similarity searches against various publicly available databases. Briefly, BLASTP (E-value 1E-05) was used to search against the NCBI non-redundant (NCBI-nr), SwissProt, and TrEMBL databases with taxonomy filter for "Viridiplantae" (taxonomy id: 33090). The BLAST annotations obtained from these databases were merged to assign functional annotation to protein sequences (Swissprot preferred over TrEMBL and then NCBI nr). Mercator4 V2.0 [29] was used to obtain MapMan4 annotations for the predicted proteins. InterProScan v5.39–77.0 [30] was used to identify conserved domains and motifs in the proteins encoded by gene models. Gene Ontology IDs for each gene were obtained from the corresponding InterPro entry. Further, rRNAs, miRNAs, and snRNAs, were predicted by homology searches against the Rfam database (release 14.2) using Infernal v1.1.3 [31] with default parameters. The tRNA genes in the genome were identified by tRNAscan-SE v2.0 [32] with default parameters. The non-coding RNA encoding genes for Craterostigma plantagineum and Lindernia brevidens were also predicted using Infernal and tRNAscan.
Gene family and synteny analysis
The predicted protein sequences from H. rhodopensis were compared against protein sequences of 20 other plant species using OrthoFinder v2.4.0 [33] to identify sets of orthologous genes, referred to as gene families. Single-copy orthologs obtained were used to construct the phylogenetic tree. The expansion/contraction analysis was performed by CAFE v4.2.1 (parameters: “-p 0.01 -t 4 -r 10,000 -filter” [34]) based on the species tree and gene family statistics. Based on this ortholog analysis, the genes that are specifically present in H. rhodopensis are classified as “orphan genes”. The significantly expanded gene families (FDR < 0.05) in H. rhodopensis were manually annotated based on the annotation of homologs from Arabidopsis or InterProScan annotations. The ELIP gene family members were manually curated to check for presence of PF00504 domain by searching against Pfam using InterProScan. Divergence times between H. rhodopensis and other plant species were estimated using MegaX software (https://www.megasoftware.net/) with default parameters using the known calibration times from the TimeTree database (http://www.timetree.org, [35]). The Synteny analysis was performed using GENESPACE R package (v1.2.3 [36]).
Transcription factors and resistance gene analogs
Transcription factors (TFs) in the H. rhodopensis genome were identified using a similarity search against plant transcription factors from the plant transcription factor database version 5.0 [37]. The RGAugury pipeline v2017-10-21 with default parameters was used to identify resistance gene analogs (RGAs) from the predicted gene set of H. rhodopensis [38]. The identified RGAs were then classified into different sub-classes based on the presence or absence of specific domains.
Transcriptome analysis
The raw reads from RNA-seq datasets were trimmed to remove low-quality bases and sequencing adaptors using Trimmomatic. The trimmed reads were filtered for ribosomal RNA using SortMeRNA. The filtered RNA seq reads were aligned against the genome assembly using HISAT2. The transcript- and gene-level quantifications were obtained using StringTie2. For gene expression analysis, the EdgeR package [39] in R/Bioconductor was used for multiple pairwise comparisons. A false discovery rate (FDR) cutoff (Benjamini–Hochberg correction) of less than 0.05 and a log2 fold change ≥ 1 was used to identify significantly differentially expressed genes. Heat maps and clustering for selected groups of genes were made using the ComplexHeatmap R package [40]. The optimal number of k-means clusters was estimated using XMeans algorithm in the RWeka package [41]. TopGO was used to perform GO enrichment analysis with FDR cutoff of 0.05. The enrichment analysis for MapMan4 annotations was performed using hypergeometric test [42] with a significance threshold of FDR 0.05. The local FDR correction method was used to adjust the P-values obtained from GO and MapMan bin enrichment analysis. For gene set enrichment analysis, the genes that had a GO and MapMan annotation were considered as background. For enrichment analysis of the ortho groups, the ortho groups were assigned a GO based on the longest member of each ortho group and the ortho groups that had a GO annotation were used as background.
Results
Sequencing and assembly of the Haberlea rhodopensis genome
Using a combination of PacBio (~ 91X) and HiC sequencing (~ 176 million read pairs), we developed a high-quality ~ 1.27 Gb (N50 = 2.92 Mb) genome assembly of H. rhodopensis consisting of 3499 pseudo-scaffolds (Fig. 1; Table 1; Table S2; Data S1; Fig. S1). The assembly accounts for 92.7% of the estimated genome size of ~ 1.37 Gb [43]. Several scaffolds did not attain chromosome lengths possibly due to chimeric read mapping. The GC content of the H. rhodopensis’ genome (38.08%) is similar to that of Arabidopsis thaliana (36%), Streptocarpus rexii (38.89%; member of Gesneriaceae family; [44]) and Xerophyta viscosa (36.51%), a resurrection species from the Velloziaceae family [45], but lower than that of other resurrection species including Boea hygrometrica (42.30%; Gesneriaceae family; [46]), and two from the Linderniaceae family (C. plantagineum (40.26%; [47]) and L. brevidens (39.26%; [48])) (Fig. S2A).
Repeat content and gene annotation
Applying de novo repeat identification, we found that 872.08 Mb (68.68%) of the H. rhodopensis genome was repetitive, similar to the repeat content estimated for B. hygrometrica [46] and Solanum lycopersicum [49]. In accordance with the pattern observed in other plant genomes, long-terminal repeat (LTR) retrotransposons were the most abundant class of repetitive DNA and comprised nearly 41% of the H. rhodopensis genome, which was slightly higher than the LTR composition in other resurrection species including C. plantagineum (35%) and L. brevidens (34%). Among LTR transposons, Gypsy and Copia elements represented 22.88% and 17.09% of the genome, respectively (Table S3). Furthermore, the synonymous mutation rate (Ks) distribution of the paralogs indicated three distinct major duplication events in H. rhodopensis that could explain its large genome size and high repeat content (Fig. S2B).
By integrating homology searches, ab initio prediction, and mRNA expression evidence, we predicted a total of 44,306 protein-coding genes in the H. rhodopensis genome (Fig. 1). On average, the predicted genes encode transcripts of 1589.5 bp length with 5.2 exons, similar to reports for other plant species (Table 1; [50, 51]). Based on a similarity search against different databases, we annotated 39,538 genes (89.24%, Table S4). We identified 666 genes for ribosomal RNAs (rRNAs), 626 transfer-RNAs (tRNAs), 205 microRNAs (miRNAs), and 160 small nuclear RNAs (snRNAs; Table 1). These numbers were similar to those for B. hygrometrica and L. brevidens. However, C. plantagineum contained significantly higher number of all non-coding genes. Among the rRNA genes, we found 273 5S rRNA genes which is significantly higher than the 5S rRNA genes present in S. lycopersicum. Interestingly, a similar observation was reported in B. hygrometrica; however, in the case of H. rhodopensis 88.3% (241) of these 5S rRNA genes were found in clusters of tandem repeats on three scaffolds which contrasts with B. hygrometrica where the majority of them were interspersed throughout the genome. Furthermore, more than 99% of the 425 core Viridiplantae genes are conserved in the H. rhodopensis genome assembly, indicating a high-quality assembly. Synteny analysis with the closely related species (A. thaliana and S. lycopersicum) further suggested the high quality of the Haberlea genome assembly (Fig. S3).
Comparative analysis of the H. rhodopensis genome with those of other plants
Next, we compared the genes predicted in H. rhodopensis with those of 20 other land plant species from evolutionarily divergent groups, including several other resurrection plants, to identify unique and shared gene families. Reciprocal pairwise BLAST comparisons using OrthoFinder grouped 663,415 proteins into 36,943 ortholog clusters (henceforth referred to as gene families) (Table S5). A total of 4253 gene families were present across all 21 species (Fig. 2A). The phylogenetic analysis suggested that Gesneriaceae species diverged from S. lycopersicum (Solanaceae) around 100–130 million years (myr) ago, in accordance with previous reports [52], and members of Gesneriaceae, H. rhodopensis and B. hygrometrica diverged from each other approximately 56 myr ago (Fig. 2B).
Orphan genes are important for taxonomy-specific developmental adaptations [53]. In H. rhodopensis, based on ortholog analysis, we found 10,435 orphan genes (23.55% of total genes) which is within the expected range observed in eukaryotes [54]. A total of 1075 gene families were specific to H. rhodopensis (Fig. 2A). The highest number of gene families coded for proteins with unknown functions and without any known domains that can suggest biological function. Four of the other families encoded zinc finger proteins (CCHC-type superfamily and SWIM-type, as well as with an integrase zinc-binding domain), suggesting a role in transcription/DNA binding. The other Haberlea-specific gene families include ULP_PROTEASE DOMAIN-CONTAINING PROTEINS, an ASPARTIC PEPTIDASE DOMAIN FAMILY, and GUANYLATE-BINDING PROTEIN 4-LIKE FAMILY.
A total of 89 and 144 gene families in H. rhodopensis were significantly expanded and contracted, respectively (Table S6). Of the 89 expanded gene families, 25 were specifically found in H. rhodopensis. Some of the more notable expanded gene families include: ZINC FINGER CCHC DOMAIN FAMILY, RAC-LIKE, SQUAMOSA PROMOTER BINDING PROTEIN-LIKE, SERINE/THREONINE-PROTEIN PHOSPHATASE 7 (PP7) LONG FORM HOMOLOG (PP7L), TCP FAMILY TRANSCRIPTION FACTOR, WRKY DNA-BINDING PROTEIN, FRS (FAR1-RELATED SEQUENCES) TRANSCRIPTION FACTOR, AT-HOOK MOTIF NUCLEAR-LOCALIZED PROTEIN, and SWIM-TYPE DOMAIN-CONTAINING PROTEIN. Furthermore, 159 gene families were expanded in the Gesneriaceae species that included EARLY LIGHT-INDUCED PROTEINS (ELIPs). ELIPs are known to play a crucial role in desiccation tolerance and are reported to be expanded in other resurrection species as well [55, 56]. Three other gene families that significantly expanded in H. rhodopensis contain homologs of the bHLH, WRKY and FRS transcription factor (TF) families. The FRS TFs have been implicated in the regulation of genes associated with drought, salinity, and temperature fluctuations [57, 58]. The expanded gene families were enriched for genes related to ‘auxin activated signaling pathway’, ‘reactive oxygen species metabolic process’, ‘defense response’, ‘cellular response to cold’, ‘flower development’, ‘cellular response to heat’, and ‘protein folding’ (Fig. 2C; Data S2). In terms of MapMan categories, these gene families are involved in diverse functional categories, such as cell wall organization, SnRK1 kinase regulatory system, RNA processing, carbohydrate metabolism, and chromatin organization (Fig. S4). Similarly, the contracted gene families were enriched for genes involved in “cell recognition”, “recognition of pollen” and “proteolysis” (Data S2).
H. rhodopensis can withstand multiple and combined stresses
We previously observed that H. rhodopensis withstands desiccation and darkness as individual stresses [3]. To study the combined effect of both stresses, we subjected H. rhodopensis plants to desiccation, complete darkness, or the combination of the two, for one month and then returned them for five days to optimal conditions for recovery (Fig. 3A; see Materials and Methods). Plants in darkness were slightly etiolated but preserved most of their chlorophyll, and eventually completely recovered (Fig. 3A). Plants subjected to desiccation in the light became air-dried after seven days and then regained their turgor and relative water content upon rehydration (Fig. 3B). Plants subjected to desiccation in darkness lost their turgor and part of their chlorophyll; however, they fully recovered when returned to optimal growth conditions (Fig. 3A). The desiccated plants (both under normal photoperiod and in darkness) had very low (< 5%) relative water content (RWC), indicating almost complete dehydration. However, cell damage was limited, as revealed by a very low electrolyte leakage (Fig. 3C). Overall, the two stresses given separately, or in combination, led to clear and predictable phenotypic changes; however, all plants fully regained their normal appearance after one week of recovery from stress.
Next, we subjected H. rhodopensis plants to chilling (4 °C), freezing (− 4 °C), desiccation, and a combination of the stresses to inspect changes in their transcriptomes (see Materials and Methods for details). In well-watered and desiccation treated plants, chilling or freezing and their combination did not result in a noticeable change in phenotype and RWC (Fig. 4A–B). Interestingly, plants subjected to chilling did not show any cell damage (based on electrolyte leakage), whereas freezing induced significant cell damage (Fig. 4C). Notably, prior incubation at 4 °C for a day (chilling + freezing) completely rescued the observed rise of conductivity, indicating that this serves as an acclimation cue for the plants. A smaller but still significant elevation of electrolyte leakage, that was not further increased during freezing, was also detected in all desiccated plants. Thus, it appears that dried Haberlea plants exhibit enhanced tolerance to other abiotic stressors.
Comparative transcriptome analysis of H. rhodopensis during desiccation, darkness, and their combination
The total RNAs of leaves collected from plants subjected to darkness, desiccation, and the combination of the two stresses, and plants that have recovered from the three stress conditions, were sequenced in three biological replicates and compared to plants grown under control conditions (Fig. 3A). Principal component analysis (PCA) revealed different transcriptome profiles in plants from the stress conditions compared to control and recovered plants (Fig. 5A). Interestingly, desiccated samples and samples from the combined stresses closely grouped along PC1 (59.15%) and were different from the darkness samples, indicating that desiccation was a major contributor to the transcriptional changes compared to darkness (Fig. S4). All recovery samples exhibited transcriptome profiles similar to those of control samples, in concordance with the observed growth phenotypes.
We identified a total of 29,481 differentially expressed genes (DEGs) across all pairwise comparisons of conditions (Fig. S4; Data S3). The maximum DEGs were observed in desiccated samples and samples under combined stresses. Next, we clustered all DEGs using k-means clustering (k = 18) followed by GO and MapMan bin enrichment to identify affected biological processes (Data S2). The DEGs were grouped into 18 clusters based on their expression profiles (Fig. 5B). The clusters 2, 15, and 18 contained genes predominantly induced under desiccation in darkness, desiccation, and darkness, respectively (Fig. 5C–E). Cluster 2 mainly included genes related to ‘fatty acid beta-oxidation’, ‘chromatin remodeling’, ‘protein transport’, ‘ABA-activated signaling pathway involved in stomatal movement’, and ‘positive regulation of seed germination’. Genes predominantly induced during desiccation (cluster 15) were involved in ‘autophagy’, ‘stomatal closure’, ‘ABA metabolic process’, ‘postreplication repair’ and ‘lipid storage’. Dark stress resulted in up-regulation of genes (cluster 18) related to ‘carbohydrate storage’, ‘proteolysis’, ‘auxin catabolic process’, ‘lateral root formation’ and ‘regulation of flower development’.
Desiccation stress alone or in combination with darkness resulted in massive induction of genes encoding for ELIPs, heat shock proteins, such as HSP17.8, HSP17.9, HSP23.6, and HSP70, and enzymes related to sugar metabolism, such as STACHYOSE SYNTHASE and AMYLASES (ALPHA and BETA) (Data S3). Interestingly, 18KDa seed maturation protein was significantly induced (8.5-log2 fold change) under both desiccation and desiccation in darkness. Two more genes related to sugar transport, encoding the sucrose transport proteins SUC3 and SUC4, were induced exclusively by desiccation and desiccation in darkness. Late embryogenesis abundant (LEA) proteins, known to respond to dehydration, were substantially induced by desiccation regardless of the light regime, and a few LEA genes were also induced in darkness. Significant induction by desiccation, irrespective of the light regime, was also observed for several genes encoding signaling proteins, such as CBL-INTERACTING PROTEIN KINASE 2 and PROTEIN PHOSPHATASE 2C (PP2C).
On the other hand, genes encoding PHYTOCHROME INTERACTING FACTOR 1 (PIF1) and GROWTH RESPONSE FACTOR 4 (GRF4) were induced only in darkness and were unaltered by desiccation and desiccation in darkness (Data S3). Moreover, GRF3 showed a rather contrasting expression pattern: it was significantly induced (3.6-log2 fold change) by darkness alone but repressed by desiccation or by desiccation in darkness. Similar expression behavior was observed for some phytochrome genes: PHYTOCHROME A1 and PHYTOCHROME B were induced by darkness alone but repressed by desiccation and by desiccation in darkness. Contrasting expression patterns were also observed for several chlorophyll catabolism genes. One PHEOPHYTINASE gene was induced by desiccation and by desiccation in darkness but repressed by darkness. STAY GREEN ONE (SGR1), PHEOPHORBIDE A OXYGENASE (PAO), and RED CHLOROPHYLL CATABOLITE REDUCTASE (RCCR) were induced by desiccation alone and by desiccation in darkness, but remained unaltered in the darkness alone.
The genes related to photosynthesis, including genes encoding CHLOROPHYLL a/b BINDING PROTEINS, LIGHT HARVESTING COMPLEX PROTEINS, and PROTOCHLOROPHYLLIDE REDUCTASE, and various other components of the photosynthetic machinery, were repressed by all stress conditions. The repression was the strongest by the combination of the two stress factors (desiccation in darkness), likely resulting in a complete photosynthesis shut-off.
Next, we investigated the DEGs affected by desiccation in darkness. We found 1532 genes specifically induced upon desiccation in darkness (Fig. 6A). Among these were genes encoding for transcription factors (bHLH30 and GTE12), glycine-rich RNA binding proteins (GRP1A and GRP2A), and autophagy-related protein 13b (ATG13B). Furthermore, 2439 genes including ASCORBATE PEROXIDASE 2, GLUTATHIONE S-TRANSFERASE T1 (GSTT1), CYTOCHROME P450 83B1 (CYP83B1), and a LEA gene were repressed upon desiccation in darkness only (Fig. 6B–C).
Comparative transcriptome analysis of H. rhodopensis during chilling, freezing, desiccation, and combinations of these stress factors
To study the transcriptional changes of H. rhodopensis plants subjected to low temperatures, the total RNAs from plants from the low-temperature experiment were sequenced (Fig. 7). In principle, the most dramatic reprogramming was induced by each of the four desiccation-related treatments, where ~ 6000 genes were upregulated, and ~ 8000 were downregulated (Figs. 7A and S6A–B; Data S4). Intriguingly, these sets of modulated genes overlapped to a very large extent, leaving relatively few condition-specific genes. This suggests that desiccation is the “master” stress factor and low temperatures cannot override it. Plants treated with freezing alone (followed by the combination of chilling and freezing) showed the lowest number of DEGs, whereas chilling alone resulted in much higher number of DEGs. Chilling, which in natural conditions is the first stressor that plants will experience in winter, may serve as a priming cue triggering a longer-lasting transcriptional reprogramming in order to prepare the plants for the subsequent sub-zero temperatures. Chilling and freezing do not share a large set of DEGs, pointing to distinct responses at the two temperatures.
In accordance with the small number of DEGs, freezing and chilling as well as freezing cluster relatively close with the controls (Fig. 7A). In turn, all desiccation-related samples form one supercluster, with a very well pronounced overlap, corroborating the observation above that low temperatures have negligible impact on dried Haberlea plants. The third group on the PCA, which is very well separated from the others, corresponds to the chilling stress and shows the unique characteristics of the Haberlea transcriptome in this condition, despite the lack of an observable growth phenotype.
Next, k-means clustering (k = 11) of the DEGs identified two clusters (clusters 7 and 9) consisting of genes specifically induced upon chilling (Data S4; Fig. 7B–C). Enrichment analysis suggested that these genes are involved in RNA biosynthesis, carbohydrate metabolism, nutrient uptake and multi-process regulation (Data S5). Cluster 11 was enriched for genes related to cell wall organization and phytohormone action; they were specifically induced by chilling and freezing (Fig. 7C). Clusters 2, 5, and 8 included genes encoding for signaling components that presumably act at the beginning of the signaling cascades and they were highly upregulated by desiccation, chilling, freezing, and the combinations of these stress factors (Fig. 7C). The genes included two PP2C protein phosphatases, whose homologs in Arabidopsis are known to be involved in ABA signaling, and a SRC2 homolog, which may act as an activator of the calcium-dependent activation of the NADPH oxidase RbohF that mediates reactive oxygen species (ROS) production (Data S5).
Several genes involved in selective protein degradation through the proteasome pathway, including the F-box stress-induced protein FBS1 and the E3 ubiquitin-protein ligase PUB18 that regulate ABA-mediated stomatal movement, were highly induced by dehydration, low temperatures, and the combination of these stresses (Data S4). At the same time, these stresses repressed the ubiquitin-protein ligase PUB23, known to negatively regulate water stress responses. The role of ABA was further substantiated by the induction of LTI65 and RCI2A (Rare Cold-Inducible 2A) genes, known to be induced not only by ABA but also by low temperatures, dehydration, and salt stress [59,60,61]. The link between ABA signaling and protein degradation was confirmed by the induction of EDL3 (EID1-LIKE 3) which encodes an F-box protein involved in mediating the regulation of abscisic acid signaling. EDL3 is known to regulate anthocyanin accumulation under drought stress [62]. Collectively, the data indicate roles of ABA signaling and protein degradation in both dehydration and low-temperature stress.
Transcription factor genes acting downstream in the gene regulatory networks were also significantly upregulated by all of these stress factors. These included AZF2, DREB2A, and ZAT6, reportedly involved in mediating the tolerance to other abiotic and oxidative stresses as well. These TFs in turn induce the expression of downstream stress-related genes that can contribute to the observed multiple stress tolerance. At the same time, other TF encoding genes, such as ATHB-52, OFP4, and WRKY46, were repressed by all stress factors studied. These TFs are likely to govern the stress-associated transcriptional reprogramming and confer multiple stress tolerance.
Genes encoding LEA proteins and ELIPs, acting further downstream the stress network, were among the most responsive in this experiment, and they were upregulated in all cases. Both families are typically associated with desiccation [63, 64], but here it is shown that some representatives are induced by low temperatures as well. The transcripts of some of the ELIPs, as well as LEA D-29, accumulate in all stress conditions, while the LEAs D-34 and SLE2 remain unchanged only during freezing and chilling + freezing. Interestingly, one of the LEAs–LEA 47, seems to be specific for low temperatures only since its transcription is not influenced by desiccation.
Two crucial enzymes involved in the synthesis of raffinose and raffinose family oligosaccharides (GOLS2 and RFS2) are considerably induced in all tested conditions. Thus, accumulation of raffinose family oligosaccharides (RFOs), previously associated mainly with desiccation, might be a universal stress response in Haberlea.
At the same time, genes involved in growth and development were repressed. These included SAUR76, encoding an auxin-responsive protein that promotes cell expansion, cell elongation, and plant growth in Arabidopsis [65], HSD1, involved in regulating plant growth and development by promoting or mediating brassinosteroid effects, and EXPA2, encoding an expansin. Many photosynthesis-related genes were repressed as well, indicating that inhibition of photosynthesis is a common response to all stresses.
In general, desiccation and chilling appear to inhibit auxin signaling and responses to auxin, as in the case of SAUR76, mentioned above, but also homologues of SAUR32 and SAUR50, the auxin-induced protein 15A (AX15A) and BIG-GRAIN 1-like B (BG3), involved in auxin transport. The situation with ABA is more complex, because while some aspects of the ABA pathways are induced, as EDL3 described above, others appear to be turned down in these conditions. For example, the receptor PYL4, known to inhibit the activity of group-A protein phosphatases type 2C (PP2Cs), is considerably downregulated. This is coupled to the measured hyperinduction of PP2Cs. In turn, abscisic acid 8'-hydroxylase 2, involved in the oxidative degradation of ABA, accumulated in all desiccated samples.
Somewhat unexpectedly, typical ROS scavengers are not among the most modulated in this experiment, which suggests that their expression is relatively stable, responsible for a constitutively active and highly effective antioxidant system. There are even some significantly downregulated ones during stresses: for example, the glutaredoxin GRXC13 and a peroxidase (PER42). Previously, it was suggested that H. rhodopensis might be preliminarily primed for drought/desiccation events [64], but this feature may be true for other abiotic stresses as well.
Among the most modulated genes, especially during desiccation and chilling, there is a large portion of cell wall-related ones. In most cases, they are downregulated—for example, expansins involved in cell expansion. The exception is a xyloglucan endotransglucosylase, which is induced by low temperatures and is normally associated with cell wall loosening by modification of the hemicellulose component.
Comparative transcriptome analysis of H. rhodopensis subjected to different stresses
Next, we compared the DEGs from the darkness and low-temperature experiments to identify key genes commonly regulated by different stresses. For this, the DEGs were obtained using the pairwise comparisons of the stress time points against the respective controls (Fig. S7). We observed a significant overlap of the DEGs between the two experiments. The maximum overlap between the DEGs from the two experiments was observed for all time points involving desiccation stress, further suggesting that desiccation acts as a “master” stress factor. Furthermore, a total of 58 genes were either induced (51) or repressed (7) in all stresses, indicating these as common stress-responsive genes in Haberlea. Some of the commonly down-regulated DEGs included those encoding for expansins (EXPA2 and EXPA8) and NAC TFs. Among the commonly up-regulated DEGs, eight encoded for TFs including members of NAC, MYB, C3H and C2H2 families.
Transcription factors and resistance gene analogs
A total of 2507 transcription factors (TFs) belonging to 55 different families and representing 5.6% of the predicted protein-coding genes were identified in the H. rhodopensis genome (Table S7). TFs of the bHLH (218), MYB (195), ERF (174), and WRKY (151) families were the most abundant (Fig. S8A). The distribution of TF families is similar to those of other eukaryotes [37]. A significant number of the TFs (83.13%, 2084) were differentially expressed during different stages (Fig. S8B; Data S3 and S5) and 37 of these were specific to Haberlea.
Resistance gene analogs (RGAs) are activated by various biotic and abiotic stresses [66]. We predicted a total of 873 RGAs in the genome of H. rhodopensis. These RGAs included 102 NBS-encoding proteins, 51 receptor-like proteins (RLPs), 575 receptor-like kinases (RLKs), and 145 transmembrane coiled-coil (TM-CC) proteins (Table S8). Out of the 102 NBS-encoding genes, 77 (75.49%) were differentially expressed in one of the pairwise comparisons (Fig. S9) and two of these were Haberlea-specific. Considering the importance of ELIPs in desiccation tolerance, we investigated these genes in the H. rhodopensis genome. Using similarity searches with other plant species, we identified a total of 23 ELIPs, arranged in tandem arrays across the genome (Fig. 8). All ELIPs were differentially expressed in at least one pairwise comparison with 20 ELIPs being significantly induced by desiccation alone or in combination with other stresses.
Discussion
Haberlea can withstand a combination of abiotic stresses
Here we showed that the desiccation tolerance in H. rhodopensis is fully preserved in darkness, as well as in chilling/freezing temperatures. Prolonged darkness combined with low temperatures occurs, for example, around and beyond the polar circles. Air-dried plants grown under photoperiod or in complete darkness withstand desiccation to just 5% RWC, which would otherwise kill non-resurrection plants. Measurements of the electrolyte leakage confirmed that neither the two stresses alone nor their combination inflicted serious damage on the plants. The full recovery of Haberlea rhodopensis from full and fast air-drying seems a special feature for this species, as the recovery from dehydration of other resurrection species, such as Boea hygrometrica and Craterostigma plantagineum, is more problematic after fast air-drying. Additionally, we demonstrated that H. rhodopensis is tolerant to chilling and once in a dehydrated state, it can also tolerate freezing (sub-zero) temperatures.
Haberlea-specific genes and expansion of gene families involved in stress tolerance
Here, we report a high-quality sequencing and assembly of the H. rhodopensis genome, whose size (~ 1.27 Gb) is similar to the genome of its close relative B. hygrometrica [46]. We identified 10,435 genes specific for Haberlea (not sharing sequence homology with other species), constituting a very high percentage of the predicted genes (23.55%). Genes specific to only one or a few closely related species are known as taxonomically restricted genes (TRGs) [67]. TRGs are implicated in the adaptation to unfavorable environments [68]. It is likely that some of the TRGs of H. rhodopensis are important for the tolerance to extreme abiotic stress factors, such as desiccation, darkness, and chilling/freezing. Many genes specific to H. rhodopensis encode proteins with unknown functions. Future functional studies may reveal new and important players in the adaptation to extreme environments.
Expansion of particular gene families is related with their increased importance for adaptation to new or/and extreme environments. The large number of gene families expanded in H. rhodopensis (89 in total) supports this notion. Members of some of these gene families are known to be involved in responses to abiotic factors, such as drought/desiccation and photoperiod (light/darkness). For example, a SERINE/THREONINE-PROTEIN PHOSPHATASE, DISEASE RESISTANCE PROTEINs, and two ELIP genes were the highest induced genes in Haberlea exposed to desiccation [1]. The ELIP family was shown to have expanded in all resurrection species [56]. Protein phosphatase 7 (PP7) proteins are specifically present in the plant kingdom and are localized mainly in the nucleus [69]. In Arabidopsis, AtPP7L is involved in chloroplast development, and its overexpression confers resistance to highlight stress [70]. In H. rhodopensis, we found a significant expansion of the SERINE/THREONINE-PROTEIN PHOSPHATASE 7 LONG FORM family. The gene families expanded in H. rhodopensis are also involved in pathways such as the SnRK1 kinase regulatory system, known for its role in adjusting cellular metabolism during starvation and stress conditions [71]. Furthermore, in the recently sequenced cold-tolerant extremophile Eutrema salsugineum [72], the expanded gene families were enriched for genes related to cold response and hormone signaling which was also the case in H. rhodopensis, suggesting similarities in adaptation to cold.
Transcriptional rewiring during desiccation in darkness
Very low light and even darkness can occur in different habitats, for example in the undergrowth under dense forests or around the polar circles during the long winters. H. rhodopensis is the only species with a sequenced genome that can withstand extended darkness. Furthermore, it is one of the very few species in which comprehensive molecular studies on adaptation to darkness have been performed. Overall, transcriptional regulation during desiccation was more prominent than during darkness. Some of the key genes are highlighted in Fig. 9. The massive induction of ELIP, LEA, and HSP genes in the desiccated Haberlea plants confirmed earlier studies showing that these genes are induced by dehydration in vegetative tissues of both model and resurrection species, including H. rhodopensis [1, 73]. The ELIP gene family, as noted above, expanded in all resurrection plants, including Haberlea, and the ELIP genes are switched on during desiccation and light stress to protect against these abiotic stresses [55, 56]. However, as seen in Fig. 8, our results demonstrate that the expression of the majority of ELIP genes is significantly upregulated also during desiccation in the dark. This suggests that ELIPs are an indispensable component of the desiccation response and that their role is not limited to the presence of light. Moreover, it appears that there is a bypassing mechanism, which, upon desiccation, activates the expression of ELIPs and other light-related genes independently of light, as their usual stimulus. LEAs are known to respond to dehydration in drying seeds and desiccated vegetative tissues. However, some LEA genes in Haberlea (LEA 2 and LEA D-29) are highly induced by darkness as well suggesting that they are highly important for plant survival in darkness [3]. Interestingly, one of the LEA genes (AT1G64065_LEA, Fig. 8) was specifically repressed upon desiccation in darkness.
Sugars are implicated in the defense against dehydration in many ways: accumulation of sucrose as water replacement is a universal response to desiccation of many resurrection species, and raffinose family oligosaccharides (RFOs), such as raffinose, stachyose, and verbascose, can protect against drought-induced oxidative stress [1, 74,75,76]. In line with these previous studies, it was not surprising to see that many genes related to sugar metabolism and transport respond to desiccation in Haberlea. The induction of stachyose synthase may contribute to the accumulation of RFOs, whereas the induction of the sucrose transport protein genes SUC3 and SUC4 may contribute to the required increase in sucrose mobility during the stress response [1, 77, 78].
Several genes encoding for signaling proteins seem to be specific for drought/desiccation. These include a CBL-INTERACTING PROTEIN KINASE 2, a PROTEIN PHOSPHATASE 1 REGULATORY SUBUNIT, and a PP2C gene. The PP2C gene is also the highest induced gene at both the earliest and the latest time points during dehydration [1]. This, together with its presumed function as a component situated at the beginning of the signaling cascade, makes it an ideal candidate for further functional studies.
Moreover, in H. rhodopensis, we found upregulation of several bHLH encoding genes during desiccation and desiccation in darkness. On the other hand, GRF3, GRF4, and PIF1 are specific for darkness. In Arabidopsis, GRF3 is implicated in integrating environmental stimuli into developmental programs: genes downstream of GRF3 are related to plant growth, development, phytohormone biosynthesis and signaling, and the cell cycle [79]. In rice, GRF4 promotes and integrates nitrogen assimilation, carbon fixation, and growth [80]. Regulation of growth is certainly critical in Haberlea exposed to long-term darkness. Darkness and desiccation activate the autophagy pathway, as several genes specific for autophagy were upregulated in the absence of light and/or in dehydrated samples. This seems to reflect the starvation specifically caused by darkness or/and desiccation, as the low temperatures alone did not upregulate these autophagy-related genes.
Transcriptional reprogramming at low temperatures
The RNA seq analysis of the samples from the low-temperature experiment indicated that chilling and freezing induce a very dissimilar transcriptomic response in H. rhodopensis. Many of the genes typically associated with desiccation tolerance, such as ELIPs and LEAs, and representatives of signal transduction pathways, such as F-box proteins and EARLY RESPONSIVE TO DEHYDRATION 7 (ERD7), were also induced by low temperatures, especially chilling. Conversely, other genes mostly associated with cold responses were upregulated by desiccation as well. This includes RARE COLD-INDUCIBLE 2A (RCI2A), COLD REGULATED GENE 27 (COR27), a temperature-induced lipocalin (TIL), and a low temperature-induced 65 kDa protein (LTI65). Additionally, several TFs known to regulate cold tolerance including INDUCER OF CBF EXPRESSION 1 (ICE1), and DREB3 were induced in low-temperature conditions. Interestingly, a DREB3 homolog and a bHLH-domain containing protein were induced only during all low-temperature time points. The bHLH and DREB TFs are also shown to be induced in cold-tolerant halophyte (E. salsugineum [81]). Furthermore, in transgenic tomato DREB3 overexpression improves tolerance to cold stress [82]. This shows that H. rhodopensis utilizes the same protective strategy elements for different purposes. In turn, genes related to multiple aspects of photosynthesis are always downregulated in response to these stressors, demonstrating that photosynthesis shutdown is a standard reaction against abiotic challenges.
Overall, by combining genome assembly and transcriptome analyses, the study presented here reveals some of the important players, such as PP2C, ERD7, and TIL, involved in the responses of H. rhodopensis to several different abiotic stresses. These are suitable candidates for future functional studies and are a valuable resource for the scientific community that enables a better understanding of mechanistic aspects of desiccation and the response to long-term darkness and low temperatures in plants. The information obtained by dissecting the molecular responses of different stresses in naturally tolerant plants, like key genes and/or genetic variants, as well as specific regulation mechanisms, can be translated into designing strategies for increasing stress resistance in other food crops. For example, factors simultaneously contributing to desiccation tolerance in vegetative tissues in resurrection species and in dehydrating seeds in crops are promising candidates for that purpose. Genes from desiccation-tolerant species have already been shown to function in model plants and crops and mitigate abiotic stresses, such as drought and salinity [83, 84]. Identifying novel Haberlea genes and pathways that potentially contribute to tolerance to extreme environments may also pave the way for engineering crops with multiple stress tolerance and higher productivity under unfavorable conditions.
Data availability
Genome and transcriptome sequencing data, and genome assembly and annotation from this article can be found in the EBI ENA database under accession number PRJEB40105.
References
Gechev TS, Benina M, Obata T, Tohge T, Sujeeth N, Minkov I, Hille J, Temanni MR, Marriott AS, Bergström E, Thomas-Oates J et al (2013) Molecular mechanisms of desiccation tolerance in the resurrection glacial relic Haberlea rhodopensis. Cell Mol Life Sci 70:689–709
Giarola V, Hou Q, Bartels D (2017) Angiosperm plant desiccation tolerance: hints from transcriptomics and genome sequencing. Trends Plant Sci 22:705–717
Durgud M, Gupta S, Ivanov I, Omidbakhshfard MA, Benina M, Alseekh S, Staykov N, Hauenstein M, Dijkwel PP, Hörtensteiner S, Toneva V et al (2018) Molecular mechanisms preventing senescence in response to prolonged darkness in a desiccation-tolerant plant. Plant Physiol 177:1319–1338
Stolárik T, Hedtke B, Šantrůček J, Ilík P, Grimm B, Pavlovič A (2017) Transcriptional and post-translational control of chlorophyll biosynthesis by dark-operative protochlorophyllide oxidoreductase in Norway spruce. Photosynth Res 132:165–179
Benina M, Obata T, Mehterov N, Ivanov I, Petrov V, Toneva V, Fernie AR, Gechev TS (2013) Comparative metabolic profiling of Haberlea rhodopensis, Thellungiella halophyla, and Arabidopsis thaliana exposed to low temperature. Front Plant Sci 4:499
Mihailova G, Solti Á, Sárvári É, Keresztes Á, Rapparini F, Velitchkova M, Simova-Stoilova L, Aleksandrov V, Georgieva K (2020) Freezing tolerance of photosynthetic apparatus in the homoiochlorophyllous resurrection plant Haberlea rhodopensis. Environ Exp Bot 178:104157
Garg V, Dudchenko O, Wang J, Khan AW, Gupta S, Kaur P, Han K, Saxena RS, Kale SM, Pham M, Yu J et al (2022) Chromosome-length genome assemblies of six legume species provide insights into genome organization, evolution, and agronomic traits for crop improvement. J Adv Res 42:315–329
Garg V, Khan AW, Fengler K, Llaca V, Yuan Y, Vuong TD, Harris C, Chan T-F, Lam HM, Varshney RK (2023) Near-gapless genome assemblies of Williams 82 and Lee cultivars for accelerating global soybean research. Plant Genome 16:e20382 https://doi.org/10.1002/tpg2.20382
Mascher M, Gundlach H, Himmelbach A, Beier S, Twardziok SO, Wicker T, Radchuk V, Dockter C, Hedley PE, Russell J, Bayer M et al (2017) A chromosome conformation capture ordered sequence of the barley genome. Nature 544:427–433
Schmidt MH, Vogel A, Denton AK, Istace B, Wormit A, van de Geest H, Bolger ME, Alseekh S, Maß J, Pfaff C, Schurr U, Chetelat R, Maumus F, Aury JM, Koren S, Fernie AR, Zamir D, Bolger AM, Usadel B (2017) De novo assembly of a new Solanum pennellii accession using Nanopore sequencing. Plant Cell 29:2336–2348
Zhang L, Hu J, Han X, Li J, Gao Y, Richards CM, Zhang C, Tian Y, Liu G, Gul H, Wang D et al (2019) A high-quality apple genome assembly reveals the association of a retrotransposon and red fruit colour. Nat Commun 10:1494
Dong Y, Gupta S, Sievers R, Wargent JJ, Wheeler D, Putterill J, Macknight R, Gechev T, Mueller-Roeber B, Dijkwel PP (2019) Genome draft of the Arabidopsis relative Pachycladon cheesemanii reveals novel strategies to tolerate New Zealand’s high ultraviolet B radiation environment. BMC Genomics 20:838
Chin CS, Peluso P, Sedlazeck FJ, Nattestad M, Concepcion GT, Clum A, Dunn C, O’Malley R, Figueroa-Balderas R, Morales-Cruz A, Cramer GR et al (2016) Phased diploid genome assembly with single-molecule real-time sequencing. Nat Methods 13:1050–1054
Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM (2017) Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736
Vurture GW, Sedlazeck FJ, Nattestad M, Underwood CJ, Fang H, Gurtowski J, Schatz MC (2017) GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33:2202–2204
Roach MJ, Schmidt SA, Borneman AR (2018) Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinform 19:460
Ghurye J, Pop M, Koren S, Bickhart D, Chin CS (2017) Scaffolding of long read assemblies using long range contact information. BMC Genomics 18:527
Gurevich A, Saveliev V, Vyahhi N, Tesler G (2013) QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM (2015) BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212
Smit A, Hubley R, RepeatModeler Open-1.0.10. (2008) Available from http://www.repeatmasker.org/
Kapitonov VV, Jurka J (2008) A universal classification of eukaryotic transposable elements implemented in Repbase. Nat Rev Genet 9:411–412
Smit A, Hubley R, RepeatMasker Open-4.0. (2015) Available from http://www.repeatmasker.org/
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120
Kopylova E, Noé L, Touzet H (2012) SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics 28:3211–3217
Kim D, Paggi JM, Park C, Bennett C, Salzberg SL (2019) Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol 37:907–915
Kovaka S, Zimin AV, Pertea GM, Razaghi R, Salzberg SL, Pertea M (2019) Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol 20:278
Hoff KJ, Lange S, Lomsadze A, Borodovsky M, Stanke M (2016) BRAKER1: unsupervised RNA-Seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics 32:767–769
Haas BJ, Salzberg SL, Zhu W, Pertea M, Allen JE, Orvis J, White O, Buell CR, Wortman JR (2008) Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol 9:R7
Schwacke R, Ponce-Soto GY, Krause K, Bolger AM, Arsova B, Hallab A, Gruden K, Stitt M, Bolger ME, Usadel B (2019) MapMan4: a refined protein classification and annotation framework applicable to multi-omics data analysis. Mol Plant 12:879–892
Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S et al (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics 30:1236–1240
Nawrocki EP, Eddy SR (2013) Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29:2933–2935
Chan PP, Lin BY, Mak AJ, Lowe TM (2019) tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res 49:9077–9096
Emms DM, Kelly S (2019) OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol 20:238
Han MV, Thomas GW, Lugo-Martinez J, Hahn MW (2013) Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Mol Biol Evol 30:1987–1997
Kumar S, Stecher G, Suleski M, Hedges SB (2017) TimeTree: a resource for timelines, timetrees, and divergence times. Mol Biol Evol 34:1812–1819
Lovell JT, Sreedasyam A, Schranz ME, Wilson M, Carlson JW, Harkess A, Emms D, Goodstein DM, Schmutz J (2022) GENESPACE tracks regions of interest and gene copy number variation across multiple genomes. Elife 11:e78526
Tian F, Yang DC, Meng YQ, Jin J, Gao G (2020) PlantRegMap: charting functional regulatory maps in plants. Nucleic Acids Res 48:D1104–D1113
Li P, Quan X, Jia G, Xiao J, Cloutier S, You FM (2016) RGAugury: a pipeline for genome-wide prediction of resistance gene analogs (RGAs) in plants. BMC Genomics 17:852
Robinson MD, McCarthy DJ, Smyth GK (2010) edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26:139–140
Gu Z, Eils R, Schlesner M (2016) Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32:2847–2849
Hornik K, Buchta C, Zeileis A (2009) Open-source machine learning: R meets Weka. Comput Stat 24:225–232
Boyle EI, Weng S, Gollub J, Jin H, Botstein D, Cherry JM, Sherlock G (2004) GO: TermFinder—open source software for accessing gene ontology information and finding significantly enriched gene ontology terms associated with a list of genes. Bioinformatics 20:3710–3715
Zonneveld BJ, Leitch IJ, Bennett MD (2005) First Nuclear DNA Amounts in more than 300 Angiosperms. Ann Bot 96:229–24477
Nishii K, Hart M, Kelso N, Barber S, Chen Y-Y, Thomson M, Trivedi U, Twyford AD, Möller M (2022) The first genome for the Cape Primrose Streptocarpus rexii (Gesneriaceae), a model plant for studying meristem-driven shoot diversity. Plant Direct 6:e388
Costa MC, Artur MA, Maia J, Jonkheer E, Derks MF, Nijveen H, Williams B, Mundree SG, Jiménez-Gómez JM, Hesselink T, Schijlen EG et al (2017) A footprint of desiccation tolerance in the genome of Xerophyta viscosa. Nature Plants 3:17038
Xiao L, Yang G, Zhang L, Yang X, Zhao S, Ji Z, Zhou Q, Hu M, Wang Y, Chen M, Xu Y et al (2015) The resurrection genome of Boea hygrometrica: a blueprint for survival of dehydration. Proc Natl Acad Sci 112:5833–5837
VanBuren R, Wai CM, Giarola V, Župunski M, Pardo J, Kalinowski M, Grossmann G, Bartels D (2023) Core cellular and tissue-specific mechanisms enable desiccation tolerance in Craterostigma. Plant J 114:231–245
VanBuren R, Wai CM, Pardo J, Giarola V, Ambrosini S, Song X, Bartels D (2018) Desiccation tolerance evolved through gene duplication and network rewiring in Lindernia. Plant Cell 30:2943–2958
Hosmani PS, Flores-Gonzalez M, van de Geest H, Maumus F, Bakker LV, Schijlen E, van Haarst J, Cordewener J, Sanchez-Perez G, Peters S et al (2019) An improved de novo assembly and annotation of the tomato reference genome using single-molecule sequencing Hi-C proximity ligation and optical maps. bioRxiv. https://doi.org/10.1101/767764
Ramírez-Sánchez O, Pérez-Rodríguez P, Delaye L, Tiessen A (2016) Plant proteins are smaller because they are encoded by fewer exons than animal proteins. Genomics Proteomics Bioinform 14:357–370
Zhuang W, Chen H, Yang M, Wang J, Pandey MK, Zhang C, Chang WC, Zhang L, Zhang X, Tang R, Garg V et al (2019) The genome of cultivated peanut provides insight into legume karyotypes, polyploid evolution and crop domestication. Nat Genet 51:865–876
Bremer B (2009) Asterids. In: Hedges SB, Kumar S (eds) The timetree of life. Oxford University Press, pp 177–187
Long M, Betran E, Thornton K, Wang W (2003) The origin of new genes: glimpses from the young and old. Nat Rev Genet 4:865–875
Tautz D, Domazet-Lošo T (2011) The evolutionary origin of orphan genes. Nat Rev Genet 12:692–702
Hutin C, Nussame L, Moise N, Moya I, Kloppstech K, Havaux M (2003) Early light-inducd proteins protect Arabidopsis from photooxidative stress. Proc Natl Acad Sci 100:4921–4926
VanBuren R, Pardo J, Man Wai C, Evans S, Bartels D (2019) Massive tandem proliferation of ELIPs supports convergent evolution of desiccation tolerance across land plants. Plant Physiol 179:1040–1049
Dai J, Sun J, Peng W, Liao W, Zhou Y, Zhou XR, Qin Y, Cheng Y, Cao S (2022) FAR1/FHY3 transcription factors positively regulate the salt and temperature stress responses in Eucalyptus grandis. Front Plant Sci 13:883654
St. Aubin B, Wai CM, Raju SKK, Niederhuth CE, VanBuren R (2022) Regulatory dynamics distinguishing desiccation tolerance strategies within resurrection grasses. Plant Direct 6:e457
Kong F, Ramonell KM (2022) Arabidopsis Toxicos en Levadura 12 modulates salt stress and ABA responses in Arabidopsis thaliana. Int J Mol Sci 23:7290
Medina J, Catalá R, Salinas J (2001) Developmental and stress regulation of RCI2A and RCI2B, two cold-inducible genes of arabidopsis encoding highly conserved hydrophobic proteins. Plant Physiol 125:1655–1666
Nylander M, Heino P, Helenius E, Palva ET, Ronne H, Welin BV (2001) The low-temperature- and salt-induced RCI2A gene of Arabidopsis complements the sodium sensitivity caused by a deletion of the homologous yeast gene SNA1. Plant Mol Biol 45:341–352
Koops P, Pelser S, Ignatz M, Klose C, Marrocoo-Selden K, Kretsch T (2011) EDL3 is an F-box protein involved in the regulation of abscisic acid signalling in Arabidopsis thaliana. J Exp Bot 62:5547–5560
Gao Z-Y, Li Z-H, Lin D-L, Jin X-H (2021) Chromosome-scale genome assembly of the resurrection plant Acanthochlamys bracteata (Velloziaceae). Genome Biol Evol 13:147
Gechev T, Lyall R, Petrov V, Bartels D (2021) Systems biology of resurrection plants. Cell Mol Life Sci 78:6365–6394
Markakis MN, Boron AK, Loock BV, Saini K, Cirera S, Verbelen J-P, Vissenberg K (2013) Characterization of a small auxin-up RNA (SAUR)-like gene involved in Arabidopsis thaliana development. PLoS ONE 8:e82596
Sekhwal MK, Li P, Lam I, Wang X, Cloutier S, You FM (2015) Disease resistance gene analogs (RGAs) in plants. Int J Mol Sci 16:19248–19290
Giarola V, Krey S, Frerichs A, Bartels D (2015) Taxonomically restricted genes of Craterostigma plantagineum are modulated in their expression during dehydration and rehydration. Planta 241:193–208
Giarola V, Bartels D (2015) What can we learn from the transcriptome of the resurrection plant Craterostigma plantagineum? Planta 242:427–434
Farkas I, Dombrádi V, Miskei M, Szabados L, Koncz C (2007) Arabidopsis PPP family of serine/threonine phosphatases. Trends Plant Sci 12:169–176
Xu D, Marino G, Klingl A, Enderle B, Monte E, Kurth J, Hiltbrunner A, Leister D, Kleine T (2019) Extrachloroplastic PP7L functions in chloroplast development and abiotic stress tolerance. Plant Physiol 180:323–341
Wurzinger B, Nukarinen E, Nägele T, Weckwerth W, Teige M (2018) The SnRK1 kinase as central mediator of energy signaling between different organelles. Plant Physiol 176:1085–1094
Xiao M, Hao G, Guo X, Feng L, Lin H, Yang W, Chen Y, Zhao K, Xiang L, Jiang X, Mei D, Hi Q (2023) A high-quality chromosome-level Eutrema salsugineum genome, an extremophile plant model. BMC Genomics 24:174
Dinakar C, Bartels D (2013) Desiccation tolerance in resurrection plants: new insights from transcriptome, proteome and metabolome analysis. Front Plant Sci 4:482
Ali Q, Ashraf M (2011) Induction of drought tolerance in maize (Zea mays L.) due to exogenous application of trehalose: growth, photosynthesis, water relations and oxidative defence mechanism. J Agron Crop Sci 197:258–271
Gechev TS, Dinakar C, Benina M, Toneva V, Bartels D (2012) Molecular mechanisms of desiccation tolerance in resurrection plants. Cell Mol Life Sci 69:3175–3186
Moyankova D, Mladenov P, Berkov S, Peshev D, Georgieva D, Djilianov D (2014) Metabolic profiling of the resurrection plant Haberlea rhodopensis during desiccation and recovery. Physiol Plant 152:675–687
Barth I, Meyer S, Sauer N (2003) PmSUC3: characterization of a SUT2/SUC3-type sucrose transporter from Plantago major. Plant Cell 15:1375–1385
Zheng W, Jiang Y, Wang X, Huang S, Yuan M, Guo Y (2019) AP3M harbors actin filament binding activity that is crucial for vacuole morphology and stomatal closure in Arabidopsis. Proc Natl Acad Sci 116:18132–18141
Piya S, Liu J, Burch-Smith T, Baum TJ, Hewezi T (2020) A role for Arabidopsis growth-regulating factors 1 and 3 in growth-stress antagonism. J Exp Bot 71:1402–1417
Li S, Tian Y, Wu K, Ye Y, Yu J, Zhang J, Liu Q, Hu M, Li H, Tong Y, Harberd NP, Fu X (2018) Modulating plant growth-metabolism coordination for sustainable agriculture. Nature 560:595–600
Wang J, Zhang Q, Cui F, Hou L, Zhao S, Xia H, Qiu J, Li T, Zhang Y, Wang X, Zhao C (2017) Genome-wide analysis of gene expression provides new insights into cold responses in Thellungiella salsuginea. Front Plant Sci 8:713
Wang G, Xu X, Wang H, Liu Q, Yang X, Liao L, Cai G (2019) A tomato transcription factor, SlDREB3 enhances the tolerance to chilling in transgenic tomato. Plant Physiol Biochem 142:254–262
Huang Z, Tang R, Yi X, Xu W, Zhu P, Jiang CZ (2022) Overexpressing Phytochrome Interacting Factor 8 of Myrothamnus flabellifolia enhanced drought and salt tolerance in Arabidopsis. Int J Mol Sci 23:8155
Wang Z, Zhu Y, Wang L, Liu X, Liu Y, Phillips J, Deng X (2009) A WRKY transcription factor participates in dehydration tolerance in Boea hygrometrica by binding to the W-box elements of the galactinol synthase (BhGolS1) promoter. Planta 230:1155–1166
Acknowledgements
Z.N. acknowledges the financial support from the Max Planck-Gesellschaft (MPG) sequencing fund. B.M-R. acknowledges funding from the Max Planck Institute of Molecular Plant Physiology and the University of Potsdam. This work was supported by resources provided by high-performance computing infrastructure at the Max Planck Institute of Molecular Plant Physiology, and the Pawsey Supercomputing Research Centre with funding from the Australian Government and the Government of Western Australia.
Funding
Open Access funding enabled and organized by Projekt DEAL. This work was funded by the European Union's Horizon 2020 research and innovation programs PlantaSYST (SGA-CSA No. 739582 under FPA No. 664620), CropStrengthen (GA No. 642901), RESIST No. (823746), and by the European Regional Development Fund through the Bulgarian “Science and Education for Smart Growth” Operational Programme (project BG05M2OP001-1.003-001-C01).
Author information
Authors and Affiliations
Contributions
Conceptualization, S.G., A.R.F., and T.G.; methodology, S.G., V.G., and T.G.; data acquisition, analysis, and interpretation, S.G., V.P., V.G., Z.N., and T.G.; writing- original draft preparation, S.G., V.P., V.G., and T.G.; writing- review and editing, all authors; funding acquisition, T.G., Z.N., A.R.F., V.P., and B.M-R. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gupta, S., Petrov, V., Garg, V. et al. The genome of Haberlea rhodopensis provides insights into the mechanisms for tolerance to multiple extreme environments. Cell. Mol. Life Sci. 81, 117 (2024). https://doi.org/10.1007/s00018-024-05140-3
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00018-024-05140-3