Chromosome-level genome assembly of the Siberian chipmunk (Tamias sibiricus)

Li, Ran; Zhang, Mingfei; Cha, Muha; Xiang, Jishan; Yi, Xianfeng

doi:10.1038/s41597-022-01910-5

Chromosome-level genome assembly of the Siberian chipmunk (Tamias sibiricus)

Data Descriptor
Open access
Published: 24 December 2022

Volume 9, article number 783, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Data

Chromosome-level genome assembly of the Siberian chipmunk (Tamias sibiricus)

Download PDF

Ran Li ORCID: orcid.org/0000-0001-8567-3527¹,
Mingfei Zhang²,
Muha Cha²,
Jishan Xiang² &
…
Xianfeng Yi¹

1947 Accesses
4 Citations
1 Altmetric
Explore all metrics

Abstract

Tamias sibiricus is regarded as one predominant scatter-hoarder that stores their food items both in small scattered caches and underground larder-hoards. This unique behavior, though providing essential seed dispersal services for many plant species worldwide, relies highly on accurate spatial memory and acute sense of olfaction. Here, we assembled a chromosome-scale genome of T. sibiricus using Illumina sequencing, PacBio sequencing and chromosome structure capture technique. The genome was 2.64 Gb in size with scaffold N50 length of 172.61 Mb. A total of 2.59 Gb genome data was anchored and orientated onto 19 chromosomes (ranging from 28.70 to 222.90 Mb) with a mounting rate of up to 98.03%. Meanwhile, 25,311 protein-coding genes were predicted with an average gene length of 32,936 bp, and 94.73% of these genes were functionally annotated. This reference genome will be a valuable resource for in-depth studies on basic biological possess and environmental adaptation of the Siberian chipmunk, as well as promoting comparative genomic analyses with other species within Rodentia.

Chromosome-level genome assembly of the caenogastropod snail Rapana venosa

Article Open access 16 August 2023

A chromosome-level genome assembly of Prosopocoilus inquinatus Westwood, 1848 (Coleoptera: Lucanidae)

Article Open access 20 July 2024

A chromosomal-level genome assembly of Serrognathus titanus Boisduval, 1835 (Coleoptera: Lucanidae)

Article Open access 15 August 2024

Background & Summary

The Siberian chipmunk, Tamias sibiricus (Laxmann, 1769) belongs to the subfamily Xerinae, within the family Sciuridae of the order Rodentia¹. This species is a small, diurnal and ground-dwelling squirrel that lives in mountain and forest habitats with bushy understory². The wild populations of T. sibiricus are naturally distributed in Russia and several east Asian countries (China, Mongolia, Korea and Japan). Meanwhile, this squirrel is one of most popular companion animals because of its attractive appearance and unique behavior³. Hence, it has been introduced as pets into European countries for decades and the accidentally escaped individuals have successfully established their populations in the wild⁴. Additionally, as important seed dispersal agents adopting the primary strategies of scatter- and larder-hoarding behavior, T. sibiricus provides essential seed dispersal services in many ecosystems across the world⁵. Over the past decades, studies of T. sibiricus have mainly focused on biology, behavior, ecology, and phylogeography^5,6,7,8. However, little is known about the genetic basis and mechanism of its environmental adaptation because of limited molecular information.

In the present study, we constructed a high-quality genome assembly for the Siberian chipmunk using the integration of short reads (Illumina sequencing), long reads (PacBio sequencing) and Hi-C reads (proximity ligation chromatin conformation capture). The final assembled genome size of T. sibiricus was 2.64 Gb with the scaffold N50 length of 172.61 Mb. A total of 2.59 Gb assembled genome sequences were successfully anchored on 19 chromosomes. This number of chromosomes was consistent with the outputs of the karyotype ananlysis⁹. 1.03 Gb repetitive sequences were identified, constituting 38.87% of this reference genome. A total of 25,311 protein-coding genes were predicted, and 97.69% of these genes were functionally annotated.

Methods

Sample collection and ethics statement

An adult female specimens of T. sibiricus was originally collected from a forestry farm in Chifeng, Inner Mongolia Autonomous Region of China (41°39′N, 118°22′E) in October 2020. The sample was then maintained at Qufu Normal University, and stored at −80°C prior to DNA and RNA extraction. All experiments were performed according to the Guidelines for the Care and Use of Laboratory Animals in China. The sampled squirrel in this study was approved by the Institutional Animal Care and Use Committee (IACUC) of Qufu Normal University, Shandong, China, under permit No. 2021095.

Sequencing

Muscle tissue of the female body was prepared for transcriptome, Illumina, PacBio whole-genome and Hi-C sequencing. All sequencing analyses were performed by the Shanghai Origingene Bio-pharm Technology Co. Ltd. (Shanghai, China). Genome DNA was extracted using a Blood & Cell Culture DNA Mini Kit (Qiagen, Germany). Quantity and quality of the total DNA were determined by 2100 Bioanalyzer (Agilent, USA) and Qubit 3.0 Fluorometer (Invitrogen, USA), respectively. Total RNA was isolated using a TRIzol Total RNA Isolation Kit (Takara, USA) following the manufacturer’s protocols¹⁰. The NanoDrop 2000 spectrophotometer (Labtech, USA) and 2100 Bioanalyzer were used to check RNA quality.

Whole-genome shotgun sequencing was performed with a single molecule real-time (SMRT) PacBio system. PacBio Sequel II libraries with an insert size of 30 kb were prepared using a SMRTbell Template Prep Kit 2.0. For survey analysis and the error rates associated with long reads, two short paired-end libraries with an insert size of 350 bp were constructed using Truseq DNA PCR-free Kit (Illumina, USA). The next-generation sequence data was generated on the Illumina Hiseq X10 platform. To construct pseudo-chromosomes, the Hi-C library was constructed according to the standard protocols described previously¹¹. After quality control, 150 bp paired-end reads (PE150) were obtained using the Illumina Hiseq X10 platform. The cDNA library was constructed using a TruSeq RNA Sample Prep Kit v2 (Illumina, USA) and sequenced on the Illumina Hiseq X10 system using the paired-end strategy.

Genome survey and assembly

A total of 132.39 Gb Illumina short-insert-size data was firstly generated to get a preliminary understanding of the genome characteristics (Table 1). Based on the clean data with duplications removed, the K-mer frequency distribution was calculated with Jellyfish v2.2.6¹² and the results were subsequently analyzed by GenomeScope v2.0¹³. The genome size of T. sibiricus was estimated to be 2.51 Gb with the number of unique K-mers peaked at 21 (Fig. 1). Evaluation of genome characteristics showed the heterozygosity rate of the assembled genome was 0.21% (Table S1).

Table 1 Statistics of the DNA sequence data used for genome assembly.

Full size table

For PacBio sequencing, approximately 111.63 Gb long reads were obtained after removing adaptors in polymerase reads with default parameters. The mean length and N50 length of PacBio subreads was 35.62 and 24.13 kb, respectively (Table 1). After self-corrected and long read polished, genome initial assembly was performed using Canu v1.8¹⁴. As a result, we generated a 2.65 Gb genome assembly with the contig N50 of 9.40 Mb (Table 2). To further improve the quality and accuracy of the genome assembly, we corrected the genome by short-read polishing with high coverage of Illumina reads using Pilon v1.23¹⁵. Total size of the draft genome assembly was 2.64 Gb with an N50 length of 9.43 Mb. For the chromosome-level assembly, 217.38 Gb Hi-C sequencing data was generated and used to anchor contigs into pseudo-chromosomes (Table 1). 3D-DNA v180922 pipeline was used to generate a chromosome-level assembly of the genome¹⁶. After removing the duplicates, the Hi-C contact map was directly taken as input for 3D-DNA, the location and direction of each contig was determined, and the neighboring contigs were connected using 100 N gaps (100 Ns). Juicebox v1.11.08 (Juicebox Assembly Tools, JBAT) was subsequently used to review and manually curate scaffolding errors¹⁷. The final size of this genome was 2.64 Gb with a scaffold N50 of 172.61 Mb (Table 2). Results showed that the size of the assembled Siberian chipmunk genome was near to that estimated from the genome survey analysis. Meanwhile, 2.59 Gb data on the base level was anchored and orientated onto 19 chromosomes with a mounting rate of up to 98.03%, and the chromosome lengths ranged from 28.70 to 222.90 Mb (Table 3 and Fig. 2). After scaffolds were clustered, ordered and orientated to restore their relative locations, the heatmap of chromosome crosstalk indicated that the genome assembly was complete and robust (Fig. 1B).

Table 2 Summary of each step in construction of the T. sibiricus genome assembly.

Full size table

Table 3 Statistics of chromosomal level assembly of T. sibiricus.

Full size table

Chromosome synteny

Collinearity analysis of chromosomes between T. sibiricus and two other Xerinae species (Sciurus vulgaris and Sciurus carolinensis) was conducted with LASTZ v1.02.00¹⁸. As shown in Fig. 3, all 19 pseudochromosomes of T. sibiricus displayed high homology with the corresponding chromosomes of another two squirrels, and two chromosomes (chr11 and chr15 of S. vulgaris, chr11 and chr14 of S. carolinensis) were fused to the chromosome (chr11) in the Siberian chipmunk. Previous studies, using cross-species chromosome painting, showed that the diploid number of chromosomes vary among the species in the superorder Glires (Rodentia and Lagomorpha)^19,20, with the Siberian chipmunks having 38 chromosomes⁹. Interestingly, the variation seems to follow a certain pattern, such as chromosome 32,34,36,38,40. Combine that with our results of chromosome synteny, chromosome fusions and fissions might occur frequent among genome evolution of Glires. Thus, further studies are needed to determine the molecular mechanism of chromosomal rearrangements and evolution with more available chromosome-level genomic data.

Repeat annotation

After the genome assembly, annotation with 3 different types of repetitive sequences, non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) was performed. RepeatModeler v2.0.1 was used to identify the repetitive elements with default parameters, and a de novo repeat sequence library was built using the results²¹. Then, a custom library was constructed combining with Dfam 3.1²² and RepBase 20181026 databases²³. For the homology prediction, repetitive elements were masked using RepeatMasker v4.1.0 on the custom library²⁴. A total of 1.03 Gb repetitive sequences were identified, constituting 38.87% of T. sibiricus genome. The predominant four categories of transposable elements (TEs) consisted of long interspersed nuclear elements (LINEs, 18.63%), DNA transposon elements (2.71%), long terminal repeats (LTRs, 10.11%), and short interspersed nuclear elements (SINEs, 8.90%) (Table 4 and Fig. 4). All ncRNAs (rRNAs, snRNAs and miRNAs) were annotated using Infernal v1.1.3²⁵ and tRNAscan-SE v2.0.7²⁶. Only high-confidence tRNAs were retained using the tRNAscan-SE script ‘EukHighConfidenceFilter’. Different types of noncoding RNAs (ncRNAs) were also annotated, yielding 6,265 tRNAs, 830 small nuclear RNAs (snRNAs), 92 ribosomal RNAs (rRNAs) and 595 micro RNAs (miRNAs) (Table S2).

Table 4 Repeat annotation in the T. sibiricus genome.

Full size table

Protein-coding gene annotation

MAKER v3.01.03 pipeline was used to predict protein-coding genes with an integration of three strategies, including ab initio prediction, transcriptome-based annotation and homology-based annotation²⁷. The ab initio prediction was generated using the pipeline BRAKER v2.1.5²⁸, which automatically trained the predictors Augustus v3.3.4²⁹ and GeneMark-ET³⁰, and made use of the mapped transcriptome data and protein homology information. The transcriptome information in BAM alignments was produced by HISAT2 v2.2.0³¹, and the protein sequences were extracted from the database OrthoDB10 v1³². For transcriptome-based annotation, the data of RNA-seq was firstly mapped to our assembly with HISAT2, and the transcriptome information in BAM alignments was produced. With the reference genome of our assembly, the RNA-seq data were further assembled into transcripts using StringTie v2.1.4³³. Protein sequences of five model rodentian species (Cricetulus griseus, Dipodomys ordii, Ictidomys tridecemlineatus, Marmota marmota and Rattus norvegicus) were downloaded from NCBI Refseq database. And all sequences were used as reference required by MAKER for the homology-based prediction. Overall, 25,311 protein-coding genes were predicted with an average gene length of 32,936 bp. The average exon number per gene was 7.52, with average exon length of 171.85 bp, and average intron length of 4850.84 bp. The final gene models predicted above were then annotated using the non-redundant (NR) protein database of NCBI, Swissprot, Pfam, the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) databases. In total, 23,995 (94.73%) were successfully annotated for at least one homologous hit by searching against these five public databases. Based on BUSCO analysis, 94.4% of the BUSCO database (mammalia_odb10) genes were identified (complete single-copy genes: 92.2%, fragmented genes: 1.5%), further underlining the accuracy and completeness of gene prediction.

Gene family

OrthoFinder v2.3.8 was used to inferred gene families (orthologue groups, orthogroups) with Diamond as the sequence aligner³⁴. The protein sequences in the T. sibiricus genome and high-quality protein annotation sequences from assembled genomes of 19 rodents were used for analysis, including the naked mole‐rat (Heterocephalus glaber), Eurasian squirrel (S. vulgaris), eastern grey squirrel (S. carolinensis), alpine marmot (M. marmota), thirteen‐lined ground squirrel (I. tridecemlineatus), Arctic ground squirrel (Urocitellus parryii), Daurian ground squirrel (Spermophilus dauricus), Iberian mole (Talpa occidentalis), Ord’s kangaroo rat (D. ordii), European blind mole (Nannospalax galili), white-footed mice (Peromyscus leucopus), deer mouse (Peromyscus maniculatus), southern grasshopper mouse (Onychomys torridus), prairie vole (Microtus ochrogaster), Chinese hamster (Cricetulus griseus), golden hamster (Mesocricetus auratus), Norway rat (R. norvegicus), mouse (Mus musculus) and degu (Octodon degus). 20,952 gene families were identified among 20 species, and a total of 433,351 genes were obtained and assigned to the orthogroups (gene families) using OrthoFinder (Table S3). Gene family analysis also showed that the genes of single-copy orthologs was 5,277. Out of the 25,311 genes of T. sibiricus, 18,863 were clustered into 15,629 orthogroups, and 148 gene families and 502 genes were unique to T. sibiricus. The number of genes assigned to different orthologous groups was displayed in Fig. S1 and Table S4.

Data Records

The genomic Illumina sequencing data was deposited in the NCBI Sequence Read Archive (SRA) database under accession No. SRR19929230³⁵.

The genomic Pacbio sequencing data was deposited in SRA database under accession No. SRR19961223³⁶.

The transcriptome Illumina sequencing data was deposited in SRA database under accession No. SRR19961278³⁷.

The Hi-C sequencing data was deposited in SRA database under accession No. SRR19960530³⁸.

The assembled genome was deposited in the GenBank at NCBI under accession No. GCA_025594165.1³⁹.

Genome annotation information of repeated sequences, gene structure and functional prediction is available in the Figshare database⁴⁰.

Technical Validation

The completeness and accuracy of the assembled genome were evaluated using two different strategies. First, BUSCO analysis revealed that 92.9% (single-copied gene: 92.2%, duplicated gene: 0.7%) of 9226 single-copy orthologues (in the mammalia_odb10 database) were successfully identified as complete, 1.5% were fragmented and 5.6% were missing in the assembly (BUSCO v4.0.5). Second, we mapped the sequencing data to the assembled genome for verifying the accuracy. The mapping rates was 97.42%, 98.00% and 96.03% for the Illumina, RNA-seq and PacBio data, respectively.

Code availability

No specific script was used in this work. The codes and pipelines used in data processing were all executed according to the manual and protocols of the corresponding bioinformatics software.

References

Wilson, D. E. & Reeder, D. M. Mammal Species of The World. A Taxonomic and Geographic Reference. (Smithsonian Institution Press, (1993).
Oshida, T., Masuda, R. & Yoshida, M. C. Phylogenetic relationships among Japanese species of the family Sciuridae Mammalia, Rodentia), inferred from nucleotide sequences of mitochondrial 12S ribosomal RNA genes. Zool. Sci. 13, 615–620 (1996).
Article CAS Google Scholar
Lee, S. J. et al. Genetic origin identification of Siberian chipmunks (Tamias sibiricus) in pet shops of South Korea. Anim. Cells Syst. 15, 161–168 (2011).
Article Google Scholar
Chapuis, J. L. Distribution in France of a naturalized pet, the Siberian chipmunk (Tamias sibiricus). Revue d’Ecologie 60, 239–253 (2005).
Google Scholar
Wang, Z.-Y. et al. Scatter-hoarding behavior in Siberian chipmunks (Tamias sibiricus): An examination of four hypotheses. Acta Ecol. Sin. 37, 173–179 (2017).
Article CAS Google Scholar
Kawamichi, M. Ecological factors affecting annual variation in commencement of hibernation in wild chipmunks Tamias sibiricus. J. Mammal. 77, 731–744 (1996).
Article Google Scholar
Marsot, M. et al. Introduced Siberian chipmunks (Tamias sibiricus barberi) contribute more to Lyme borreliosis risk than native reservoir rodents. PloS One 8, e55377 (2013).
Article ADS CAS Google Scholar
Pisanu, B., Obolenskaya, E. V., Baudry, E., Lissovsky, A. A. & Chapuis, J. L. Narrow phylogeographic origin of five introduced populations of the Siberian chipmunk Tamias (Eutamias) sibiricus (Laxmann, 1769) (Rodentia: Sciuridae) established in France. Biol. Invasions 15, 1201–1207 (2013).
Article Google Scholar
Beklemisheva, V. R. et al. Reconstruction of karyotype evoluti1on in core Glires. I. The genome homology revealed by comparative chromosome painting. Chromosome Res. 19, 549–565 (2011).
Article CAS Google Scholar
Rio, D. C., Ares, M., Hannon, G. J. & Nilsen, T. W. Purification of RNA using TRIzol (TRI reagent). Cold Spring Harbor Protocols 2010, 5439 (2010).
Article Google Scholar
Belton, J. M. et al. Hi-C: a comprehensive technique to capture the conformation of genomes. Methods 58, 268–276 (2012).
Article CAS Google Scholar
Marçais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764–770 (2011).
Article Google Scholar
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
Article CAS Google Scholar
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
Article CAS Google Scholar
Walker, B. J. et al. Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9, e112963 (2014).
Article ADS Google Scholar
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Article ADS CAS Google Scholar
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Article CAS Google Scholar
Harris, R. S. Improved Pairwise Alignment of Genomic DNA. Ph.D. dissertation, The Pennsylvania State University, Pennsylvania (2017).
Li, T. L. et al. Evolution of genome organizations of squirrels (Sciuridae) revealed by cross-species chromosome painting. Chromosome Res. 12, 317–335 (2004).
Article Google Scholar
Li, T. L., Wang, J. Z., Su, W., Nie, W. H. & Yang, F. Karyotypic evolution of the family sciuridae: inferences from the genome organizations of ground squirrels. Cytogenet. Genome Res. 112, 270–276 (2006).
Article CAS Google Scholar
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. USA 117, 9451–9457 (2020).
Article ADS CAS Google Scholar
Hubley, R. et al. The Dfam database of repetitive DNA families. Nucleic Acids Res. 44, D81–D89 (2016).
Article CAS Google Scholar
Bao, W., Kojima, K. K. & Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mobile DNA 6, 1–6 (2015).
Article Google Scholar
Smit, A. F., Hubley, R. & Green, P. Repeat Masker Open-4.0. http://www.repeatmasker.org (2015).
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935 (2013).
Article CAS Google Scholar
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).
Article CAS Google Scholar
Holt, C. & Yandell, M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics 12, 1–14 (2011).
Article Google Scholar
Hoff, K. J., Lange, S., Lomsadze, A., Borodovsky, M. & Stanke, M. BRAKER1: unsupervised RNA-Seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics 32, 767–769 (2016).
Article CAS Google Scholar
Stanke, M., Steinkamp, R., Waack, S. & Morgenstern, B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res. 32, W309–W312 (2004).
Article CAS Google Scholar
Brůna, T., Lomsadze, A. & Borodovsky, M. GeneMark-EP+: eukaryotic gene prediction with self-training in the space of genes and proteins. NAR Genomics Bioinf. 2, lqaa026 (2020).
Article Google Scholar
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
Article CAS Google Scholar
Kriventseva, E. V. et al. OrthoDB v10: sampling the diversity of animal, plant, fungal, protist, bacterial and viral genomes for evolutionary and functional annotations of orthologs. Nucleic Acids Res. 47, D807–D811 (2019).
Article CAS Google Scholar
Kovaka, S. et al. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 20, 1–3 (2019).
Article Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 1–4 (2019).
Article Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR19929230 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR19961223 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR19961278 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR19960530 (2022).
Yi, X.-F. Tamias sibiricus isolate XY-2022, whole genome shotgun sequencing project. GenBank https://identifiers.org/insdc.gca:GCA_025594165.1 (2022).
Li, R. & Yi, X.-F. Chromosome-level genome assembly of the Siberian chipmunk, Tamias sibiricus (Rodentia: Sciuridae). figshare https://doi.org/10.6084/m9.figshare.20219664 (2022).

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No. 32070447 and No. 32200359) and the Young Talents Invitation Program of Shandong Provincial Colleges and Universities (No. 20190601).

Author information

Authors and Affiliations

School of Life Sciences, Qufu Normal University, Qufu, 273165, China
Ran Li & Xianfeng Yi
Key Laboratory of Agro-Ecological Protection & Exploitation and Utilization of Animal and Plant Resources in Eastern Inner Mongolia, Chifeng University, Chifeng, 024000, China
Mingfei Zhang, Muha Cha & Jishan Xiang

Authors

Ran Li
View author publications
You can also search for this author in PubMed Google Scholar
Mingfei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Muha Cha
View author publications
You can also search for this author in PubMed Google Scholar
Jishan Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Xianfeng Yi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Li R., Xiang J.S. and Yi X.F. conceived and designed the research. Li R., Zhang M.F. and Cha M.H. collected the samples and extracted the genomic DNA. Li R. and Yi X.F. conducted the experiments, analyzed the data, and wrote the manuscript. All authors read, revised and approved the final version of the manuscript.

Corresponding authors

Correspondence to Jishan Xiang or Xianfeng Yi.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, R., Zhang, M., Cha, M. et al. Chromosome-level genome assembly of the Siberian chipmunk (Tamias sibiricus). Sci Data 9, 783 (2022). https://doi.org/10.1038/s41597-022-01910-5

Download citation

Received: 06 October 2022
Accepted: 14 December 2022
Published: 24 December 2022
DOI: https://doi.org/10.1038/s41597-022-01910-5
Springer Nature Limited

This article is cited by

High-tannin food enhances spatial memory and scatter-hoarding in rodents via the microbiota-gut-brain axis
- Xiangyu Zhao
- Jiawei Guo
- Xianfeng Yi
Microbiome (2024)
Chromosome-level genome assembly of Microplitis manilae Ashmead, 1904 (Hymenoptera: Braconidae)
- Xiaohan Shu
- Ruizhong Yuan
- Xuexin Chen
Scientific Data (2023)

Chromosome-level genome assembly of the Siberian chipmunk (Tamias sibiricus)

Abstract

Similar content being viewed by others

Chromosome-level genome assembly of the caenogastropod snail Rapana venosa

A chromosome-level genome assembly of Prosopocoilus inquinatus Westwood, 1848 (Coleoptera: Lucanidae)

A chromosomal-level genome assembly of Serrognathus titanus Boisduval, 1835 (Coleoptera: Lucanidae)

Background & Summary