A database of animal metagenomes

Hu, Ruirui; Yao, Rui; Li, Lei; Xu, Yueren; Lei, Bingbing; Tang, Guohao; Liang, Haowei; Lei, Yunjiao; Li, Cunyuan; Li, Xiaoyue; Liu, Kaiping; Wang, Limin; Zhang, Yunfeng; Wang, Yue; Cui, Yuying; Dai, Jihong; Ni, Wei; Zhou, Ping; Yu, Baohua; Hu, Shengwei

doi:10.1038/s41597-022-01444-w

A database of animal metagenomes

Data Descriptor
Open access
Published: 16 June 2022

Volume 9, article number 312, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Data

A database of animal metagenomes

Download PDF

Ruirui Hu^1,2^na1,
Rui Yao²^na1,
Lei Li²^na1,
Yueren Xu²,
Bingbing Lei²,
Guohao Tang³,
Haowei Liang³,
Yunjiao Lei²,
Cunyuan Li^2,4,
Xiaoyue Li²,
Kaiping Liu²,
Limin Wang¹,
Yunfeng Zhang¹,
Yue Wang²,
Yuying Cui²,
Jihong Dai²,
Wei Ni²,
Ping Zhou¹,
Baohua Yu³ &
…
Shengwei Hu^1,2

4893 Accesses
9 Citations
3 Altmetric
Explore all metrics

Abstract

With the rapid development of high-throughput sequencing technology, the amount of metagenomic data (including both 16S and whole-genome sequencing data) in public repositories is increasing exponentially. However, owing to the large and decentralized nature of the data, it is still difficult for users to mine, compare, and analyze the data. The animal metagenome database (AnimalMetagenome DB) integrates metagenomic sequencing data with host information, making it easier for users to find data of interest. The AnimalMetagenome DB is designed to contain all public metagenomic data from animals, and the data are divided into domestic and wild animal categories. Users can browse, search, and download animal metagenomic data of interest based on different attributes of the metadata such as animal species, sample site, study purpose, and DNA extraction method. The AnimalMetagenome DB version 1.0 includes metadata for 82,097 metagenomes from 4 domestic animals (pigs, bovines, horses, and sheep) and 540 wild animals. These metagenomes cover 15 years of experiments, 73 countries, 1,044 studies, 63,214 amplicon sequencing data, and 10,672 whole genome sequencing data. All data in the database are hosted and available in figshare https://doi.org/10.6084/m9.figshare.19728619.

Measurement(s)	Metagenome metadata
Technology Type(s)	Collection and integration the metagenomic information of multiple animal species
Factor Type(s)	animal
Sample Characteristic - Organism	animal
Sample Characteristic - Environment	metagenome
Sample Characteristic - Location	United States of America • People’s Republic of China • Canada

Metazen – metadata capture for metagenomes

Article Open access 08 December 2014

The AnimalAssociatedMetagenomeDB reveals a bias towards livestock and developed countries and blind spots in functional-potential studies of animal-associated microbiomes

Article Open access 05 October 2023

Computational Metagenomics: State-of-the-Art, Facts and Artifacts

Background & Summary

Microorganisms play essential roles in specialized niches, including internal host biology as well as the external environment. Microbes are found in diverse habitats, including deep seas, saline marshes, and glaciers¹. The roles of microorganisms in biodiversity have become a focus of interest for researchers due to their omnipresence². This diversity represents a vast genetic resource that could be exploited for the discovery of novel genes, biomolecules for metabolic pathways, and potentially valuable end-products³. The structure and function of the microbial community has received significant attention for decades, notably in association with research concerning human microbiota⁴. However, veterinarians, animal nutritionists, and microbiologists have begun to focus on studying the microbes of domestic (horses, pigs, and ruminants) and wild animals⁵. For domestic animals, a better understanding of disease-causing microbes of livestock can contribute to achieving the goals of better foods and a cleaner environment⁶. This is not only conducive to the healthy development of domestic animal and poultry breeding industries but also reduces the risk of food-borne diseases being transmitted to humans, which is conducive to public health security⁷. Regarding wild animals, wildlife microbiota are also natural hosts for animal and human pathogens; mapping their distributions can shed light on the timing and pathways of their transmission to humans, as is the case in the current COVID-19 pandemic⁸.

With the development of ultra-high throughput metagenomic sequencing technologies, including 16s rRNA gene sequencing and whole-genome sequencing, the number and scope of metagenomic sequencing projects have increased rapidly⁹. This has led to an exponential growth of metagenomics data under different experimental conditions. Therefore, metagenomic data contain an overwhelming volume of complex information, posing challenges not only for data storage but also for metadata annotation and management. Several pioneering studies have been designed to construct resources for storing raw sequencing data, including the National Center for Biotechnology Information (NCBI)¹⁰, the Sequence Read Archive (SRA)¹¹, and the European Nucleotide Archive (ENA)¹². These public resources contain human and animal metagenomic information that will serve as an important reference for current studies¹³. For instance, a previous study¹⁴ meta-analyzed 20 publicly available datasets from 16S rRNA gene-sequencing studies of the swine gut microbiota and demonstrated that GI tract location was the strongest predictor of the swine gut microbiota composition. In addition, eight genus-level gut microbes were identified that could serve as potential markers of swine gut microbiota. Therefore, the inclusion of comprehensive metadata accompanying the sequencing dataset will enhance future meta-analyses and allow researchers to directly compare their results with those of similar studies. Although some databases provide all raw data from the relevant experiments, it is still difficult for users to compare, classify, analyze, and store the data, given the limitations of computing resources and devices¹⁵. To address this problem, various databases have been created. For example, the HumanMetagenome DB 1.0 simplifies the identification and use of public human metagenomes¹⁶. Studies such as TerrestrialMetagenome DB¹⁷ and PlanetMicrobe¹⁸ have been reported. However, there are currently few reports concerning research datasets of animal metagenomes.

The present study found that large-scale population research can usually capture the orientation law of organisms to a certain extent^19,20,21; for example, human microbiome composition^22,23,24. The scale of the research also means that multiple effects can be compared across the same set of samples; for example, rearrangement of the gut bacterial ecosystem during the weaning transition in pigs²⁵. Nevertheless, large-scale data integration can provide a common framework for comparison effects. The establishment of the animal metagenomic database aims to provide large-scale data integration related to the animal microbial datasets²⁶. The goal is to provide researchers with a browsing interface and multiple consideration options so that users can quickly obtain the information they seek. For example, considering large-scale population data is better suited to achieving common research goals and to understanding the factors that are related to the changes of animal intestinal microbial flora²⁷.

Here, we focused on collecting and integrating the metagenome content of multiple animal species to help users understand the ecological underpinnings of microbiomes. The main merit of this work lies in the integrated implementation of the basic project information in the form of a very robust and user-friendly interface²⁸. To summarize, we have organized and integrated the information in the data for ease of use by matching experimental items to microbial enrichment variation; browsing and search functions were implemented, making the database easily used by researcher and biologists. In the future, we will continue to improve the animal metagenomics database, adding common domestic animals in terms of species, and we plan to update the database every two years.

Methods

Database construction

The construction scheme of the AnimalMetagenome DB is shown in Fig. 1. First, we collected the metadata of animal metagenomes from the NCBI database and from published articles. Subsequently, we removed human samples and environmental samples as well as non-metagenomic datasets. Then, we collected project information and standardized the sample attributes. Finally, all data were assembled into the database system, and the web platform was implemented.

Data collection

All data contained in the AnimalMetagenome DB were collected manually. First, we searched for projects related to four domestic animals: pigs, horse, bovines, and sheep, and wild animals from the NCBI BioProject database (https://www.ncbi.nlm.nih.gov/bioproject) using the keywords “Pig”, “Swine”, “Bovine”, “Cattle”, “Horse”, “Sheep”, and “Wild animal”. Then, we screened out projects related to animal metagenome research and collected information about these projects, including project description, project accession number, project creation date, and the number of samples contained in the project. Second, according to the project accession number, we collected data of all samples of this project from the NCBI BioSample database (https://www.ncbi.nlm.nih.gov/biosample) and collected other available data of these samples from published articles, such as PubMed ID, DNA extraction method, and the geographic location where the sample was collected. Then, we checked the information collected from the NCBI BioSample database with that of the original article to ensure the accuracy of the data. In addition, we integrated all information about the Experiments and Runs associated with the samples from the NCBI SRA database (https://www.ncbi.nlm.nih.gov/sra/) and matched the sample information with the information of Experiment and Run according to the sample accession number. Finally, the metadata of all animal metagenomes were counted to indicate that the collection was complete.

Confirmation of animal metagenomes based on the metadata

We confirmed animal metagenomic samples by checking the information contained in the experiment title, sample title, and sample name. After screening, we deleted environmental samples, human samples, animal genome sequencing samples, and transcriptome sequencing samples in the animal metagenomic projects.

Standardization of attributes

In the BioSample database, the sample attributes are not standardized, and the attribute names are displayed in many different ways. Therefore, to unify the description of the sample attributes, we referred to the curatedMetagenomicData²⁹ to standardize 10 different attributes, including species, sample site, sex, age, host phenotype, experimental condition, sample collection date, sample geographic location, longitude, and latitude. For species, we divided domestic animals into four types: pigs, bovines, horses, and sheep, and referred to the NCBI Taxonomy database (https://www.ncbi.nlm.nih.gov/taxonomy) to classify wild animals by class, order, family, genus, and species. The sample site1 (host body site) was divided into five main categories: gut (e.g. cecum, ileum), stomach (e.g. abomasum, rumen), skin, oral cavity, and bio-fluids (e.g. saliva, milk). We refer to HumanMetagenomeDB¹⁶ to uniformly convert geographic locations to countries. Meanwhile, geographic location coordinates (longitude and latitude) were standardized to decimal format (rounded to two decimal places). The date was standardized in accordance with the international standard ISO 8601 (YYYY-MM-DD). The age of animals was converted to months, rather than days, weeks, or years. In addition, we referred to the Animal QTLdb database (https://www.animalgenome.org/QTLdb) to classify the study purpose of the project, which comprised health traits, production traits, microbial diversity, and life history traits. Health traits was divided into disease, immune capacity, and pathogens and parasites; life history traits included weaning; and production traits included growth, feed efficiency, feed intake, methane emission, metabolism, and meat quality.

Web platform implementation

The AnimalMetagenome DB web architecture consisted of the front-end Vue and back-end SSM. Vue is a progressive framework for building user interfaces. It uses a data-driven approach that is different from the traditional dom-driven JavaScript. The back end was written in Java, and the integration framework of Spring + Spring MVC + Mybatis was chosen for development. The database was implemented in MySQL, a cross-platform, safe, and efficient database system.

Data Records

Information about metadata in the AnimalMetagenome DB was manually collected from BioProject, BioSample¹⁰, SRA¹¹, Taxonomy³⁰, PubMed, and Google Scholar databases. Each entry contains four parts: sample information (sample ID, species, sample site, DNA extraction method, sex, age, collection date, geographic location, sample type), sequencing attributes (experiment ID, instrument, library source, library strategy, total size, total spots, total bases), project information (project ID, project title, project description, study purpose, creation date), and literature information (PubMed ID). Table 1 shows an overview of the attributes in the AnimalMetagenome DB.

Table 1 List of attributes present in the AnimalMetagenome DB.

Full size table

The current version of the data contains 5 Excel files, which are the metagenome metadata of pigs, horse, bovines, sheep and wild animals. These data are publicly available free-of-charge from the Figshare repository³¹. At the same time, these data can also be browsed, searched and downloaded through our online database website. AnimalMetagenome DB is freely available at http://animalmetagenome.com/.

Technical Validation

Contents of the database

The AnimalMetagenome DB version 1.0 includes metadata of 82,097 metagenomes from more than 540 species of animals, covering 15 years of experiments from May 2006 to May 2021. Of these metagenomes, 77% (63,214) were contained amplicon sequencing data, and 13% (10,672) were whole genome sequencing data. Furthermore, most libraries were sequenced using Illumina sequencing technology (88%), followed by LS454 sequencing technology (7%), Ion Torrent sequencing technology (2%), and other (2%) (Fig. 2a).

The database comprises 69,302 samples from 1,044 projects, of which 32,607 samples were from pigs; 16,500 samples were from wild animals; 9,300 samples were from bovines; 6,959 samples from sheep, and 3,936 samples were from horses (Fig. 2b). To facilitate the search by species, the wild animals were classified by class, order, family, genus, and species. At the level of class, the majority of the wild animal samples were from mammal species, followed by Aves species (Fig. 2c). The DNA extraction methods of some samples in the database were marked, and the DNA of most samples was extracted by DNA extraction kit (86.8%), followed by the phenol-chloroform method (5.5%), and the repeated bead beating plus column (RBB + C) method (3.9%). In addition, the host gender of some samples was also annotated, with 13.3% (9,348) of the samples derived from females and 12.7% (8,774) from males. According to the sample geographic location, metagenomes covered 73 countries; most of the samples were from the United States of America (USA), with 20.3% of the annotated samples, followed by the People’s Republic of China, with 13.2% of the annotated samples (Fig. 2d). According to the sample site information, 69.5% of the samples were from the gut, followed by the stomach at 13.5% (Fig. 2e). From the perspective of different study methods, the database also contains information about the host’s experimental treatments (for example, different diet types: “high-fiber diet” and “low-fiber diet”), and information on chemical administration (for example, the host used antibiotics, drugs, or other specific chemicals).

According to purpose of projects, the 1,044 projects contained in the database were classified into four main types: microbial diversity, production traits, health traits, and life history traits (Fig. 2f). For the health traits, 26 different diseases were included (for example: “diarrhea”, “flu”), of which diarrhea-related projects were the most numerous. Among projects on production traits, 42% of the studies were related to metabolism, followed by the studies on methane emission accounting for 20%, feed efficiency accounting for 20%, and feed intake accounting for 6%.

Functions of the database

The AnimalMetagenome DB provides a platform that can help users to browse, search and download animal metagenome metadata according to their interests. The AnimalMetagenome DB user interface is divided into three parts (Fig. 3).

“Browse”, displays the specific information of species, study purpose, and sample site. Users can choose any category or subcategory of interest to obtain results. In addition, users can further filter the dataset by selecting single or multiple filters of “Species”, “Study Purpose “, and “Sample Site” in the obtained results. After filtering, the metadata of the selected item can be downloaded as an Excel file. The queried information will be displayed below the screening box, including project ID, sample ID, experiment ID, PubMed ID, species, sample site2, study purpose, creation date, library strategy, and DNA extraction method.

“Search”, contains all the data of the current version of the database (8,2097 animal metagenomic metadata in total). The data set can be filtered according to the nine attributes “Species”, “Sample Site”, “DNA Extraction Method”, “Sequencing Platform”, “Library Strategy”, “Study Purpose”, “Creation Date”, “Sex”, and “Age”. All animal metagenomic metadata will be displayed in the search section, including samples that are not marked with geographic location. Users can filter animal metagenomic metadata by selecting single or multiple attributes of interest. After filtering, the metadata of the selected samples can be downloaded as an Excel file. The information about each sample—“Project ID”, “Project Title”, “Experiment ID”, “Sample ID” and “PubMed ID” (if present)—can be hyperlinked to the source database.

“Map”, provides a more intuitive method for reviewing data. Users can directly select samples from the world map according to the location of interest, but only select samples marked with the geographical location. This function includes the rectangular drawing tool to help user to select an area of interest on the map and then select samples. It is worth noting that a single point shown on the map may represent multiple samples. Therefore, users can only use the drawing tools to select samples in a given area. After selecting an area on the map, the selected metagenomic metadata will be displayed in the data set table below the map.

Usage Notes

To better understand the usage of the AnimalMetagenome DB, we give an example. The microbiota is extremely important to the function and health of the gastrointestinal tract³². Porcine epidemic diarrhea is an enteric disease in pig caused by porcine epidemic diarrhea virus (PEDV), which is a member of the family Coronaviridae³³. If we are interested in PEDV caused intestinal dysbiosis in pigs, we can use the AnimalMetagenome DB to find related studies and samples. In the “Search” interface, we can click on “Species” – “Domestic animal” – “Pig”, then select “Sample Site” – “Gut”, and finally select “Study Purpose” – “Health traits” – “Pathogens & Parasites” – “PEDV”. After selecting the filter criteria and clicking “search”, two projects and 44 samples will be shown. We can view all the metadata of the related metagenome and click “Project ID”, “Sample ID”, “Project Title”, “Experiment ID” or “PubMed ID” to open the source database. Finally, we can download the metagenomic data of the selected samples to further analyze the changes of pig intestinal microbiota affected by the porcine epidemic diarrhea virus, as well as the composition and relative abundance of gut microbiota at different taxonomic levels.

Code availability

The code of the AnimalMetagenome DB has been uploaded to GitHub: https://github.com/boyNextDooooor/AnimalMetagenomeDB.git.

References

Hugenholtz, P. & Tyson, G. W. Microbiology: metagenomics. Nature 455(7212), 481–3 (2008).
Article ADS CAS Google Scholar
Singh, B., Bhat, T. K., Kurade, N. P. & Sharma, O. P. Metagenomics in animal gastrointestinal ecosystem: a microbiological and biotechnological perspective. Indian J. Microbiol. 48(2), 216–227 (2008).
Article CAS Google Scholar
Cowan, D. A. Microbial genomes–the untapped resource. Trends Biotechnol. 18(1), 14–16 (2000).
Article CAS Google Scholar
Han, H. et al. From gut microbiota to host appetite: gut microbiota-derived metabolites as key regulators. Microbiome 9(1), 1–16 (2021).
Article Google Scholar
Oakley, B. B. et al. The poultry-associated microbiome: network analysis and farm-to-fork characterizations. PLoS One 8(2), e57190 (2013).
Article ADS CAS Google Scholar
Frank, D. N. Growth and development symposium: promoting healthier humans through healthier livestock: animal agriculture enters the metagenomics era. J. Anim. Sci. 89(3), 835–844 (2011).
Article CAS Google Scholar
Robert, L. Economic burden from health losses due to foodborne illness in the United States. J. Food Protect. 75(1), 123–131 (2012).
Article Google Scholar
Zuo, T. et al. Alterations in Gut Microbiota of Patients With COVID-19 During Time of Hospitalization. Gastroenterology 159(3), 944–955.e8 (2020).
Article CAS Google Scholar
Bentley, D. R. Whole-genome re-sequencing. Curr. Opin. Genet. Dev. 16(6), 545–552 (2006).
Article CAS Google Scholar
Barrett, T. et al. BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res. 40(D1), D57–D63 (2012).
Article CAS Google Scholar
Kodama, Y., Shumway, M. & Leinonen, R. The Sequence Read Archive: explosive growth of sequencing data. Nucleic Acids Res. 40, D54–D56 (2012).
Article CAS Google Scholar
Harrison, P. W. et al. The European Nucleotide Archive in 2018. Nucleic Acids Res. 47, D84–D88 (2019).
Article CAS Google Scholar
Ley, R. E., Peterson, D. A. & Gordon, J. I. Ecological and evolutionary forces shaping microbial diversity in the human intestine. Cell 124(4), 837–848 (2006).
Article CAS Google Scholar
Holman, D. B., Brunelle, B. W., Trachsel, J. & Allen, H. K. Meta-analysis To Define a Core Microbiota in the Swine Gut. mSystems 2(3), e00004–17 (2017).
Article CAS Google Scholar
Zhang, Q. et al. Gut MEGA: a database of the human gut Meta Genome Atlas. Brief. Bioinform. 22(3) (2021).
Kasmanas, J. C. et al. Human Metagenome DB: a public repository of curated and standardized metadata for human metagenomes. Nucleic Acids Res. 49(D1), D743–D750 (2021).
Article CAS Google Scholar
Corrêa, F. B., Saraiva, J. P., Stadler, P. F. & da Rocha, U. N. Terrestrial Metagenome DB: a public repository of curated and standardized metadata for terrestrial metagenomes. Nucleic Acids Res. 48(D1), D626–D632 (2020).
PubMed Google Scholar
Ponsero, A. J. et al. Planet Microbe: a platform for marine microbiology to discover and analyze interconnected ‘omics and environmental data. Nucleic Acids Res. 49(D1), D792–D802 (2021).
Article CAS Google Scholar
Wei, C., Chen, W., Tian, P., Zhang, C. & Zhai, Q. New Progress of Research on Gut Microbiota and Human Health. J. Food Sci. Tech. 17(2), 1–9 (2017).
Google Scholar
Tringe, S. G. et al. Comparative metagenomics of microbial communities. Science 308(5721), 554–557 (2005).
Article ADS CAS Google Scholar
Debelius, J. et al. Tiny microbes, enormous impacts: what matters in gut microbiome studies? Genome Biol. 17(1), 1–12 (2016).
Article Google Scholar
Benson, A. K. et al. Individuality in gut microbiota composition is a complex polygenic trait shaped by multiple environmental and host genetic factors. Proc. Natl. Acad. Sci. USA 107(44), 18933–18938 (2010).
Article ADS CAS Google Scholar
Caporaso, J. G. et al. QIIME allows analysis of high-throughput community sequencing data. Nat. Methods 7(5), 335–336 (2010).
Article CAS Google Scholar
Elizabeth, K. et al. Bacterial community variation in human body habitats across space and time. Science 326(5960), 1694–1697 (2009).
Article Google Scholar
Motta, V., Luise, D., Bosi, P. & Trevisi, P. Faecal microbiota shift during weaning transition in piglets and evaluation of AO blood types as shaping factor for the bacterial community profile. PLoS One 14(5), e0217001 (2019).
Article CAS Google Scholar
Levin, D. et al. Diversity and functional landscapes in the microbiota of animals in the wild. Science 372(6539), eabb5352 (2021).
Article CAS Google Scholar
Yatsunenko, T. et al. Human gut microbiome viewed across age and geography. Nature 486(7402), 222–227 (2012).
Article ADS CAS Google Scholar
Huson, D. H., Richter, D. C., Mitra, S., Auch, A. F. & Schuster, S. C. Methods for comparative metagenomics. BMC bioinformatics 10(1), 1–10 (2009).
Google Scholar
Pasolli, E. et al. Accessible, curated metagenomic data through ExperimentHub. Nat. Methods 14(11), 1023–1024 (2017).
Article CAS Google Scholar
Federhen, S. The NCBI taxonomy database. Nucleic Acids Res. 40(D1), D136–143 (2012).
Article CAS Google Scholar
Ruirui, H. et al. AnimalMetagenome DB: a database for animal metagenomes. figshare https://doi.org/10.6084/m9.figshare.19728619.v1 (2022).
Turnbaugh, P. J. et al. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature 444(7122), 1027–1031 (2006).
Article ADS Google Scholar
Hanke, D. et al. Porcine Epidemic Diarrhea in Europe: In-Detail Analyses of Disease Dynamics and Molecular Epidemiology. Viruses 9(7), 177 (2017).
Article Google Scholar

Download references

Acknowledgements

Thanks to all contributors to this work. The Bingtuan Science and Technology Project [2021CB033], the foundation of state key laboratory for sheep genetic improvement and healthy production [2021ZD08 and MYSKLKF201901], the Third Xinjiang Scientific Expedition Program [2021xjkk0605 and 2021xjkk0504] and the Young innovative talents [2017CB003, CXRC201603 and CXRC201806] supported this work.

Author information

These authors contributed equally: Ruirui Hu, Rui Yao, Lei Li.

Authors and Affiliations

State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural and Reclamation Science, Shihezi, Xinjiang, China
Ruirui Hu, Limin Wang, Yunfeng Zhang, Ping Zhou & Shengwei Hu
College of Life Sciences, Shihezi University, Shihezi, Xinjiang, 832003, China
Ruirui Hu, Rui Yao, Lei Li, Yueren Xu, Bingbing Lei, Yunjiao Lei, Cunyuan Li, Xiaoyue Li, Kaiping Liu, Yue Wang, Yuying Cui, Jihong Dai, Wei Ni & Shengwei Hu
College of Information Science and Technology, Shihezi University, Shihezi, Xinjiang, 832003, China
Guohao Tang, Haowei Liang & Baohua Yu
College of Animal Science and Technology, Shihezi University, Shihezi, Xinjiang, 832003, China
Cunyuan Li

Authors

Ruirui Hu
View author publications
You can also search for this author in PubMed Google Scholar
Rui Yao
View author publications
You can also search for this author in PubMed Google Scholar
Lei Li
View author publications
You can also search for this author in PubMed Google Scholar
Yueren Xu
View author publications
You can also search for this author in PubMed Google Scholar
Bingbing Lei
View author publications
You can also search for this author in PubMed Google Scholar
Guohao Tang
View author publications
You can also search for this author in PubMed Google Scholar
Haowei Liang
View author publications
You can also search for this author in PubMed Google Scholar
Yunjiao Lei
View author publications
You can also search for this author in PubMed Google Scholar
Cunyuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyue Li
View author publications
You can also search for this author in PubMed Google Scholar
Kaiping Liu
View author publications
You can also search for this author in PubMed Google Scholar
Limin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yunfeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yue Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuying Cui
View author publications
You can also search for this author in PubMed Google Scholar
Jihong Dai
View author publications
You can also search for this author in PubMed Google Scholar
Wei Ni
View author publications
You can also search for this author in PubMed Google Scholar
Ping Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Baohua Yu
View author publications
You can also search for this author in PubMed Google Scholar
Shengwei Hu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.H., W.N. and B.Y. conceived, designed, and supervised the experiments. R.H., R.Y., P.Z. and L.L. performed experiments, data processing, and data statistics. B.Y., H.L. and K.W. developed the database. Y.X., B.L., Y.L., C.L., X.L., K.L., L.W., Y.Z., Y.W., Y.C. and J.D. took part in data processing and data statistics. R.H., R.Y., L.L., W.N. and S.H. wrote the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Ping Zhou, Baohua Yu or Shengwei Hu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hu, R., Yao, R., Li, L. et al. A database of animal metagenomes. Sci Data 9, 312 (2022). https://doi.org/10.1038/s41597-022-01444-w

Download citation

Received: 31 January 2022
Accepted: 01 June 2022
Published: 16 June 2022
DOI: https://doi.org/10.1038/s41597-022-01444-w
Springer Nature Limited

This article is cited by

Big Data for a Small World: A Review on Databases and Resources for Studying Microbiomes
- Pratyay Sengupta
- Shobhan Karthick Muthamilselvi Sivabalan
- Karthik Raman
Journal of the Indian Institute of Science (2023)
The AnimalAssociatedMetagenomeDB reveals a bias towards livestock and developed countries and blind spots in functional-potential studies of animal-associated microbiomes
- Anderson Paulo Avila Santos
- Muhammad Kabiru Nata’ala
- Ulisses Rocha
Animal Microbiome (2023)

A database of animal metagenomes

Abstract

Similar content being viewed by others

Metazen – metadata capture for metagenomes

The AnimalAssociatedMetagenomeDB reveals a bias towards livestock and developed countries and blind spots in functional-potential studies of animal-associated microbiomes

Computational Metagenomics: State-of-the-Art, Facts and Artifacts

Background & Summary