Twnbiome: a public database of the healthy Taiwanese gut microbiome

Chattopadhyay, Amrita; Lee, Chien-Yueh; Lee, Ya-Chin; Liu, Chiang-Lin; Chen, Hsin-Kuang; Li, Yung-Hua; Lai, Liang-Chuan; Tsai, Mong-Hsun; Ni, Yen-Hsuan; Chiu, Han-Mo; Lu, Tzu-Pin; Chuang, Eric Y.

doi:10.1186/s12859-023-05585-6

Twnbiome: a public database of the healthy Taiwanese gut microbiome

Database
Open access
Published: 14 December 2023

Volume 24, article number 474, (2023)
Cite this article

Download PDF

You have full access to this open access article

BMC Bioinformatics Aims and scope Submit manuscript

Twnbiome: a public database of the healthy Taiwanese gut microbiome

Download PDF

Amrita Chattopadhyay¹,
Chien-Yueh Lee²,
Ya-Chin Lee³,
Chiang-Lin Liu⁴,
Hsin-Kuang Chen⁵,
Yung-Hua Li⁴,
Liang-Chuan Lai^1,6,
Mong-Hsun Tsai^1,5,7,
Yen-Hsuan Ni⁸,
Han-Mo Chiu^8,9,
Tzu-Pin Lu^1,3,10 &
…
Eric Y. Chuang^1,4,11,12

1464 Accesses
1 Citation
8 Altmetric
1 Mention
Explore all metrics

Abstract

With new advances in next generation sequencing (NGS) technology at reduced costs, research on bacterial genomes in the environment has become affordable. Compared to traditional methods, NGS provides high-throughput sequencing reads and the ability to identify many species in the microbiome that were previously unknown. Numerous bioinformatics tools and algorithms have been developed to conduct such analyses. However, in order to obtain biologically meaningful results, the researcher must select the proper tools and combine them to construct an efficient pipeline. This complex procedure may include tens of tools, each of which require correct parameter settings. Furthermore, an NGS data analysis involves multiple series of command-line tools and requires extensive computational resources, which imposes a high barrier for biologists and clinicians to conduct NGS analysis and even interpret their own data. Therefore, we established a public gut microbiome database, which we call Twnbiome, created using healthy subjects from Taiwan, with the goal of enabling microbiota research for the Taiwanese population. Twnbiome provides users with a baseline gut microbiome panel from a healthy Taiwanese cohort, which can be utilized as a reference for conducting case-control studies for a variety of diseases. It is an interactive, informative, and user-friendly database. Twnbiome additionally offers an analysis pipeline, where users can upload their data and download analyzed results. Twnbiome offers an online database which non-bioinformatics users such as clinicians and doctors can not only utilize to access a control set of data, but also analyze raw data with a few easy clicks. All results are customizable with ready-made plots and easily downloadable tables. Database URL: http://twnbiome.cgm.ntu.edu.tw/.

View this article's peer review reports

MetaGeneBank: a standardized database to study deep sequenced metagenomic data from human fecal specimen

Article Open access 30 September 2021

Progress of analytical tools and techniques for human gut microbiome research

Article 28 September 2018

A geographically-diverse collection of 418 human gut microbiome pathway genome databases

Article Open access 11 April 2017

Background

With the recent advancements in next generation sequencing (NGS) technology, the study of microbiota has been growing rapidly [1,2,3,4]. Traditionally, microbial genomics utilized cultivation-based methods to study genomes of microbes in the environment [5]. However, such traditional methods miss out on capturing much of the microbial diversity, as it is challenging to culture large groups of microorganisms properly. With the advent of 16S ribosomal RNA (16S rRNA) sequencing techniques, microbiota-related research, databases, and publications have all been growing exponentially [6]. Moreover, since 2008, with the initiation of the Human Microbiome Project (HMP), which has already released more than 2,200 microbial genome sequences isolated from various human body sites, including the feces, nasal cavity, throat, gut, and so on (https://portal.hmpdacc.org), microbiome research has undergone immense and rapid development [7].

Among all sites, the gut microbiota from the gastrointestinal tract are considered to be one of the biggest ensembles of microbial flora in the human body. The adult human gut contains about 10¹⁴ bacterial cells from more than 1,000 different bacterial species, which are involved in numerous human metabolic and physiological functions [8,9,10]. The host benefits from the homeostatic balance provided by the bacteria; however, a change in the microbial composition can lead to a severe imbalance between the beneficial and potentially pathogenic bacteria, thus making the gut vulnerable to microbial alterations. This imbalance, also known as “dysbiosis”, can cause physical symptoms or even diseases [11, 12], which have spawned a plethora of research endeavors towards unraveling the relationship between different diseases and microbiota [13,14,15,16]. Such studies aimed to provide descriptions of characteristic alterations in the composition of the microbiome that may prove useful as diagnostic biomarkers [17]. Furthermore, to elucidate the therapeutic potential of the human gut microbiome through manipulation, administration of a solution of fecal matter from a donor into the intestinal tract of a recipient has been used to directly change the recipient’s microbial composition in attempts to confer a health benefit [18,19,20]. This procedure, known as “fecal microbiota transplantation”, has already been extrapolated from animal models to human beings and has been used to successfully treat recurrent Clostridium difficile infection [21]. Preliminary findings indicate that it may also carry therapeutic potential for other conditions such as inflammatory bowel disease, obesity, metabolic syndrome, and functional gastrointestinal disorders. Therefore, microbiota have implications for human health, making them potential biomarkers for clinical applications.

It is well-known that the microbiome is diverse and heterogeneous in different individuals, even in the same tissue type. Several important factors, such as diet, disease phenotype, and environmental exposures, can have great impact on the abundance and composition of the microbiome [22, 23]. According to a recent study, the geographical location of the recruited subjects had a greater effect on human gut microbiota variations than disease phenotype in the constructed model [24]. This suggests that cross-national studies of potential microbiota biomarkers is not a good strategy. Therefore, it becomes challenging to identify consistent signatures as the geographic scale becomes larger, making it more difficult to establish a standard or a normal reference baseline that would be applicable globally [25]. Hence, as the microbiome segregated across ethnic and geographic populations could potentially lead to differences in disease severity [26, 27], developing population-specific databases containing the microbiome data from healthy individuals is a priority.

Furthermore, combining the demand for fast computation speed and large data storage, NGS analyses are mostly performed remotely on powerful servers using command-line interfaces. Therefore, the tools involved are often developed without a graphical user interface (GUI) to maximize computation efficiency. This kind of terminal-based environment imposes a considerable barrier, especially for biologists and clinicians not trained in bioinformatics. Furthermore, the output of most analysis tools is log files or other file formats that are friendly for computers to parse but may not be easily interpreted by humans without data cleaning and transformation. Therefore, for convenience of access and interpretation, to facilitate specific queries by users without advanced bioinformatics skills, creating a user-friendly database from metagenome resources is a current need in the field of microbiome research.

In this study we established the first public gut microbiota database, from healthy subjects in Taiwan, titled Twnbiome (http://Twnbiome.cgm.ntu.edu.tw), that enables microbiome research for populations in Taiwan. Researchers can easily access and compare the composition of microbiota from diseased subjects using Twnbiome as the healthy reference dataset. Twnbiome further provides a 16S rRNA analysis pipeline and statistically comparable metrics with a GUI, for easy access and convenient interpretation.

Construction and content

Database overview

Figure 1 provides an overview of the Twnbiome database. The database primarily offers two main utilities: (i) Twnbiome healthy baseline and (ii) User uploaded data analysis. The Twnbiome database provides users with information regarding the microbiota composition of healthy Taiwanese subjects. The users can further explore all information under different classifications such as gender, age, body mass index (BMI), and so on (Table 1), through 3 different functions: (a) overview, (b) summary, and (c) browser (Fig. 2). The ‘Data Analysis’ utility requires users to upload their 16S ribosomal RNA (rRNA) sequence data, where the built-in bioinformatics pipeline is utilized to conduct analysis. Twnbiome was developed with Django 2.2 and runs on Python 3.6.7 and MySQL 5.7.29.

Table 1 Demographic characteristics of 119 healthy Taiwanese subjects in Twnbiome

Full size table

Database content

Twnbiome provides a comprehensive gut microbiome landscape of 119 healthy subjects from Taiwan. The healthy volunteers were recruited via the Health Management Center of National Taiwan University Hospital (NTUH), and all participants provided informed consent during their routine health checkups. Fecal samples were collected from each volunteer and all information from the subjects was recorded by a trained interviewer. The study was approved by the institutional review board of NTUH (#201801085RINB).

Subject inclusion and clinical information

Individuals aged 18 years and above, who self-reported as healthy and were of Taiwanese Han Chinese ancestry, were recruited. Clinical information from all subjects was obtained from NTUH. Additionally, each subject was asked to self-report information on their physical activity, meal timings, probiotic/nutritional supplement intake, diet type, frequency of eating out, and sleep/wake regularity (lifestyle), through a customized questionnaire. Participants self-reported as healthy if they did not have any of the following diseases: autoimmune disease (rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis, hyper/hypothyroidism, psoriasis, type I diabetes, and multiple sclerosis), neuropsychiatric disease (mood disorders, schizophrenia, autism spectrum disorder), gastrointestinal disease (diarrhea), metabolic diseases (hypertension, atherosclerosis, type II diabetes, non-alcoholic fatty acid), cancers, or other major illnesses. Other basic demographic information including age, sex, and body mass index (BMI) was reported. Considering the potential effects of BMI on microbiota, subjects with BMI < 18.5 (underweight) or BMI ≥ 27 (obese) were excluded from the study. Lastly, subjects with recent drug usage (less than 3 months), including antibiotics, anti-hypertensives, hypolipidemic agents, steroids, antipsychotic and gastrointestinal drugs, were further excluded. Finally, 213 subjects passed all inclusion criteria, out of which 119 fulfilled the criteria of healthy subjects (Table 1).

Biosample collections and 16s rRNA sequencing

One-gram fecal samples were collected with user-friendly sterilized kits, and the sample transition process was performed by transferring the samples to -80 ℃ refrigerators immediately, for undergoing cold-chain transportation. QIAamp DNA Stool Mini Kit (QIAGEN Inc., USA) was used to extract the microbiota DNA via lysis, and purification procedures were conducted following the manufacturer’s protocol. For quality control, all DNA samples were evaluated by NanoDrop Microvolume Spectrophotometers. Sequencing of 16S rRNA was performed with the Illumina HiSeq platform, and library preparation was done according to the manufacturer’s instructions (Illumina, USA). Briefly, 12.5 ng of DNA was used for PCR (polymerase chain reaction) amplification of the V3 and V4 regions of the 16S rRNA gene.

The PCR products were purified with AMPure XP beads (Beckman Coulter, USA) and subjected to a secondary PCR reaction with primers from the Nextera XT Index kit (Illumina, USA) by adding dual indices and Illumina sequencing adapters. The final libraries (~ 630 bp) were purified with AMPure XP beads and sequenced by the Illumina HiSeq machine with paired-end sequencing (2*300 bp).

Bioinformatic analysis

Once the 16S rRNA sequencing experiments were complete, FASTQ files were generated by HiSeq Report and sequence quality control was conducted using FastQC software (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) on the raw FASTQ files. For further analysis, the analysis pipeline in Quantitative Insights Into Microbial Ecology (QIIME, version 1.9.1) was used [28]. First, PEAR (version 0.9.8) merged the paired-end reads [29], where the reads that did not overlap with more than 30 bp were discarded. The merged sequences then underwent the filtering step of QIIME for quality control with the following criteria: (1) maximum 3 consecutive low-quality base calls were allowed before truncating the reads, (2) at least 75% of consecutive high-quality base calls were included in a read, and (3) no ambiguous characters were allowed in the sequence, while setting < Q20 as the low quality threshold. After filtering, the sequences were clustered with USEARCH [30] to identify the operational taxonomic units (OTUs) with 97% similarity to the reference from the Greengenes taxonomic database (May 2013 version, http://greengenes.lbl.gov/). Then the OTUs were classified into different taxonomic levels: kingdom, phylum, class, order, family, genus, and species. The OTU information was then used to calculate the relative abundance and the bacterial diversity using alpha and beta diversity indexes. The alpha diversity was mainly calculated using the observed OTUs, whereas the beta diversity was calculated using two different methods: unweighted and weighted UniFrac. Furthermore, the enterotype was determined by previously established methods [31].

Features of Twnbiome database

Figure 2 provides a screen shot of the welcome page of the Twnbiome database (http://Twnbiome.cgm.ntu.edu.tw). The content of the database is offered to users through various tabs on the navigation bar. The information includes (1) exclusion criteria of our recruited subjects; (2) the basic quality-control information for the sequencing data, including raw reads, effective reads, and Q30 value; (3) detailed sample information and classifications using pie charts; (4) beta diversity plots; (5) detailed taxonomic composition under different classifications; and (6) browsers for specific samples and taxa searches. Details of the sample exclusion criteria are provided under the “About” tab. The “Overview” tab includes quality control distribution plots and gut microbiota composition pie charts for healthy subjects, with classifications based on gender, age, BMI, physical activity, alpha-diversity phylum level, enterotypes, probiotic and supplement intake, diet type, meal on-time frequency, frequency of eating out, and lifestyle. Each pie chart is coded with an interactive function that allows users to see the exact number of the recruited subjects. The "Overview" section further offers the beta diversity principal co-ordinate analysis (PCoA) plot of the 119 healthy subjects, to visualize similarities or dissimilarities between the samples. The PCoA plots are also interactive, can be rotated or zoomed in/out, and can be modified by adding legends. In the “Summary” section, a detailed taxonomic composition, from phylum to genus, among the groups with different gender or BMI is provided. Users can use the drop-down list to choose the taxa of interest to obtain the detailed composition. Finally, in the “Browser” section, specific taxa searches are offered where the users can directly type the name of the taxa (Kingdom, Phylum, Class, Order, Family, Genus, and Species) that they are interested in to explore the average and median abundance among the healthy subjects.

16S rRNA sequencing data upload and analysis

For the advancement of metagenomic research among the Taiwanese population, the database further offers a user-friendly interface in the “Tools” section for users to analyze their own data. In the upload data section, the user is required to upload their FASTQ files, and when the analysis is done, the user receives a link to the results for them to explore and download. All analyses are conducted using the same protocol as described before in the ‘methods’ section.

Utility and discussion

Subject demography

Table 1 gives full demographic details of the subjects in the database. The majority of participants were female (68.07%) and the age of the participants ranged between 18 and 94 years, with 55 adults, 45 middle aged (45 to < 65), and 19 old aged (65 to < 95) participants with a mean age of 47.1 years. Participants were further stratified based on their BMI, physical activity, dietary habits, and lifestyle/sleep patterns. A majority of subjects were in the normal range for BMI (68.07%), had never used probiotics/nutritional supplements (57.98%), reported a balanced diet (59.66%) with regular meal times (51.26%), and had abnormal sleep patterns (58.82%).

Twnbiome database

Twnbiome was created with the expectation of enhancing microbiome research in Taiwan and facilitating the improvement of patients suffering from microbial-related health problems. Background information on the project and the subject inclusion/exclusion criteria are provided in the “About” section of the web site (Fig. 2).

Overview section

This section covers the quality plots for the sequencing reads (Fig. 3a), pie charts giving a pictorial description of subject demography as described before (Fig. 3b) under different classifications, a sample browser for users to stratify samples based on the demographics and lifestyle factors and beta diversity plots depicting the amount of species change among subjects with various stratifications (Fig. 3c). The average number of raw reads per sample was 169,252, the average Q30 distribution was 87.42%, and the average number of effective reads was 120,666. Overall, the most abundant phylum among healthy Taiwanese subjects was Firmicutes (51.76%) followed by Bacteroidetes (35.25%) (Fig. 3b). The dominating presence of Firmicutes and Bacteroidetes in the gut was consistent with prior findings from studies focusing on healthy microbiome composition [32, 33]. Specifically, when stratified by enterotype composition, it was found that the healthy subjects were segregated into three different enterotypes enriched with Bacteroides, Prevotella, and Ruminococcus (Fig. 3b), again consistent with prior enterotype research findings [34, 35]. Furthermore, the beta diversity PCoA plots for the study subjects displayed no visible significant clusters when stratified by age (Fig. 3c) or any other variable (results not shown), indicating the homogeneity of the healthy subjects that were recruited for this study.

Summary section

Figure 4a displays a screenshot of the distribution of microbiome abundance at the genus level for the study subjects when stratified by sex and BMI. These figures can be accessed under the “Summary” section of the Twnbiome web site. The user can similarly select any of the other taxonomic levels (phylum, class, order, or family) to get the distribution of microbiome abundance for both sex and BMI stratification. The user can move the mouse over each of the colored bars to view the name of the corresponding microbiome and its relative abundance. Bacteroides, Faecalbacterium, Acidobacteria, Chlostridiales, Lachnospiraceae, Prevotella, Megamonas, Blautia, Phascolarotobacterium, Megasphaera, Ruminococcus, and Bifidobacterium were among the most abundant genera that were observed in both males/females and normal/overweight groups.

Browser section: example 1

Figure 4b provides a screenshot of the “Browser” section of the Twnbiome database. The figure shows the search results for Bacteroides at the genus level, ranked by average abundance. The web page displays both average abundance and median abundance for Bacteroides as > 20%, consistent with the findings in the “Summary” section, thus making it the most abundant genera among healthy Taiwanese subjects. Furthermore, Plebeius and Uniformis, as the top abundant species for genus Bacterioides, have been found to exist in the guts of healthy Asian individuals and have been extracted for various research purposes in prior studies [36,37,38].

Data analysis: example 2

Figure 5 gives an example of the procedure for conducting data analysis on a user-uploaded data, along with the analysis output. The user needs to first click on the “Tools” link at top-right corner of the welcome page (Fig. 2), and will be taken to a data upload page (Fig. 5a) where the user is required to upload an e-mail address and the paired-end FASTQ files. On clicking the submit button, the user will be forwarded to the next page (Fig. 5b), which provides a notification of the status of the submission. Once the analysis is complete the user will receive an e-mail with a link to download the results of the analysis. The results include the information of the data analyzed (Fig. 5c) along with an OTU table containing the taxa information and the corresponding relative abundance (Fig. 5d). Users can download the results as .xls or .csv tables with a single click.

Research on microbiota is on the rise like never before. There are more than 50,000 research articles, just on the gut microbiome, in the Web of Science database since 2000, and as of December 18, 2019, there are already more than 9,500 publications in 2019, a growth of about 30-fold since 2000, sixfold since 2010, and twofold since 2015 [39]. A search in the Web of Science database further reveals a total of 16,716 research articles since 2020 and a total of 5609 publications in 2023 alone (as of 13th November 2023). With such a huge accumulation of research, it is common knowledge that microbiotic composition is affected by many factors, including diet, geographic location, genetics, antibiotics, lifestyle, and so on. Therefore, for conducting microbiome studies under a case-control design, it is very important to have the right comparative groups to explore the potential targets. This along with recent advances in high-throughput sequencing techniques at affordable costs, has led to an enormous amount of gut microbiome data being generated and curated [40]. One significantly important human gut microbiome database is GMrepo [41, 42], which is manually curated from 33 sources with special focus on disease markers. It allows cross-dataset or cross phenotype comparisons of identified markers enabling systematic demonstration of consistent and inconsistent disease-associated microbial markers across datasets. Human gut microbiome atlas (HGMA) (https://www.microbiomeatlas.org/) is another repository that contains health and disease datasets from 20 different countries across five continents and provides region enriched microbial species for different geographical locations. However, both GMrepo and HGMA yet lacks an analysis platform, which can enable users to upload data to conduct metagenomics analyses. Twnbiome is one of the first public databases that shares the microbiota information from more than a hundred healthy Taiwanese individuals and provides population specific enriched species of the gut microbiota. The database, which is still growing, can not only be used to query and obtain summary level data based on phylum, class, order, family, genus, species, but it is the first time that the comprehensive summary statistics of microbiome diversity related to age, sex, lifestyle, and other factors are provided for healthy Taiwanese individuals. Users can further customize the figures and results through interactive platforms for research purposes. It is an unprecedented effort in microbiome research with accurate data acquisition, an established pipeline for data analyses, and the ability to organize, store, access, and share/integrate processed datasets. The release of the database would firmly accelerate microbiota research in Taiwan.

Conclusion

Twnbiome provides users with a baseline gut microbiome panel from a healthy Taiwanese cohort, which can be utilized as a reference control for conducting case-control studies for a variety of diseases. Twnbiome is an interactive, informative, and user-friendly database that not only provides users with a control dataset, ready to be utilized, but also offers an analysis pipeline, which non-bioinformatics users such as clinicians and doctors can utilize to conduct their analysis with a few clicks, and obtain customizable ready-made plots and easily downloadable tables.

Data availability

The database contains all summary statistics and analyzed findings. The raw data can be available on reasonable request from the corresponding author.

Abbreviations

NGS:: Next generation sequencing
rRNA:: Ribosomal RNA
HMP:: Human microbiome project
GUI:: Graphical user interface
NTUH:: National Taiwan University Hospital
BMI:: Body mass index
QIIME:: Quantitative insights into microbial ecology
OTU:: Operational taxonomic units
PCoA:: Principal co-ordinate analysis

References

Venter JC, Remington K, Heidelberg JF, Halpern AL, Rusch D, Eisen JA, et al. Environmental genome shotgun sequencing of the Sargasso Sea. Science. 2004;304(5667):66–74.
Article CAS PubMed Google Scholar
Thomas T, Gilbert J, Meyer F. Metagenomics-a guide from sampling to data analysis. Microb Inform Exp. 2012;2(1):1–12.
Article Google Scholar
Jovel J, Patterson J, Wang W, Hotte N, O’Keefe S, Mitchel T, et al. Characterization of the gut microbiome using 16S or shotgun metagenomics. Front Microbiol. 2016;7:459.
Article PubMed PubMed Central Google Scholar
Caporaso JG, Lauber CL, Costello EK, Berg-Lyons D, Gonzalez A, Stombaugh J, et al. Moving pictures of the human microbiome. Genome Biol. 2011;12(5):1–8.
Article Google Scholar
Hugenholtz P, Goebel BM, Pace NR. Impact of culture-independent studies on the emerging phylogenetic view of bacterial diversity. J Bacteriol. 1998;180(18):4765–74.
Article CAS PubMed PubMed Central Google Scholar
Jones S. Trends in microbiome research. Nature Publishing Group; 2013.
Book Google Scholar
Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM, Knight R, Gordon JI. The human microbiome project. Nature. 2007;449(7164):804–10.
Article CAS PubMed PubMed Central Google Scholar
Quigley EM. Gut bacteria in health and Disease. Gastroenterol Hepatol (N Y). 2013;9(9):560.
PubMed Google Scholar
DeGruttola AK, Low D, Mizoguchi A, Mizoguchi E. Current understanding of dysbiosis in Disease in human and animal models. Inflamm Bowel Dis. 2016;22(5):1137–50.
Article PubMed Google Scholar
Clarke G, Stilling RM, Kennedy PJ, Stanton C, Cryan JF, Dinan TG. Minireview: gut microbiota: the neglected endocrine organ. Mol Endocrinol. 2014;28(8):1221–38.
Article PubMed PubMed Central Google Scholar
Moos WH, Faller DV, Harpp DN, Kanara I, Pernokas J, Powers WR, et al. Microbiota and neurological disorders: a gut feeling. BioRes open Access. 2016;5(1):137–45.
Article CAS PubMed PubMed Central Google Scholar
Tamboli CP, Neut C, Desreumaux P, Colombel JF. Dysbiosis in inflammatory bowel Disease. Gut. 2004;53(1):1–4.
Article CAS PubMed PubMed Central Google Scholar
Stecher B, Hardt W-D. The role of microbiota in Infectious Disease. Trends Microbiol. 2008;16(3):107–14.
Article CAS PubMed Google Scholar
Round JL, Mazmanian SK. The gut microbiota shapes intestinal immune responses during health and Disease. Nat Rev Immunol. 2009;9(5):313–23.
Article CAS PubMed PubMed Central Google Scholar
Scheperjans F, Aho V, Pereira PA, Koskinen K, Paulin L, Pekkonen E, et al. Gut microbiota are related to Parkinson’s Disease and clinical phenotype. Mov Disord. 2015;30(3):350–8.
Article PubMed Google Scholar
Kamada N, Seo S-U, Chen GY, Núñez G. Role of the gut microbiota in immunity and inflammatory Disease. Nat Rev Immunol. 2013;13(5):321–35.
Article CAS PubMed Google Scholar
Berry D, Reinisch W. Intestinal microbiota: a source of novel biomarkers in inflammatory bowel Diseases? Best Pract Res Clin Gastroenterol. 2013;27(1):47–58.
Article CAS PubMed Google Scholar
Kang D-W, Adams JB, Gregory AC, Borody T, Chittick L, Fasano A, et al. Microbiota transfer therapy alters gut ecosystem and improves gastrointestinal and autism symptoms: an open-label study. Microbiome. 2017;5(1):1–16.
Article Google Scholar
Smits LP, Bouter KE, de Vos WM, Borody TJ, Nieuwdorp M. Therapeutic potential of fecal microbiota transplantation. Gastroenterology. 2013;145(5):946–53.
Article PubMed Google Scholar
Gupta S, Allen-Vercoe E, Petrof EO. Fecal microbiota transplantation: in perspective. Th Adv Gastroenterol. 2016;9(2):229–39.
Article Google Scholar
Youngster I, Russell GH, Pindar C, Ziv-Baran T, Sauk J, Hohmann EL. Oral, capsulized, frozen fecal microbiota transplantation for relapsing Clostridium difficile Infection. JAMA. 2014;312(17):1772–8.
Article CAS PubMed Google Scholar
Zmora N, Suez J, Elinav E. You are what you eat: diet, health and the gut microbiota. Nat Rev Gastroenterol Hepatol. 2019;16(1):35–56.
Article CAS PubMed Google Scholar
Rothschild D, Weissbrod O, Barkan E, Kurilshikov A, Korem T, Zeevi D, et al. Environment dominates over host genetics in shaping human gut microbiota. Nature. 2018;555(7695):210–5.
Article CAS PubMed Google Scholar
He Y, Wu W, Zheng H-M, Li P, McDonald D, Sheng H-F, et al. Regional variation limits applications of healthy gut microbiome reference ranges and Disease models. Nat Med. 2018;24(10):1532–5.
Article CAS PubMed Google Scholar
Gupta VK, Paul S, Dutta C. Geography, ethnicity or subsistence-specific variations in human microbiome composition and diversity. Front Microbiol. 2017;8:1162.
Article PubMed PubMed Central Google Scholar
Deschasaux M, Bouter KE, Prodan A, Levin E, Groen AK, Herrema H, et al. Depicting the composition of gut microbiota in a population with varied ethnic origins but shared geography. Nat Med. 2018;24(10):1526–31.
Article CAS PubMed Google Scholar
King CH, Desai H, Sylvetsky AC, LoTempio J, Ayanyan S, Carrie J, et al. Baseline human gut microbiota profile in healthy people and standard reporting template. PLoS ONE. 2019;14(9):e0206484.
Article CAS PubMed PubMed Central Google Scholar
Lawley B, Tannock GW. Analysis of 16S rRNA gene amplicon sequences using the QIIME software package. Berlin: Springer Oral Biology; 2017. pp. 153–63.
Google Scholar
Zhang J, Kobert K, Flouri T, Stamatakis A. PEAR: a fast and accurate Illumina paired-end reAd mergeR. Bioinformatics. 2014;30(5):614–20.
Article CAS PubMed Google Scholar
Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26(19):2460–1.
Article CAS PubMed Google Scholar
Le Chatelier E, Nielsen T, Qin J, Prifti E, Hildebrand F, Falony G, et al. Richness of human gut microbiome correlates with metabolic markers. Nature. 2013;500(7464):541–6.
Article PubMed Google Scholar
Qin J, Li R, Raes J, Arumugam M, Burgdorf KS, Manichanh C, et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464(7285):59–65.
Article CAS PubMed PubMed Central Google Scholar
Huttenhower C, Gevers D, Knight R, Abubucker S, Badger JH, Chinwalla AT, et al. Structure, function and diversity of the healthy human microbiome. Nature. 2012;486(7402):207.
Article CAS Google Scholar
Arumugam M, Raes J, Pelletier E, Le Paslier D, Yamada T, Mende DR, et al. Enterotypes of the human gut microbiome. Nature. 2011;473(7346):174–80.
Article CAS PubMed PubMed Central Google Scholar
Costea PI, Hildebrand F, Arumugam M, Bäckhed F, Blaser MJ, Bushman FD, et al. Enterotypes in the landscape of gut microbial community composition. Nat Microbiol. 2018;3(1):8–16.
Article CAS PubMed Google Scholar
Wang T, Cai G, Qiu Y, Fei N, Zhang M, Pang X, et al. Structural segregation of gut microbiota between Colorectal cancer patients and healthy volunteers. ISME J. 2012;6(2):320–9.
Article CAS PubMed Google Scholar
Benítez-Páez A, Gómez del Pulgar EM, Sanz Y. The glycolytic versatility of Bacteroides uniformis CECT 7771 and its genome response to oligo and polysaccharides. Front Cell Infect Microbiol. 2017;7:383.
Article PubMed PubMed Central Google Scholar
Kitahara M, Sakamoto M, Ike M, Sakata S, Benno Y. Bacteroides plebeius sp. nov. and Bacteroides coprocola sp. nov., isolated from human faeces. Int J Syst Evol Microbiol. 2005;55(5):2143–7.
Article CAS PubMed Google Scholar
Li D, Gao C, Zhang F, Yang R, Lan C, Ma Y et al. Seven facts and five initiatives for gut microbiome research. Protein Cell. 2020:1–10.
Sengupta P, Sivabalan SKM, Mahesh A, Palanikumar I, Kuppa Baskaran DK, Raman K. Big data for a small world: a review on databases and resources for studying microbiomes. J Indian Inst Sci. 2023:1–17.
Dai D, Zhu J, Sun C, Li M, Liu J, Wu S, et al. GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison. Nucleic Acids Res. 2022;50(D1):D777–D84.
Article CAS PubMed Google Scholar
Wu S, Sun C, Li Y, Wang T, Jia L, Lai S, et al. GMrepo: a database of curated and consistently annotated human gut metagenomes. Nucleic Acids Res. 2020;48(D1):D545–D53.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We acknowledge National Taiwan University’s Centers of Genomic and Precision Medicine for conducting the next-generation sequencing. We thank Melissa Stauffer, Ph.D. for English editing our manuscript.

Funding

This work was partly supported by Grant No. 111RB07, Central Taiwan Science Park Bureau, National Science and Technology Council; Research and Development Center for Medical Devices, National Taiwan University (112KKZA3T1); and Population Health and Welfare Research Center from Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan (NTU-112L9004) .

Author information

Authors and Affiliations

Bioinformatics and Biostatistics Core, Centers of Genomic and Precision Medicine, National Taiwan University, Taipei, Taiwan
Amrita Chattopadhyay, Liang-Chuan Lai, Mong-Hsun Tsai, Tzu-Pin Lu & Eric Y. Chuang
Department of Biomedical Engineering, China Medical University, Taichung, Taiwan
Chien-Yueh Lee
Department of Public Health, Institute of Health Data Analytics and Statistics, National Taiwan University, Taipei, Taiwan
Ya-Chin Lee & Tzu-Pin Lu
Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan
Chiang-Lin Liu, Yung-Hua Li & Eric Y. Chuang
Center for Biotechnology, National Taiwan University, Taipei, Taiwan
Hsin-Kuang Chen & Mong-Hsun Tsai
Graduate Institute of Physiology, National Taiwan University, Taipei, Taiwan
Liang-Chuan Lai
Institute of Biotechnology, National Taiwan University, Taipei, Taiwan
Mong-Hsun Tsai
College of Medicine, National Taiwan University, Taipei, Taiwan
Yen-Hsuan Ni & Han-Mo Chiu
Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
Han-Mo Chiu
Institute of Health Data Analytics and Statistics, National Taiwan University, Taipei, Taiwan
Tzu-Pin Lu
Biomedical Technology and Device Research Laboratories, Industrial Technology Research Institute, Hsinchu, Taiwan
Eric Y. Chuang
Division Research and Development Center for Medical Devices, National Taiwan University, Taipei, Taiwan
Eric Y. Chuang

Authors

Amrita Chattopadhyay
View author publications
You can also search for this author in PubMed Google Scholar
Chien-Yueh Lee
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Chin Lee
View author publications
You can also search for this author in PubMed Google Scholar
Chiang-Lin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hsin-Kuang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yung-Hua Li
View author publications
You can also search for this author in PubMed Google Scholar
Liang-Chuan Lai
View author publications
You can also search for this author in PubMed Google Scholar
Mong-Hsun Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Yen-Hsuan Ni
View author publications
You can also search for this author in PubMed Google Scholar
Han-Mo Chiu
View author publications
You can also search for this author in PubMed Google Scholar
Tzu-Pin Lu
View author publications
You can also search for this author in PubMed Google Scholar
Eric Y. Chuang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.P.L, E.Y.C conceived the study. Y.C.L., H.K.C., Y.H.L., Y.H.N., and H.M.C. collected samples and curated the data. A.C. and C.Y.L. did the formal analysis and interpreted the results. C.Y.L., C.L.L. and A.C. developed the database. T.P.L, E.Y.C, M.H.T, and L.C.L. provided the administrative support and supervised the project. A.C prepared the manuscript. A.C., C.Y.L. and T.P.L revised the manuscript. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Tzu-Pin Lu or Eric Y. Chuang.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the institutional review board of National Taiwan University Hospital (#201801085RINB) and all patients gave informed consent. All methods were performed in accordance with the relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Chattopadhyay, A., Lee, CY., Lee, YC. et al. Twnbiome: a public database of the healthy Taiwanese gut microbiome. BMC Bioinformatics 24, 474 (2023). https://doi.org/10.1186/s12859-023-05585-6

Download citation

Received: 31 May 2023
Accepted: 27 November 2023
Published: 14 December 2023
DOI: https://doi.org/10.1186/s12859-023-05585-6

Twnbiome: a public database of the healthy Taiwanese gut microbiome

Abstract

Similar content being viewed by others

MetaGeneBank: a standardized database to study deep sequenced metagenomic data from human fecal specimen

Progress of analytical tools and techniques for human gut microbiome research

A geographically-diverse collection of 418 human gut microbiome pathway genome databases

Background

Construction and content

Database overview

Database content

Subject inclusion and clinical information

Biosample collections and 16s rRNA sequencing

Bioinformatic analysis

Features of Twnbiome database

16S rRNA sequencing data upload and analysis

Utility and discussion

Subject demography

Twnbiome database

Overview section

Summary section

Browser section: example 1

Data analysis: example 2

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation