Abstract
Current asthma treatments have been discovered to decrease the risk of disease progression. Herein, we aimed to characterize novel potential therapeutic targets for asthma. Differentially expressed genes (DEGs) for GSE64913 and GSE137268 datasets were characterized. Weighted correlation network analysis (WGCNA) was used to identify trait-related module genes within the GSE67472 dataset. The intersection of the module genes of interest, as well as the DEGs, comprised the key module genes that underwent additional candidate gene screening using machine learning. In addition, a bioinformatics-based approach was used to analyze the relative expression levels, diagnostic values, and reverently enriched pathways of the screened candidate genes. Furthermore, the candidate genes were silenced in asthmatic mice, and the inflammation and lung injury in the mice were validated. A total of 1710 DEGs were characterized in GSE64913 and GSE137268 for asthma patients. WGCNA identified 2367 asthma module genes, of which 285 overlapped with 1710 DEGs. Four candidate genes, CDC167, POSTN, SEC14L1, and SERPINB2, were validated using the intersection genes of three machine learning algorithms, including Least Absolute Shrinkage and Selection Operator, Random Forest, and Support Vector Machine. All the candidate genes were significantly upregulated in asthma patients and demonstrated diagnostic utility for asthma. Furthermore, silencing CDC167 reduced the levels of inflammatory cytokines significantly and alleviated lung injury in ovalbumin (OVA)-induced asthmatic mice. Our study demonstrated that CDC167 exhibits potential as diagnostic markers and therapeutic targets for asthma patients.
Similar content being viewed by others
Introduction
Asthma is a highly prevalent chronic respiratory condition that is observed on a global scale. According to available data, there has been a decline in the estimated prevalence of asthma, with rates decreasing from 601.2 to 477.9 per 1,000,000 individuals between the years 1990 and 2019 (Cao et al. 2022). Nevertheless, the frequency of these incidents continues to increase, highlighting the ongoing need to address the burden of disease, particularly among school-aged children (Asher et al. 2021). The primary approach to managing asthma includes the use of short-acting β(2)-agonist (SABA) medications and inhaled corticosteroids (Fuhlbrigge and Sharma 2021; Nwaru et al. 2020). These treatments are intended to prevent or minimize asthma symptoms. However, it is important to note that the use of these medications has been associated with certain adverse effects, such as an increased likelihood of exacerbations and even higher mortality rates (Fuhlbrigge and Sharma 2021; Nwaru et al. 2020). The pathophysiology of asthma encompasses inflammation, hyperresponsiveness, and airway remodeling. However, the considerable heterogeneity of the disease makes it difficult to identify biomarkers and develop novel therapeutic approaches (Busse et al. 2022). In addition, the characterized subtypes are not consistently stable; for instance, Type-2 (T2 or Th2)-low asthma can transition to the T2-high subtype as the disease progresses, and vice versa, based on the T2 inflammation-based classification standards (Habib et al. 2022). As a result, there is an urgent need to characterize novel biomarkers and therapeutic targets in order to enhance the diagnosis and treatment of asthma.
The utilization of bioinformatics analysis has demonstrated significant promise in the characterization of biomarkers and therapeutic targets in various diseases, including asthma (Abdel-Aziz et al. 2020). Accordingly, ~ 38% of childhood asthma cases have been attributed to various hereditary factors, which have been associated with immune responses, muscle function, and lung function (Zayed 2020). Furthermore, the presence of disease heterogeneity, variations in datasets or sample sources, individual differences among patients, and differences in analytical methodology contribute to the variability observed in various studies. For example, weighted gene co-expression network analysis (WGCNA) was applied using the GSE43696 dataset, resulting in the identification and characterization of 15 hub genes. These hub genes include BIRC5, CCNB2, CDCA2, MELK, UBE2C, and KIF20A, among others (He et al. 2020). The GSE89809 dataset was subjected to comparable analyses, yielding hub genes such as CCR1, CCR7, CXCR1, CXCR2, TLR2, FCGR3B, and FPR1 (Zhang et al. 2021). However, these findings diverged significantly from previous analyses despite some shared characteristic biological processes. Additionally, the presence of heterogeneity was observed in various studies, necessitating the utilization of multiple datasets and comprehensive analytical methodologies to identify universal biomarkers and therapeutic targets for further characterization.
The objective of this study was to discover new potential biomarkers using three machine learning algorithms: Least Absolute Shrinkage and Selection Operator (LASSO), Random Forest, and Support Vector Machine (SVM). The differentially expressed genes (DEGs) from the GSE64913 and GSE137268 datasets were combined. These datasets consisted of samples collected from the airway epithelium and induced sputum of individuals diagnosed with asthma, respectively. The genes belonging to the module that was highly correlated with asthma were characterized using the GSE67472 dataset. In addition, the intersect genes of the module genes and the DEGs were analyzed with machine learning in order to identify hub genes associated with asthma. Furthermore, the hub genes were confirmed by examining their expression profiles in both the dataset and asthmatic mice. The findings demonstrate that CDC167, POSTN, SEC14L1, and SERPINB2 exhibit potential as diagnostic markers and therapeutic targets for individuals with asthma.
Materials and methods
Acquisition of asthma-related datasets
The workflow of our bioinformatics analyses is summarized in Fig. 1. The datasets pertaining to asthma were obtained by downloading them from the GEO database, which identified differential gene expression and pathways in asthma and help us to find the potential as diagnostic markers, accessible at https://www.ncbi.nlm.nih.gov/geo/. The GSE64913 dataset comprises airway epithelial biopsy samples obtained from a cohort of 42 healthy individuals and 28 patients diagnosed with asthma. Similarly, the GSE67472 dataset encompasses epithelium samples collected from 43 healthy individuals and 62 asthma patients. Lastly, the GSE137268 dataset consists of induced sputum samples obtained from 15 healthy individuals and 54 patients diagnosed with asthma. The basic information of the GEO datasets was shown in Table 1.
Characterization of DEGs
The “limma” package in the R software was used to screen the DEGs that were upregulated and downregulated in the GSE64913 and GSE137268 datasets. The screening criteria were set as |log2FC|> 1.2 and P value < 0.05. The volcano plot was generated using the “ggplot2” package. All the GEGs from both datasets were collected for subsequent analysis.
Weighted correlation network analysis (WGCNA) analysis
According to the WGCNA method (Botía et al. 2017; Langfelder and Horvath 2008), the GSE67472 dataset with most varied of genes was subsequently employed to examine the genes that are most likely associated with asthma using. The “goodSamplesGenes” function from the “WGCNA” R package was employed to assess the presence of missing and discrete values. Subsequently, the “hclust” function was utilized in hierarchical cluster analysis to eliminate outliers and generate a heat map illustrating the correlations between modules and traits. To circumvent the issue of arbitrary thresholds, a soft threshold was employed, while ensuring scale independence and mean connectivity by keeping the powers fixed. When the soft threshold power was defined as 10, the scale-free topology index was 0.9. Thus, this network conformed to the power-law distribution and closer to the real biological network state (Yang et al. 2018). The process of gene clustering involved the utilization of the “TOMsimilarity” and “hclust” functions to create the weighted adjacency matrix and transformed topological overlap matrix (TOM). The “cutreeDynamic” function was used to self-adaptively prune the dynamically identified modules of the hierarchical clustering tree by setting 60 genes for each module. The co-expressed modules were generated using the “dynamicTreeCut” algorithm and subsequently clustered using the “moduleEigengenes” approach. The modules that exhibited similarities were subsequently subjected to clustering using the “hclust” function, with a height threshold of 0.25. Subsequently, the clinical correlations of the genes within the module were assessed using the “corPvalueStudent” function. Subsequently, gene significance and module membership were computed for each module. Subsequently, the module genes of interest, identified in both the previously characterized DEGs and the most prominent module associated with asthma, were selected for subsequent analysis.
Gene ontology (GO) and kyoto encyclopedia of genes and genomes (KEGG) enrichment
The “clusterProfile” package was used to apply GO and KEGG enrichment to the DEGs as well as the relevant module genes. Included in GO enrichment are biological process (BP), cellular component (CC), and molecular function (MF). P value < 0.05 were applied to both enrichments.
Machine learning
The LASSO, Random Forest, and SVM algorithms were utilized to identify hub genes among the module genes of interest. The module genes of interest were normalized with the DEGs. LASSO was a conventional feature selection method, which could screen important differential genes (Alhamzawi and Ali 2018). Random Forest could be highly parallelized to obtain a more accurate and stable model (Asadi et al. 2021). And SVM algorithms help with gene classification and regression tasks (Uddin et al. 2019). In addition, the R software’s “glmnet” function was used to conduct the analysis for LASSO linear regression. Subsequently, the “cv.glmnet” function was used for cross-model validation, and the “coef” function was used to analyze the optimized genes. For Random Forest analysis, R package “randomForest” was used to construct the forest, and the 30 most important asthma-related genes were selected as potential hub genes. For SVM, SVM-REF was constructed using the “caret” and “randomForest” R packages. Finally, the potential hub genes characterized by all three algorithms were deemed hub genes, which were characterized by the “VennDiagram” R package.
ROC evaluation
The module genes of interest were normalized using the DEGs. Subsequently, the predictive value of the predicted hub genes was evaluated using ROC curves using the “pROC” R package. Subsequently, the area under the curve (AUC) and 95% confidence interval (CI) were calculated.
Gene-set enrichment analysis (GSEA)
The R packages “clusterProfiler” and “org.Hs.eg.db” were utilized to perform GSEA in order to identify the signaling pathways associated with hub genes in all three characterized datasets.
Animal experiments
In order to validate the hub gene CCDC167, an in vivo verification was conducted by referencing a previously published report (Wu et al. 2022). In this study, female BALB/c mice (aged 5 ± 1 weeks) were procured from the Experimental Animal Centre of East China Normal University [SCXK (Shanghai) 2021-0006] and subsequently housed in animal facilities that maintained specific pathogen-free conditions. All experimental procedures were reviewed and approved by the Animal Care and Use Committee of Kongjiang Hospital. After a period of acclimation lasting one week, a total of 10 mice were randomly assigned to one of three groups: the control group, which received treatment with saline; the shNC group, which induced asthma and received treatment with short hairpin (sh) RNA as a negative control; and the shCCDC167 group, which induced asthma and received treatment with shCCDC167. To induce asthma, a solution consisting of 20 μg ovalbumin (OVA, Sigma-Aldrich, St. Louis, MO) and 2 mg aluminum hydroxide (769,460, Sigma-Aldrich) was administered via intraperitoneal injection in a volume of 1 mL on day 7, day 14, and day 21. Subsequently, the mice with asthma were confined within a transparent enclosed cage measuring 20 × 20 × 30 cm for a duration spanning from day 27 to day 30. During this period, they were subjected to daily treatment with a 1% OVA solution administered via atomization for a duration of 20 min. The mice in the control group were administered a comparable quantity of normal saline. Furthermore, in the shNC and shCCDC167 experimental groups, a total volume of 50 μL of AAV6 recombinant vector (obtained from Genepharma Co., Ltd., Shanghai, China) containing either shNC or shCCDC167 was administered to the lungs of the mice via endotracheal intubation on day 1 and day 7. On the 34th day of the experiment, all mice were administered an intraperitoneal injection of pentobarbital sodium (100 mg/kg, Vetoquinol, Cedex, France). Subsequently, the bronchoalveolar lavage fluid (BALF) and lung tissues were collected for subsequent analysis. The shRNAs (Genepharma) employed in this study were as follows: CCDC167 shRNA: 5ʹ -CCG GCC TAG TGT TCA AGC ATG GCT TCT CGA GAA GCC ATG CTT GAA CAC TAG GTT TTT TG-3ʹ, and control shRNA: 5ʹ -CCG GCC TAG TGT TCA AGC ATG GCT TCT CGA GAA GCC ATG CTT GAA CAC TAG GTT TTT TG-3ʹ. The cytokines IgE (J24307, Giled Biotechnology, Wuhan, China), IL-4 (J24113, Giled), IL-5 (J24112, Giled), and IL-13 (J24122, Giled) present in bronchoalveolar lavage fluid (BALF) were examined using ELISA kits as per the manufacturer's guidelines. The mouse lung tissues were fixed using a 10% formalin solution, which was followed by dehydration, paraffin embedding, and subsequent slicing into sections measuring 5 μm in thickness. The sliced sections underwent regular dewaxing and staining procedures using hematoxylin and eosin (H&E), periodic acid-Schiff stain (PAS), and Masson's trichrome stain to observe the presence of inflammatory cell infiltration, smooth muscle cell hyperplasia, and airway mucus secretion in the samples. The measurements of the inner area of the bronchial wall (WAi), airway smooth muscle area (WAm), number of bronchial smooth muscle cells (N), and inner perimeter of the bronchial wall (Pi) were conducted using Image Pro Plus software (Media Cybernetics Inc., MD, USA). These measurements were obtained after the sections were observed at 40X magnification and photographed using a microscope (Olympus CX41, Tokyo, Japan).
Statistics
Statistical analyses were conducted using R software (v4.2.1) and GraphPad Prism (V9.4.0, GraphPad Software, San Diego, CA, USA). Unless otherwise specified, the Student's t-test was employed for comparing between two groups, while one-way analysis of variance (ANOVA) was utilized for comparing among three or more groups. A P value < 0.05 was deemed to be statistically significant.
Results
Characterization of module genes
The DEGs for the GSE64913 and GSE137268 datasets, which include samples of airway epithelium and induced sputum, were characterized using the “limma” package of the R software. A total of 720 DEGs were identified for GSE137268, of which 488 were upregulated and 232 were downregulated (Fig. 2A). For GSE64913, 1035 DEGs were identified, of which 516 were upregulated and 519 were downregulated (Fig. 2B). DEGs for the two datasets were significantly different, with 1035 DEGs for GSE64913 and 720 DEGs for GSE137268; however, only 45 intersect genes were characterized between the two datasets (Fig. 2C). Then, subsequent analysis was conducted on all DEGs from both datasets. The GSE67472 dataset, which contains airway epithelium samples from healthy donors and asthma patients, was normalized using DEGs and analyzed using the WGCNA package. Resulting gene dendrograms and respective module colors were displayed in Fig. 2D and E. The disease-related modules with the greatest significance were identified (Fig. 2F). Among the characterized modules, the light-cyan, cyan, and salmon modules were significantly downregulated in asthma patients, whereas the purple, brown, and dark-green modules were significantly upregulated. The correlation and importance between genes and modules for the dark-green module are depicted in Fig. 2G; accordingly, 285 genes characterized both in the dark-green module and in the DEGs were chosen as the module genes of interest that were used in subsequent analyses (Fig. 2H). The module genes of interest were primarily enriched in negative regulation of proteolysis, hydrolase, peptidase and endopeptidase activities (BP), regulation of endopeptidase activity (BP), apical part of cell (CC), apical plasma membrane (CC), and peptidase activities (MF) by GO analysis (Fig. 2I). Furthermore, via KEGG enrichment, the genes of interest were predominantly enriched in the chemokine signaling pathway, serotonergic synapse, glutathione metabolism, and salivary secretion (Fig. 2J). Collectively, we characterized 285 module genes of interest for future analyses.
Identification of asthma-related hub genes via machine learning algorithms
The module genes of interest were then trained using LASSO, Random Forest, and SVM algorithms in order to identify the hub genes for asthma. Following normalization to the DEGs, LASSO regression was used to identify the most relevant genes via the minimum criteria (Fig. 3A), and 20 genes were identified as potential hub genes with the optimal lambda value using the LASSO algorithm (Fig. 3B). Random Forest was used to calculate the importance of the module genes of interest. Importantly, the 30 most important genes were chosen as the potential hub genes with mean decrease gini > 0.4 (Fig. 3C). As the number of trees increases, the error rate rapidly decreases. When the number of trees exceeds 30, the decrease in error rate begins to slow down and gradually stabilizes (Fig. 3D). In addition, 18 candidate hub genes were validated using the SVM algorithm, which displayed the lowest 5 × CV error (0.19, Fig. 3E) and highest 5 × CV accuracy (0.81, Fig. 3F). Finally, CCDC167, POSTN, SEC14L1, and SERPINB2 were designated as the asthma hub genes because they were characterized by all three machine learning algorithms (Fig. 3G).
Validation of the hub genes
Next, the hub gene expression levels in the GSE67472 datasets were verified to validate the prediction. The expression of hub genes, including CCDC167, POSTN, SEC14L1, and SERPINB2, was significantly lower in healthy donors (controls) compared to asthma patients (Fig. 4A–D). The predictive value of the hub genes was assessed using ROC curves, and the aero under curves for CCDC167, POSTN, SEC14L1, and SERPINB2 were approximately 0.79, 0.87, 0.86, and 0.89, respectively, supporting their sensitivity and specificity in asthma diagnosis (Fig. 4E–H). The GSEA showed that in patients having a high expression of CCDC167, amino sugar and nucleotide sugar metabolism, biosynthesis of nucleotide sugars, glycosphingolipid biosynthesis—lacto and neolacto series, mucin-type O-glycan biosynthesis, and terpenoid backbone biosynthesis were upregulated. Conversely, allograft rejection, ascorbate and aldarate metabolism, graft-versus-host disease, pentose and glucuronate interconversions, and type I diabetes mellitus were downregulated (Fig. 5A). On the other hand, in patients having a high expression of POSTN, amino sugar and nucleotide sugar metabolism, biosynthesis of nucleotide sugars, oxidative phosphorylation, proteasome and ribosome were upregulated. Conversely, allograft rejection, autoimmune thyroid disease, graft-versus-host disease, intestinal immune network for IgA production, and type I diabetes mellitus were downregulated (Fig. 5B). Moreover, in patients having a high expression of SEC14L1, biosynthesis of nucleotide sugars, mucin-type O-glycan biosynthesis, oxidative phosphorylation, proteasome, and ribosome were upregulated. Conversely, IL-17 signaling pathway, legionellosis, mineral absorption, rheumatoid arthritis, taurine, and hypotaurine metabolism were downregulated (Fig. 5C). Furthermore, in patients with elevated SERPINB2 expression, biosynthesis of nucleotide sugars, mucin-type Oglycan biosynthesis, proteasome, protein export, and ribosome were upregulated. On the other hand, allograft rejection, autoimmune thyroid disease, graft-versus-host disease, intestinal immune network for IgA production, and rheumatoid arthritis were downregulated (Fig. 5D). In addition, the application of GO analysis (Fig. 5E) revealed that the hub genes were enriched in fibrinolysis (BP), regulation of viral-induced cytoplasmic pattern recognition receptor signaling pathway (BP), regulation of RIG-I signaling pathway (BP), negative regulation of viral-induced cytoplasmic pattern recognition receptor signaling pathway (BP), negative regulation of response to external stimulus (BP), collagen-containing extracellular matrix (CC), Golgi apparatus subcompartment (CC), trans-Golgi network (CC), peptidase inhibitor activity (MF), extracellular matrix structural constituent (MF), endopeptidase inhibitor activity (MF), serine-type endopeptidase inhibitor activity (MF), and heparin-binding (MF). These results suggest that the characterized hub genes were upregulated in asthma patients and were involved in disease signaling transduction. Among these enrichments, coiled-coil domain-containing protein 167 (CCDC167) was the gene involved in the greatest number of processes.
In vivo validation of the hub gene CCDC167
CCDC167 was upregulated in different types of tumors as previously reported (Chen et al. 2021). However, the role of CCDC167 in the development of asthma still remains unclear. To further study the mechanism of CCDC167 in asthma, he shCCDC167 was used to downregulate the expression of CCDC167 in mice with asthma, and the resultant impacts of shCCDC167 were examined. The suppression of gene expression was confirmed in mice treated with shCCDC167, as demonstrated in Fig. 6A. In comparison to the control group (shNC), the experimental group (shCCDC167) exhibited decreased levels of total cell counts (Fig. 6B), eosinophils (Fig. 6C), macrophages (Fig. 6D), and lymphocytes (Fig. 6E) in BALF. Additionally, the shCCDC167 group demonstrated significantly lower levels of inflammatory cytokines, including IgE (Fig. 6F), IL-4 (Fig. 6G), IL-5 (Fig. 6H), and IL-13 (Fig. 6I), in BALF.
Subsequently, the lung histology of the mice was examined. The airways of the shNC mice exhibited significant cellular infiltration and damage to alveolar structures, as observed through H&E staining. However, these effects were mitigated in the shCCDC167 group (Fig. 7A). Goblet cell hyperplasia and the production of airway mucus were detected in shNC mice using PAS staining. However, these effects were notably reduced in the shCCDC167 group, as shown in Fig. 7B. Furthermore, there was a notable presence of smooth muscle hyperplasia and fibrosis in the small airway tissues of the shNC group. However, this observation was reversed in the shCCDC167 group, as depicted in Fig. 7C. In the asthmatic mice treated with shNC, there was a notable reduction in WAi/WAm levels, accompanied by a noteworthy elevation in N/Pi, WAi/Pi, and WAm/Pi levels. However, these changes were reversed upon treatment with shCCDC167, as depicted in Fig. 7D–G. The observations made in vivo provide empirical support for our initial hypothesis that CCDC167 has the potential to serve as both a biomarker and a therapeutic target for asthma.
Discussion
Asthma has been one of the most prevalent airway diseases in the world, affecting more than 2.3 million people and causing approximately 37,600 deaths in 2016 (2020). Unfortunately, this disease remains incurable, and the treatment has long been “symptom-control” oriented, which requires long-term medication and results in a series of adverse reactions (Moore et al. 2022; Skoner et al. 2022; Skov et al. 2022). Consequently, the identification of novel biomarkers and therapeutic targets for the treatment of asthma is urgent and essential. With the inclusion of datasets of epithelial samples and induced sputum samples, as well as the application of different training algorithms, we aimed to reduce the influence of heterogeneity in the current study. Four hub genes, CCDC167, POSTN, SEC14L1, and SERPINB2, have been characterized and confirmed with training databases and in vivo.
Asthma is obviously a heterogeneous disease with its many phenotypes, such as allergic and non-allergic, T2 subtype, eosinophilic, and endotypes defined with different immunological mechanisms (Calvén et al. 2020). Therefore, it is difficult to characterize asthma biomarkers and therapeutic targets. In addition, asthma is stimuli-sensitive, which could potentially lead to heterogeneity in clinical samples under varying environmental conditions, such as humidity and air pollution of the air (Lam et al. 2016; Strauss et al. 1978). Inclusion of specific subtypes and restriction of the environment may increase consistency and improve the quality of analyzed hub genes but may limit their applicability to other subtypes of asthma. In addition, over-subdivided phenotypes would cause difficulties in drug development and clinical decision-making. Several types of samples were collected for asthma in different researches, including exhaled air, airway epithelial biopsies, induced sputum, serum, and urine (Popović-Grle et al. 2021). Among these samples, airway epithelial biopsies and induced sputum were generated directly from the infection focus, and are therefore expected to more accurately represent the pathology of asthma patients, such as cough, wheezing, shortness of breath,etc. (Aegerter and Lambrecht 2023; Bakakos et al. 2011; Bradley et al. 1991). Asthma refers to abnormalities of immune cells, the epithelium of the airways, and their interactions (Calvén et al. 2020). Both airway epithelial biopsy and induced sputum were important diagnostic samples involving airway epithelial cells and inflammatory cells at different levels (Maestrelli et al. 1995). In the present study, we demonstrated that DEGs for epithelial biopsy and induced sputum were sufficiently distinct to ensure relevance and heterogeneity and included DEGs from both the epithelium and immune cells.
The utilization of Weighted WGCNA and machine learning algorithms has been employed in the characterization of hub genes associated with various diseases, including asthma. WGCNA has the ability to identify and characterize gene modules that are most pertinent to specific traits (Li et al. 2021). This allows for the exclusion of DEGs that may be statistically significant but not relevant to the traits under investigation. In their study, Yan Li et al. conducted an analysis of DEGs in the GSE64913 dataset, and characterized the asthma-related traits within the same dataset. Their findings demonstrated that ANXA8, ATF4, CD44, CYCS, DDIT3, FKBP5, LDHA, PMAIP1, S100A2, and SFN exhibited potential as hub genes associated with asthma (Li et al. 2023). Although the majority of their predicted genes demonstrated limited prognostic value when referencing the same dataset, this finding underscores the limited efficacy of hub gene characterization solely through the use of WGCNA. In their study, Ding et al. employed the WGCNA technique along with five machine learning algorithms to identify hub genes associated with lipid metabolism. Their findings indicated that CH25H exhibited potential as a biomarker for asthma in relation to lipid metabolism (Ding et al. 2023). Accordingly, the inadequacy of a singular biomarker for this diverse disease was apparent. In this study, we have identified four hub genes that are associated with asthma. Previous research has demonstrated that two of these genes, namely POSTN and SERPINB2, are considered T2 signature genes and have shown promise as potential biomarkers (Burgess et al. 2021; Du et al. 2022; Mo et al. 2019). However, the functional roles of CCDC167 and SEC14L1 in the context of asthma have yet to be fully elucidated.
POSTN is a well-characterized biomarker and crucial asthma regulator. POSTN promoted mucin hypersecretion and sustained eosinophilic inflammation, both of which were essential asthma pathologies of asthma (Burgess et al. 2021). POSTN was also a biomarker and a key player in tissue remodeling and fibrosis, which are important processes in bronchial asthma (Izuhara et al. 2015). SERPINB2 has been identified as an adaptive immunity regulator. Compared to wild-type SERPINB2(+/+) macrophages and mice, OVA significantly induced OVA-specific IFN-gamma-secreting T cells and enhanced the secretion of Th1-promoting cytokines in SERPINB2(−/−) macrophages and mice (Schroder et al. 2010). SERPINB2 was positively correlated with the severity of bronchial asthma (NE et al. 2017), and antagonizing SERPINB2 could inhibit the differentiation of Th2 cells in vitro (Zhou et al. 2021). Our analysis strategy identified POSTN and SERPINB2 as hub genes, which highlighted the significance of the two genes and supported the efficacy of our analyses.
As determined by GSEA, the hub genes were primarily enriched with the biosynthesis of nucleotide sugars, mucin-type O-glycan biosynthesis, proteasome, and ribosome, whereas they were negatively correlated with allograft rejection and graft-versus-host disease. CCDC167 was associated with the majority of these predicted biological processes, emphasizing its central role in asthma. However, CDC167 was previously an uncharacterized asthma gene. CCDC167 was reported to be significantly upregulated as a hub gene in the lungs of chloroprene-treated mice (Guo and Xing 2016), and to be downregulated by multiple antitumor therapeutics in breast cancer patients (Chen et al. 2021). In the present study, CCDC167 was silenced in OVA-induced asthmatic mice, and the inhibitory effect of CCDC167 silencing in asthma was confirmed by decreased inflammatory cytokines and an improvement in airway injuries.
Conclusion
The four hub genes, namely CCDC167, POSTN, SEC14L1, and SERPINB2, were subjected to characterization and validation using the training databases. In particular, the role of CCDC167 as a crucial factor and a possible therapeutic target for asthma was confirmed through experimentation on mice with OVA-induced asthma.
Data availability
The datasets obtained and analyzed during the current study were made available from the corresponding authors through request. Gene expression data (GSE67472, GSE64913 and GSE137268) were downloaded from the Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo).
References
Abdel-Aziz MI, Neerincx AH, Vijverberg SJ, Kraneveld AD, Maitland-van der Zee AH (2020) Omics for the future in asthma. Semin Immunopathol 42:111–126. https://doi.org/10.1007/s00281-019-00776-x
Aegerter H, Lambrecht BN (2023) The pathology of asthma: what is obstructing our view? Annu Rev Pathol 18:387–409. https://doi.org/10.1146/annurev-pathol-042220-015902
Alhamzawi R, Ali HTM (2018) The Bayesian adaptive lasso regression. Math Biosci 303:75–82. https://doi.org/10.1016/j.mbs.2018.06.004
Asadi S, Roshan S, Kattan MW (2021) Random forest swarm optimization-based for heart diseases diagnosis. J Biomed Inform 115:103690. https://doi.org/10.1016/j.jbi.2021.103690
Asher MI, Rutter CE, Bissell K, Chiang CY, El Sony A, Ellwood E, Ellwood P, García-Marcos L, Marks GB, Morales E, Mortimer K, Pérez-Fernández V, Robertson S, Silverwood RJ, Strachan DP, Pearce N (2021) Worldwide trends in the burden of asthma symptoms in school-aged children: Global Asthma Network Phase I cross-sectional study. Lancet 398:1569–1580. https://doi.org/10.1016/s0140-6736(21)01450-1
Bakakos P, Schleich F, Alchanatis M, Louis R (2011) Induced sputum in asthma: from bench to bedside. Curr Med Chem 18:1415–1422. https://doi.org/10.2174/092986711795328337
Botía JA, Vandrovcova J, Forabosco P, Guelfi S, D’Sa K, Hardy J, Lewis CM, Ryten M, Weale ME (2017) An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks. BMC Syst Biol 11:47. https://doi.org/10.1186/s12918-017-0420-6
Bradley BL, Azzawi M, Jacobson M, Assoufi B, Collins JV, Irani AM, Schwartz LB, Durham SR, Jeffery PK, Kay AB (1991) Eosinophils, T-lymphocytes, mast cells, neutrophils, and macrophages in bronchial biopsy specimens from atopic subjects with asthma: comparison with biopsy specimens from atopic subjects without asthma and normal control subjects and relationship to bronchial hyperresponsiveness. J Allergy Clin Immunol 88:661–674. https://doi.org/10.1016/0091-6749(91)90160-p
Burgess JK, Jonker MR, Berg M, Ten Hacken NTH, Meyer KB, van den Berge M, Nawijn MC, Heijink IH (2021) Periostin: contributor to abnormal airway epithelial function in asthma? Eur Respir J 57:2001286. https://doi.org/10.1183/13993003.01286-2020
Busse WW, Melén E, Menzies-Gow AN (2022) Holy Grail: the journey towards disease modification in asthma. Eur Respir Rev 31:210183. https://doi.org/10.1183/16000617.0183-2021
Calvén J, Ax E, Rådinger M (2020) The airway epithelium-A central player in asthma pathogenesis. Int J Mol Sci 21:8907. https://doi.org/10.3390/ijms21238907
Cao Y, Chen S, Chen X, Zou W, Liu Z, Wu Y, Hu S (2022) Global trends in the incidence and mortality of asthma from 1990 to 2019: an age-period-cohort analysis using the global burden of disease study 2019. Front Public Health 10:1036674. https://doi.org/10.3389/fpubh.2022.1036674
Chen PS, Hsu HP, Phan NN, Yen MC, Chen FW, Liu YW, Lin FP, Feng SY, Cheng TL, Yeh PH, Omar HA, Sun Z, Jiang JZ, Chan YS, Lai MD, Wang CY, Hung JH (2021) CCDC167 as a potential therapeutic target and regulator of cell cycle-related networks in breast cancer. Aging (Albany NY) 13:4157–4181. https://doi.org/10.18632/aging.202382
Ding X, Qin J, Huang F, Feng F, Luo L (2023) The combination of machine learning and untargeted metabolomics identifies the lipid metabolism -related gene CH25H as a potential biomarker in asthma. Inflamm Res. https://doi.org/10.1007/s00011-023-01732-0
Du L, Xu C, Shi J, Tang L, Xiao L, Lei C, Liu H, Liang Y, Guo Y, Tang K (2022) Elevated CXCL14 in induced sputum was associated with eosinophilic inflammation and airway obstruction in patients with asthma. Int Arch Allergy Immunol 183:1216–1225. https://doi.org/10.1159/000526367
Fuhlbrigge AL, Sharma S (2021) Oral corticosteroid use in asthma: a wolf in sheep’s clothing. J Allergy Clin Immunol Pract 9:347–348. https://doi.org/10.1016/j.jaip.2020.10.014
GBD 2016 Occupational Chronic Respiratory Risk Factors Collaborators, GBD 2016 Occupational Chronic Respiratory Risk Factors Collaborators (2020) Global and regional burden of chronic respiratory disease in 2016 arising from non-infectious airborne occupational exposures: a systematic analysis for the Global Burden of Disease Study 2016. Occup Environ Med 77:142–150. https://doi.org/10.1136/oemed-2019-106013
Guo Y, Xing Y (2016) Weighted gene co-expression network analysis of pneumocytes under exposure to a carcinogenic dose of chloroprene. Life Sciences 151:339–347. https://doi.org/10.1016/j.lfs.2016.02.074
Habib N, Pasha MA, Tang DD (2022) Current understanding of asthma pathogenesis and biomarkers. Cells 11:2764. https://doi.org/10.3390/cells11172764
He LL, Xu F, Zhan XQ, Chen ZH, Shen HH (2020) Identification of critical genes associated with the development of asthma by co-expression modules construction. Mol Immunol 123:18–25. https://doi.org/10.1016/j.molimm.2020.01.015
Izuhara K, Matsumoto H, Ohta S, Ono J, Arima K, Ogawa M (2015) Recent developments regarding periostin in bronchial asthma. Allergol Int 64(Suppl):S3-10. https://doi.org/10.1016/j.alit.2015.04.012
Lam HC, Li AM, Chan EY, Goggins WB (2016) The short-term association between asthma hospitalisations, ambient temperature, other meteorological factors and air pollutants in Hong Kong: a time-series study. Thorax 71:1097–1109. https://doi.org/10.1136/thoraxjnl-2015-208054
Langfelder P, Horvath S (2008) WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9:559. https://doi.org/10.1186/1471-2105-9-559
Li M, Zhu W, Wang C, Zheng Y, Sun S, Fang Y, Luo Z (2021) Weighted gene co-expression network analysis to identify key modules and hub genes associated with paucigranulocytic asthma. BMC Pulm Med 21:343. https://doi.org/10.1186/s12890-021-01711-3
Li Y, Li L, Zhao H, Gao X, Li S (2023) The identification and clinical value evaluation of CYCS related to asthma through bioinformatics analysis and functional experiments. Dis Markers 2023:5746940. https://doi.org/10.1155/2023/5746940
Maestrelli P, Saetta M, Di Stefano A, Calcagni PG, Turato G, Ruggieri MP, Roggeri A, Mapp CE, Fabbri LM (1995) Comparison of leukocyte counts in sputum, bronchial biopsies, and bronchoalveolar lavage. Am J Respir Crit Care Med 152:1926–1931. https://doi.org/10.1164/ajrccm.152.6.8520757
Mo Y, Zhang K, Feng Y, Yi L, Liang Y, Wu W, Zhao J, Zhang Z, Xu Y, Hu Q, He J, Zhen G (2019) Epithelial SERPINB10, a novel marker of airway eosinophilia in asthma, contributes to allergic airway inflammation. Am J Physiol Lung Cell Mol Physiol 316:L245-l254. https://doi.org/10.1152/ajplung.00362.2017
Moore WC, Kornmann O, Humbert M, Poirier C, Bel EH, Kaneko N, Smith SG, Martin N, Gilson MJ, Price RG, Bradford ES, Liu MC (2022) Stopping versus continuing long-term mepolizumab treatment in severe eosinophilic asthma (COMET study). Eur Respir J 59:2100396. https://doi.org/10.1183/13993003.00396-2021
Ne EL, Abdel-Latif RS, El-Hady HA (2017) Association between SERPINB2 gene expression by real time PCR in respiratory epithelial cells and atopic bronchial asthma severity. Egypt J Immunol 24:165–181
Nwaru BI, Ekström M, Hasvold P, Wiklund F, Telg G, Janson C (2020) Overuse of short-acting β(2)-agonists in asthma is associated with increased risk of exacerbation and mortality: a nationwide cohort study of the global SABINA programme. Eur Respir J 55:1901872. https://doi.org/10.1183/13993003.01872-2019
Popović-Grle S, Štajduhar A, Lampalo M, Rnjak D (2021) Biomarkers in different asthma phenotypes. Genes (Basel) 12:801. https://doi.org/10.3390/genes12060801
Schroder WA, Le TT, Major L, Street S, Gardner J, Lambley E, Markey K, MacDonald KP, Fish RJ, Thomas R, Suhrbier A (2010) A physiological function of inflammation-associated SerpinB2 is regulation of adaptive immunity. J Immunol 184:2663–2670. https://doi.org/10.4049/jimmunol.0902187
Skoner DP, Golant AK, Norton AE, Stukus DR (2022) Is this medication safe for my child? How to discuss safety of commonly used medications with parents. J Allergy Clin Immunol Pract 10:3064–3072. https://doi.org/10.1016/j.jaip.2022.07.032
Skov IR, Madsen H, Henriksen DP, Andersen JH, Pottegård A, Davidsen JR (2022) Low-dose oral corticosteroids in asthma associates with increased morbidity and mortality. Eur Respir J 60:2103054. https://doi.org/10.1183/13993003.03054-2021
Strauss RH, McFadden ER, Ingram RH, Deal EC, Jaeger JJ (1978) Influence of heat and humidity on the airway obstruction induced by exercise in asthma. J Clin Invest 61:433–440. https://doi.org/10.1172/jci108954
Uddin S, Khan A, Hossain ME, Moni MA (2019) Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inform Decis Mak 19:281. https://doi.org/10.1186/s12911-019-1004-8
Wu Q, Liu J, Deng J, Chen Y (2022) Long non-coding RNA HOTTIP induces inflammation in asthma by promoting EFNA3 transcription by CCCTC-binding factor. Am J Transl Res 14:8903–8917
Yang Q, Wang R, Wei B, Peng C, Wang L, Hu G, Kong D, Du C (2018) Candidate biomarkers and molecular mechanism investigation for glioblastoma multiforme utilizing WGCNA. BioMed Res Int. https://doi.org/10.1155/2018/4246703
Zayed H (2020) Novel comprehensive bioinformatics approaches to determine the molecular genetic susceptibility profile of moderate and severe asthma. Int J Mol Sci 21:4022. https://doi.org/10.3390/ijms21114022
Zhang Z, Wang J, Chen O (2021) Identification of biomarkers and pathogenesis in severe asthma by coexpression network analysis. BMC Med Genomics 14:51. https://doi.org/10.1186/s12920-021-00892-4
Zhou J, Lu Y, Wu W, Feng Y (2021) HMSC-derived exosome inhibited Th2 cell differentiation via regulating miR-146a-5p/SERPINB2 pathway. J Immunol Res 2021:6696525. https://doi.org/10.1155/2021/6696525
Funding
The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.
Author information
Authors and Affiliations
Contributions
Conception and design of the study were performed by Yukai Zhong, Yuanjing Chen and Qi Shen. The experiments were performed by Yukai Zhong and Qiong Wu. Data analysis was performed by Yukai Zhong, Qiong Wu and Li Cai. Yukai Zhong wrote the paper. Yuanjing Chen and Qi Shen were responsible for its revision. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Ethical approval
All experimental procedures were reviewed and approved by the Animal Care and Use Committee of Kongjiang Hospital.
Consent to participate
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhong, Y., Wu, Q., Cai, L. et al. CDC167 exhibits potential as a biomarker for airway inflammation in asthma. Mamm Genome (2024). https://doi.org/10.1007/s00335-024-10037-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00335-024-10037-4