FormalPara Key Points

The seven circulating vascular-related RNA transcripts under study (lncRNA ANRIL, miRNA-126-5p, CDK4, CDK6, TGF-β, E-cadherin, and TNF-α) could serve as potential coronary artery disease (CAD) biomarkers.

The studied transcripts are a promising non-invasive panel that can be used for reflecting the cellular vascular function, proliferation, and inflammatory processes in CAD patients.

Blood transcriptome analysis via investigating different types of RNA transcripts in CAD patients opens new frontiers for the non-invasive diagnosis of CAD.

1 Introduction

Coronary artery disease (CAD), a subtype of cardiovascular disease, is considered one of the leading causes of death worldwide [1]. Several risk factors are associated with CAD, including smoking, obesity, high blood pressure, high blood levels of lipids, family history, and genetic predisposition [2]. Previous epidemiological studies indicated that CAD susceptibility is 40–60% of genetic origin [3]. In Egypt, cardiac genetic and genomic changes are currently considered a priority with the rise of the Egyptian Collaborative Cardiac Genomics (ECCO-GEN) project supported by the Aswan Heart Center and the Magdi Yacoub Global Heart Foundation [4].

Atherosclerosis is the primary pathogenesis of coronary vasculature [5]. Coronary atherosclerosis (CAS) is characterized by various clinical manifestations, such as acute coronary syndrome, stable angina, and heart failure [6]. CAS is a prolonged, inflammatory, multifaceted condition where atherosclerotic plaque is considered the hallmark of atherosclerotic lesions [7]. CAS develops and progresses through a complex interplay between immune cells, vascular cells, and endothelial shear stresses [8].

Immune cells produce both pro- and anti-inflammatory cytokines and thus are involved in the initiation of the atherogenic process and affect plaque formation as well as stability [9]. Vascular smooth muscle cells (VSMCs) and vascular endothelial cells (EC) are major constituents of the arterial wall. They are essential in maintaining blood flow via vascular wall remodeling [5]. VSMC proliferation and phenotype switch are prominent contributors to atherosclerotic plaque formation [10], where plaque content of VSMCs is inversely proportional to plaque vulnerability [10]. Furthermore, endothelial cell activation and dysfunction are significant features during atherogenesis, where endothelial dysfunction plays a central role in the formation, progression, and complications of atherosclerotic plaque [11].

Early diagnosis of CAS is critical for avoiding plaque rupture that may result in acute myocardial infarction (MI) or sudden cardiac death [12]. Therefore, identifying coronary artery biomarkers linked to vessel wall functions is a hot spot in CAS research. Earlier studies indicated that RNAs have a distinct signature in the blood of CAD patients [13,14,15]. These RNA transcripts are probably linked to genes involved in inflammation and cell cycle control [16,17,18].

Long ncRNA (lncRNA), a class of non-coding RNA (ncRNA) molecules with lengths of >200 nucleotides, are thought to serve an essential role in the development of human diseases; however, only a few have been well characterized with regards to their functional roles [19]. ANRIL, also known as cyclin-dependent kinase inhibitor 2B antisense RNA 1 (CDKN2B-AS1), is implicated in various malignant and nonmalignant diseases [20] and is known previously to have a role in cardiovascular diseases (CVDs) by regulating inflammation responses and cell cycle [11]. Regarding CAD, ANRIL is expressed in vascular endothelial cells and smooth muscles and has a vital role in plaque formation [19,20,21,22].

Another subtype of ncRNAs is the microRNAs (miRNAs), which have a role in regulating gene expression on post-transcription or post-translation levels [23] and were also found to be altered in CAD patients and implicated in atherosclerosis development [24]. MiRNA-126-5p was among the early discovered miRNAs expressed specifically in the vasculature and have critical functions in vascular development [25]. Recent studies have unveiled the potentially important function of miRNA-126-5p in vascular diseases, including atherosclerosis, coronary artery disease, stroke, and diabetic vasculopathy. The action of miRNA-126-5p reflects its function in angiogenesis and inflammation [24,25,26].

Several publications illustrated the interplay and cross-regulation between different RNA classes, especially in chronic diseases [27,28,29,30]. Following on from these, the present study investigates the expression level of some of the vascular-related RNA transcripts (lncRNA ANRIL, miRNA-126-5p, CDK4, CDK6, TGF-β, E-cadherin, and TNF-α) implicated in the cellular vascular function, proliferation, and inflammatory processes in the plasma of CAD patients to elucidate their role as potential CAD biomarkers.

2 Study Participants and Methodological Approach

2.1 Study Participants

A case-control study design with 180 participants was classified into two groups. The CAD group included 90 patients presenting with either complaints or signs of CAD revealed by examination, ECG, and echocardiography and recruited from the Suez Canal University Hospital (SCUH) cardiology outpatient clinic from June 2020 to December 2021. Patients were enrolled from both genders, and their ages ranged from 18 to 85 years. Patient history was taken for each patient, including demographic data, CAD family history, past history, and lifestyle habits. We excluded patients with other chronic CVD or congenital heart disease. The risk factors were detected by clinical diagnosis and/or treatment for hyperlipidemia, diabetes mellitus, and hypertension. All patients under study underwent a clinical examination, including vital signs and general and local examinations. For calculating body mass index (BMI) (kg/m2), anthropometrical height and body weight were estimated. Two measurements were performed for blood pressure and resting heart rate using an automatic sphygmomanometer, and average values were recorded. left ventricular (LV) size, LV systolic function (ejection fraction), and diastolic dysfunction was evaluated by echocardiography with two independent operators. The left ventricular ejection fraction (LVEF) and wall motion score index (WMSI) were estimated for all CAD patients. CAD patients under study were selected based on at least one of the following findings: positive stress imaging, whether by stress echocardiography or exercise stress ECG or single-photon emission computerized tomography (SPECT), wall motion alterations in resting echocardiography, prior history of admission to a coronary care unit with CAD or previous coronary stenting or prior coronary angiography diagnosing obstructive CAD.

The control group included 90 healthy volunteers from the SCUH blood bank of both genders, aged over 18 years old, with no current treatments or complaints from any CVD or chronic disease. All control subjects aged over 45 years old were subjected to echocardiography to assess their cardiac condition and to ensure they were free from any CAD signs.

2.2 Selection of Circulating Vasculature-Related Transcripts Using Bioinformatics Tools

The transcripts under study were selected using online bioinformatics tools such as HMDD (http://www.cuilab.cn/) [31], miR2Disease (http://www.mir2disease.org/) [32], LncRNAdisease (http://www.rnanut.net/lncrnadisease/) [33], and LncBase v3 (https://diana.e-ce.uth.gr/lncbasev3) [34]. In addition, current literature results for the most common ncRNAs and their target genes involved in vascular heart disease were considered [35,36,37].

2.3 Plasma Samples Collection

A vacutainer tube containing ethylene diamine tetraacetic acid (EDTA) anticoagulant was used to collect 3 mL of blood from all study participants. Plasma was obtained via centrifugation for 15 minutes at 3500 rpm, and qiazol was added to plasma with a ratio of 5:1 and stored at − 80 °C until transcript analysis. To ensure there was no sample hemolysis, the blood sample was slowly withdrawn and slowly shacked (7–8 times) inside the EDTA vacutainer. Fresh samples were centrifuged within 1 hour after withdrawal. In case of any visible hemolysis, the sample was assessed using spectrophotometer and samples with a significant degree of hemolysis were discarded.

2.4 Total RNA Including miRNA Extraction and Quality Analysis

Qiagen miRNeasy Mini kit (cat no 217004, QIAGEN, Hilden, Germany) was used for total RNA extraction using the manufacturer protocol. NanoDrop 2000 1C spectrophotometer (NanoDrop Tech., Inc. Wilmington, DE, USA) was used to determine RNA concentration and purity.

2.5 Circulating Vasculature-Related Transcripts Expression Analysis Using Quantitative Real-Time PCR Assay

The expression levels of the seven selected vasculature circulating transcripts were assessed in the plasma of all study participants. RT-qPCR was done via two steps:

  1. (a)

    Reverse transcription (RT), where miScript II RT Kit (Qiagen, Catalog no. 218161) was used to generate complementary DNA (cDNA) from total RNA, in which mRNAs, miRNAs, and lncRNAs were polyadenylated by poly(A) polymerase and converted into cDNA. RT was carried out at 37 °C for 1 hour, followed by inactivation of the reaction by briefly incubating at 95 °C using a Veriti™ 96-Well Thermal Cycler (Applied Biosystems, USA)

  2. (b)

    qPCR, where the cDNA premix was used as a template for relative quantification of the seven transcripts under study (ANRIL, miR-126-5p, CDK4, CDK6, E-cadherin, TNF-α, and TGF-β) using miScript SYBR Green PCR Kit (Qiagen, cat. no 218076) with a universal reverse primer and a specific forward primer sequence for miRNAs, while other genes had their specific forward and reverse primers as presented in Table 1. For ANRIL, Taqman ncRNA assay CDKN2B-AS1 (Thermofisher, USA) was used following the manufacturer's instructions. Minimum information for publication of quantitative RT-PCR experiments (MIQE) guidelines were followed throughout the experiment. The reactions were run in triplicate with ‘No-RT’, and ‘No-template’ controls present in each run. The qPCR cycling conditions started at 95 °C for 5 minutes, followed by 40 cycles of denaturation, annealing, and elongation at 95 °C (15 s), 55 °C (1 min), and 72 °C (1 min), respectively. Endogenous control β-actin was used to normalize ANRIL and mRNAs, while SNORD68 was used to normalize miRNA-126-5p to enable data analysis using the ΔΔCT method of relative quantification.

Table 1 Primer sequences of the transcripts under study

2.6 Circulating Vasculature-Related Transcripts Predictive Significance Testing

The discriminative power of the vasculature circulating transcripts was analyzed individually and in groups using receiver operating characteristic (ROC) curves to evaluate the diagnostic power of the transcripts under study as biomarkers in CAD. Statistical significance was considered for a p value of < 0.05.

2.7 Scoring of Disease Prioritization for Circulating Vasculature-Related Transcripts Under Study in Relation to CAD

For disease prioritization, the gene set option in GeneAnalytics software [38] was used to perform the disease relationship analysis using the information available in MalaCards database, the human disease database. The following parameters determined the disease-matching score: the number of transcripts studied that matched CAD normalized by the total number of transcripts associated explicitly with CAD, the quality and type of the transcript–CAD relations, and each gene in CAD has a score derived from its relationship to the disease, as annotated in MalaCards sources. We also chose the anatomical disease categorization option and selected CVDs; then, we presented the score of the top 12 CVDs associated with the transcripts under study. As for gene prioritization concerning CAD, VarElect software [39] was used for the prioritization of the genes for the transcripts under study based on their contribution to CAD. The VarElect scoring system depends on giving each gene a relevance score depending (in part) on its current weight with CAD based on previous research. The weight with CAD was determined by the frequency with which the gene appears in association with CAD (term frequency) compared with all genes under study (inverse document frequency).

2.8 Enrichment Analysis of Circulating Vasculature-Related Transcripts in CAD

GeneAnalytics software was used to conduct the functional enrichment analysis to annotate the seven vasculature circulating transcripts into different pathways and gene ontology (GO) terms (biological process, cellular component, and molecular function). The results were prioritized by relevance to the analyzed transcripts under study. Considering pathway enrichment analysis, GeneAnalytics software was also used to analyze the seven transcripts under study—pathway relationships using PathCards available information from the biological pathways database. PathCards applies a gene-content algorithm enabling the unification of thousands of pathways from multiple pathway sources available in GeneCards and clusters them into SuperPaths. Considering GO analysis, novel and supporting information were retrieved regarding the functional roles of the genes under study in biological processes such as development, differentiation, diseases, and pathways, as well as in molecular functions such as binding or catalysis. The presented score in each item is a transformation of the resulting p value, corrected for multiple comparisons using the false discovery rate (FDR) method, with higher scores indicating a better match.

2.9 Vasculature-Related miRNA–mRNA Regulatory Network Construction

The targets of the vasculature circulating transcripts in CAD were predicted using miRTargetLink 2.0 (Version 2.0, https://ccb-compute.cs.uni-saarland.de/) [40]. Statistically homogenous and significant differentially expressed transcript targets were kept, and different filters were used to retrieve strong validated targets only.

2.10 Statistical Analysis

SPSS software version 23.0 and Excel 2016 were used for data analysis. G*Power 3.1.9.2 (http://www.gpower.hhu.de/A) [41] was used for sample size calculation. Gene expression fold change of the transcripts was estimated using the LIVAC method (= 2−ΔΔCq) [42]. Quantitative data were expressed as median and quartiles or means ± standard deviation, while qualitative data were expressed as numbers and percentages. Student-t, Two-sided Chi-square, ANOVA, Kruskal-Wallis, and Mann-Whitney U tests were used as appropriate. Correlation analysis by Spearman's rank test was carried out. Different statistical methods were used to decrease the confounding effect as follows:

  1. 1.

    Stratification for producing different groups within which some of the confounders do not vary. Then, the exposure–outcome association was evaluated within each stratum of the confounder. So, within each stratum, the confounder could not confound because it does not vary across the exposure–outcome.

  2. 2.

    ROC curve of the circulating transcripts and other study variables.

  3. 3.

    Spearman’s correlation analysis for the correlation between CAD and various variables in the study.

3 Results

3.1 Main Features and Risk Factors Among the Study Population

The main feature data from all study participants are presented in Table 2. The average age of participants was 49.4 ± 6.1 and 53.9 ± 9.7 years among controls and CAD groups, respectively. The CAD group showed significantly higher ages (p < 0.001) than the control group. On the contrary, there was no significant difference among different ages. The male gender was more represented among the study participants, with 54 (60.0%) and 65 (72.2%) males in the control and the CAD group, respectively, with a non-statistical difference (p > 0.05) among the two groups. Regarding CAD risk factors, smokers were significantly (p = 0.011) higher in the CAD group with a total of 51 (56.7%) smokers compared with the control group, which included 33 (36.7%) smokers. Family history was found in 57 (63.3%) patients compared with 35 (38.9%) participants in the control group, with a significant difference between the two groups. Concerning dyslipidemia, CAD patients had a significantly higher incidence, representing 80.0% of the CAD population compared with 7.8% in the control group. The average (mean ± SD) BMI of the CAD group was 30.4 ± 6.2, with no statistical difference (p = 0.158) compared with the control group (29.3 ± 2.7). At the same time, there was a statistically significant difference among different obesity grades, as shown in Table 2.

Table 2 Main features and risk factors among the study participants

3.2 Comorbidities and Cardiac Findings in CAD Patients

Table 3 shows the comorbidities and cardiac findings among the CAD patients in the study. Clinical examination revealed an average ± SD for body surface area (BSA) of 1.83 ± 0.17, systolic blood pressure of 126.78 ± 16.46, and diastolic blood pressure of 80.89 ± 14.8. At the same time, echocardiography revealed an average ± SD for LVEF (46.02 ± 12.99) and WMSI (1.5 ± 0.42) with statistically significant results. Concerning comorbidities, there was no statistical significance between CAD and either diabetes, hypertension, or ischemic heart disease among our study patients. Regarding LVEF grades, the chi-squared test revealed a statistically significant difference among the different grades. The diastolic function grades are represented in grades normal 50 (26.3%), mild 30 (15.8%), moderate 4 (2.1%), and severe 6 (3.2%), with a highly significant difference between them.

Table 3 Comorbidities and cardiac findings in CAD patients

3.3 Relative Expression Analysis of Circulating Vasculature-Related Transcripts

The relative expression patterns of ANRIL, miR-126-5p, CDK4, CDK6, E-cadherin, TNF-α, and TGF-β were assessed using qRT‐PCR and are shown in Fig. 1 as a heatmap. The transcripts’ relative expression is presented as median (Q1 and Q3) in Fig. 2. The scores were as follows: ANRIL 3.6 (1.9–5.9), miR-126 4.2 (1.7–8.2), CDK4 7.9 (4.33–10.9), CDK6 1.8 (− 0.6 to 3.1), E-cadherin 7.7 (4.1–11.4), TNF-α 2.1 (0.1–4.4), and TGF-β − 0.1 (− 3.1 to 2.2) (Fig. 2).

Fig. 1
figure 1

A heatmap illustrating the relative gene expression of the circulating vasculature-related transcripts under study. Heatmap scale from red to blue where red indicates higher expression (log2FC), and blue reflects low expression

Fig. 2
figure 2

Relative gene expression of the seven circulating vasculature-related transcripts under study in CAD patients. Data are represented as median (Q1 and Q3) of log2FC in the form of whiskers and boxes. Significant p-values are red. CAD coronary artery disease

3.4 Circulating Vasculature-Related Transcripts Predictive Significance by ROC Analysis

ROC, including AUC and confidence interval levels, are presented in Table 4. The results indicated the statistical significance of all the circulating transcripts under study. CDK4 showed the highest AUC equivalent to 0.91, followed by E-cadherin (0.90), miR-126-5p (0.83), ANRIL (0.82), TNF-α (0.63), TGF-β (0.62), and CDK6 (0.59) in descending order.

Table 4 ROC curve analysis for circulating vasculature-related transcripts understudy

The combined predictive ROC curve analysis for the transcripts under study is presented in Fig. 3, showing high predictive significant results for each group. Groups were combined based on the different roles of these transcripts on vasculature pathogenesis. In group A, CDK4 and CDK6 are implicated in vascular smooth muscle proliferation, while E-cadherin is responsible for the proliferation and integrity of endothelial cells. In group B, ANRIL is implicated in vascular smooth muscle proliferation, while miR-126-5p is responsible for the proliferation and integrity of endothelial cells. In group C, TNF-α and TGF-β are implicated in the vascular inflammatory process with opposite atherogenic and athero-protective actions.

Fig. 3
figure 3

Combined predictive significance by ROC curve analysis for different circulating vasculature transcripts under study. A Combined ROC curve analysis for E-cadherin, CDK4, and CDK6. B Combined ROC curve analysis for ANRIL and miR-126-5p. C Combined ROC curve analysis for TNF-α and TGF-β. ROC receiver operating characteristic

3.5 Correlation Analysis Among the Circulating Vasculature-Related Transcripts’ Expression Levels and CAD Patients’ Clinical Features

The seven circulating vasculature-related transcripts showed various distributions among all CAD patients, as shown in Table 5. Although none of the transcripts under study is a potential target gene for miR-126-5p or ANRIL, a strong association was detected between most of the transcripts in CAD patients with a significant Spearman’s correlation coefficient with a two-tailed significance of p < 0.001. The most significant correlation was between ANRIL and CDK4 (r = 0.74), ANRIL and CDK6 (r = 0.70), ANRIL and E-cadherin (r = 0.68), miR-126-5p and CDK4 (r = 0.55), E-cadherin and CDK4 (r = 0.64), and TNF-α and CDK6 (r = 0.54). This could suggest new predicted targets for both miR-126-5p and ANRIL that need further validation in future studies.

Table 5 Correlation matrix among the seven circulating vasculature-related transcripts in this study

Considering the relation between the seven circulating vasculature-related transcripts under study and the clinical findings shown in Table 6, all the studied genes were statistically significant with CAD. ANRIL, CDK4, CDK6, and E-cadherin showed a positive statistically significant relationship with age. ANRIL, CDK4, CDK6, E-cadherin, and TNF-α showed a positive, statistically significant relationship with gender. All the genes studied showed positive, statistically significant results with smoking except TGF-β. MiR-126-5p, CDK4, and E-cadherin showed positive, statistically significant results with family history. Dyslipidemia correlated significantly with all genes under study except TGF-β. Finally, ANRIL and CDK4 showed a significant correlation concerning hypertensive patients.

Table 6 Correlation among the circulating vasculature-related transcripts under study and clinical data

As for correlations of the clinicopathological features among CAD patients shown in Table 7, age correlated significantly with gender, smoking, family history, diabetes, hypertension, and ischemic heart disease. Gender correlated significantly with age, smoking, family history, hypertension, and BSA. Family history correlated significantly with age, gender, ischemic heart disease, and WMSI. Dyslipidemia correlated significantly with BMI, obesity, obesity grade, and LVEF. BMI, obesity, and obesity grade correlated significantly with each other, dyslipidemia, DM, ischemic heart disease, BSA, and WMSI. Both LVEF and WMSI correlated significantly with each other and with ischemic heart disease.

Table 7 Correlation among the different clinicopathological features understudy in CAD

3.6 Disease and Prioritization Scores for the Circulating Vasculature-Related Transcripts Studied in Relation to CAD

Anatomical disease categorization revealed a total of 1215 diseases related to the seven circulating transcripts under study, with 120 diseases related to the cardiovascular system. The top scores are shown in Fig. 4A and are mostly related to CAD pathogenesis.

Fig. 4
figure 4

Disease and prioritization scores for the transcripts under study in relation to CAD. A Disease score for the transcripts among cardiovascular diseases. Scores were calculated using GeneAnalytics tool (https://geneanalytics.genecards.org). B Prioritization score of the gene variants of the seven circulating vasculature transcripts in relation to CAD. C Average disease-causing likelihood score for the seven circulating vasculature transcripts in relation to CAD. The color scale presented from red to blue where blue reflects lower scores, and red indicates higher ones. Scores were calculated using VarElect tool (https://ve.genecards.org/). CAD coronary artery disease, CVD cardiovascular disease, Familial AA familial aortic aneurysm, Lipoprotein QTL lipoprotein quantitative trait locus

Concerning the strength of the connection between each transcript under study and CAD, the results presented in Fig. 4B show the prioritization score of the gene variants among the seven circulating transcripts. This list is based on the association between CAD-shared pathways and these biomarkers, paralog relations, interaction networks, domain sharing, and joint publications. Furthermore, analyzing the average likelihood for these seven biomarkers to cause disease (shown in Fig. 4C) revealed that CDK4 and CDK6 had the highest scores, equivalent to 84.53% and 78.44%, respectively. E-cadherin scored 70.62%, followed by TNF-α with a score of 70.41%, and finally TGF-β with 42.49%. The score for ANRIL and miR-126-5p was not detected by the tool used. This score reflects the principle that a variant in the original genes of those transcripts with high mutation intolerance is more likely to cause CAD.

3.7 Enrichment Analysis of the Circulating Vasculature-Related Transcripts in CAD

All the functions and pathways targeted by the seven circulating vasculature-related transcripts under study in CAD were identified via enrichment analysis based on annotated gene targets in GO.

Our analysis found the biological processes previously known to be involved in CAD distinctly enriched. The top 10 enriched pathways were associated with response to organic substance regulation of cell population proliferation, positive regulation of fibroblast proliferation, positive regulation of mononuclear cell migration, regulation of gene expression, negative regulation of production of miRNAs involved in gene silencing by miRNA, negative regulation of cell–cell adhesion, positive regulation of protein kinase B signaling, negative regulation of myoblast differentiation, and positive regulation of ERK1 and ERK2 cascade as represented in Fig. 5A according to their matching scores.

Fig. 5
figure 5

Function and pathway enrichment analysis for the seven circulating vasculature-related transcripts under study according to matching score. A Top 10 biological processes. B Top 10 molecular functions. C Top 10 cellular components. D Top 10 pathways. The color scale presented from red to blue where blue reflects higher scores, and red indicates lower ones. All scores were calculated using GeneAnalytics tool (https://geneanalytics.genecards.org)

Considering molecular function enrichment analysis shown in Fig. 5B, the top ten functions were RNA polymerase II CTD heptapeptide repeat kinase activity, cyclin-dependent protein serine/threonine kinase activity, cyclin binding, FBXO family protein binding, type III transforming growth factor (TGF)-β receptor binding, cytokine activity, type II TGF-β receptor binding, type I TGF-β receptor binding, γ-catenin binding, and GTPase activating protein binding according to their matching scores.

Moreover, we revealed that the top ten cellular components for the seven circulating vasculature-related genes under study were cyclin-dependent protein kinase holoenzyme complex, mediator complex, cyclin D2-CDK4 complex, Flotillin complex, apical junction complex, phagocytic cup, catenin complex, cell surface, cytoplasmic side of the plasma membrane, and extracellular space according to their matching scores shown in Fig. 5C.

Finally, enrichment of specific pathways for the seven circulating vasculature-related transcripts under study revealed pathways relevant to mitotic G1-G1/S phases, developmental lineage pathway of dendritic cells, cyclins and cell cycle regulation, Sertoli-Sertoli cell junction dynamics, cell cycle role of SCF complex in cell cycle regulation, EV release from cardiac cells and their functional effects, T-cell antigen receptor (TCR) signaling pathway, RXR and RAR heterodimerization with other nuclear receptors, and ERK signaling according to their matching scores (Fig. 5D).

3.8 Regulatory Network Construction for the Vasculature-Related miRNA–mRNA Targets

Network analysis revealed a strong relationship between five of the circulating vasculature-related transcripts under study and their target miRNAs and miR-126-5p, but not for ANRIL. The miRNA-target gene network comprised the six circulating vasculature-related transcripts under study where it initially revealed 407 targets (Supplementary file 1, see electronic supplementary material) and then filtered to select only strongly validated targets that revealed a final set of 82 targets (11 mRNA and 71 miRNAs) using miRTargetLink 2.0 (Fig. 6). Also, we constructed a miRNA-target gene network comprising the four top biomarkers according to ROC curve analysis, which were CDK4, E-cadherin, miR126-5p, and ANRIL in descending order (Fig. 6). The network revealed 128 targets that were then filtered to select only strongly validated targets, with a final set of 50 targets (11 mRNA and 39 miRNA).

Fig. 6
figure 6

Vasculature-related miRNAs-target gene network analysis for the circulating vasculature-related transcripts under study in CAD. The targets used were only the strongly validated miRNAs in the literature using ‘miRTargetLink 2.0 (https://ccb-compute.cs.uni-saarland.de/mirtargetlink2)’. A The seven circulating vasculature transcripts’ miRNA-target gene network. B The top four circulating vasculature transcripts according to ROC curve analysis miRNA-target gene network. CAD coronary artery disease, ROC receiver operating characteristic

4 Discussion

Blood transcriptome analysis via investigating different types of RNA transcripts in CAD patients opens new frontiers for the non-invasive diagnosis of CAD. These RNA transcripts are seen as potential biomarkers for early detection of CAD in high-risk individuals or for improving the diagnosis of CAD in areas where invasive or advanced imaging methods are unavailable. They can also potentially serve as therapeutic targets and/or markers to monitor the suitability and efficacy of new or existing therapies [43].

Previously, lncRNA-ANRIL, miR-126-5p, CDK6, TNF-α, and TGF-β expression in CAD patients were studied either separately or in a maximum combination of two [44,45,46,47,48,49]. In the present study, the preceding transcripts, in addition to CDK4 and E-cadherin, are examined, aiming to elucidate their role as potential biomarkers for CAD.

The study results showed upregulation of ANRIL, miR-126-5p, CDK4, CDK6, E-cadherin, and TNF-α expression levels, while TGF-β expression was downregulated in the plasma of CAD patients. Upregulated expression of ANRIL, miR-126-5p, CDK6, TNF-α, and downregulation of TGF-β in CAD is in line with previous studies [44,45,46,47,48]. Nevertheless, downregulation of ANRIL in CAD patients was also documented in other studies [49,50,51]. This discrepancy could be attributed to different Chr9p21 locus genotypes as the Chr9p21 rs10757278/rs2383206-G risk allele associated with increased ANRIL expression is prevalent in Egyptian CAD patients [52, 53]. In fact, and to our knowledge, this study addressed for the first time the upregulated expression level of CDK4 and E-cadherin in CAD patients. All the transcripts showed a significant correlation with CAD. CDK4, E-cadherin, miR-126-5p, and ANRIL showed the highest predictive values for CAD, while the combined predictive values of TNF-αTGF-β and CDK4CDK6–E-cadherin showed 100% discrimination between CAD and controls.

Previous research stated that either the ncRNAs or the gene product of the transcripts under study participate in the pathogenesis of atherosclerosis, plaque formation, and stability [47, 54,55,56,57,58,59], mainly by inducing vascular endothelial cells (VEC) injury, and promotion of VSMC proliferation, migration, as well as the transformation from the contractile to the synthetic phenotype [60].

Increasing ANRIL expression in VECs participates in inflammation and accelerates endothelial injury via the TNF-α-NF-κB-ANRIL/YY1-IL6/8 signaling pathway [54]. ANRIL regulated the expression of VEGF, an important angiogenic factor reported to induce the migration and proliferation of endothelial cells, enhanced vascular permeability, and accelerated endothelial injury [61].

ANRIL facilitates the adhesion and proliferation of mononuclear cells to injured endothelial cells, thus promoting its infiltration into the endothelial layer and its differentiation to macrophages, indorsing extracellular matrix (ECM) degradation, and ultimately leading to superficial erosion or rupture of atheromatic plaques [52].

Moreover, ANRIL promotes VSMC proliferation, migration, and apoptosis via regulating CDK4 and CDK6 kinase activity. It also inhibits CDK4/6 inhibitors p16INK4a and p15INK4b from the CDKN2A/CDKN2b locus by recruiting PRC1/PRC2/EZH2 protein complex, and activation of DNA methyltransferase1 (DNMT1), resulting in hypermethylation and inactivation of CDKN2A/CDKN2b genes [62,63,64].

CDK4 and CDK6 are master regulators of the cell cycle. Their activity is restricted to the G1-S phase, controlled by the regulatory subunits D-type cyclins and CDK inhibitor p16INK4a [65, 66].

Inhibition of p16ink4a and p15INK4b leads to the assembly of the functioning D1-CDK4/6 complexes, which phosphorylates and inactivates the cell cycle suppressor proteins Rb (Retinoblastoma protein) family (p105Rb, p107, p130Rb2) [65]. The phosphorylated RB releases the transcription factor E2F1 from the Rb/E2F complexes, activating the transcription of genes whose products are required for DNA synthesis and S-phase progression, thereby allowing VSMC proliferation [67, 68].

MiR-126-5p, a flow-responsive miRNA, has a variable role in coronary atherogenesis depending on the stage of the disease. Upregulation of this miRNA may be athero-protective in the early phases of plaque development by promoting endothelial cell proliferation via targeting the negative regulator of EC proliferation, Dlk1 (delta-like 1 homolog) [69]. The high miR-126-5p expression could serve as a promoter of plaque destabilization later [45]. MiR-126-5p secreted by the EC has a paracrine effect on VSMC via modulating its gene expression and function toward atherogenic phenotype [70].

E-cadherin, a cell surface adhesion molecule, is implicated in mechanical adhesion, regulation of cell division, migration, differentiation, and apoptosis of VSMCs [71]. Increased expression of E-cadherin is coincident with VSMC proliferation. E-cadherin regulates cell proliferation by adjusting the expression of the cell cycle activator cyclin D1, mediated by the E-cadherin/catenin/TCF signaling pathway [57, 72].

TNF-α, a pro-atherosclerotic cytokine produced by several cells such as macrophages, T-lymphocytes, and dendritic cells, is known to be activated in atherogenesis. TNF-α boosts endothelial dysfunction, promoting VSMC proliferation and switching to a synthetic phenotype, which increases the production of ECM, and nucleic acid-enriched exosomes [73, 74].

Conversely, TGF-β inhibits migration and proliferation of VSMC by targeting signal transducer and activator of transcription 3 (STAT3) and NF-κB pathways [75].

TGF-β stimulates endothelial migration and proliferation at low concentrations but inhibits both at higher concentrations [76] by regulating the expression of cell cycle inhibitors p16INK4A and p15INK4B. These proteins are expressed at low levels in proliferating cells and are rapidly induced by TGF-β [77].

In the present study, upregulation and positive correlation between ANRIL and TNF-α agree with previously described research findings [49].

However, the expression pattern and correlations between the studied transcripts could be attributed to dyslipidemia and smoking, as nearly all RNAs, except for CDK6 and TGF-β, show significant correlations with both, creating a link between these risk factors and CAD via atherosclerosis. Smoking and chronic dyslipidemia led to enhanced oxidative stress, resulting in increased ox-LDL [70, 78]. Ox-LDL significantly promotes ANRIL expression in human aortic smooth muscle cells [44]. A high level of LDL cholesterol significantly increased miR-126-5p in patients with risk factors for CAD [79].

Ox-LDL stimulated monocyte/macrophage release of TNF-α through the NF-κB signaling pathway [80]. TNF-α induces the expression of ANRIL, CDK4, and CDK6 through the TNF-α/NF-kb signaling pathway [81, 82].

In addition, ox-LDL downregulates TGF-β expression by enhancing TGF‐β-producing-Treg (Regulatory T) cell apoptosis through the Fas/FasL pathway [83].

The TGF-β/SMADs pathway regulates the expression of E-cadherin through induction of the transcriptional repressors SNAIL, ZEB, and TWIST. These repressors bind to the promoter region of the E-cadherin gene and suppress its transcription [84].

Hence, it could be concluded that ox-LDL indirectly enhances the expression of E-cadherin by downregulating TGF-β.

Regarding other risk factors, ANRIL, CDK4, CDK6, and E-cadherin showed a positive relationship with age. ANRIL correlation with age is in line with the previous study [85]. Furthermore, ANRIL is positively correlated with CDK4, CDK6, and E-cadherin in agreement with a previous study [86], explaining the correlation of CDK4, CDK6, and E-cadherin with age.

On the other hand, a positive correlation between ANRIL and TGF-β could be related to a positive correlation between TNF-α and TGF-β, which could be attributed to the balanced expression of pro-and anti-inflammatory cytokines, characteristic of the chronic, low-degree inflammation in an atherosclerotic arterial wall [87].

According to Cesari et al., circulating levels of inflammatory markers could predict future cardiovascular events, both in the general population and in adults with a known CVD. The present study combined predictive ROC curve analysis of TNF-α and TGF-β, showing 100% prediction of CAD [88]. This result is supported by the analysis of connection strength between each transcript and CAD. TNF-α and TGF-β show the highest percentage of gene variants implicated in CAD, probably due to their roles in mediating the inflammatory processes related to atherosclerosis and CAD pathogenesis [87].

Anatomical disease categorization analysis showed that the transcripts studied are related to CAD, mainly myocardial infarction, aortic aneurysm, and essential hypertension, whereas CDK4, CDK6, and E-cadherin had the highest scores related to the likelihood of causing disease [89, 90], thus explaining the high combined predictive value of these transcripts for CAD.

Interestingly, the examined transcripts showed high categorization to lipoprotein quantitative trait locus. Further studies are needed to elucidate the association between them.

Function and pathway enrichment analysis of the circulating vasculature transcripts in the present study were associated with response to an organic substance, regulation of different cell population proliferation, regulation of gene expression, and mononuclear cell migration, consistent with many previous results [19, 20, 91,92,93,94,95].

Considering the analysis of cellular components and molecular function of the studied transcripts, it indicated their confinement to components and molecules mediating and executing cell proliferation, namely CDK4/6 cyclin-dependent protein kinase holoenzyme complex mediating RNA polymerase II CTD heptapeptide repeat kinase activity, mediator complex which interacts with and transmits signals from transcription factors to RNA polymerase II, cyclin binding and formation of cyclin D2-CDK4 complex arbitrating serine/threonine kinase activity, which phosphorylates and inactivates RBp, which is also supported by previous results [68, 96,97,98].

Finally, enrichment of specific pathways for the seven circulating vasculature-related genes revealed pathways relevant to mitotic G1-G1/S phases, cyclins and cell cycle regulation, and dendritic cell developmental lineage pathway TNF-α and TGF-β that were also previously reported [65, 99].

In the present study, we constructed a vasculature-related miRNA‐mRNA regulatory network which identified a strong relationship between miR-126-5p, CDK4, and E-cadherin, with 50 targets (11 mRNA and 39 miRNAs). Further research is needed to validate the role of the identified transcript targets as CAD biomarkers.

The primary limitation of the current study is that it was a mono-center study; further studies with a larger sample size from multicentric sources are advised to confirm the study results on a broader scale, however this study is considered a preliminary proof-of-concept study and further larger studies will be done. The scarcity of research studies investigating CDK4 and E-cadherin as biomarkers of CAD established an opportunity to highlight this literature gap and to emphasize the need for further research in this area. A further limitation is that not all the control subjects were assessed using echocardiography. Only subjects aged over 45 years were assessed to ensure they are free from any CAD signs. Furthermore, as the CAD patients in this study were under treatment, there is a possibility that these drugs may have interfered with the expression of the evaluated transcripts. Finally, it is vital that functional in vitro and in vivo studies are performed to validate the study results and emphasize these vasculature-related transcripts as CAD biomarkers.