Introduction

Myxomatous mitral valve disease (MMVD) is the most common heart disease in dogs. It is especially common in small size dog breeds, but the Cavalier King Charles Spaniel (CKCS) stands out as a breed with an extraordinarily high prevalence of this disease (Darke 1987; Häggström et al. 1992).

MMVD has been intensively studied over the last decades, and several excellent reviews have been published (e.g., Aupperle and Disatian 2012; Borgarelli and Buchanan 2012; Borgarelli and Haggstrom 2010; Burchell and Schoeman 2014; Fox 2012; O’Brien et al. 2021). The disease is characterized by morphological changes in the mitral valve leaflets, chordae tendineae, and chordal-papillary muscle. The changes include elongation and thickening of the valves in combination with derangement of connective tissue, reduction in connective tissue density, damage to the basement membrane and endothelium, damage to the valve collagen matrix and accumulation of immature collagens, laminin, proteoglycans, and glycosaminoglycans in the extracellular matrix (ECM) of the mitral valves (Aupperle et al. 2009a; Buchanan 1977; Corcoran et al. 2004; Falk et al. 2010; Hadian et al. 2010; Han et al. 2010, 2013a, 2013b). The morphological changes result in inadequate biomechanical properties of the heart valves, insufficient closure, and subsequent regurgitation of blood from the left ventricle to the left atrium (Sargent et al. 2015). In severe cases, consequences of MMVD include congestive heart failure (CHF) resulting in reduced expected lifespan and reduced life quality for both dog (Boswood et al. 2018; Haggstrom et al. 2013) and owner (Clements et al. 2003). Lifelong treatment is usually required (e.g., Keene et al. 2019). Alternatively, heart valve surgery is a possibility (e.g., Matsuura et al. 2022).

It has been demonstrated that genetics play an important role in MMVD (Olsen et al. 1999; Stern et al. 2015; Swenson et al. 1996), and selective breeding can reduce MMVD prevalence in CKCS (Birkegard et al. 2015). Numerous studies have been performed to identify and elucidate the genetic and molecular mechanisms underlying MMVD. This includes genome-wide association studies, whole genome-sequencing studies, proteome analysis, micro-RNA studies, gene expression/ transcriptome studies, and other genomic studies. While previous gene expression studies have included several dog breeds of different age and multiple stages of MMVD, the present study focuses exclusively on CKCS and, more specifically, on two large age-matched groups of CKCS where one group comprises dogs without CHF and another group of dogs have CHF caused by MMVD. Hence, the objective of the present study is to compare gene expression in mitral valves from ~ 10-year-old CKCS dogs with MMVD, which has progressed into CHF, to gene expression in other dogs of a similar age of the same breed.

Materials and methods

Animals and sample collection

Privately owned CKCS with no MMVD or different stages of MMVD were recruited at time for elective euthanasia at Department of Veterinary and Animal Sciences, University of Copenhagen. The dogs were collected from August 2008 to August 2020. The study was approved by the Danish Animal Experiments Inspectorate (licenses no. 2006/561-1145, 2011/561-71 and 2016-15-0201-01074). Dogs with cardiac disease other than MMVD were excluded. Some of the dogs have previously been included in other studies with other research aims (contact the corresponding author for a complete list of studies). Prior to euthanasia and upon written-owner consent, all dogs underwent a clinical examination using a standardized protocol including owner interview, physical examination, auscultation, and echocardiography as previously described (Reimann et al. 2021). Mitral regurgitation murmur intensity was graded on a scale of 1–6 (Gompf 1988). The diagnosis of MMVD was based on auscultation, echocardiography, and the presence of clinical signs of CHF due to MMVD, according to the American College of Veterinary Internal Medicine (ACVIM) MMVD classification guidelines (Keene et al. 2019). Dogs examined before 2019 were retrospectively classified on the basis of the measurements obtained at the time of their examination. Echocardiographic assessment and ACVIM staging were performed as described elsewhere (Reimann et al. 2021). All dogs, included in the study, were approximately 10 years old (Table 1, Fig. 1). The dogs were divided in two groups based on symptoms of CHF. One group contained one dog with no identifiable structural disorder of the heart (ACVIM stage A) plus dogs with MMVD at ACVIM stages B1 and B2. In this study, we will collectively term this group of dogs ‘10y-noCHF.’ The other group of dogs, which we in this study will term ‘10y-MMVD-CHF,’ were diagnosed with CHF due to MMVD (ACVIM stage C). CHF diagnosis was based on a history of MMVD, previous or current clinical signs of CHF (e.g., cough, dyspnea, tachypnea, nocturnal restlessness, and exercise intolerance), echocardiographic changes compatible with severe MMVD, and response to diuretic treatment. Dogs were euthanized using pentobarbitale (200–400 mg/kg or until effect, IV) after sedation and pain relief with butorphanol (0.1 mg/kg, IM) and dexmedetomidine (0.02 mg/kg, IM). The heart was collected, and the mitral valves were excised within 60 min after euthanasia and stored in RNAlater (Merck KGaA, Darmstadt, Germany).

Table 1 Descriptive statistics for dogs classified as 10y-noCHF (n = 19) and 10y-MMVD-CHF (n = 15)

Three dogs (two B2 and one C) underwent the full clinical examinations, auscultation, and echocardiography 4 months before euthanasia. None of the B2 stage dogs had developed clinical signs that indicated CHF on the day of euthanization. Tissue collection was delayed for one dog (in RNAlater 90 min after euthanasia). This had no observable effect on RNA quality or gene expression results.

RNA isolation and sequencing

RNA was isolated from mitral valves using the Qiagen RNeasy Fibrous Tissue Mini Kit (Qiagen, Hilden, Germany), according to the manufacturer’s instruction including DNase treatment. RNA quantity was measured using a NanoDrop 1000 spectrophotometer (Thermo Fisher Scientific, Waltham, Massachusetts, USA) and RNA quality, expressed as an RNA Integrity Number (RIN), was determined for each sample using a 2100 Bioanalyzer (Agilent Technologies, Santa Clara, California, USA). Subsequently, mRNA was purified from total RNA using poly-T oligo-attached magnetic beads. Fragmentation was carried out using divalent cations under elevated temperature in First-Strand Synthesis Reaction Buffer (5X). First-strand cDNA was synthesized using random hexamer primer and M-MuLV Reverse Transcriptase, and then RNaseH was used to degrade the RNA. Second-strand cDNA synthesis was subsequently performed using DNA polymerase I and dNTP. Remaining overhangs were converted into blunt ends via exonuclease/polymerase activities. After adenylation of 3’ ends of DNA fragments, adaptors with hairpin loop structure were ligated to prepare for hybridization. In order to select cDNA fragments of preferentially 370 ~ 420 bp in length, the library fragments were purified with AMPure XP system (Beckman Coulter, Beverly, USA). After PCR amplification, the PCR product was purified by AMPure XP beads, and the sequencing library was finally obtained. Libraries were constructed at E-GENE, Shenzhen, China and sequenced by an Illumina NovaSeq 6000 platform at Novogene, Beijing, China. Paired-end sequencing reads of 150 bp were generated.

RNA sequence quality was ascertained using FastQC (Andrews 2010), and reads were mapped to the canFam4/UU_Cfam_GSD_1.0 dog genome assembly using the RNA sequence aligner STAR (Dobin et al. 2013). Subsequently, a feature annotation file was compiled based on the canFam4 NCBI-RefSeq annotation. This set of features was augmented by a set of putative genes extracted among canFam4 aligned human XenoRefSeq genes from the UCSC genome browser repository. This step captured the presence of protein-coding genes, which may not have been properly annotated in the canine genome. Mapped RNA sequence reads were concomitantly assigned to genomic features and counted using featureCounts (Liao et al. 2014). Genes with low expression, i.e., features with a sum of less than 100 counts across all samples, were filtered out and excluded from further analysis.

Analyses of gene expression

The BioLayout-3.4 network analysis tool (Theocharidis et al. 2009) was used to cluster samples based on similarity of overall gene expression pattern and visualize the results in a three-dimensional network where nodes represent samples and the distance in space between nodes represents the correlation in gene expression between two samples. When cases and controls are marked up in the network, any clustering according to CHF status will be observable. The tool was used to calculate Pearson correlation coefficients based on gene expression data, and the advanced graph layout Fruchterman–Rheingold algorithm was used to visualize the results. In order to minimize noise and to emphasize biologically meaningful results, very abundant features (i.e., essential household genes), with over 10,000 reads per feature per sample on average, were filtered out before the BioLayout analysis. A Pearson correlation coefficient cut-off of 0.93 was used for clustering. This was the highest cut-off that allowed a combined cluster of all samples.

An analysis of differences in gene expression between dogs classified as 10y-noCHF and dogs classified as 10y-MMVD-CHF was performed using the R package DESeq2 (Love et al. 2014). Analyses were performed on the raw un-normalized feature count data in accordance with the DESeq2 documentation and guidelines. Sex was included as a fixed effect. A volcano plot illustrating the results was created using the R package ggplot2 (Wickham 2016). A Benjamini–Hochberg false discovery rate (FDR) threshold of 0.1 was used to identify significant DE genes.

STRING network analysis

A STRING network and enrichment analysis was performed using Cytoscape (Jensen et al. 2009; Shannon et al. 2003). The analysis was performed using gene names for all DE genes with a FDR < 0.1. Two analyses were performed, one using the STRING default-confidence score of 0.4 and one using a stringent-confidence score of 0.9. Furthermore, up to 10 additional interactors were allowed in both analyses. The default-confidence network provided an overview of the functional networks of which the identified DE genes play a role. The stringent-confidence network enabled identification of the functions for which the dataset provided the strongest evidence. Inclusion of additional interactors allowed STRING to build a network even though specific components (genes) could be missing in the dataset.

Fig. 1
figure 1

Split-violin plots and box plots illustrating descriptive statistics and key diagnostic parameters for Cavalier King Charles Spaniel dogs without CHF (n = 19) and MMVD affected Cavalier King Charles Spaniel dogs with CHF (n = 15). LVIDDN = left ventricular end-diastolic diameter normalized for body weight. LAAo ratio = the ratio of diameters for the left atrium and aorta. Based on the box plots, two dogs without CHF were identified as outliers (gray dots): One with regard to body weight and another dog with regard to LAAo ratio and LVIDDN. Both were kept in the dataset based on an overall evaluation of their status

Results

Thirty-four dogs were included in the study. Nineteen dogs (10 females and 9 males) were classified as 10y-noCHF. These included one dog with ACVIM stage A, 10 dogs with ACVIM stage B1, and eight dogs with ACVIM stage B2. Furthermore, 15 dogs (6 females and 9 males) were diagnosed with CHF due to MMVD indicated by ACVIM stage C. Age and body weight did not differ significantly between the two groups of dogs (Table 1, Fig. 1).

RNA isolation resulted in 34 samples with a RIN number > 8.6 (average 9.23, sd = 0.37) and sequencing resulted in 22.4—35.6 M reads per sample with an average of 27.7 M reads per sample remaining after the quality check.

The result of the BioLayout gene expression correlation analysis is illustrated in a three-dimensional sample-to-sample-weighted network graph as shown in Fig. 2. Dogs in the 10y-noCHF group are marked with blue spheres, and dogs in the 10y-MMVD-CHF group are marked using red spheres. A clear clustering of 10y-MMVD-CHF dogs is observed. This indicates that the advanced stage of MMVD, at which CHF had developed, was associated with a distinct gene expression pattern. However, it is also evident that several dogs in the 10y-noCHF group had expression patterns that overall resembled the pattern observed in MMVD affected dogs with CHF. This reflects the heterogeneous nature of the 10y-noCHF group where several dogs have severe MMVD although they have not developed CHF.

Fig. 2
figure 2

Three-dimensional illustration of the results of the BioLayout gene expression correlation analysis. Blue = Dogs classified as 10y-noCHF; Red = Dogs classified as 10y-MMVD-CHF. The distances between dots corresponds to the correlations in overall gene expression between individuals

A more detailed gene-by-gene analysis of differences in gene expression between dogs classified as 10y-noCHF and dogs classified as 10y-MMVD-CHF was performed using the DESeq2 software package. We used a significance threshold of 0.1 for the p-adj value calculated by DESeq2. This value is a Benjamini–Hochberg FDR, which effectively corrects for multiple testing. The analysis revealed a set of 56 genes, which were significantly differentially expressed between the two groups (Table 2, Fig. 3).

Table 2 Differentially expressed genes and results of the DESeq2 analysis
Fig. 3
figure 3

Volcano plot of DESeq2 analysis results. The log2FoldChange is on the X-axis and the negative logarithm of the false discovery rate (FDR = p-adj) is on the Y-axis. The FDR threshold of 0.1 is indicated by a horizontal line. Vertical lines indicate log2FoldChanges of − 1 and + 1

Compared to dogs in the 10y-noCHF group, five genes were up-regulated with a log2FoldChange > 1 and fifteen were down-regulated with a log2FoldChange <  − 1 in CKCS with MMVD-associated CHF. Among these DE genes, two genes, MYH1 and LOC102724058, were down-regulated with a log2FoldChange <  − 3. The most up-regulated gene was CNTN3 with a log2FoldChange of 2.62.

A STRING network analysis for significant DE genes allowing for ten additional interactors and a stringent-confidence level of 0.9 revealed a functional network incorporating 20 genes (13 DE genes plus 7 additional interactors) (Fig. 4) and a small network of three genes (2 DE genes, ALDH1A2, RDH10 plus 1 additional interactor, CYP26A1). A functional enrichment analysis of the large network revealed a predominance of genes involved in TGFβ signaling and ECM organization, whereas the three genes in the small network all were involved in retinol metabolism. In addition, genes in the large network were involved in focal adhesion, laminin interaction, circulatory system development and more (see Supplementary Table S1). We, furthermore, used the full set of DE genes to build networks using the STRING default-confidence level of 0.4. This resulted in a network comprising 37 genes of which 27 were DE genes, and the remaining 10 genes were additional interactors (Fig. 5). A functional enrichment analysis on the default-confidence network revealed significant enrichment of genes involved in, e.g., ‘regulation of cell population proliferation,’ ‘estrogen-dependent nuclear events downstream to ESR-membrane signaling,’ and ‘fluid shear stress and atherosclerosis’ (Supplementary Table S2).

Fig. 4
figure 4

High confidence large functional network constructed using STRING. Marked with light blue: Genes involved in TGFβ signaling. Marked with dark blue: Genes involved in extracellular matrix organization

Fig. 5
figure 5

Default confidence large functional network constructed using STRING. Genes associated with the KEGG pathway “Fluid shear stress and atherosclerosis” are marked with pink circles

Four of the 13 DE genes in the high-confidence large network had an observed |Log2FoldChange|< − 1. Ten of the 27 DE genes in the default-confidence large network had a |Log2FoldChange|< − 1. It is worth noticing that the genes with the greatest difference in gene expression between CKCS in the 10y-noCHF and the 10y-MMVD-CHF group (|Log2FoldChange|< − 2) were not part of the identified networks.

Discussion

We here report the results of a mitral valve transcriptome analysis performed in a cohort of 34 age-matched CKCS dogs with or without CHF caused by their MMVD.

We identified 56 genes, which were differentially expressed between the two groups. Among these, we identified genes, which previously have been associated with heart disease, and gene networks, which previously have been associated with MMVD development. Furthermore, we identified a small number of genes of importance for sarcomere assembly and function, which we hypothesize, may have a close connection with the genetic causes of MMVD.

It should be noted that we did not strictly compare healthy and affected dogs. All dogs (except one) had MMVD, but for one group, the disease had progressed into CHF whereas for the other group, it had not, even though all dogs had approximately the same age. We classified these two groups as cases and controls, respectively. The majority of the control dogs (10y-noCHF) had a clear heart murmur due to mitral regurgitation, and they were classified as ACVIM stages B1 or B2. The cases (10y-MMVD-CHF), on the other hand, were classified as ACVIM stage C due to their symptoms of CHF. Age of dogs in both groups was around 10 years, with a slightly higher average age in the 10y-noCHF group. It would have been preferable to have a control group of CKCS dogs without any sign of MMVD. Unfortunately, as we aimed to perform this study on age-matched groups of dogs, such dogs were not available. The majority of elderly CKCS dogs have some degree of murmur due to regurgitation (Beardow and Buchanan 1993).

Previous MMVD-focused transcriptome studies have typically been performed on 3–6 MMVD cases and a similar number of controls. Most often there has been an uneven distribution of breeds between cases and controls and a significant difference in age between the two groups (Li et al. 2015; Lu et al. 2015; Markby et al. 2017, 2020a, 2020b; Oyama and Chittur 2006; Zheng et al. 2009). Significant strengths in the present study are that we included more individuals and that the two groups of dogs only differed in one parameter, i.e., their CHF status. Thus, the number of possible confounding factors was smaller compared to previous studies.

Compared to several of the above-mentioned transcriptome studies, the present study found a relatively low number of DE genes. This was probably due to the close similarity between the case and control groups and the absolute minimization of confounding factors in the present study. Furthermore, the larger sample size improved the power to avoid spurious differences in feature counts between the two groups, i.e., it reduced the risk of false-positive results in the DE analysis. The benefit of this is that the observed differences in gene expression can more reliably be assigned to case/control status.

In the functional enrichment analysis, we identified some of the same pathways as have been identified in previous studies. This included the TNFβ-signaling pathway, pathways related to ECM organization, and pathways related to vascular development, endothelium damage, and metallopeptidases (Aupperle et al. 2009b; Li et al. 2015; Lu et al. 2015; Markby et al. 2020a, 2020b; Moesgaard et al. 2014; Oyama and Chittur 2006; Zheng et al. 2009).

Among DE genes detected in the present study, CRIP1 and SERPINE1 were also identified as DE genes by Markby et al. (2020b). In a more detailed comparison of our results with this study, we noticed that many of the DE genes detected in the two studies were members of the same gene families. For example, several ADAMTS variants, collagen genes, laminins, and different myosin genes were detected in both studies.

Some of the DE genes identified in the present study have previously been directly linked to cardiac diseases including valve disease. ADAMTS19 has been associated with progressive non-syndromic heart valve disease (Wünnemann et al. 2020). ALDH1A2 has been linked to human congenital heart disease (Pavan et al. 2009). BMPER regulates cardiomyocyte size and changes in the mitral valve have been detected in BMPER knockout mice (Willis et al. 2013). COL17A1 has been linked to mitral regurgitation and mitral valve prolapse in humans (Uysal et al. 2022). CRIP1 expression has been associated with cardiac hypertrophy and an increased risk of stroke (Zeller et al. 2017). A partial deletion of CYP21A2 has been found in patients with mitral valve prolapse (Chen et al. 2009). MAOA can play a role in the pathogenesis of heart failure (Kaludercic et al. 2011). MYH1 is one of many genes important for the morphogenesis of the heart (England and Loughna 2013; Henderson et al. 2017), and MYHAS encodes an antisense RNA, which regulates expression of myosin heavy chain genes including MYH1 (Haddad et al. 2003).

We primarily found a number of genes down-regulated in dogs with MMVD-induced CHF and only a few genes up-regulated in this group. Many of the down-regulated genes play a central role in mechanisms, which can be considered beneficial for heart maintenance. Hence, a relevant question is, why these genes appear to be down-regulated in dogs that have developed heart failure and what role, if any, their down-regulation plays in the development of CHF?

Among the DE genes identified in the present study, ALDH1A2 and RDH10 formed a small high-confidence functional network together with the additional interactor CYP26A1. The network was closely connected with retinoic acid (RA) biosynthesis, i.e., the oxidization of retinol (vitamin A) to retinaldehyde and the subsequent irreversible conversion of retinaldehyde to RA. RDH10 is the primary enzyme responsible for the first step in this reaction (Farjo et al. 2011), while the later step is catalyzed by retinaldehyde dehydrogenases (RALDHs) among which, the aldehyde dehydrogenase 1A2 (ALDH1A2) is the major form involved in cardiac development (Moss et al. 1998; Niederreither et al. 1997). A number of variations in ALDH1A2 have been described in human patients with congenital heart disease (CHD), but none of them have been confirmed as significant modifiers of the risk of CHD in humans (Pavan et al. 2009). Genes involved in RA biosynthesis have previously been associated with MMVD in dogs. A retinoic acid receptor responder (RARRES3) was associated with MMVD in a microarray gene expression study performed in 10 CKCS dogs with MMVD, Whitney grade ≥ 3 and 6 dogs without signs of MMVD (Lu et al. 2015). Furthermore, RA signaling and the ALDH1A2 gene have been linked to cardiac repair mechanisms in mice (Da Silva et al. 2021). Hence, we suggest that the observed changes in expression of ALDH1A2 and RDH10 may be a compensatory reaction to MMVD rather than a cause of disease. Why some dogs had an appropriate up-regulation of these genes and why some did not, needs to be investigated further.

Overall, it must be expected that compensatory mechanisms to disease, including MMVD, are established and managed in an orchestrated way, which in a transcriptome analysis will appear as networks of functionally related genes expressed in a coordinated manner. Hence, it is not surprising that many of the genes in the identified larger networks are involved in mechanisms such as TGFβ-signaling and ECM organization. Abundance of myxomatous effector proteins has previously been shown to increase in response to increased tensile strain on the heart valves (Lacerda et al. 2012; Orton et al. 2012). TGFβ signaling and disturbances in ECM organization have also been suggested as primary causes of MMVD (reviewed by Tang et al. 2022). However, the present results, i.e., the apparent well-orchestrated expression of genes related to these pathways, encourage us to suggest that these pathways were up-regulated or uphold as a well-regulated compensatory mechanism to MMVD in ~ 10-year-old dogs without CHF.

On the other hand, the genes with the greatest difference in expression between 10y-noCHF and 10y-MMVD-CHF dogs were three genes, MYH1, LOC102724058, and CNTN3, which were not part of the identified gene networks, i.e., they were not part of an orchestrated response to disease.

Of these three genes, LOC102724058 is a human gene that aligns to a 17 kb region of canine chromosome 36 with 91% identity. It was one of the features that were identified by augmenting the canine reference annotation with homologous human genes (see Material & Methods section). The gene is a long non-coding RNA gene with unknown function. The other two genes, CNTN3 and MYH1, encodes contactin 3 and myosin heavy chain 1, respectively. Contactin 3 has among other things been associated with heart rate recovery after exercise (Verweij et al. 2018). MYH1 expression has been found to be significantly up-regulated in mice with cardiomyopathy (Szema et al. 2013). Both of these genes relate to the contractile activity of the heart, the coordination of this activity, and consequently the hemodynamics across the mitral valves. This may explain the observed DE of genes in the default-confidence network related to ‘fluid shear stress and vascular changes,’ which, furthermore, corroborate the previously observed arteriosclerotic changes in dogs with MMVD (Falk et al. 2006). Comparatively, a vast number of different human myopathies including cardiomyopathies are caused by mutations in one of the many cytoskeletal sarcomeric proteins, of which MYH1 is one (reviewed by Henderson et al. 2017). Mitral valve function relies on proper biomechanical performance of several structural components including the left atrial, ventricular, and papillary myocardium (Fox 2012; Richards et al. 2012; Schoen 2008). Hence, a disturbance in the heart’s contractile activity due to suboptimal coordination of sarcomere assembly and function may explain the compensatory responses illustrated by the TGFβ-signaling and ECM organization-related gene networks described here. Mitral regurgitation secondary to myocardial dysfunction is well known in human patients (Asgar et al. 2015). Interestingly, myocardial fibers are present in the proximal third of the mitral valve in dogs (Buchanan 1977; Fox 2012). Thus, it is possible that the changes in valvular gene expression, relevant for myocardial function, affect valvular performance and integrity. The observed degree of differential expression as well as the known function of MYH1 and CNTN3 may suggest that genes encoding heart muscle proteins play a role in MMVD and CHF. However, further investigations are needed to clarify this.