Clinical diagnosis of metabolic disorders using untargeted metabolomic profiling and disease-specific networks learned from profiling data

Thistlethwaite, Lillian R.; Li, Xiqi; Burrage, Lindsay C.; Riehle, Kevin; Hacia, Joseph G.; Braverman, Nancy; Wangler, Michael F.; Miller, Marcus J.; Elsea, Sarah H.; Milosavljevic, Aleksandar

doi:10.1038/s41598-022-10415-5

Clinical diagnosis of metabolic disorders using untargeted metabolomic profiling and disease-specific networks learned from profiling data

Article
Open access
Published: 21 April 2022

Volume 12, article number 6556, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Clinical diagnosis of metabolic disorders using untargeted metabolomic profiling and disease-specific networks learned from profiling data

Download PDF

Lillian R. Thistlethwaite^1,2^na1,
Xiqi Li²^na1,
Lindsay C. Burrage^2,5,
Kevin Riehle²,
Joseph G. Hacia³,
Nancy Braverman⁴,
Michael F. Wangler^2,5,6,
Marcus J. Miller⁷,
Sarah H. Elsea² &
…
Aleksandar Milosavljevic^1,2

3928 Accesses
17 Citations
7 Altmetric
Explore all metrics

Abstract

Untargeted metabolomics is a global molecular profiling technology that can be used to screen for inborn errors of metabolism (IEMs). Metabolite perturbations are evaluated based on current knowledge of specific metabolic pathway deficiencies, a manual diagnostic process that is qualitative, has limited scalability, and is not equipped to learn from accumulating clinical data. Our purpose was to improve upon manual diagnosis of IEMs in the clinic by developing novel computational methods for analyzing untargeted metabolomics data. We employed CTD, an automated computational diagnostic method that “connects the dots” between metabolite perturbations observed in individual metabolomics profiling data and modules identified in diseasespecific metabolite co-perturbation networks learned from prior profiling data. We also extended CTD to calculate distances between any two individuals (CTDncd) and between an individual and a disease state (CTDdm), to provide additional network-quantified predictors for use in diagnosis. We show that across 539 plasma samples, CTD-based network-quantified measures can reproduce accurate diagnosis of 16 different IEMs, including adenylosuccinase deficiency, argininemia, argininosuccinic aciduria, aromatic l-amino acid decarboxylase deficiency, cerebral creatine deficiency syndrome type 2, citrullinemia, cobalamin biosynthesis defect, GABA-transaminase deficiency, glutaric acidemia type 1, maple syrup urine disease, methylmalonic aciduria, ornithine transcarbamylase deficiency, phenylketonuria, propionic acidemia, rhizomelic chondrodysplasia punctata, and the Zellweger spectrum disorders. Our approach can be used to supplement information from biochemical pathways and has the potential to significantly enhance the interpretation of variants of uncertain significance uncovered by exome sequencing. CTD, CTDdm, and CTDncd can serve as an essential toolset for biological interpretation of untargeted metabolomics data that overcomes limitations associated with manual diagnosis to assist diagnosticians in clinical decision-making. By automating and quantifying the interpretation of perturbation patterns, CTD can improve the speed and confidence by which clinical laboratory directors make diagnostic and treatment decisions, while automatically improving performance with new case data.

metPropagate: network-guided propagation of metabolomic information for prioritization of metabolic disease genes

Article Open access 02 July 2020

Untargeted metabolomic analysis for the clinical screening of inborn errors of metabolism

Article Open access 15 April 2015

Clinical Validation of Targeted and Untargeted Metabolomics Testing for Genetic Disorders: A 3 Year Comparative Study

Article Open access 10 June 2020

Introduction

While the adoption of exome sequencing in the clinic brought major improvements in diagnostic accuracy and speed, it has also brought with it the challenge of interpreting numerous variants of uncertain significance (VUSs). A wide variety of diagnostic approaches have been developed to connect observed genomic alterations with observed clinical phenotypes. For example, initiatives such as the Matchmaker Exchange¹ use clinical descriptions and semantic similarity metrics to calculate similarity between individuals². A “match” between individuals with highly conspicuous clinical phenotypes can significantly improve the power to find causative genetic variants in cases of Mendelian disease. Unfortunately, many clinical phenotypes observed in individuals with inborn errors of metabolism (IEMs) are non-specific (e.g., seizures, intellectual disability, diarrhea, vomiting, and poor feeding), making diagnosis based solely on clinical descriptions difficult.

In contrast to clinical phenotypes, metabolic defects observed in many IEMs cause highly distinct metabolite perturbation patterns in plasma, represented by abnormal accumulation or depletion of essential metabolites stemming from an affected protein that has enzymatic, carrier, receptor, or structural roles in cellular metabolism. As a result, metabolomics can help bridge existing knowledge gaps between causal genetic variation and observed clinical phenotypes.

Functional evidence from patient-derived “omic” data (e.g., the transcriptome, proteome, and metabolome) is recognized as one of the key factors in resolving VUSs. The widely adopted American College of Medical Genetics and the Association for Molecular Pathology (ACMG/AMP) Guidelines³ define evidence category PS3, which provides means for formally incorporating functional evidence from “well-established” functional studies³. Untargeted clinical testing metabolomics⁴ is a functional diagnostic test which has allowed for the successful diagnosis of many cases of metabolic disorders that would be hard to diagnose using clinical phenotype descriptions and targeted tests alone^5,6,7,8,9,10. Nevertheless, wide application of this source of evidence requires a quantitative, transparent, and computationally efficient method for detecting and comparing disease-specific multi-metabolite perturbations. Various automated tools for predicting the pathogenicity of genetic variants have been developed (e.g., SIFT, PolyPhen2, CADD, DANN)^11,12,13,14, but none incorporate metabolomic profiling, nor other types of precise molecular phenotyping information.

Many computational methods^15,16,17,18 have been developed for the analysis of clinical research metabolomics data. Unlike clinical research metabolomics, which follows a case–control observational study design and relies on population-based statistical power, clinical metabolomics testing facilitates the interpretation of an individual (N-of-1) case in relation to a reference population of healthy controls^4,19. Other than the manual inspection of untargeted metabolomics data currently used to diagnose individual cases, few alternative methods are suitable for interpreting multi-metabolite perturbations observed in N-of-1 cases, and of these available alternatives, many rely on knowledge-driven modelling (e.g., pathway maps and biomarker lists) approaches²⁰.

The CTD method²¹ is a novel information-theoretic method that assigns statistical significance to sets of metabolites based on their connectedness in disease-specific metabolite “co-perturbation” networks derived from accumulating patient data. A network contains metabolite nodes, and the weighted edges connect metabolites that are co-perturbed in a specific disease. Gaussian graphical models are used to compute edge weights, which indicate the strength of positive or negative partial correlation between metabolites. Given a disease-specific network and a set of metabolites that are perturbed in a given individual, CTD identifies a subset of perturbed metabolites that are highly connected within the network. The CTD method uses an efficient algorithm that can handle highly dense (“hairball”) networks and outputs small p values for highly connected metabolite sets and large p values for sparsely connected metabolites sets. Unlike any other method of similar complexity, CTD does not require computationally costly permutation testing to establish p values of combinatorial patterns of multi-metabolite perturbations and can thus be used to interpret untargeted metabolomic profiles of individual patients in an efficient and rigorous way.

To interpret perturbations observed in N-of-1 metabolomics profiles without relying just on prior biochemical pathway knowledge, we applied the CTD method²¹ to existing and newly acquired datasets and introduce the CTDdm and CTDncd distance methods, both extensions of CTD, to serve as additional predictors for diagnosis. We assessed if CTD-based network-quantified measures could reproduce accurate diagnosis of IEMs and whether these measures hold long-term value to supplement existing information from biochemical pathways in order to assist in interpreting VUSs. We provide evidence that CTD-based metrics can indeed expedite the analysis of complex metabolomic datasets and increase the sensitivity of clinical diagnostic pipelines for clinical purposes that include identifying precision treatments for individuals with IEMs.

Methods

Data collection

Data used represent a meta-analysis of untargeted metabolomics plasma samples collected from previously reported studies^{5,6,7,8,9,10,22,23}, as well as previously unreported samples (Table 1, Fig. 1). All study procedures were approved by the Institutional Review Board (IRB) of the Baylor College of Medicine and complied with all relevant guidelines and regulations. For some of the studies, informed consent was obtained and for others, it was waived by the Baylor College of Medicine’s IRB-approved waiver of informed consent. All sample data were de-identified. While all sample data were processed similarly²⁴, some data differ in sample source (e.g., heparin vs. EDTA plasma) and platform specifications (e.g., mass analyzer) (Table 1). Sample data were generated by Baylor Genetics in collaboration with Metabolon, Inc. (Morrisville, NC) on referred clinical or research samples. Twenty-one research samples were collected at RhizoKids International family conference for people affected by rhizomelic chondrodysplasia punctata²⁵ (RhizoKids International, rhizokids.com). A total of 539 profiled plasma samples were included in this study, including 414 samples from existing studies and 125 previously unpublished samples (see Table 1). Across all 539 samples, there was a range of 376–684 named, z-scored compounds and a range of 0–261 unnamed (“unknown”), z-scored compounds included for each sample. Compounds rarely present in normal reference blood samples cannot be z-scored. However, argininosuccinate only presents in diseased samples and may serves as a strong predictor for disease and was thus encoded as a binary predictor (“1” for the presence or “0” for the absence). Unknown metabolites were excluded from the analysis in this work because the clinical utility of these compounds for diagnosis is limited.

Table 1 Description of data and data sources.

Full size table

Inference of disease-specific networks

Metabolites with z-scores in > 50% reference samples and > 50% disease samples were used for network learning. Missing z-scores were imputed using the minimum z-score of the analyte in a large reference population. In order to model the differences in perturbation signatures between disease cases and controls, two types of Gaussian graphical network models were then learned from the data: one from both disease and control samples (disease + control network), and a second from only control samples (control network)²¹. We used the Graphical Lasso algorithm implemented in the R package huge (v1.3.5) to estimate the precision matrix, where regularization parameter lambda is selected using criteria “stars”. For both graphs, edge weights are the estimated partial correlation between any two metabolites after conditioning on all other variables in the datasets. Next, edges found in the disease + control network that were also found in the control network were pruned²¹. This pruned, “disease-specific” network represents probability of any pairs of metabolites being co-perturbed together at the state of the disease and was used in downstream analysis (Fig. 1). Including both examples of disease and control profiles in the training data (“discriminative latent structure inference”) introduces a hidden variable representing the disease state associated with each sample, allowing the network to model the specific metabolomic differences between two conditions (disease vs. control). Disease-specific network models from five IEMs in Thistlethwaite et al.²¹ were included, as well as novel network models learned on 11 additional IEMs, totaling 16 IEM disease states (Table S2). All network structures used in this paper are accessible through the CTDext R package accessible via GitHub (https://github.com/BRL-BCM/CTDext), an extension of the CTD CRAN R package that also includes added functionalities and filesharing necessary to reproduce our results. Graphical model metrics including node count, edge count and graph density for all network structures are documented in Table S2.

The CTD method

The CTD method was described previously²¹. Briefly, the method takes a weighted, disease-specific “co-perturbation” network and a set of network nodes as inputs and identifies a subset of the input nodes that is highly connected within the network (Fig. 2A,B). The method also provides a p value corresponding to the level of connectedness of the input node subset within the network.

The CTDncd distance: a network-based patient-patient distance method

In the context of a network, two individuals’ sets of metabolite perturbations can be compared by calculating the distance between them. If two metabolite sets hit the same nodes or modules in the context of a network, they may be considered related; however, if two metabolite sets share no overlapping nodes, nor do they hit proximal parts of a network, they may be considered unrelated (Fig. 2C). To calculate the distance between two node subsets in a network, we use the Normalized Compression Distance (NCD) metric²⁶ which is based on normalized mutual information (Eq. 1).

$$CTDncd({S}_{1}, {S}_{2}) = \frac{max(I({S}_{1}, {S}_{2})-I({S}_{1}), I({S}_{1}, {S}_{2})-I({S}_{2}))}{max(I({S}_{1}), I({S}_{2}))}$$

(1)

The CTDdm distance: a network-based patient-disease distance method, a variation on CTDncd

CTDdm also uses Eq. (1) to calculate the distance between a set of metabolites perturbed in an individual and the metabolite set perturbed in individuals with a specific disease (Fig. 1). This set, referred to as the “main disease module”, is calculated by Algorithm 1 in Supplemental Text 1. Information provided by CTDdm was shown to reduce false positive rate and improve overall diagnostic accuracy when combined with the CTD method (Table S1). CTD + CTDdm metrics were therefore used for scoring sample profiles.

Analysis of exome sequencing data

The collection and processing of clinical exome sequencing data from 170 individuals is detailed in Alaimo et al.²². Briefly, data were acquired using protocols adapted for clinical testing, described previously²⁷. Variants were called using AtlasSNP2²⁸ (v. 1.4.3). Variants in intronic or intergenic regions were filtered out, as well as variants found in ESP5400 or 1000 Genomes²⁹ at frequencies greater than 0.05. The pathogenicity of each genetic variant was assessed according to the ACMG/AMP guidelines^3,30. For each disease gene associated with any of the 15 IEMs (X-linked OTC deficiency excluded) modeled in this paper, we assigned each individual to one of 5 classes (Table 2) based on the assumption of an autosomal recessive inheritance pattern, the ACMG/AMP pathogenicity category, and observed zygosity of the variants in individuals’ exomes.

Table 2 Categorization of individuals based on classification of genetic variants identified in personal genome data.

Full size table

The metabolomics data portal: a diagnostics tool for untargeted metabolomics data

To provide access to the data and to provide a prototype tool to aid in the clinical diagnostic process, we developed an R shiny application. This application visualizes individuals’ metabolomics data and implements the network-assisted diagnostic functionalities featured in this paper. A walkthrough of the features this application offers can be found in Supplemental Text 1. The full application can be accessed at https://genboree.org/genboreeKB/projects/metabolomics-data-portal.

Results

Data-driven network models show higher accuracy relative to rule-based biomarker models and eliminate the requirement for a priori biomarkers

Prior knowledge, including lists of metabolite biomarkers, have previously been integrated into rule-based models for diagnosis in metabolomics²⁰. Network-based modeling is less biased toward specific lists of biomarkers and models the co-variation between metabolites and is thus more suitable in the discovery and complex diagnostics contexts. We explored the possibility that network-based approaches may also show added accuracy associated with incorporating information from full untargeted metabolomics profiles compared to information based solely from known biomarkers of metabolic disease states.

Haijes et al.²⁰ reported models that rank diseases for each individual based on lists of known biomarkers for each IEM. The correct diagnosis ranked first in 37% of 115 validation plasma samples. In 72% of cases, the correct diagnosis could be found within a short “differential diagnosis” (DD) list of candidate diagnoses, returned by the rule-based algorithm. To compare this rule-based method to CTD, we assigned a DD to an individual based on whether combined network-quantified scores (CTD + CTDdm) match at a Bonferroni-corrected combined network p value < 0.05 in the corresponding disease-specific network model.

As shown in Table 3, CTD + CTDdm ranked the correct diagnosis (from 16 modeled IEMs) first in 70% of 154 samples. Moreover, 89% of samples had the correct diagnosis in their DD short list. When we omitted individuals with OTC deficiency (MIM:311250)—where 13 out of 17 were female—from consideration due to diagnostic difficulties associated with the possibility of skewed X-inactivation in females with this diagnosis^31,32, CTD + CTDdm ranked the correct diagnosis first in 79% of the remaining 137 samples with known diagnoses across the remaining 15 modeled IEMs (Table 3), and 94% of samples had the correct diagnosis in their DD short list. Prediction performance (sensitivity, specificity, accuracy) of all individual disease-specific models measured by CTD + CTDdm ranks is shown in Table S3. All disease rankings for each individual can be viewed using the Metabolomics Data Portal (see “Methods”), in the Network-assisted Diagnostics tab (Figure S2b).

Table 3 Accuracy of diagnostic rankings across 188 plasma samples with known disease.

Full size table

To determine whether the increased sensitivity comes at the cost of lower specificity, we also examined the median size of the DD short lists for 172 reference samples. Given that reference samples represent individuals without disease, a maximally specific method would assign 0 DDs to each of these individuals. In Haijes et al.²⁰, reference samples identified a median of 3 out of 58 DDs. In our network-based approach, we found the median size to be 1 out of 16 DDs, verifying that our approach shows comparable specificity.

Network approaches may be used for diagnosis in place of pathway-based approaches

While multi-metabolite perturbations are often interpreted in the context of well-curated pathway knowledge, such knowledge is not always available, does not include all measurable metabolites, and is not disease-specific. We therefore asked if data-driven network models may be used in lieu of pathway-based models. The possibility of using data-driven networks is particularly relevant for metabolic diseases whose affected pathways are not fully characterized, such as various peroxisomal, mitochondrial, and seizure disorders.

Toward this purpose, we defined a pathway-based diagnostic model to be one that considers only metabolites that are a priori known to be involved in a disease-relevant pathway, without information provided by the remaining untargeted metabolomics profile. In contrast, network-based diagnostic models and full-profile models were defined to be ones that considers all frequently detected metabolites in untargeted metabolomics profiles, without a priori information about which metabolites are most relevant for diagnosis. The former utilizes CTD-based metrics, whereas the latter includes all frequently detected metabolites as predictors.

To compare the approaches directly, we used untargeted metabolomics profiling data from individuals diagnosed with any of four genetically distinct urea cycle disorders⁷, where the disease mechanism is well-characterized by defects in enzymes and perturbations of metabolites in the urea cycle pathway (Fig. 3), as well as negative control (“reference”) profiles. We compared the performance of partial least squares regression models: two modeled the relative abundances of metabolites in the urea cycle; third modeled CTD- and CTDdm-quantified scores; fourth modeled all frequently detected metabolites in the untargeted metabolomics profiles. We note that argininosuccinate is rarely identified by our platforms due to its low concentration in normal plasma samples. As a consequence of its rare presentation in normal “reference” blood samples, z-scores for argininosuccinate levels cannot be generated when identified in an individual sample and thus, cannot be used as a quantitative predictor. Instead, we encoded argininosuccinate as a binary predictor (“1” for the presence or “0” for the absence of the metabolite). We, therefore, defined the “ASA-Arg-Orn-Cit” model to include quantitative variables, arginine, ornithine, and citrulline, and one binary variable, argininosuccinate. To further evaluate diagnostic accuracy, we defined the full “Pathway” model to include all metabolites found in both the urea cycle and the periphery of the urea cycle, as illustrated in Fig. 3.

As shown in Fig. 3, both network- and pathway-based models improve diagnostic power relative to the full-profile model. CTD performed competitively with pathway-based models of four major urea cycle disorders: argininosuccinic aciduria (MIM:207900), argininemia (MIM:207800), ornithine transcarbamylase (OTC) deficiency (MIM:311250), and citrullinemia (MIM:215700). In the case of argininosuccinic aciduria and particularly for OTC deficiency, the network-based model outperformed the ASA-Arg-Cit-Orn model and was competitive with the full Pathway model. Interestingly, model accuracies for OTC deficiency suffered from poorer discrimination compared to argininosuccinic aciduria, argininemia, and citrullinemia. This is partially due to the known phenotypic heterogeneity of effects associated with X-inactivation patterns observed in females with OTC deficiency^31,32.

Overall, this result suggests network-based and pathway-based modeling approaches have comparable accuracies. Thus, when biochemical pathway knowledge for a particular disease state is not available, data-driven network-based models may provide a valuable alternative.

Separating treatment-related effects from disease-related effects in metabolomics data

While the heterogeneity of effects associated with X-inactivation in OTC deficiency can explain why diagnostic accuracy of OTC deficiency was lower than many other IEMs, it is also likely that treatment-related effects may be confounding the raw metabolomics data and as a result, be affecting the ability of both pathway (knowledge-driven) and network-based (data-driven) models from performing well in diagnosis. To test for treatment confounding and whether it could be removed or ameliorated, we examined all OTC deficiency samples from Burrage et al.⁷ where treatment information is available. We found that 8/10 patients with OTC deficiency were taking citrulline supplements as part of a prescribed treatment regimen. To identify treatment-driven signatures that differentiated from disease-driven signatures, we constructed a citrulline supplement-specific network by contrasting eight OTC deficiency patients taking supplemental citrulline against the remaining 20 urea cycle disorder patients. A second network was learned from OTC deficiency patients not undergoing citrulline supplementation and edges found in both networks were pruned from the first network. Similar to “main disease module” identification (Algorithm 1), we determined the representative “main treatment module” for citrulline supplementation. When we compared the main disease module identified in the OTC disease-specific network to the main treatment module identified in the citrulline supplement-specific network, we found two out of four treatment-related compounds (e.g., carnitine and pyruvate) in the original OTC deficiency main disease module (Fig. 4A).

We then asked if treatment-related signatures contributed to false positives in patients with these diseases. Notably, the citrulline supplement-related treatment module was also found to overlap perturbations in other disease states such as cobalamin biosynthesis defect (carnitine, betaine) and argininosuccinate lyase deficiency (phenylalanine and pyruvate). As a consequence, we expected diagnostic accuracy to improve by omitting these treatment-related metabolites from the disease-specific OTC network folds. One case, for instance, is argininosuccinate lyase deficiency patient “EDTA-ASLD-7”, where OTC deficiency was falsely ranked first by CTD + CTDdm (combined, p = 0.00568), while the correct diagnosis fell out of the top three rankings (combined, p = 0.0385). With the updated network model, significance of OTC deficiency dropped to the 5th in the list of diagnoses (combined, p = 0.0716), and the correct diagnosis ascended to within top three. Additionally, removal of treatment signal also unflagged a cobalamin biosynthesis defect patient “EDTA-COB-6”, as the significance of OTC deficiency dropped an order of magnitude (combined, p = 0.00335). Removal of treatment-related compounds also increased model AUC, specificity, and overall accuracy (Fig. 4B).

This result suggests that pruning treatment-related nodes can improve the diagnosis of patients with other diseases where compounds to be removed are affected by the disease state. Nevertheless, the pruning of treatment-related nodes should be performed with caution, as treatment-related signatures that are particularly specific to a given diagnostic category can also improve diagnostic performance. Ideally, we would only collect metabolomic data prior to administering treatment, in order to train disease-specific network models for diagnostic purposes. However, many individuals that undergo untargeted metabolomic screening are already on a treatment regimen in order to manage their symptoms, and as a result, the confounding due to treatment is common. While we have shown that censoring treatment-related metabolites from our diagnostic OTC network models helped diagnostic accuracy, this represents only a first pass attempt to separate treatment effects from disease effects. Future research may be needed to explore new strategies that would improve diagnostic accuracy by removing treatment effects.

Dissecting the genetic etiology of peroxisome biogenesis disorders using untargeted metabolomics with network models

Peroxisome biogenesis disorders (PBDs) are autosomal recessive disorders that result from the impaired assembly and biological functioning of peroxisomes³³. They are composed of two known major classes: Zellweger spectrum disorders (ZSD, MIM:214100,601539) and rhizomelic chondrodysplasia punctata (RCDP, MIM:215100). As a whole, PBDs are complex since they are caused by deficiency of any single gene in a group of related genes and their severity is strongly influenced by the residual activity of downstream gene products. Furthermore, PBDs affect the functions of numerous organ systems, and their pathological mechanisms of disease are only partially understood. We therefore asked whether a data-driven network-based approach may help dissect their etiology and help provide diagnostic information.

Untargeted metabolomic profiling from Wangler et al.⁶ describe 18 individuals with ZSD originating from deleterious variants in PEX1 (MIM:602136). We collected an additional 21 samples from individuals diagnosed with a RCDP type 1 disorder with confirmed deleterious variants in PEX7 (MIM:601757). For all 39 samples, pairwise patient-patient distance calculations using CTDncd were estimated and plotted in lower dimensional space (Fig. 5). As shown, the distances accurately cluster individuals with ZSD separately from individuals with RCDP and from known reference samples. Interestingly, examination of the ZSD cluster highlighted less pronounced abnormalities in plasma metabolite levels in older individuals ($\ge$ 10 years) with a smaller centroid–centroid distance to the reference cluster (d = 0.625) than that of younger patients (< 10 years of age) (d = 0.949), in agreement with Wangler et al.⁶. Overall, k-means clustering generated a cluster purity score of 0.888, which indicates low false positive and false negative clustering results. These results suggest that CTDncd can accurately distinguish between groups of individuals with ZSD and RCDP, the two major subtypes of PBDs.

Evidence from the metabolome can help interpret variants of uncertain significance

Methods for using untargeted metabolomics data to resolve the pathogenicity of VUSs are currently manual, qualitative, and can be very laborious. We therefore asked if disease-specific networks and CTD-based metrics may help automate the interpretation of untargeted metabolomics profiling data to improve variant assessment. We recently reported that manual evaluation of untargeted metabolomics data aided in the diagnosis in the context of exome sequencing for several IEMs²². In our current analysis, we interpreted the same exome sequencing and respective metabolomics data in two stages. First, as in Alaimo et al.²², genetic variants identified in patient exome data were classified according to the ACMG/AMP guidelines. Individuals with variants found in at least one disease gene relevant to any of the 15 IEMs (X-linked OTC excluded) modeled in our analysis were categorized into the one of 5 groups (Table 2) based on the observed variants’ pathogenicity and zygosity. Second, select patient metabolomics data were then analyzed using CTD and CTDdm in the context of relevant IEM-specific networks.

For all individuals with variants categorized into groups 1–3 (Table 2) in disease-relevant genes for a given disease state (Table 1), individuals with a combined CTD + CTDdm disease-specific network p value < 0.05 were reported as a HIT, and the pathogenicity of the respective variants was re-interpreted towards a more pathogenic classification. Furthermore, individuals with combined network p values between 0.05 and 0.15 were reported as a BORDERLINE HIT. Of all 29 variant confirmations or re-interpretations reported by Alaimo et al.²², 10 had variants in disease genes relevant to the 15 IEMs (X-linked OTC excluded) modeled in this paper. Of all 10 individuals, 9/10 were classified as HIT and 1/10 was a BORDERLINE HIT (Table 4). These results suggest that the interpretation of untargeted metabolomic profiling data to improve variant assessment can be automated using CTD-based metrics.

Table 4 Variant re-interpretations based on evidence quantified from the metabolome.

Full size table

Of further relevance are the patterns detected by CTD + CTDdm that connect individuals to disease states and that escaped manual inspection in Alaimo et al.²² (Table 4). Patient 10 was a 21-year-old female who had a likely pathogenic homozygous variant (NM_004813.4:c.993_995del) in PEX16 (MIM:603360). Her metabolomic profile showed several metabolite perturbations consistent with a PBD. Out of 16 diagnoses, ZSD ranked 6th and RCDP ranked 2nd. CTD detected a module containing a very long-chain fatty acid that was positively perturbed (Table 4), a hallmark of ZSD, in the ZSD disease network. Review of clinical reports revealed that Patient 10 had an intellectual disability, spasticity, ataxia and structural brain abnormalities, phenotypes consistent with those observed in individuals with PEX16 pathogenic variants, as reported previously³⁴. We then confirmed that this individual was the same individual who was diagnosed with a PBD in a recent publication³⁵, a diagnosis that took over 18 years to establish. We previously reported that plasma disease signatures in individuals with a mild to intermediate ZSD are more pronounced in younger subjects, suggesting studies earlier in life reveal larger biochemical changes for a number of possible reasons⁶. CTD’s ability to detect ZSD-relevant disease signatures in Patient 10 (21-year-old), however, shows how CTD can assist clinicians in difficult diagnostic situations.

While the CTDdm score put Patient 10’s module moderately far away from the main disease module (63rd percentile), the ZSD disease network modeled perturbation patterns in individuals with PEX1 defects from a variety of different levels of severity. It is therefore plausible that a mild PEX16 disease signature, while highly connected in the PEX1 network, was more distal to several metabolites perturbed in individuals with PEX1-associated ZSD. To make the ZSD disease network more sensitive to disease signatures observed in each of the 14 PEX genes that cause ZSD, using profiling data from individuals with biallelic defects in any of several different PEX genes would be beneficial for network learning. While this diagnosis was further complicated by reduced plasma-derived perturbations with age observed in ZSDs, the fact that the diagnosis was missed by previous manual inspection of metabolomic data²², however, highlights the power of our data-driven network method to identify both the modeled (PEX1) and related disease gene’s (PEX16) effect on the metabolome.

Discussion

Several recent publications have called for systems biology solutions to shortcomings observed in current diagnostic methods for metabolic disorders^36,37,38. IEMs provide a useful context for testing novel computational approaches such as CTD because the genetic etiology of many IEMs is well-established. In this work, we have shown the accuracy of the CTD method using untargeted metabolomics on a variety of IEM data sets. Our results pave the way toward the dissection of the genetic etiology and precise diagnosis of more complex, metabolically heterogeneous diseases, such as diabetes and metabolic syndrome.

CTD characterizes individuals’ metabolomic likeness to a given disease state based on the connectedness of metabolite perturbations in a disease-specific network. Analogous to the background knowledge clinicians accumulate about metabolism by combining textbook knowledge of biochemistry and experience in the clinic, disease-specific networks are learned directly from representative profiling data and reflect information found in well-curated pathway knowledge³⁹. Network-based similarity metrics have been constructed previously for use in disease-disease similarity⁴⁰, functional protein similarity⁴¹, and similarity in clinical ontological terms⁴². We apply similar logic to quantify patient-patient (CTDncd) or patient-disease (CTDdm) distances using metabolomics data and use both metrics as additional network-quantified predictors of diagnosis.

CTD makes few assumptions about the nature of disease-associated perturbations and is thus well-suited for discovery and diagnosis of hard-to-diagnose cases. For example, CTD does not make any hard assumptions about the metabolites involved in a particular disease or the directionality (+ or −) of perturbations. While information about the directionality of a metabolite perturbation can be useful for diagnostic discrimination, there are several situations where this information can be disadvantageous. For example, related disease states and disease genes frequently affect the same molecular components but in different ways^40,43. Just as different variants identified in a single gene can lead to different levels of disease severity, modeling the effect that one disease gene has on a metabolic phenotype may not accurately predict the metabolic effect of another gene that is involved in the same pathway, particularly when it comes to the direction that specific metabolites are perturbed. By considering combinatorial patterns without regard to directionality, CTD-based metrics make fewer assumptions and are therefore more suitable for data-driven discovery and for resolving hard-to-diagnose cases. If routine discrimination and not discovery is the main goal, information about the directionality of metabolite perturbations can be combined with combinatorial information garnered in CTD-based metrics (see Supplemental Text 1).

As is the case with any data-driven modeling approach, the performance of CTD-based metrics is mostly determined by the amount and quality of data available. Based on our experience, we recommend a minimum of 5 disease profiles, a minimum of 25 reference profiles for each disease condition, and the use of surrogate disease and negative control profiles²¹ to learn stable disease-specific networks (Figure S5). If the samples represent a spectrum of disease severity or are the result of defects from more than one gene, we recommend gathering even more unique examples of disease prior to network learning.

The paucity of profiles available for rare disorders similarly affects both our network-based and rule-based approaches. There are many metabolic disease states including several IEMs where the paucity of cases has precluded data-driven classification. When a new case of such diseases shows a different metabolic perturbation signature that is contrary to existing biomarker-based knowledge, the case is very likely to be falsely omitted from the correct diagnosis by rule-based algorithms²⁰. This problem will only be alleviated with the accumulation of more case profiling data. One strength of the network-based strategy is that new data can be incorporated automatically by a structured network learning process. In contrast, for rule-based approaches, knowledge curation and optimization can often be laborious.

Network models may also be significantly affected by confounding factors. For example, networks learned from samples collected after treatment was initiated will inevitably connect both treatment-related and disease-related metabolites together. Treatment-related modules, if not pruned properly in the network pruning stage, can cause some individuals without the modeled disease to be falsely diagnosed. CTDdm is designed to identify false positive calls made by CTD alone (see Supplemental Text 1) but fails when the treatment-related module is also well-connected and/or proximal with the disease-related module in the network. Such a circumstance may arise when nearly all the samples used in network learning for a particular disease were on treatment at the time their blood was sampled. Similarly, some treatments (e.g., metabolite supplementation, enzyme replacement therapy) have shown to successfully normalize disease-relevant metabolite perturbations in targeted, disease pathways. Network models trained on metabolomic profiles without disease-relevant perturbations will not be informative. On the other hand, some treatment-related signatures—especially when the treatment is highly specific to a particular IEM—can improve diagnostic accuracy in some circumstances, as these signatures can provide indirect evidence for the presence of disease.

In summary, our work suggests that data-derived network models offer competitive diagnostic accuracy compared to rule-based biomarker modeling approaches, show improved performance as more data accumulates, and can replace pathway-based modeling approaches, particularly when the relevant pathway knowledge is unavailable.

Conclusions

The benefit of CTD-based metrics can be particularly powerful when applied to individuals who are undiagnosed by current methods. By quantifying the likeness of individuals’ metabolite perturbations with perturbation patterns observed in many diseases, candidate diagnoses can be ranked and possible diagnoses can be recommended. Furthermore, if genetic sequencing data are available for an individual exhibiting strong disease-specific metabolite perturbation patterns, VUSs can be re-interpreted given the functional evidence provided by untargeted metabolomics. While CTD-based metrics cannot eliminate manual review entirely, they can expedite it and increase the confidence by which clinical laboratory directors make diagnostic decisions. Finally, disease-specific network models can be automatically and continuously updated as new case profiling data accumulates, ensuring stronger network stability and improved diagnostic performance.

Data availability

Datasets related to this article are available within Supplemental Table 2 and accessible via the Metabolomics Data Portal Download Data tab (Figure S4).

References

Philippakis, A. A. et al. The matchmaker exchange: A platform for rare disease gene discovery. Hum. Mutat. 36(10), 915–921 (2015).
Article PubMed PubMed Central Google Scholar
Buske, O. J. et al. PhenomeCentral: A portal for phenotypic and genotypic matchmaking of patients with rare genetic diseases. Hum. Mutat. 36(10), 931–940 (2015).
Article PubMed PubMed Central Google Scholar
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17(5), 405–424 (2015).
Article PubMed PubMed Central Google Scholar
Kennedy, A. D. et al. Metabolomics in the clinic: a review of the shared and unique features of untargeted metabolomics for clinical research and clinical testing. J. Mass Spectrom. 53(11), 1143–1154 (2018).
Article CAS ADS PubMed Google Scholar
Miller, M. J. et al. Untargeted metabolomic analysis for the clinical screening of inborn errors of metabolism. J. Inherit. Metab. Dis. 38(6), 1029–1039 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wangler, M. F. et al. A metabolomic map of Zellweger spectrum disorders reveals novel disease biomarkers. Genet. Med. 20(10), 1274–1283 (2018).
Article CAS PubMed PubMed Central Google Scholar
Burrage, L. C. et al. Untargeted metabolomic profiling reveals multiple pathway perturbations and new clinical biomarkers in urea cycle disorders. Genet. Med. 21(9), 1977–1986 (2019).
Article PubMed PubMed Central Google Scholar
Donti, T. R. et al. Diagnosis of adenylosuccinate lyase deficiency by metabolomic profiling in plasma reveals a phenotypic spectrum. Mol. Genet. Metab. Rep. 8, 61–66 (2016).
Article CAS PubMed PubMed Central Google Scholar
Atwal, P. S. et al. Aromatic l-amino acid decarboxylase deficiency diagnosed by clinical metabolomic profiling of plasma. Mol. Genet. Metab. 115(2–3), 91–94 (2015).
Article CAS PubMed Google Scholar
Kennedy, A. D. et al. 2-pyrrolidinone and succinimide as clinical screening biomarkers for GABA-transaminase deficiency: Anti-seizure medications impact accurate diagnosis. Front. Neurosci. 13, 394 (2019).
Article PubMed PubMed Central Google Scholar
Vaser, R., Adusumalli, S., Leng, S. N., Sikic, M. & Ng, P. C. SIFT missense predictions for genomes. Nat. Protoc. 11(1), 1–9 (2016).
Article CAS PubMed Google Scholar
Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7(4), 248–249 (2010).
Article CAS PubMed PubMed Central Google Scholar
Rentzsch, P., Witten, D., Cooper, G. M., Shendure, J. & Kircher, M. CADD: Predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 47(D1), D886–D894 (2019).
Article CAS PubMed Google Scholar
Quang, D., Chen, Y. & Xie, X. DANN: A deep learning approach for annotating the pathogenicity of genetic variants. Bioinformatics 31(5), 761–763 (2015).
Article CAS PubMed Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102(43), 15545–15550 (2005).
Article CAS ADS PubMed PubMed Central Google Scholar
Gu, Z., Liu, J., Cao, K., Zhang, J. & Wang, J. Centrality-based pathway enrichment: A systematic approach for finding significant pathways dominated by key genes. BMC Syst. Biol. 6, 56 (2012).
Article PubMed PubMed Central Google Scholar
Jacob, L., Neuvial, P. & Dudoit, S. More power via graph-structured tests for differential expression of gene networks. Ann. Appl. Stat. 6(2), 561–600 (2012).
Article MathSciNet MATH Google Scholar
Do, K. T., Rasp, D. J. N., Kastenmüller, G., Suhre, K. & Krumsiek, J. MoDentify: Phenotype-driven module identification in metabolomics networks at different resolutions. Bioinformatics 35(3), 532–534 (2019).
Article CAS PubMed Google Scholar
Liu, N. et al. Comparison of untargeted metabolomic profiling vs traditional metabolic screening to identify inborn errors of metabolism. JAMA Netw. Open 4(7), e2114155 (2021).
Article PubMed PubMed Central Google Scholar
Haijes, H. A. et al. Untargeted metabolomics for metabolic diagnostic screening with automated data interpretation using a knowledge-based algorithm. Int. J. Mol. Sci. 21, 3 (2020).
Article CAS Google Scholar
Thistlethwaite, L. R. et al. CTD: An information-theoretic algorithm to interpret sets of metabolomic and transcriptomic perturbations in the context of graphical models. PLoS Comput. Biol. 17, 1 (2021).
Google Scholar
Alaimo, J. T. et al. Integrated analysis of metabolomic profiling and exome data supplements sequence variant interpretation, classification, and diagnosis. Genet. Med. 22(9), 1560–1566 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pappan, K. L. et al. Clinical metabolomics to segregate aromatic amino acid decarboxylase deficiency from drug-induced metabolite elevations. Pediatr. Neurol. 75, 66–72 (2017).
Article PubMed Google Scholar
Ford, L. et al. Precision of a clinical metabolomics profiling platform for use in the identification of inborn errors of metabolism. J. Appl. Lab. Med. 5(2), 342–356 (2020).
Article PubMed Google Scholar
Duker, A. L. et al. Growth charts for individuals with rhizomelic chondrodysplasia punctata. Am. J. Med. Genet. A. 173(1), 108–113 (2017).
Article CAS PubMed Google Scholar
Li, M., Chen, X., Li, X., Ma, B. & Vitanyi, P. M. B. The similarity metric. IEEE Trans. Inf. Theory 50(12), 3250–3264 (2004).
Article MathSciNet MATH Google Scholar
Yang, Y. et al. Clinical whole-exome sequencing for the diagnosis of mendelian disorders. N. Engl. J. Med. 369(16), 1502–1511 (2013).
Article CAS PubMed PubMed Central Google Scholar
Challis, D. et al. An integrative variant analysis suite for whole exome next-generation sequencing data. BMC Bioinform. 13, 8 (2012).
Article Google Scholar
Auton, A. et al. A global reference for human genetic variation. Nature 526(7571), 68–74 (2015).
Article ADS CAS PubMed Google Scholar
Richards, C. S. et al. ACMG recommendations for standards for interpretation and reporting of sequence variations: Revisions 2007. Genet. Med. 10(4), 294–300 (2008).
Article CAS PubMed Google Scholar
Ricciuti, F. C., Gelehrter, T. D. & Rosenberg, L. E. X-chromosome inactivation in human liver: Confirmation of X-linkage of ornithine transcarbamylase. Am. J. Hum. Genet. 28(4), 332–338 (1976).
CAS PubMed PubMed Central Google Scholar
Yorifuji, T. et al. X-inactivation pattern in the liver of a manifesting female with ornithine transcarbamylase (OTC) deficiency. Clin. Genet. 54(4), 349–353 (1998).
Article CAS PubMed Google Scholar
Braverman, N. E. et al. Peroxisome biogenesis disorders in the Zellweger spectrum: An overview of current diagnosis, clinical manifestations, and treatment guidelines. Mol. Genet. Metab. 117(3), 313–321 (2016).
Article CAS PubMed Google Scholar
Ebberink, M. S. et al. Identification of an unusual variant peroxisome biogenesis disorder caused by mutations in the PEX16 gene. J. Med. Genet. 47(9), 608–615 (2010).
Article CAS PubMed Google Scholar
Bacino, C. et al. A homozygous mutation in PEX16 identified by whole-exome sequencing ending a diagnostic odyssey. Mol. Genet. Metab. Rep. 5, 15–18 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tebani, A., Abily-Donval, L., Afonso, C., Marret, S. & Bekri, S. Clinical metabolomics: The new metabolic window for inborn errors of metabolism investigations in the post-genomic era. Int. J. Mol. Sci. 17, 7 (2016).
Article Google Scholar
Argmann, C. A., Houten, S. M., Zhu, J. & Schadt, E. E. A next generation multiscale view of inborn errors of metabolism. Cell Metab. 23(1), 13–26 (2016).
Article CAS PubMed Google Scholar
Graham, E. et al. Integration of genomics and metabolomics for prioritization of rare disease variants: A 2018 literature review. J. Inherit. Metab. Dis. 41(3), 435–445 (2018).
Article CAS PubMed PubMed Central Google Scholar
Krumsiek, J., Suhre, K., Illig, T., Adamski, J. & Theis, F. J. Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data. BMC Syst. Biol. 5, 21 (2011).
Article CAS PubMed PubMed Central Google Scholar
Suthram, S. et al. Network-based elucidation of human disease similarities reveals common functional modules enriched for pluripotent drug targets. PLoS Comput. Biol. 6(2), e1000662 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Cao, M. et al. Going the distance for protein function prediction: A new distance metric for protein interaction networks. PLoS One 8(10), e76339 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Deng, Y., Gao, L., Wang, B. & Guo, X. HPOSim: An R package for phenotypic similarity measure and enrichment analysis based on the human phenotype ontology. PLoS One 10(2), e0115692 (2015).
Article CAS PubMed PubMed Central Google Scholar
Cho, D. Y., Kim, Y. A. & Przytycka, T. M. Chaptexr 5: Network biology approach to complex diseases. PLoS Comput. Biol. 8(12), e1002820 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

LRT was supported by a training fellowship from the Gulf Coast Consortia, NLM Biomedical Informatics Training Program (Grant No. T15 LM007093). LCB was supported by K08DK106453 and is the recipient of a Burroughs Wellcome Fund Career Award for Medical Scientists. MJM was supported by a NIH T32 GM07526-41 Medical Genetics Training Grant, and MFW and SHE received support from NIH R01NS107733. NB was supported by a Canadian Institute for Health Research (CIHR) Grant #144213. AM was supported in part by the Henry and Emma Meyer Professorship in Molecular Genetics and NIH NHGRI Clinical Genome Resource (ClinGen) Grant U24HG009649.

Author information

These authors contributed equally: Lillian R. Thistlethwaite and Xiqi Li.

Authors and Affiliations

Quantitative and Computational Biosciences Program, Baylor College of Medicine, One Baylor Plaza, 400D, Houston, TX, 77030, USA
Lillian R. Thistlethwaite & Aleksandar Milosavljevic
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
Lillian R. Thistlethwaite, Xiqi Li, Lindsay C. Burrage, Kevin Riehle, Michael F. Wangler, Sarah H. Elsea & Aleksandar Milosavljevic
Department of Biochemistry and Molecular Medicine, Keck School of Medicine of the University of Southern California, Los Angeles, CA, USA
Joseph G. Hacia
Department of Pediatrics and Human Genetics, McGill University, Montreal, QC, Canada
Nancy Braverman
Texas Children’s Hospital, Houston, TX, USA
Lindsay C. Burrage & Michael F. Wangler
Jan and Dan Duncan Texas Children’s Hospital Neurological Research Institute, Houston, TX, USA
Michael F. Wangler
Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
Marcus J. Miller

Authors

Lillian R. Thistlethwaite
View author publications
You can also search for this author in PubMed Google Scholar
Xiqi Li
View author publications
You can also search for this author in PubMed Google Scholar
Lindsay C. Burrage
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Riehle
View author publications
You can also search for this author in PubMed Google Scholar
Joseph G. Hacia
View author publications
You can also search for this author in PubMed Google Scholar
Nancy Braverman
View author publications
You can also search for this author in PubMed Google Scholar
Michael F. Wangler
View author publications
You can also search for this author in PubMed Google Scholar
Marcus J. Miller
View author publications
You can also search for this author in PubMed Google Scholar
Sarah H. Elsea
View author publications
You can also search for this author in PubMed Google Scholar
Aleksandar Milosavljevic
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.R.T, X.L. and A.M. wrote the main manuscript. L.R.T. and X.L. prepared all figures and tables. All authors reviewed the manuscript.

Corresponding author

Correspondence to Aleksandar Milosavljevic.

Ethics declarations

Competing interests

Several authors were, at the time of this study, employees of Baylor College of Medicine, which has a partnership with Baylor Genetics that derives revenue from the clinical testing referenced in this manuscript. L.R.T, X.L, L.C.B., K.R., M.F.W., N.B. and J.G.H. declare no other potential conflicts of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Thistlethwaite, L.R., Li, X., Burrage, L.C. et al. Clinical diagnosis of metabolic disorders using untargeted metabolomic profiling and disease-specific networks learned from profiling data. Sci Rep 12, 6556 (2022). https://doi.org/10.1038/s41598-022-10415-5

Download citation

Received: 05 June 2021
Accepted: 14 March 2022
Published: 21 April 2022
DOI: https://doi.org/10.1038/s41598-022-10415-5
Springer Nature Limited

This article is cited by

The expanding diagnostic toolbox for rare genetic diseases
- Kristin D. Kernohan
- Kym M. Boycott
Nature Reviews Genetics (2024)
Circulating N-formylmethionine and metabolic shift in critical illness: a multicohort metabolomics study
- Martin Ingi Sigurdsson
- Hirotada Kobayashi
- Kenneth B. Christopher
Critical Care (2022)

Clinical diagnosis of metabolic disorders using untargeted metabolomic profiling and disease-specific networks learned from profiling data

Abstract

Similar content being viewed by others

Introduction

Methods

Data collection

Inference of disease-specific networks

The CTD method

The CTDncd distance: a network-based patient-patient distance method

The CTDdm distance: a network-based patient-disease distance method, a variation on CTDncd

Analysis of exome sequencing data

The metabolomics data portal: a diagnostics tool for untargeted metabolomics data

Results

Data-driven network models show higher accuracy relative to rule-based biomarker models and eliminate the requirement for a priori biomarkers

Network approaches may be used for diagnosis in place of pathway-based approaches

Separating treatment-related effects from disease-related effects in metabolomics data

Dissecting the genetic etiology of peroxisome biogenesis disorders using untargeted metabolomics with network models

Evidence from the metabolome can help interpret variants of uncertain significance

Discussion

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation