In silico profiling of systemic effects of drugs to predict unexpected interactions

Yoo, Sunyong; Noh, Kyungrin; Shin, Moonshik; Park, Junseok; Lee, Kwang-Hyung; Nam, Hojung; Lee, Doheon

doi:10.1038/s41598-018-19614-5

In silico profiling of systemic effects of drugs to predict unexpected interactions

Article
Open access
Published: 25 January 2018

Volume 8, article number 1612, (2018)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

In silico profiling of systemic effects of drugs to predict unexpected interactions

Download PDF

4961 Accesses
12 Citations
Explore all metrics

Abstract

Identifying unexpected drug interactions is an essential step in drug development. Most studies focus on predicting whether a drug pair interacts or is effective on a certain disease without considering the mechanism of action (MoA). Here, we introduce a novel method to infer effects and interactions of drug pairs with MoA based on the profiling of systemic effects of drugs. By investigating propagated drug effects from the molecular and phenotypic networks, we constructed profiles of 5,441 approved and investigational drugs for 3,833 phenotypes. Our analysis indicates that highly connected phenotypes between drug profiles represent the potential effects of drug pairs and the drug pairs with strong potential effects are more likely to interact. When applied to drug interactions with verified effects, both therapeutic and adverse effects have been successfully identified with high specificity and sensitivity. Finally, tracing drug interactions in molecular and phenotypic networks allows us to understand the MoA.

A machine learning framework for predicting drug–drug interactions

Article Open access 02 September 2021

Investigating Side Effect Modules in the Interactome and Their Use in Drug Adverse Effect Discovery

Drug voyager: a computational platform for exploring unintended drug action

Article Open access 28 February 2017

Introduction

Most complex diseases are caused by a complicated interplay of various biological processes and dysfunctional systems. However, traditional drug discovery paradigm, ‘the single drug, single target’, has limitations in many aspects of the complex disease treatment. Single drug acting on a single target in a disease system may ignore complex interactions between drugs and their target proteins¹. Compared to single drug treatments, it is now evident that drug combinations have a number of advantages such as increase of therapeutic effects, reduction of drug dosage and decrease of toxicity and side effects². However, unexpected adverse effects may also occur due to drug-drug interactions (DDIs) resulting from various processes³. On this account, significant attention to overall phenotypic effects of drug interactions is necessary to discover drug combinations with increased therapeutic effects and reduced adverse effects. While there is a relatively large number of in vitro and in silico methods identifying phenotypic effects of a single drug^4,5, only a few methods attempt to investigate phenotypic effects of drug interactions^6,7.

Systematic in vitro approaches have been used to identify effective drug combinations by using a high-throughput screening technique^8,9,10. However, large-scale experiments are needed for each possible drug combination, which increases cost and time exponentially with the increase of the number of drugs. Therefore, systematic in silico approaches have been proposed to investigate potential drug combinations by calculating drug similarity based on various types of drug information, including chemical structure, target proteins, ATC code and side effects^11,12. Although these methods have good performance in predicting drug combinations, they require various types of annotated data and cannot predict phenotypic effects of drug combinations. Alternatively, some computational approaches have been developed to predict potential drug combinations based on network-based analysis from drug-induced gene expression profiles^13,14,15. These methods construct a backbone network, e.g., functional, protein interaction or drug-target interactions, and identify drug sets with similar response on the network. While these methods can estimate whether a drug combination is effective on a certain disease, it is difficult to predict the overall effects on the human body, since a large amount of gene expression data is required to consider many phenotypes. At the same time, several DDI prediction methods, such as similarity-based, knowledge-based or mechanism-based methods, have been proposed^16,17,18. Most of these methods only focus on predicting DDIs in given drug pairs without showing their potential phenotypic effects. Only a few knowledge-based approaches predicted adverse effects of drugs and DDIs based on adverse drug reaction (ADR) reports and Electronic Health Records (EHRs)^6,7. However, these methods suffer from several limitations in the use of spontaneous ADR reporting system, including sampling variance and reporting biases^6,19,20. More importantly, they seldom consider the complicated mechanism of action (MoA) of drug effects and interactions in biological systems.

Here, we present a phenotype-based in silico method to predict effects and interactions of drug pairs based on the profiling of systemic effects of drugs. Recent studies have demonstrated that phenotypic effects of drugs can be utilized in predicting drug interactions by using them to calculate the similarity between drug pairs or as a core feature in machine learning^11,21,22,23. We, therefore, hypothesized that drug pairs having similar phenotypic effects are more likely to interact with each other. To test this underlying hypothesis, we first generated profiles of systemic effects for each drug by investigating the propagated drug effects from a molecular network and mapped these results to a phenotypic network. Next, to identify the effects and interactions of drug pairs, the connectivity and closeness between phenotypes of drug profiles were calculated in the phenotypic network. By introducing systematic analysis on molecular and phenotypic networks, effects and interactions of drug pairs were identified together with the understanding of the underlying MoA. To investigate the coverage of our prediction, we applied our method to two sources for predicting both therapeutic indications and adverse effects of drug interactions. We found that predicted effects and interactions of drug pairs cover the large amounts of results which were reported in previous work.

To conclude, the novelty of our method is threefold: (i) to our knowledge, it is the first unbiased, phenotype-based in silico method which predicts drug effects and their interaction potential with quantitative assessment; (ii) by investigating systemic effects of drugs and their associations in molecular and phenotypic networks, it enables us to understand the MoA of drug interactions; and (iii) as a preliminary tool, our method can screen candidate drug combinations and notify us hazardous drug interactions.

Results

In silico analysis for identifying drug interactions and their phenotypic effects

We designed a novel algorithm to predict phenotypic effects of drug pairs and their interaction potential by investigating systemic effects of drugs from molecular to phenotypic networks. For a query drug pair, the algorithm works in five steps (Fig. 1): (i) constructing an inferred drug phenotype vector (DPV) by calculating systemic effects of a drug on the molecular network and filtering effective phenotype values; (ii) constructing a known DPV based on published databases; (iii) creating a combined DPV based on known and inferred DPVs; (iv) mapping phenotypes of the combined DPV on the phenotypic network; and (v) calculating phenotype-specific interaction scores (P-scores) and drug interaction scores (DI-scores) for a query drug pair based on their combined DPVs.

We generated DPVs which cover systemic effects, including both known and unexpected phenotypic effects, of individual drugs. Each DPV contains drug effects for 3,833 phenotypes defined by Medical Subject Headings (MeSH) and Online Mendelian Inheritance in Man (OMIM). For this, the inferred DPV was generated in three steps. In the first step, propagated drug effects were calculated by using the random walk with restart (RWR) algorithm on the molecular network (Fig. 1a). The effects of drugs are not limited to their direct targets, but they are further propagated to interacting proteins. Therefore, initial values of the molecular network were assigned to the drug targets, and their propagated effects were calculated by applying the RWR algorithm. In the second step, phenotype values were calculated by combining propagated drug effects based on gene-phenotype associations. Accordingly, phenotypes have high values when a drug directly binds to phenotype associated genes or when drug targets are closely located to phenotype associated genes. In the third step, an inferred DPV was constructed by filtering effective phenotypes from the inferred list of phenotypes, which were calculated from the second step (Fig. 1b). When there were a large number of drug targets or phenotype associated genes, the phenotype tended to get a high value. The result was caused by the imbalance of the prior knowledge of drug targets, protein interactions and phenotype associated genes. To show that prior knowledge has high variation, we calculated the coefficient of variation (CV) which is defined as the ratio of the standard deviation to the mean. The CV of genes per phenotype (CV = 3.58), node degree of the molecular network (CV = 2.68) and direct and indirect targets per drug (CV = 3.94 and 6.80, respectively) were considered to be high variances since those CV values were higher than one²⁴. To overcome this problem, we filtered effective phenotype values by comparing phenotype values with the randomly generated DPVs. The phenotype values were converted into Boolean values (one or zero) based on p-values, which were calculated from the distribution of phenotype values of random DPVs. The list of phenotype values of a drug, determined by combining the molecular and phenotypic networks, was defined as the inferred DPV. The average number of phenotypes per inferred DPV was 116.1. Next, we constructed the known DPV by collecting drug-phenotype associations from published databases (Fig. 1c). The average number of phenotypes per known DPV was 57.4. Finally, a combined DPV was generated by integrating both the known and inferred DPVs, which has a large coverage of phenotypic effects including verified and unexpected drug-phenotype associations (Fig. 1d). Through this process, the average number of phenotypes were increased (27.4%) for 2,434 drugs among a total of 5,441 drugs. Consequently, we generated combined DPVs for 5,441 approved and investigational drugs, which contain an average of 160.8 phenotypes. The distribution of the number of targets associated with drugs, genes associated with phenotypes and phenotypes associated with drugs is provided in Supplementary Fig. 1.

To investigate the relationship between drugs, we mapped phenotype values of combined DPVs on the phenotypic network (Fig. 1e). In the phenotypic network, phenotype nodes were connected when they shared some common properties or were related by definition²⁵. Therefore, similar phenotypes were closely linked with shared properties. The average distance of the random phenotype pair was 6.5 and the average distance of the phenotype pair of the known drug combination was 3.4 in the phenotypic network. This indicates that the drugs are more likely to have similar phenotypic effects when they interact with each other. For a given query drug pair, all phenotype terms in both combined DPVs were considered as candidate phenotypic effects of the drug pair. To filter out meaningless phenotypes, all candidates were ranked by P-scores, which were calculated by considering connectivity and closeness between phenotype pairs containing this specific phenotype among the whole pairs (Fig. 1f). To further predict whether the drug pair interacts, the interaction potential was quantified by aggregating all P-scores of the drug pair (Fig. 1g). If a drug pair shares a relatively higher number of similar phenotypes, the drug pair will have high scored phenotype terms, increasing the interaction potential. Finally, we calculated DI-scores which normalize the interaction potential by dividing it by the geometric mean of the values obtained by calculating each interaction potential against itself. We provide a web interface for all predicted drug interactions and their ranked phenotypic effects, which is available at http://biosoft.kaist.ac.kr/DPV.

Systemic effects of drugs are predicted by network propagation from molecule to phenotype

In this study, the prediction of systemic effects of drugs on the molecular and phenotypic networks is an essential step to identify phenotypic effects of drug pairs and their interaction potential. Therefore, we first evaluated whether DPVs can be used to predict phenotypic effects of drugs by comparing them with randomized target DPVs. Inferred DPVs without known DVPs were used for a fair comparison (Fig. 1b). Also, randomized target DPVs were constructed by applying the same development strategies used for the inferred DPV, where only the targets were randomly selected while fixing the number of targets. Three types of phenotypic effect information, including therapeutic effects, side effects and potential candidate effects, were used as a gold standard, respectively (Table 1). We collected verified phenotypic effects from DrugBank indications for therapeutic effects, SIDER for side effects and CTD for potential candidate effects and used them as a gold standard positive set. To evaluate the prediction results, precision (p) and recall (r) values were calculated. Out of 4,990 therapeutic effects of 828 drugs, inferred DPVs covered 1,692 phenotypes (p = 0.033 and r = 0.339), which represents better performance compared to randomized target DPVs (p = 3.04 × 10⁻⁴ and r = 0.010). Similarly, for side effect prediction, inferred DPVs covered 7,570 phenotypes (p = 0.079 and r = 0.256) among the total 29,574 phenotypes of 643 drugs, and in potential candidate effect prediction, inferred DPVs covered 168,360 phenotypes (p = 0.381 and r = 0.557) among the total 302,514 phenotypes of 1,585 drugs. Overall performance represents that the prediction of phenotypic effects with inferred DPVs shows better performance compared to the prediction with randomized target DPVs.

Table 1 Precision and recall values of inferred DPV and randomized target DPV in predicting therapeutic effects, side effects, off-label effects and potential candidate effects.

Full size table

Next, we investigated the relationship between precision and recall according to p-value threshold. The precision and recall performance of the inferred DPVs are influenced by the p-value threshold in the process of calculating phenotype values (Fig. 1b). Setting a high p-value increases the number of phenotype candidates of the drug, which causes the precision to decrease but the recall value to increase. Conversely, setting a low p-value decreases the number of phenotype candidates, which causes the precision to increase but the recall value to decrease. The relationship between precision, recall and F1 scores according to p-value can be seen in the Supplementary Table 1.

Unlike our method, knowledge-based approaches, including OFFSIDES, predict drug effects from ADRs based on statistical analysis⁶. However, they suffer from several limitations in the use of spontaneous ADR reporting system, including sampling variance and reporting biases^6,19,20. To demonstrate that our prediction is not biased to specific phenotype terms, we calculated phenotypic similarity for all drug pairs based on Jaccard index (JI). The Jaccard index of phenotypic effects between drugs is high when predictions are biased toward common phenotypic effects. From the result, average Jaccard index of inferred DPVs (JI = 2.2 × 10⁻²) was 2.95 times less than OFFSIDES (JI = 6.5 × 10⁻²). Overall, these findings indicate that inferred DPVs provide a high coverage and unbiased results in predicting phenotypic effects of drugs.

Highly connected phenotypes represent potential phenotypic effects of drug pairs

Our method predicts phenotypic effects of drug pairs by calculating the connectivity and closeness between phenotypes of each drug, which are potential therapeutic or adverse effects. Using known therapeutic and adverse effects collected from DCDB²⁶ and TWOSIDES⁶ as gold and silver standards respectively, we calculated the coverage of our predictions. We defined a set of candidate phenotypic effects of a drug pair as a union of phenotypes in combined DPVs. As a result, our predictions offered a large coverage in predicting therapeutic effects (63%) and adverse effects (41%) of drug pairs (Supplementary Fig. 2). However, for there were too many phenotype candidates, including false positives, we needed quantitative assessment to filter out meaningless phenotype candidates. To solve this problem, P-scores were calculated for all possible phenotypes in combined DPVs. Then we performed the fold-enrichment (FE) test to evaluate the correlation between the P-score and the likelihood of having known phenotypic effects. All candidate phenotypic effects were ranked by the P-score and binned into groups of 5,000 phenotypes. As a result, as P-scores increase, the number of known phenotypic effects has markedly increased, especially at top 10% region (Fig. 2a). We also performed kernel density estimation (KDE) to compare the density of distribution between verified and unverified phenotypic effect scores. Original distribution of verified and unverified phenotypic effect scores shows right-skewed distribution. Therefore, the log-transformed value of the P-score were used to better reveal the pattern of the data. As a result, the distribution of the verified phenotypic effect scores was more enriched on high score region than low score region, while the distribution of the unverified phenotypic effects showed opposite aspect (Fig. 2b). This indicates that verified phenotypic effects can be detected more readily if drug pairs have high P-scores. Also, the P-score shows a clear bimodal distribution, which can be used to filter out meaningless phenotypes. Next, average area under the curve (AUC) scores of receiver operating characteristic (ROC) and precision-recall (PR) curves for therapeutic effects of 1,093 drug combinations and adverse effects of 53,694 drug interactions were calculated respectively to examine the performance of the P-score (Fig. 2c). Additionally, in order to confirm whether the proposed method shows better performance when the amount of information is sufficient, 203 phenotypes with more than 20 related targets were selected from 3,833 phenotypes and average AUC scores of ROC and PR were calculated. As a result, we obtained the area under the receiver operating characteristic (AUROC) and area under the precision-recall curve (AUPR) scores for therapeutic effect (AUROC = 0.731 ± 0.021, AUPR = 0.624 ± 0.003) and adverse effect (AUROC = 0.734 ± 0.033, AUPR = 0.817 ± 0.015) predictions. In addition, when the genetic information associated with the phenotypes was sufficient, we were able to predict therapeutic (AUROC = 0.731 ± 0.021, AUPR = 0.624 ± 0.003) and adverse effects (AUROC = 0.734 ± 0.033, AUPR = 0.817 ± 0.015) with higher performance. Next, we calculated the ROC and PR performance for each phenotype and averaged them to normalize the occurrence of phenotypes (Fig. 2d). For this, we obtained phenotype ranking for each drug pair based on P-scores and then calculated the ROC and PR of the phenotype based on ranking in all drug pairs. This process was applied on all phenotypes, and the average values of AUROC and AUPR were calculated to normalize the phenotype occurrence. From the result, we confirmed that when phenotype occurrence is normalized, similar performance in predicting therapeutic (AUROC = 0.713 ± 0.032, AUPR = 0.707 ± 0.026) and adverse effects (AUROC = 0.752 ± 0.036, AUPR = 0.717 ± 0.024) is obtained. In addition, when genetic information related to the phenotype was sufficient, the therapeutic (AUROC = 0.825 ± 0.009, AUPR = 0.786 ± 0.008) and adverse effect (AUROC = 0.861 ± 0.012, AUPR = 0.852 ± 0.011) prediction performance was greatly increased. These results indicate that P-scores using combined DPVs are relevant for phenotypic effect prediction. Finally, we compared our predictions with Iyer study, one of the knowledge-based approaches predicting adverse effects of drug interactions⁷. The AUC scores of the adverse effect prediction of drug interactions were calculated by using TWOSIDES as a silver standard, yet there exists biases and errors toward ADR reports. To ameliorate this situation, we used a gold standard built by Iyer study which contains 1,692 DDIs for 14 diseases (Supplementary Data 1). Although this gold standard covers a relatively small number of drug interactions and diseases, we could evaluate the prediction performance in more strict condition. In comparison of the AUROC scores, our method showed better performance than Iyer study in predicting acute renal failure, hyperglycemia, hyperkalemia, hypokalemia, nephrotoxicity, pancytopenia, rhabdomyolysis and QT prolongation (Fig. 2e). Also, the lowest AUROC of our prediction result was 0.55 in hypoglycemia, while Iyer study did not perform well in predicting pancytopenia, acute renal failure, hyperglycemia, hypokalemia and nephrotoxicity, with AUROC scores less than or equal to 0.5. Moreover, our method has a definite advantage in offering insights on the MoA of drug interactions, which cannot be handled in knowledge-based approaches. Overall, our results indicate that the P-score can be used for identifying prioritized phenotypic effects of drug pairs.

Drug pairs having similar phenotypes are more likely to interact with each other

To demonstrate whether the quantified phenotypic effects of drug pairs can be used to predict drug interactions, we first calculated DI-scores for all possible drug pairs among the list of FDA-approved and investigational drugs (Supplementary Data 2). We then evaluated the performance in predicting drug interactions which have therapeutic effects or adverse effects. As gold standard datasets, 1,093 drug combinations from DCDB²⁶ and TTD²⁷ were used for therapeutic effects and 29,074 DDIs from DrugBank²⁸ and KEGG²⁹ were used for adverse effects.

We performed the FE test to identify the correlation between the DI-score and the likelihood that a drug combination occurs. All possible drug pairs were ranked by the DI-score and binned into groups of 50,000 drug pairs. As a result, as DI-scores increase, the number of drug combinations has markedly increased, especially at top 25% region (Fig. 3a). The result indicates that the drug pairs assigned with high DI-scores are more likely to have drug combinations. We also performed KDE to compare the density distributions of DI-scores between verified and unverified drug combinations. As a result, the distribution of verified drug combination scores was enriched on high score region compared to scores from unverified drug combinations (Fig. 3b). This indicates that verified and unverified drug combinations can be separately detected with the DI-scores. Next, we used AUC scores of ROC curves to examine the quantitative performance of the proposed method in predicting drug combinations. We compared our method with four different cases: (i) using known DPVs only; (ii) using inferred DPVs only; (iii) using target closeness to calculate DDI score by distance between each pair of drug targets on PPI network²¹; and (iv) using target effect overlap to predict DDIs based on similarity between random walk results on PPI network³⁰. Importantly, our method, which integrates the information from known and inferred drug-phenotype associations into combined DPVs, exhibited better performance (AUROC = 0.943) than those with single information (AUROC = 0.869~0.917). Also, our method showed better performance than target closeness (AUROC = 0.896) and target effect overlap (AUROC = 0.908) methods (Fig. 3c). However, ROC curves could not reflect the effects of change the proportion of positive to negative instances. In drug combination prediction, large class skew and large changes in class distributions are common, because the negative set of drug combinations is not available. Therefore, many studies have exempted gold standard positive set from all possible drug pairs and used them as the gold standard negative set^11,21,31. To see the effect due to the class skew, we calculated PR curves. When AUC values from PR curves were compared, our method (AUPR = 0.897 ± 0.004) showed the best performance among the other methods (Fig. 3d). Moreover, we calculated PR curves for different positive/negative ratios to evaluate the performance in the various skewness of dataset and compared it with previous methods (Supplementary Fig. 3). From the result, our method achieved the best performance even with the highly skewed dataset. Finally, we compared our DDI prediction with previous methods, including target closeness, target effects overlap and target connectivity in weighted PPI (Supplementary Fig. 4).

Drug pairs were assigned with high scores when their phenotypic effects were similar. Further analysis was done on whether the sets of drug pairs sharing similar DI-score patterns have similar phenotypes. In order to identify optimal subsets of drug pairs which have high relevance to each other, we performed the biclustering with DI-score matrix to take into account duality between drugs in subspace^32,33. The result showed that the drug pairs are divided into 11 clusters, including four high-scored and seven low-scored clusters (Fig. 3e). Next, to investigate the difference among high-scored, low-scored and random clusters, we calculated phenotypic similarity for each cluster based on Jaccard index. For a drug pair, a list of associated phenotypes of each drug was gathered from DrugBank and SIDER, and the number of common phenotypes was counted to calculate Jaccard index. From the result, the average Jaccard index of high-scored clusters (JI = 0.03) was markedly higher than the index of low-scored (JI = 5.2 × 10⁻⁴) and random clusters (JI = 9.3 × 10⁻³) (Fig. 3f). This indicates that high- and low-scored clusters differ strongly with respect to the number of shared phenotypes. Next, we identified major phenotypes of high-scored clusters by performing phenotype enrichment analysis based on Fisher’s exact test. To get Fisher’s test value of each phenotype, the number of drug pairs was counted based on whether they are included in the cluster and whether they have associations with the phenotype. From the result, we found that each cluster has representable phenotypes and that similar phenotypes are enriched to each cluster. For example, phenotypes enriched in cluster 1 (C1 in Fig. 3e) were related to blood clotting disorder or symptoms, including bleeding, thrombosis, pulmonary embolism and myocardial infarction. Whereas phenotypes enriched in cluster 2 (C2 in Fig. 3e) were related to mental and neurological disorders (Fig. 3g). Detailed information of enriched phenotypes of four high-scored clusters can be found in Supplementary Tables 2–5. These results show that sets of drug pairs sharing comparable DI-score patterns have similar phenotypes, providing that our method can be used as a tool to screen sets of drug pairs for specific phenotypes.

Our method predicts drug interactions based on the connectivity and closeness between phenotypes on the phenotypic network, which enables us to find previously undetected drug combinations by target distance-based methods^11,21. By assigning high scores on drug pairs which have distanced targets but have similar target phenotypes, we can improve prediction coverage of drug combinations (Table 2). For instance, ramipril and irbesartan are being currently prescribed as a drug combination for albuminuria treatment. When measuring the distance between the target genes of these two drugs in the molecular network (average shortest distance = 3.48), they are far away from each other, which is close to random (p-value < 0.05). Therefore it is hard to find that these two drugs can be used as a drug combination from target distance-based methods. However, in our method, ramipril and irbesartan receives a high score (DI-score = 0.908) and are proposed as a potential drug combination. Furthermore, we can predict additional phenotypic effects, such as cardiovascular disease, heart condition, hypertension and diabetes nephropathy, which have not been reported in DrugBank and DCDB^34,35,36.

Table 2 Top 10 drug combinations showing a large difference in rank between our proposed method and target distance-based similarity method.

Full size table

Analyzing mechanism of action of drug interactions via molecular and phenotypic networks

Our novel method, which predicts drug effects and interactions from the integrated molecular and phenotypic networks, provides opportunities for better understanding of the molecular mechanisms underlying drug interactions. As an example, an interaction between mirtazapine and pramipexole was predicted with a high score (DI-score = 0.91) of our method (Fig. 4). Mirtazapine is an antidepressant which refines the specificity of effects on the noradrenergic and serotonergic systems by blocking α₂ adrenergic autoreceptors and heteroreceptors. In addition, mirtazapine selectively antagonizes the 5-HT₂ and 5-HT₃ serotonin receptors in the central and peripheral nervous system, which are mainly targeted in depression treatment. It also being used as anxiolytic, hypnotic, antiemetic and appetite stimulant³⁷. Pramipexole is a non-ergoline dopamine agonist with selective actions at dopamine D₂ and preferential D₃ receptor, which are indicated targets for treating Parkinson’s disease and restless legs syndrome³⁸. The two drugs, mirtazapine and pramipexole, share 11 common targets. Although their interaction has not been reported in DrugBank and DCDB, recent studies have reported that this drug pair can be used as a drug combination to treat restless legs syndrome and Parkinson’s disease^39,40. Also, TWOSIDES has reported 99 adverse events including fainting, pain, bleeding and erythema, which cannot be clearly attributed to the individual drugs alone⁶. In our results, therapeutic targets or biomarkers of neurological disorder and mental disorder, such as DRD1, DRD2, DRD4, HTR2A, PRL and ADRA2A, were assigned with high scores in both mirtazapine and pramipexole (Fig. 4a and b). Based on these results, we can consider that neurological disease (P-score = 19.82, rank = 2), parkinsonism (P-score = 9.82, rank = 70) and restless legs syndrome (P-score = 8.19, rank = 140) are related with mirtazapine and pramipexole (Fig. 4c). Furthermore, our method predicted 62 adverse events over 99 reported adverse events, including pain, difficulty breathing, bleeding, erythema and coma with high P-scores (Supplementary Data 3). Next, to understand the MoA of a drug interaction, we investigated whether significantly enriched pathways in both sets of high-scored genes of mirtazapine and pramipexole are associated with our predicted phenotype. For this, genes with the top 10% of propagated drug effects were selected in each drug (Fig. 1a) and pathway enrichment test was performed based on the selected genes by using DAVID tool⁴¹. Based on these results, we found an intersection of significantly enriched pathways (p-value < 0.001) in both drugs. Next, we validated associations between pathways and our predicted phenotypes by PubMed manual curation (Supplementary Data 4). The result shows that a large number of observed pathways are associated with our predicted phenotypes. For example, ‘Neuroactive ligand-receptor interaction’, ‘Dopaminergic synapse’, ‘Serotonergic synapse’, ‘Cholinergic synapse’ and ‘Cocain addiction’ pathways were found to be associated with neurological disease and Parkinson’s disease^{42,43,44,45,46}. For inflammation, associated pathways, such as ‘Chemokine signaling pathway’ and ‘cAMP signaling pathway’, were found^47,48. As another example, an interaction about zoledronic acid and gemcitabine is provided in Supplementary Fig. 5. These case studies show that our method can investigate the MoA of drug interactions based on the propagated effects of drugs and their connectivity on molecular and phenotypic networks.

Predicted phenotypic effects and their interaction potentials are validated in external literature

To validate the reliability of our method, we confirmed whether the predicted phenotypic effects of drug pairs and their interactions were identified in external literature⁴⁹. We first ranked predicted phenotypic effects of the 894 approved drug combinations by P-scores, and made three independent sets by selecting top 5%, bottom 5% and random 5% phenotypes containing 34,352 drug pair-phenotypic effect associations respectively. For the selected drug pair - phenotypic effects, we counted co-occurrences (n_c) from PubMed abstracts, calculated the Jaccard index and conducted the Fisher’s exact test (n_f) (Table 3). The average number of co-occurrence of the high-scored set (n_c = 0.87) was 8.7 and 2.8 times larger than the average number of co-occurrence of the low-scored set (n_c = 0.10) and the random set (n_c = 0.31). Also, in order to correct the differences in the frequency of drug pairs and phenotypic effects, the co-occurrence value was normalized by the Jaccard index. From the result, the average value of the high-scored set (JI = 2.9 × 10⁻⁵) was 3.6 and 2.1 times higher than the values of the low-scored set (JI = 8.1 × 10⁻⁶) and the random set (JI = 1.4 × 10⁻⁵). Furthermore, we performed Fisher’s exact test to find the significant associations (p-value < 0.001), and the number of significance associations of the high-scored set (n_f = 674) was 7.1 and 3.1 times higher than that of the low-scored set (n_f = 94) and the random set (n_f = 216).

Table 3 Literature validation for drug pair-phenotypic effect and drug-drug interaction associations by comparing co-occurrence, Jaccard index and Fisher’s exact test value between high-, low-scored and random sets.

Full size table

Next, we applied above process to predicted drug interactions to validate the DI-score. We ranked all drug pairs by DI-scores, and made three independent sets by selecting top-ranked 5%, bottom-ranked 5% and random 5% drug pairs containing 739,975 drug interactions respectively. For selected drug interactions, the average number of co-occurrence of the high-scored set (n_c = 1.67) was significantly larger than the average number of co-occurrence of the low-scored set (n_c = 1.9 × 10⁻³) and the random set (n_c = 0.24). The average value of the Jaccard index for the high-scored set (JI = 1.9 × 10⁻⁴) was remarkably higher than the value of the low-scored set (JI = 3.3 × 10⁻⁷) and the random set (JI = 5.9 × 10⁻⁶). Finally, in the Fisher’s exact test, the high-scored set (n_f = 21,380) had markedly more significant associations than the low-scored set (n_f = 49) and the random set (n_f = 3,109). As the result shows, we can conclude that both P- and DI-scores can be used as metrics to identify the effects and interactions of drug pairs.

Discussion

Drug interactions often occur when drugs act on the same or interrelated pathways, resulting in the regulation of biological processes. In this process, unexpected effects may occur due to the complex molecular mechanism of drug interactions. Therefore, identifying phenotypic effects of drug interactions with mechanistic explanation is crucial to increase therapeutic effects while reducing adverse effects. Here, we introduce a phenotype-based approach to predict effects and interactions of drug pairs based on the profiling of systemic effects of drugs. Our analysis of drug effects and their interactions in the molecular and phenotypic networks offers the mechanistic explanation of why phenotypic effects occurred and why drug pair interacts.

In this study, systemic effects of drugs obtained from calculating propagated effects on molecular and phenotypic networks were used as core information in predicting phenotypic effects of drug pairs and their interaction potential. By comparing with random and OFFSIDES results, we confirmed that systemic effects of drugs were successfully predicted with high coverage, and that they were not biased with respect to the number of drug targets and disease associated genes. Based on these results, phenotypic effects of drug pairs were predicted by calculating P-scores considering the connectivity and closeness between phenotypes on the phenotypic network. Most previous methods have focused on predicting potential drug interactions or relations between a certain disease and drug interactions. Even though a few knowledge-based approaches predict adverse effects of drugs and DDIs, they seldom considered the complicated MoA in predicting phenotypic effects of drug interactions. Our predictions attained high coverage of therapeutic effects (63%) and adverse effects (41%) of drugs with the AUROC value of 0.731 and 0.734, respectively. Also, compared with the knowledge-based approach, our method showed better performance in 9 phenotypic effects among 14 phenotypic effects. We further quantified the interplay of drugs by aggregating the P-scores of the drug pair. Compared to the conventional method, our method showed better performance, with high rates of sensitivity and specificity in predicting drug combinations (AUROC = 0.943, AUPR = 0.897). Furthermore, these processes enabled us to find drug interactions previously undetected by target distance-based similarity methods, by allowing high scores on drug pairs which have similar target phenotypes even though their drug targets are distantly located. We also found that drug pairs sharing comparable patterns have similar phenotypes, by applying biclustering on the score matrix of drug interaction. Thus, the drug interaction prediction in this study can be used as a tool to find the sets of drug pairs that are associated with specific phenotypes. Finally, by analyzing candidate phenotypic effects of the previously unknown drug interactions at the molecular and phenotypic networks, possible MoA of those drug interactions could be explained.

However, the given method needs improvements to be directly used as a screening tool in clinical practice. First, the dosage is not taken into an account in the method, although the effects of drug interactions can be varied by different combinations of concentration. Until now, most studies on dose-response relationship have only considered individual drugs, where only a few computational methods have calculated the expected dose-response relationship for drug combinations². Second, limited by the current knowledge of drugs, diseases and interactions of molecule and phenotype, our prediction cannot provide detailed interaction types such as additive, synergistic and antagonistic interactions. Even though our study provides a list of prioritized effects and interactions of drug pairs, the absence of interaction type information is an obstacle to design precise clinical trials. Nevertheless, these limitations can be taken into an account for further experiments or improved computational methods. With these further improvements, our method can be used as a valuable resource in drug development and large-scale clinical trial design, serving as an in silico screening tool to provide a list of prioritized drug interactions with phenotypic effects in a cost-effective manner. We believe that the identification of drug interactions with phenotypic effects can be a key factor (i) to provide insights into the underlying molecular and phenotypic mechanisms of the drug interactions and (ii) to extend the combinatorial use of drugs, increasing therapeutic effects while reducing adverse effects.

Materials and Methods

Data set

Drug information was collected from DrugBank version 4.3²⁸. In this study, we mainly focused on 5,441 drugs including approved and investigational drugs which have at least one target information. Drug targets were collected from DrugBank, DCDB²⁶, CTD⁵⁰, MATADOR⁵¹, STITCH⁵² and TTD²⁷ databases, and gene-phenotype associations were collected from CTD database. Drug-phenotype associations were collected from DrugBank, CTD, ClinicalTrials.gov⁵³ and DCDB databases by exploiting the MetaMap tool to extract phenotype-related terms⁵⁴. With inputs like narrative text, MetaMap returns a ranked list of Metathesaurus concepts associated with each word of the input text. Among the Metathesaurus concepts categorized in semantic types, we used Metathesaurus concepts assigned to 20 semantic types out of 135 semantic types, which have related phenotypes such as ‘Disease or Syndrome’, ‘Sign or symptom’ and ‘Clinical attribute’ (Supplementary Table 6). Drug side effects were obtained from SIDER⁵⁵ database, and adverse effects of drug pairs were collected from TWOSIDES⁶ database.

A PPI network, including 19,093 nodes and 270,970 edges, was obtained from BioGrid version 3.4.136⁵⁶, and a phenotypic network was taken from Unified Medical Language System (UMLS) in the 2016AA version²⁵. UMLS provides integrated information of various terminologies pertaining to biomedicine. The Metathesaurus is the main component of the UMLS which is organized by biomedical concepts, where each distinct concept is assigned to a unique concept identifier (CUI). We collected CUIs with broader (RB), narrower (RN) and other-related (RO) relationships among 11 types of UMLS relationships from MRREL lists, resulting in total 220,104 CUIs and 663,018 relations. For systematic analysis, we integrated molecular and phenotypic networks based on the CODA system which handles various types of biological information⁵⁷.

Propagation of drug effects from molecules to phenotypes

We constructed a molecular network based on a PPI network and performed RWR algorithm to investigate the propagation of drug effects in the molecular layer. RWR simulates the random walker from its seed nodes and iteratively transmits the node values to the neighbor nodes with the probabilities proportional to the corresponding edge weights^58,59. First, we assigned initial values to seed nodes in molecule network based on drug-target associations to simulate RWR algorithm. Drug-target associations can be divided into two groups, direct and indirect associations. The biological activity of drugs cause changes in various biological systems by complex interactions with molecular components, and their exact MoA remains largely unknown. Therefore, to expand the coverage of drug effects, we used not only direct (binding) associations, but also indirect associations which can be caused by the changes in the expression of a protein, drug induced phosphorylation or active metabolites of the drug (Supplementary Fig. 1a). The initial values of a direct and indirect association were assigned as 1 and 0.3, respectively. Second, the transition probability from a node to the neighbor node was calculated. We assumed that the transition probability represents the propagated drug effects on the molecular network. The transition probability vector of each node at time step t + 1 is defined as equation (1).

$${p}_{t+1}=(1-r){W}^{T}{p}_{t}+r{p}_{0}$$

(1)

where r represents the restarting probability of the random walker at each time step, set to 0.7 in this study. W represents the normalized adjacency matrix of the molecular network, p_t is the probability vector of each node at time step t, and p₀ represents the initial probability vector. The RWR algorithm simulates the random walker until all nodes reach the steady state (p_t+1−p_t < 10⁻⁸). We then mapped RWR results to phenotypes based on the gene-phenotype associations. In this step, we found all genes which are therapeutic targets or biomarkers of certain phenotypes and mapped the sum of these gene values, which were obtained from RWR results, to the corresponding phenotypes. Through this process, we obtained a list of phenotype values for each drug.

Filter effective phenotypes from a large number of candidates

Although a phenotype value calculated from propagated drug effects is high, it may not mean that the drug is highly related to the phenotype. When there are many phenotype associated genes, or when a drug has a large number of target proteins, overall phenotype values get increased stochastically. To overcome this problem, we generated random DPVs and compared them with a list of inferred phenotype values to select phenotypes with significant values. A random DPV was generated by randomly selecting drug’s targets while fixing the number of target proteins. For each drug, 1,000 random DPVs were generated, and phenotypes with an empirical p-value lower than 0.01 were selected. The empirical p-value was calculated as equation (2).

$$p=(r+1)/(n+1)$$

(2)

where n is the number of random DPVs and r is the number of DPV values that are larger than the phenotype value, respectively⁶⁰. Value one was assigned on phenotypes with the p-value lower than 0.01, and zero on the others. From this process, inferred DPVs were generated with filtered effective phenotypes from a large number of inferred candidates of drug effects. Although phenotypic effects of drugs were extracted from the molecular network, there may have been an omission of some drug-phenotype associations due to the incomplete information such as pathophysiology information about diseases, protein interaction and molecular mechanisms of a drug. Therefore, the combined DPVs were generated by combining both known and inferred DPVs which cover the large amount of known and unexpected drug-phenotype associations (Fig. 1b).

Predicting phenotype-specific interaction of drug pairs

A set of candidate phenotypic effects of a drug pair was defined as a union of phenotypes in combined DPVs. There are hundreds of phenotypes in each combined DPV, so we prioritized phenotypes by P-scores to filter out meaningless phenotypes. For this, we mapped phenotypes of the combined DPVs to the phenotypic network. The P-score of a phenotype is calculated by considering phenotype pairs containing this specific phenotype among the whole pairs with the closeness in the phenotypic network as equation (3)^61,62.

$${\rm{P}}-{{\rm{score}}}_{{d}_{1}{d}_{2}}(c)=\sum _{p\in P}{e}^{-d(c,p)}({v}_{{1}_{c}}{v}_{{2}_{p}}+{v}_{{1}_{p}}{v}_{{2}_{c}})$$

(3)

where c is the phenotype of interest, and P is the set of phenotypes in the combined DPV. d(c,p) is the shortest path length between phenotype nodes c and p in the phenotypic network. Therefore, e^{−d(c, p)} is increased when two phenotypes are closely located in the phenotypic network. ${v}_{{i}_{j}}$ is the value of phenotype j in the combined DPV of drug i. If the phenotype belongs to both combined DPVs, and the phenotype is closely located to the phenotype set of the opposite combined DPV in the phenotypic network, then the phenotype is given a high value. Conversely, if the phenotype belongs to less than one combined DPV, and there is no connection between the phenotype and the phenotype set of the other combined DPV, then the phenotype is given a minimum value (zero).

Phenotype-based prediction of drug interactions

The interaction potential between each drug pair was quantified based on phenotypic effects of the drug pair. The underlying hypothesis of this study was that a drug pair can interact with each other if they have similar phenotypic effects. Therefore, we defined an interaction potential of a drug pair as a sum of P-scores of phenotype candidates as equation (4).

$${\rm{IP}}({d}_{1},{d}_{2})=\sum _{p\in P}{\rm{P}}-{{\rm{score}}}_{{d}_{1},{d}_{2}}(p)$$

(4)

If a drug pair has lots of similar phenotypic effects, the number of phenotypes with a high P-score will increase and the interaction potential will be given a high value. Conversely, if there are no shared phenotypes between a drug pair, the interaction potential will be given a low value. Next, the interaction potential was normalized by dividing it by the geometric mean of the interaction potential values, which were obtained by calculating each interaction potential against itself. Finally, the DI-score is calculated as equation (5)

$${\rm{DI}}-{\rm{score}}({d}_{1},{d}_{2})=\,\frac{IP({d}_{1},{d}_{2})}{\sqrt{IP({d}_{1},{d}_{1})\cdot IP({d}_{2},{d}_{2})}}$$

(5)

The value of the DI-score is zero (min) when combined DPVs have no shared phenotypes and phenotypes between combined DPVs are disconnected in the phenotypic network, and the value is one (max) when combined DPVs have the same phenotypes. DI-score has following characteristics: (i) the DI-score is related to the closeness between phenotype components; (ii) the DI-score is independent of the size of combined DPVs; and (iii) the maximum DI-score is assigned when combined DPVs are identical, no matter how many phenotypes they share (Supplementary Fig. 6).

Performance evaluation

To measure the performance of our method in predicting drug combinations and DDIs, we collected 1,093 known drug combinations from DCDB²⁶ and TTD²⁷ databases and 29,074 known DDIs from DrugBank²⁸ and KEGG²⁹, which were used as the gold standard positive set. Since the negative set of drug combinations and DDIs is not available, we exempted gold standard positive set from all possible drug pairs and used them as the gold standard negative set. These sets were used to calculate the ROC, FE and KDE. Moreover, unlike the ROC curve, the PR curve is very sensitive to the imbalance of positive/negative ratio⁶³. Therefore, we generated negative sets by considering the various negative/positive ratios in samples (1:10, 1:100, 1:500, 1:1000), highlighting the difference in performance measures which might be lost in the ROC curve analysis (Fig. 2d). To obtain robust AUC score estimates, we performed our method 10 times by randomly selecting different negative sets. We then averaged the resulting AUC scores, and benchmarked the AUC performance against the performance of previous methods. Next, we performed FE test to evaluate whether the drug pairs identified by the high similarity score are more likely to result in drug combinations. In the FE test, all possible drug pairs were ranked by the DI-score and binned into groups of 50,000 drug pairs. The FE is defined as equation (6).

$$Fold\,enrichment=\frac{m/n}{M/N}$$

(6)

where m is the number of known drug combinations in each bin among all known drug combinations M, and n is the number of drug pairs in each bin among the total possible drug pairs N. Linear regression model and generalized additive model were used to fit the distribution of FE values⁶⁴. In addition, we estimated probability density function of positive and negative set scores by KDE to compare the distribution of positive and negative sets⁶⁵.

In order to identify drug pairs which have a high relevance to each other, we performed the biclustering with DI-score matrix based on Plaid model in R⁶⁶. Biclustering is used to find a subset of columns and rows with a high similarity scores in a table, which is composed of different rows and columns. However, as in this paper, biclustering can be applied to symmetric matrices where column and row are the same. If a one-sided clustering algorithm is applied in our result, drugs will be clustered considering the similarity of interaction scores across all 6,499 drugs. However, groups of patterns found in drug pair matrix are not homogeneous across all the drug pairs. Rather, only a subset of the drug pairs possesses these groupings. Therefore, biclustering was performed to find an optimal co-cluster with a high interaction score between drugs in the subset of drugs.

The phenotype enrichment analysis of each cluster is performed by Fisher’s exact test, and two-sided p-value and odds ratio are used to evaluate the strength of the enrichment of phenotypes among the clustered and un-clustered drug pairs.

References

Lee, J. & Bogyo, M. Target deconvolution techniques in modern phenotypic profiling. Current opinion in chemical biology 17, 118–126 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fitzgerald, J. B., Schoeberl, B., Nielsen, U. B. & Sorger, P. K. Systems biology and combination therapy in the quest for clinical efficacy. Nat. Chem. Biol. 2, 458–466 (2006).
Article CAS PubMed Google Scholar
Chou, T.-C. Theoretical basis, experimental design, and computerized simulation of synergism and antagonism in drug combination studies. Pharmacol. Rev. 58, 621–681 (2006).
Article CAS PubMed Google Scholar
MacDonald, M. L. et al. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat. Chem. Biol. 2, 329–337 (2006).
Article MathSciNet CAS PubMed Google Scholar
Cheng, F. et al. Adverse drug events: database construction and in silico prediction. J. Chem. Inf. Model. 53, 744–752 (2013).
Article CAS PubMed Google Scholar
Tatonetti, N. P., Patrick, P. Y., Daneshjou, R. & Altman, R. B. Data-driven prediction of drug effects and interactions. Sci. Transl. Med. 4, 125ra131–125ra131 (2012).
Article Google Scholar
Iyer, S. V., Harpaz, R., LePendu, P., Bauer-Mehren, A. & Shah, N. H. Mining clinical text for signals of adverse drug-drug interactions. J. Am. Med. Inf. Assoc. 21, 353–362 (2014).
Article Google Scholar
Borisy, A. A. et al. Systematic discovery of multicomponent therapeutics. Proceedings of the National Academy of Sciences 100, 7977–7982 (2003).
Article ADS CAS Google Scholar
Lehár, J. et al. Synergistic drug combinations tend to improve therapeutically relevant selectivity. Nat. Biotechnol. 27, 659–666 (2009).
Article PubMed PubMed Central Google Scholar
Tan, X. et al. Systematic identification of synergistic drug pairs targeting HIV. Nat. Biotechnol. 30, 1125–1130 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, P. et al. Large-scale exploration and analysis of drug combinations. Bioinformatics 31, 2007–2016 (2015).
Article CAS PubMed Google Scholar
Zhao, X.-M. et al. Prediction of drug combinations by integrating molecular and pharmacological data. PLoS Comput. Biol. 7, e1002323 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lee, J.-H. et al. CDA: combinatorial drug discovery using transcriptional response modules. PLoS One 7, e42573 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Huang, L. et al. DrugComboRanker: drug combination discovery based on target network analysis. Bioinformatics 30, i228–i236 (2014).
Article CAS PubMed PubMed Central Google Scholar
Pang, K. et al. Combinatorial therapy discovery using mixed integer linear programming. Bioinformatics 30, 1456–1463 (2014).
Article CAS PubMed PubMed Central Google Scholar
Duke, J. D. et al. Literature based drug interaction prediction with clinical assessment using electronic medical records: novel myopathy associated drug interactions. PLoS Comput. Biol. 8, e1002614 (2012).
Article CAS PubMed PubMed Central Google Scholar
Vilar, S. et al. Drug—drug interaction through molecular structure similarity analysis. J. Am. Med. Inf. Assoc. 19, 1066–1074 (2012).
Article Google Scholar
Cheng, F. et al. Prediction of polypharmacological profiles of drugs by the integration of chemical, side effect, and therapeutic space. J. Chem. Inf. Model. 53, 753–762 (2013).
Article CAS PubMed Google Scholar
Bate, A. & Evans, S. Quantitative signal detection using spontaneous ADR reporting. Pharmacoepidemiol. Drug Saf. 18, 427–436 (2009).
Article CAS PubMed Google Scholar
DuMouchel, W. Bayesian data mining in large frequency tables, with an application to the FDA spontaneous reporting system. The American Statistician 53, 177–190 (1999).
Google Scholar
Gottlieb, A., Stein, G. Y., Oron, Y., Ruppin, E. & Sharan, R. INDI: a computational framework for inferring drug interactions and their associated recommendations. Mol. Syst. Biol. 8, 592 (2012).
Article PubMed PubMed Central Google Scholar
Gottlieb, A., Stein, G. Y., Ruppin, E. & Sharan, R. PREDICT: a method for inferring novel drug indications with application to personalized medicine. Mol. Syst. Biol. 7, 496 (2011).
Article PubMed PubMed Central Google Scholar
Cheng, F. & Zhao, Z. Machine learning-based prediction of drug–drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J. Am. Med. Inf. Assoc. 21, e278–e286 (2014).
Article Google Scholar
Everitt, B. S. The Cambridge dictionary of statistics. (Cambridge University Press, 2006).
Bodenreider, O. The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32, D267–D270 (2004).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. DCDB 2.0: a major update of the drug combination database. Database 2014, bau124 (2014).
Article PubMed PubMed Central Google Scholar
Zhu, F. et al. Therapeutic target database update 2012: a resource for facilitating target-oriented drug discovery. Nucleic Acids Res., gkr797 (2011).
Law, V. et al. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 42, D1091–D1097 (2014).
Article CAS PubMed Google Scholar
Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res., gkv1070 (2015).
Park, K., Kim, D., Ha, S. & Lee, D. Predicting Pharmacodynamic Drug-Drug Interactions through Signaling Propagation Interference on Protein-Protein Interaction Networks. PLoS One 10, e0140816 (2015).
Article PubMed PubMed Central Google Scholar
Huang, J. et al. Systematic prediction of pharmacodynamic drug-drug interactions through protein-protein-interaction network. PLoS Comput. Biol. 9, e1002998 (2013).
Article CAS PubMed PubMed Central Google Scholar
Anagnostopoulos, A., Dasgupta, A. & Kumar, R. In Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. 201–210 (ACM).
Madeira, S. C. & Oliveira, A. L. Biclustering algorithms for biological data analysis: a survey. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 1, 24–45 (2004).
Article CAS Google Scholar
Yip, G. W. et al. The Hong Kong diastolic heart failure study: a randomised controlled trial of diuretics, irbesartan and ramipril on quality of life, exercise capacity, left ventricular global and regional function in heart failure with a normal ejection fraction. Heart 94, 573–580 (2008).
Article CAS PubMed Google Scholar
Palatini, P. et al. Maintenance of blood-pressure-lowering effect following a missed dose of aliskiren, irbesartan or ramipril: results of a randomized, double-blind study. J. Hum. Hypertens. 24, 93–103 (2010).
Article CAS PubMed Google Scholar
Xu Teng, H. J., Dan, L., Xiao, G. & Xia, L. Combined Therapy of Ramipril and Irbesartan for Early Period Diabetic Nephropathy [J]. China Pharmacist 6, 045 (2010).
Google Scholar
Gillman, P. K. A systematic review of the serotonergic effects of mirtazapine in humans: implications for its dual action status. Hum. Psychopharmacol. Clin. Exp. 21, 117–125 (2006).
Article CAS Google Scholar
Mierau, J. et al. Pramipexole binding and activation of cloned and expressed dopamine D 2, D 3 and D 4 receptors. European Journal of Pharmacology: Molecular Pharmacology 290, 29–36 (1995).
Article CAS Google Scholar
Makiguchi, A. et al. Mirtazapine-induced restless legs syndrome treated with pramipexole. The Journal of neuropsychiatry and clinical neurosciences 27, e76–e76 (2014).
Article Google Scholar
Holtz, N. A., Tedford, S. E., Persons, A. & Napier, C. The effects of mirtazapine on pramipexole-induced riskiness in a rat model of parkinson’s disease. Drug Alcohol Depend. 156, e97 (2015).
Article Google Scholar
Jiao, X. et al. DAVID-WS: a stateful web service to facilitate gene/protein list analysis. Bioinformatics 28, 1805–1806 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kong, Y. et al. High throughput sequencing identifies MicroRNAs mediating α-synuclein toxicity by targeting neuroactive-ligand receptor interaction pathway in early stage of drosophila Parkinson’s disease model. PLoS One 10, e0137432 (2015).
Article PubMed PubMed Central Google Scholar
La Fuente‐Fernández, D. et al. Biochemical variations in the synaptic level of dopamine precede motor fluctuations in Parkinson’s disease: PET evidence of increased dopamine turnover. Ann. Neurol. 49, 298–303 (2001).
Article PubMed Google Scholar
Kim, S. E., Choi, J. Y., Choe, Y. S., Choi, Y. & Lee, W. Y. Serotonin transporters in the midbrain of Parkinson’s disease patients: a study with 123I-β-CIT SPECT. J. Nucl. Med. 44, 870–876 (2003).
CAS PubMed Google Scholar
Bartus, R. T. On neurodegenerative diseases, models, and treatment strategies: lessons learned and lessons forgotten a generation following the cholinergic hypothesis. Exp. Neurol. 163, 495–529 (2000).
Article CAS PubMed Google Scholar
Majewska, M. D. Cocaine addiction as a neurological disorder: implications for treatment. NIDA Res. Monogr. 163, 1–26 (1996).
CAS PubMed Google Scholar
Keane, M. P. & Strieter, R. M. Chemokine signaling in inflammation. Crit. Care Med. 28, N13–N26 (2000).
Article CAS PubMed Google Scholar
Schlegel, N. & Waschke, J. cAMP with other signaling cues converges on Rac1 to stabilize the endothelial barrier–a signaling pathway compromised in inflammation. Cell Tissue Res. 355, 587 (2014).
Article CAS PubMed Google Scholar
Matsuo, Y. & Ishizuka, M. Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13, 157–169 (2004).
Article Google Scholar
Davis, A. P. et al. The comparative toxicogenomics database: update 2011. Nucleic Acids Res. 39, D1067–D1072 (2011).
Article ADS CAS PubMed Google Scholar
Günther, S. et al. SuperTarget and Matador: resources for exploring drug-target relationships. Nucleic Acids Res. 36, D919–D922 (2008).
Article PubMed Google Scholar
Kuhn, M. et al. STITCH 4: integration of protein–chemical interactions with user data. Nucleic Acids Res., gkt1207 (2013).
Zarin, D. A., Tse, T., Williams, R. J., Califf, R. M. & Ide, N. C. The ClinicalTrials. gov results database—update and key issues. New Engl. J. Med. 364, 852–860 (2011).
Article CAS PubMed PubMed Central Google Scholar
Aronson, A. R. & Lang, F.-M. An overview of MetaMap: historical perspective and recent advances. J. Am. Med. Inf. Assoc. 17, 229–236 (2010).
Article Google Scholar
Kuhn, M., Campillos, M., Letunic, I., Jensen, L. J. & Bork, P. A side effect resource to capture phenotypic effects of drugs. Mol. Syst. Biol. 6 (2010).
Chatr-Aryamontri, A. et al. The BioGRID interaction database: 2015 update. Nucleic Acids Res. 43, D470–D478 (2015).
Article CAS PubMed Google Scholar
Hwang, W., Hwang, Y., Lee, S. & Lee, D. In BMC Med. Inf. Decis. Making. S4 (BioMed Central Ltd).
Köhler, S., Bauer, S., Horn, D. & Robinson, P. N. Walking the interactome for prioritization of candidate disease genes. The American Journal of Human Genetics 82, 949–958 (2008).
Article PubMed Google Scholar
Li, Y. & Patra, J. C. Genome-wide inferring gene–phenotype relationship by walking on the heterogeneous network. Bioinformatics 26, 1219–1224 (2010).
Article CAS PubMed Google Scholar
Sham, P. C. & Purcell, S. M. Statistical power and significance testing in large-scale genetic studies. Nature Reviews Genetics 15, 335–346 (2014).
Article CAS PubMed Google Scholar
Pedersen, T., Pakhomov, S. V., Patwardhan, S. & Chute, C. G. Measures of semantic similarity and relatedness in the biomedical domain. J. Biomed. Inf. 40, 288–299 (2007).
Article Google Scholar
Nekola, J. C. & White, P. S. The distance decay of similarity in biogeography and ecology. J. Biogeogr. 26, 867–878 (1999).
Article Google Scholar
Davis, J. & Goadrich, M. In Proceedings of the 23rd international conference on Machine learning. 233–240 (ACM).
Hastie, T. J. & Tibshirani, R. J. Generalized additive models. Vol. 43 (CRC Press, 1990).
Duong, T. ks: Kernel density estimation and kernel discriminant analysis for multivariate data in R. Journal of Statistical Software 21, 1–16 (2007).
Article Google Scholar
Kaiser, S. & Leisch, F. A toolbox for bicluster analysis in R (2008).

Download references

Acknowledgements

This work was supported by the Bio-Synergy Research Project (NRF-2012M3A9C4048758) of the Ministry of Science, ICT and Future Planning through the National Research Foundation.

Author information

Authors and Affiliations

Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea
Sunyong Yoo, Moonshik Shin, Junseok Park, Kwang-Hyung Lee & Doheon Lee
Bio-Synergy Research Center, Daejeon, 34141, Republic of Korea
Kyungrin Noh & Doheon Lee
School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005, Republic of Korea
Hojung Nam

Authors

Sunyong Yoo
View author publications
You can also search for this author in PubMed Google Scholar
Kyungrin Noh
View author publications
You can also search for this author in PubMed Google Scholar
Moonshik Shin
View author publications
You can also search for this author in PubMed Google Scholar
Junseok Park
View author publications
You can also search for this author in PubMed Google Scholar
Kwang-Hyung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Hojung Nam
View author publications
You can also search for this author in PubMed Google Scholar
Doheon Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.Y. and D.L. proposed the objective and motivation of this work and designed overall method. S.Y., K.N. and M.S. performed data-preprocessing. S.Y. identified effects and interactions of drug pairs together with the underlying mechanism of action. K.N. and M.S. helped to write the main manuscript text and provided comments that improve introduction and method parts. S.Y. performed evaluation process. K.N. and J.P. gathered and processed the data used in the evaluation part. J.P. performed literature validation and provided some ideas in discussion. D.L., K.L. and H.N. supervised this work.

Corresponding author

Correspondence to Doheon Lee.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Dataset 1

Dataset 2

Dataset 3

Dataset 4

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yoo, S., Noh, K., Shin, M. et al. In silico profiling of systemic effects of drugs to predict unexpected interactions. Sci Rep 8, 1612 (2018). https://doi.org/10.1038/s41598-018-19614-5

Download citation

Received: 20 November 2017
Accepted: 03 January 2018
Published: 25 January 2018
DOI: https://doi.org/10.1038/s41598-018-19614-5
Springer Nature Limited

This article is cited by

Beta-2 adrenergic receptor agonism alters astrocyte phagocytic activity and has potential applications to psychiatric disease
- Ellen R. Bowen
- Phillip DiGiacomo
- Kevin V. Grimes
Discover Mental Health (2023)
Phenotype-oriented network analysis for discovering pharmacological effects of natural compounds
- Sunyong Yoo
- Hojung Nam
- Doheon Lee
Scientific Reports (2018)

In silico profiling of systemic effects of drugs to predict unexpected interactions

Abstract

Similar content being viewed by others

Introduction

Results

In silico analysis for identifying drug interactions and their phenotypic effects

Systemic effects of drugs are predicted by network propagation from molecule to phenotype

Highly connected phenotypes represent potential phenotypic effects of drug pairs

Drug pairs having similar phenotypes are more likely to interact with each other

Analyzing mechanism of action of drug interactions via molecular and phenotypic networks

Predicted phenotypic effects and their interaction potentials are validated in external literature

Discussion

Materials and Methods

Data set

Propagation of drug effects from molecules to phenotypes

Filter effective phenotypes from a large number of candidates

Predicting phenotype-specific interaction of drug pairs

Phenotype-based prediction of drug interactions

Performance evaluation

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation