Introduction

Breast cancer (BC) is the malignant growth that begins in the breast cells. BC is the widespread cancer in India (in women) which accounts for 14% of all the cancers in women1,2. In general, 1 in 28 women are prone to have BC during her life span. The frequency rates in India begin to rise in the early thirties and peak at the age of 50–64 years3. In urban areas, 1 in 22 women are likely to have BC during her life span as compared to rural areas where 1 in 60 women develop BC in her life span4. Duration of survival of cancer patients is an important sign for knowing the result of treatment in any study. Since the 1990s, due to regular efforts in the diagnosis and treatment, the overall survival time of patients has been enhanced5. In cancer multiple genes participate which gradually alters the normal healthy cells into cancerous cells. Cancers have the capacity to develop resistance to conventional chemotherapy6. Thus, it is essential to recognize new therapeutic agents or promising targets. In current cancer therapy, new generations of drugs have targeted cancer specific proteins that are expressed in different cancers. Target specific cancer therapy minimizes the side effect profiles of conventional cytotoxic drugs7. In targeted therapy attempts are being made to design ligands with maximum selectivity to act on specific drug targets. Network pharmacology (NP) is an emerging discipline useful in drug discovery, which combines genomic technologies and system biology through computational biological tool. Network pharmacology, is an approach capable of describing complex relationships among biological systems, drugs and diseases8. It also clarifies the possible mechanisms of complex bio-actives through large data set analysis and determines the synergistic effects in cancer treatment9. Traditional Chinese medicine (TCM) is combination of complex herbal formulation and has been used for more than thousands of years in the treatment of different diseases and disorders for the prolongation of life expectancy in different parts of the world10. The network-target-based network pharmacology is a promising approach for the next-generation mode of drug research and development for TCM herbs or herbal formulae. It provides a new logical guide and technical route for developing and understanding TCM drugs' mechanisms of action. NP encourages the discovery of successful molecules, recognizing their interrelationship, elucidating the relationship between TCM formulae and diseases or TCM syndrome, establishing logical TCM drugs, as well as directing integrated use of TCM and conventional drugs11,12,13. CP decoction or infusion traditionally used as purgative, tonic in the treatment of convulsions, intermittent fevers, lung and skin diseases, cure bad cough, breathing difficulty and chest pain, treats inflammations, earache, muscular and rheumatic pain and various cardiovascular disease, cancer and other chronic diseases14,15.

In our previous study we have tried to investigate the effect of ethyl acetate fraction of CP in breast cancer rats and possible molecular mechanism involved by the CP16. Therefore, this study extends to use NP approach to establish the effect of CP in BC and to predict core targets and biological functions, pathways and mechanism of action. Molecular docking was also performed on selected active components and the key targets to validate the NP results.

Results

Components in CP

According to literature and TCMID, TCM, TCMSP databases 61 active components were found in CP. The selected components were further subjected to evaluate pharmacokinetics parameters i.e. ADME criteria for OB ≥ 30% and DL ≥ 0.18 values are shown in Table 1.

Table 1 The specific information of anti-breast cancer components in Caesalpinia Pulcherima.

Screening of components of CP for breast cancer

Among the 61 components, only 29 components were found to relate with breast cancer as shown in Table 1. Using TCMSP the 29 components were subjected to OB and DL criteria filtering out with only four active components of CP based on the ADME criteria. The genes of each active component were fetched from Swiss target prediction whereas genes of the disease i.e., breast cancer were retrieved from GeneCards Human database.

Construction and analysis of target PPI network

In order to enhance the visualization and understand the mechanism of the targets, it is important to study the PPI of the target genes. The target genes of the corresponding components were subjected to STRING v_11 to visualize and construct the PPI network for the same. The high-confidence target protein interaction data was set with a score level greater than 0.9. The interactions between the target proteins is depicted in Fig. 1, which comprises of total 124 nodes, 233 edges; each edge represents PPIs. The other parameter is average node degree which values at 3.76 and local clustering co-efficient: 0.538 corresponds to the number of targets that are connected to the network. Degree plays an important role in showing the role of proteins interaction and nodes of network. The PPI network shows the targets involved in breast cancer are ESR-1, ESR-2, ESRRA, MET, VEGF, FGF, PI3K, PDK-1, MAPK, PLK-1, NEK-2, and GRK which are the major targets in breast cancer. Along with ESR-1; EGFR, MET, and VEGFR are located centrally in the network which indicates the role of proteins in the pathogenesis of breast cancer. Basically, ESR pathway is a forthcoming starting point to discover the mechanisms of breast cancer. The main factor in ESR pathway is estrogen which is found to be an integral component involved in the development and maturation of breast. The involvement of estrogen receptors are also seen in other pathological processes including breast cancer, endometrial cancer, and osteoporosis. So, PPI network and pathway analysis of novel genes were carried out for the recognition of critical genes related to the breast cancer.

Figure 1
figure 1

Protein–protein interaction network of CP in breast cancer (BC) targets obtained from STRING v_11.0 database (contributing co-author: Shweta A. More).

Arrangement and construction of disease-target network

To study the signaling pathway and function of the selected target genes, the data was imported to Cytoscape to construct compound-target network. In Fig. 2, the compound-target-disease interaction network was constructed which elucidate the mechanisms of drug action in the breast cancer treatment. It consists of 4 ingredients, and 150 interactive target proteins. In this network, we found that many targets were hit by multiple components. This fact inferred that the active biochemical of CP might influence these targets synergistically; it has therapeutic effects on other disease and disorders additionally to BC. The details of three topological parameters i.e. Betweenness Centrality, Closeness Centrality, and Degree are given in Table 2 which gives an important role of each target in the network structure.

Figure 2
figure 2

Compound-target-breast cancer network constructed by Cytoscape v_3.7.1 (contributing co-author: Shweta A. More).

Table 2 Important nodes with network analyzer results.

GO gene enrichment analysis and KEGG pathway annotation

GO enrichment analysis was carried out to analyse the target proteins. The setting for the ClueGO was set for three criteria to analyse the target genes for GO biological (Table 3), Go molecular (Table 4), and GO cellular (Table 5) and the most important parameter as KEGG pathway (Fig. 3A). The GO term fusion was restricted to pV ≤ 0.005 that is based on the false discovery rate (Benjamini-Hochberg). The Ras-Raf-MAPK signaling pathway is a key route for the ErbB family, as is the PI3K/AKT pathway, both of which results in alteration of cell proliferation and apoptosis. As far as breast cancer is considered, over-expression of ErbB receptor may lead to Ras activation. So to analyse the statement, GO and KEGG analysis was carried to determine the signaling pathway and following pathways were found to associate: PI3K-Akt, MAPK, ErbB, Ras, Chemokine, HIF-1, FoxO, sphingolipid, AMPK, VEGF, JAK-STAT, TBF, insulin, GnRH, estrogen signaling pathway, prolactin signaling pathway, thyroid hormone signaling pathway, and relaxin signaling pathway (Fig. 3B).

Table 3 GO biological process.
Table 4 GO molecular function.
Table 5 GO cellular component.
Figure 3
figure 3

(A) Kyoto encyclopaedia of genes and genomes pathway (Contributing co-author: Shweta A. More). (B) Gene ontology enrichment analysis (contributing co-author: Shweta A. More).

CP can be used for the treatment of other conditions also as hepatitis B, measles, human T-cell leukemia virus 1 infection, Kaposi sarcoma-associated herpes virus infection. As the constituents of CP was selected to examine the effects on breast cancer but via KEGG analysis (Table 6) it was found responsible in many cancers as colorectal cancer, renal cell carcinoma, pancreatic cancer, endometrial cancer, prostate cancer, melanoma, bladder cancer, small cell lung cancer, non-small cell lung cancer, hepatocellular carcinoma, and gastric cancer. Owing to the facts and visualization CP may be used as novel drug for the treatment of various diseases and disorder.

Table 6 KEGG pathways.

Molecular modelling: docking of the active components

After analysing the pathways and diseases and disorder related to the genes, it is important to study structure based design of the component, as well as its ability to predict the binding-conformation of small molecule ligands to the appropriate target binding site. The main reason behind selecting the proteins is that they played a major role in PPI, Compound-target network and in KEGG analysis; as well as these proteins are found to play a vital role in the mechanism of BC. Table 7 and Fig. 4A–E gives the detail about docking score and poses of active components i.e., cyanidin, catechin, ellagic acid, and quercetin against c-MET (PDB Id: 5EYD), EGFR (PDB Id: 4ZAU), PDGFR-α/β (PDB Id: 5GRN), VEGFR (PDB Id: 2OH4), and ERR (PDB Id: 1ERR-α).

Table 7 Docking score of tyrosine kinase (interacting amino acid).
Figure 4
figure 4

Docking analysis of components and targets. (A) Catechin and EGFR (GLU 762). (B) Catechin and PDGFR (PHE 837). (C) Cyanidin and 1ERR (LEU 346). (D) Cyanidin and VEGFR (LEU 838). (E) Quercetin and c-Met (ALA 1221) (contributing author: Santosh N Mokale).

In vitro experimental validation

The KEGG analysis indicated that PI3K-Akt, Ras-Raf-MAPK and estrogen signaling pathway are a key route associated with breast cancer treatment by CP. To verify the reliability of obtained targets in network pharmacology analysis, we have conducted western blot analysis for ER and EGFR protein expression to confirm signaling pathway in breast cancer effects of the CP. The expression of ER and EGFR were declined after 24 h in MCF-7 cells treated with EAFCP at dose level 200 µg/ml. The obtained results indicate that EAFCP may suppress estrogen regulated EGFR mediated signaling pathway (Fig. 5).

Figure 5
figure 5

Influence of EAFCP treatment on expression ER signaling proteins in MCF-7 cell. (A) Representative western blot images of ER-α and EGFR protein expression. β-actin was used as a control. (B) Relative protein concentration of ER- α and EGFR respect to β-Actin (contributing co-author: Nikhil S. Sakle).

In vivo experimental validation

In this part to reveal the molecular mechanism of EAFCP (ethyl acetate fraction of Caesalpinia pulcherima) in treating MNU (N-Methyl-N-nitrosourea) induced mammary carcinoma, the anti-tumour effect improved after treatment for 30 days. EAFCP 500 mg/kg and TAM (Tamoxifen) 2 mg/kg tumour-bearing rats were significantly more resistant to the development tumour than control rats and observed decreased tumour development (Fig. 6A–C). Consequently, we have examined the effect of treatment on the density of ER-α expression by immunohistochemistry (Fig. 7A–D). After EAFCP and TAM treatment, ER-α immunoreactivity inside the nucleus was reduced significantly. The results strongly established that the treatment worsen tumour development by interfering ER.

Figure 6
figure 6

In vivo experimental study. (A) Representative rats in MNU control and treated group. (B) Representative images of tumours obtained from MNU control and treated rats. (C) Weight of tumours (g) (mean ± SEM; ap < 0.001, bp < 0.01) (contributing co-author: Nikhil S. Sakle).

Figure 7
figure 7

Significant ER-α immunohistological changes in mammary gland of control and treated rats after 04 week of treatment. All the groups receive i.p. dose of MNU at 50 mg/kg, body weight. (A) MNU Control. (B) EAFCP 250 mg/kg. (C) EAFCP 500 mg/kg. (D) TAM 2 mg/kg (contributing co-author: Nikhil S. Sakle).

Discussion

As far as traditional approach is considered “one drug, one target” theory of drug design is used, in contradictory network pharmacology which aims to explore the correlation of drugs and diseases, based on the multi-targeted therapy17. Novelty of this approach includes the use of systems biology, network analysis, connectivity, and redundancy. NP studies was successfully used to identify the novel targets and to determine the unknown signaling pathways interact with compounds18,19. The NP approach provides new insights into the systemic connection between therapeutic targets, and a disease as a whole and provides a powerful and promising tool for the clarification of disease mechanisms at a systemic level and the discovery of potential bioactive ingredients20. In this context the present study generated a novel network which gives a general view of molecular mechanism of CP.

Active components found in CP are ellagic acid, gallic acid, cyanidine, catechin, quercetin, rutin, β-sitosterol, myricetin, flavonoids, homo-flavonoids, pulcherrimin, lupeol, glycosides and phenols. The components were screened for its DL and OB criteria and thus, four active components i.e. quercetin, ellagic acid, cyanidine, and catechin were found suitable for further studies. BC network constructed through the plant bioactive target followed by recognition of targets associated with BC pathway. The network reveals the potential of 4 CP bioactives to modulate the BC by the interactions of 150 proteins through multiple pathways. These components inhibit cells proliferation, induces cell cycle arrest and apoptosis in different cancer cell types21,22,23,24,25. Through PPI interaction of the genes we found 124 nodes and 233 edges. ESR-1, ESR-2, ESRR-A, MET, FGF, VEGF, PI3K, PDK-1, MAPK, PLK-1, NEK-2, and GRK were likely to be key genes in the development of BC. GO and KEGG analysis revealed several pathways as well as other disease and disorders for the selected genes. The GO enrichment analysis showed the direct involvement of bioactive in the regulation of BC. KEGG pathway analysis proved that estrogen signaling pathway and ErbB signaling pathway may be crucial signaling pathway in the selected network which helps to support that CP may be used for BC treatment. Other than estrogen signaling pathway and ErbB, Ras, Chemokine, HIF-1, FoxO, sphingolipid, PI3K-Akt, AMPK, VEGF, JAK-STAT, TBF, insulin signaling pathway, GnRH, estrogen signaling pathway, prolactin signaling pathway, thyroid hormone signaling pathway, and relaxin signaling pathway were also found, suggesting the use of CP in multi-targets. Adding on the evidence for CP it can be used in colorectal cancer, renal cell carcinoma, pancreatic cancer, endometrial cancer, prostate cancer, melanoma, bladder cancer, small cell lung cancer, non-small cell lung cancer, hepatocellular carcinoma and gastric cancer. In addition to this, the docking study was carried for the validation of targets. It also screens the affinity between the components and targets, which can directly clarify their structure–activity relationship. On the basis of the results obtained in network pharmacology, therapeutic effect of CP was investigated by western blotting, signifying that EAFCP treatment could regulates ER signaling pathway and suggesting that breast cancer can be treated through a complex system with multi-component target disease interaction. Our earlier preclinical study on the phytochemicals from CP decrease cell proliferation and induce apoptosis, significantly improved the pathological conditions of MNU induced breast cancer rat tissue suggesting their involvement in BC treatment16. It is not yet known whether giving CP alone or with chemotherapeutic agent will enhances the activity in treating patients with breast cancer which is future scope of the present study.

Conclusion

This study scientifically investigates the pharmacological mechanism of CP in the treatment of breast cancer through network pharmacology, docking analysis, western blotting and in vivo animal study. It is in addition worth mentioning that network pharmacology has great advantages in clearing up the mechanism of CP as TCM.

Materials and methods

The following parameters are important in order to construct a network: (1) identification and confirmation of compounds using chemical databases; (2) selection of compounds on the basis of pharmacokinetic parameter i.e. ADME (absorption, distribution, metabolism, and excretion) criteria; (3) selected compounds were further subjected to understand protein interaction and to obtain the relevant information by using publicly available database or tools; (4) genes related to target disease i.e., breast cancer were identified using human disease database and common genes of target and compounds were selected; (5) construction and analysis were carried out to understand the interaction and molecular mechanisms using visualization software; and (6) to perform docking study of the active actives.

Chemical databases

The chemical components of CP were identified through literature and the Traditional Chinese Medicine Ingredient Database26 (TCMID, https://www.megabionet.org/tcmid/); the TCM Database@Taiwan27 (https://tcm.cmu.edu.tw/), most comprehensive databases on global scale. The chemical components were subjected to database called Traditional Chinese Medicine Systems Pharmacology28 (TCMSP, https://lsp.nwu.edu.cn/tcmsp.php) to screen for breast cancer. TCMSP helps to promote integration of both traditional as well as modern medicines in order to accelerate the drug discovery which builds the framework for system pharmacology and covers the ADME information. Chemical structures, synonyms, molecular weight, canonical SMILES and physicochemical properties were collected with the help of ChEMBL29 (https://www.ebi.ac.uk) and Pubchem30 (https://pubchem.ncbi.nlm.nih.gov).

Evaluation of pharmacokinetics parameters

The selected components were further screened for oral bioavailability and drug-likeness pharmacokinetics parameters which include ADME. The ADME characteristics of the drug indicate the ratio of the oral drug to oral dosage of the blood circulatory system. The parameters to access the components druggability is analysed according to the set parameters as oral bioavailability (OB ≥ 30) value and drug-likeness (DL ≥ 0.18) indices recommended by TCMSP28. Among all the selected components, only the components which fit in the criteria were selected for construction of network.

Identification of target genes and construction of target PPI (protein–protein interaction) network

PPI is important aspect to study the involvement of proteins in various biochemical processes in order to understand the cellular organization, bioprocess and functions. This can be done by using the virtual screening database called STRING 11.0 (https://string-db.org/)31. The genes of the selected components were uploaded to STRING to get the information about PPIs. The setting for generating the PPI network was in accordance with ‘Homo sapiens’ and the confidence in the interaction among the target protein was set to the highest confidence data > 0.9. The network nodes represent proteins whereas the edge represents associated protein–protein.

Identification of disease target genes

In order to construct the compound-target network, it is important to identify the genes related to disease. The information related to breast cancer associated target genes was collected from GeneCards32 (https://www.genecards.org), human gene database which provides information related to all annotated and predicted human genes.

Construction of compound-target network

Once the protein–protein interaction was carried, the next step is to understand the molecular mechanism which is achieved by constructing the compound-target network using Cytoscape33 visualization software v_3.7.1. The compound-target network helps to understand and analyse the mechanism of the components with target as well as the pathway involved.

Gene Ontology (GO) gene enrichment analysis and Kyoto encyclopaedia of genes and genomes (KEGG) pathway annotation

GO and KEGG pathway annotation is carried on ClueGO, another Cytoscape plug-in which gives a network-based visualization to diminish redundancy of results from pathway enrichment analysis34. GO is carried out to analyse the gene cluster in the network to improvise the data prediction. GO provides a hierarchically organized set of thousands of standardized terms for biological processes, molecular functions and cellular components, with curated and predicted gene annotations based on these terms for multiple species. Biological process GO annotation is frequently used resource for pathway enrichment analysis. The study objective is to identify the biological process in order to layout the meaningful functional information. KEGG is used to study gene functions and the metabolic pathway of the inputted network of genes and molecules. It also helps us to find out the contributing pathway of the target associated with the disease.

Molecular modelling: docking of the active components

Docking study was carried out to find the affinity as well as orientation of the selected active component by docking them against the selected receptors as c-MET, EGFR, PDGFR-α/β, VEGF, and ERR using Glide v_7.6 program interfaced with Maestro v_11.3 of Schrödinger 2017 (Schrodinger, LLC, New York, NY, USA). The crystal structure for c-MET (PDB Id: 5EYD), EGFR (PDB Id: 4ZAU), PDGFR-α/β (PDB Id: 5GRN), VEGFR (PDB Id: 2OH4), and ERR (PDB Id: 1ERR-α) were taken from RCSB Protein Data Bank and prepared for docking using ‘protein preparation wizard’. The structures of compounds were built using Maestro build panel and optimized to lower energy conformers using Ligprep v_3.335.

Experimental validation

Western blot analysis

The cell lysates from EAFCP (200 µg/ml)16 treated MCF-7 cells were obtained and protein concentrations were measured using a Bradford protein assay kit (Bio-Rad, USA). Approximately, 50 μg lysate was resolved on 10% SDS-PAGE and transferred onto the PVDF membrane (Millipore, USA). The membrane was blocked and incubated with respective primary antibodies (ER and EGFR) at 4 °C overnight. Blots were washed, incubated with HRP-conjugated secondary antibodies, developed using chemiluminescent solution (Immobilon Western, Millipore, USA) and scanned by using gel documentation system (Bio-Rad)36,37.

Animals

All procedure involving animal experiments were reviewed and approved (CPCSEA/IAEC/Pharm.Chem.-31/2016-17/129) by the Institutional Animal Ethical Committee (IAEC) of Y. B. Chavan College of Pharmacy, Aurangabad, India. All experiments were performed in accordance with the regulations and guidelines issued by the Committee for the Purpose of Control and Supervision of Experiments on Animals (CPCSEA) India. Virgin female Sprague–Dawley rats (B.W. 150–210 g) were obtained from Wockhardt Research Centre Pvt. Ltd, Aurangabad, Maharashtra, India, and maintained in standard environment conditions (12:12 h light–dark cycle, 25 ± 2 °C; 55 ± 5% relative humidity) with free access to food pellets and water ad libitum. After adaptation for 7 days, rats were randomly divided into 5 groups of six rats each. On the 50th day of rat’s age, MNU (50 mg/kg) was injected intraperitoneal (i.p.) (Supplementary file S1). A fresh solution of MNU was prepared by dissolving immediately before use in 0.9% NaCl adjusted to pH 4 with acetic acid.

The normal control group I and MNU cancer-induced group II receive normal saline solution, MNU cancer-induced group III and group IV receive treatment EAFCP at 250 and 500 mg/kg, p.o. body weight, while cancer-induced Group V receive TAM 2 mg/kg, p.o. body weight) for 30 days16.

Immunohistochemical analysis

IHC analysis was performed on formalin-fixed, paraffin-embedded tissue sections using standard histologic procedures. The primary antibodies for ER-α was incubated at a dilution of 1:50 for one and half hour at room temperature or 16 h (overnight) at 4 °C. The antigen retrieval was performed with thermic treatment by microwave using 3 cycles of 5 min for ER-α with citrate buffer solution. The tissues were counterstained with hematoxylin. The immunoexpression of ER-α was evaluated as per the Allred score for ER nuclear positivity, the proportion score (PS) (0–5) and the % positive tumor cells are respectively, 0 (0%), 1 (< 1%), 2 (1–10%), 3 (11–33%), 4 (34–66%), and 5 (67–100%). The intensity of staining (IS) for the nuclear positivity of the cells graded as 0, 1, 2, and 3 was as none, mild, moderate, and strong, respectively. So the total scores for ER is given as TS = PS + IS. TS 0 and 2 are negative scores, and 3, 4, 5, 6, 7, and 8 are positive scores37,38.