Introduction

Being overweight increases the chances of developing serious complications such as diabetes mellitus (type II), cardiovascular disorders, osteoarthritis, some types of cancer, and stroke. Obesity reflects both genetic and acquired correlated factors and contributes to dyslipidemia (high LDL cholesterol, high triacylglyceride, and low HDL cholesterol levels), insulin resistance or impaired blood glucose, and borderline hypertension (Sarma et al. 2021). Glycolipid metabolism abnormalities can be said to affect about 12.5 to 31.4% of the population of most countries and are expected to keep rising as obesity rates increase. One billion people are expected to be obese worldwide by 2030 (Noubiap et al. 2022). Besides, higher medical treatment expenses would be incurred as many complications are present. A long-term lifestyle regimen begins with weight loss (low-fat diet) and modifying physical activity enhances metabolic symptoms but not to a sufficient degree. The complicated treatment plan involves several different medications which result in drug interactions, side effects, and an insufficient therapeutic response. Therefore, searching through food components and developing new medications with diverse activities is one prospective approach (Lillich et al. 2021).

In the modern era, computational drug design is one of the key mechanisms for pharmaceutical discovery. The rapid advancement of new drugs is significantly assisted by molecular docking studies. Additionally, molecular modeling projections validate the outcomes of molecular docking (Al-Karmalawy et al. 2021). Innovative in silico approaches, such as molecular docking, are utilized to highlight the interactive mechanism between targeted obesity-related enzymes and their potential inhibitors. Therefore, it provides an influential, coherent, and low-cost implementation for the screening of novel bioactive natural compounds (de Lima Barros et al. 2022). Nowadays, bioinformatics and combinatorial chemistry are the most dynamic approaches to find potent druggable candidates in the drug-development field (Swain and Hussain 2022).

Trifolium L. (clover or trefoil) is an important economic genus of the family Fabaceae (Leguminosae), consisting of approximately 300 species from the Mediterranean area, temperate, and somewhat subtropical regions of North and South America. Trifolium species are high-protein fodder plants eaten by cattle and wildlife, and indirectly improve human health (Kassem et al. 2017). In addition, clovers have a long history of being valued as remedial plants for curing various skin, respiratory, nervous, reproductive, and digestive complaints (Kolodziejczyk-Czepas 2016). In recent years, numerous therapeutic effects have been supported concerning their traditional uses as antioxidant, anti-inflammatory, antihyperglycemic, anticancer, antimicrobial, hepatoprotective, and healing effects (Malca-Garcia et al. 2022). Moreover, the estrogenic effect of clovers was proven to alleviate menopausal symptoms and serve as an alternative therapy for disorders associated with metabolic syndrome (Jungbauer and Medjakovic 2014). Lately, the ameliorated impact of Trifolium pratense L. (red clover) isoflavones extract against metabolic syndrome elements was proved by Yokoyama et al. (2020). Likewise, its extract significantly improved the plasma lipid profile after menopause, decreased plasma glucose levels, and increased insulin sensitivity in diabetic rats (Oza and Kulkarni 2020). Trifolium resupinatum L. (Persian or Reversed clover) is indigenous to the Mediterranean, southern Europe, and southwest Asia (Singh et al. 2019). It is one of the economically valuable pasture crops popularly used as fodder and hay for livestock. In Egypt, this species grows densely along the Nile River, irrigation canals, and drainage networks. It is considered an aggressive species because it quickly spreads to cover any exposed soil (Elkordy et al. 2019). Previous works on the aerial parts of T. resupinatum characterized the high levels of isoflavone and flavonol metabolites (Kassem et al. 2017).

In the present research, phenolic compounds were isolated from the aqueous methanol T. resupinatum leaf extract and used them as standards with other reference phenolics through the LC-ESI-MS/MS analysis. Global natural products social (GNPS)-based molecular networking was integrated. Two major isoflavone isolates along with the investigated extract were examined for their ability to suppress pancreatic lipase in vitro. The other isolates were virtually screened through molecular docking, which is conceivably the implicit mechanism of preventing obesity caused by the suppression of pancreatic lipase. In addition, the extract was checked for its effect on the development of some metabolic syndrome criteria in a high-fat diet-induced obese rat model. The isolated compounds were also assessed for absorption, distribution, metabolism, and excretion (ADME), as well as for drug-likeness.

Material and Methods

General Procedures

The ultraviolet analyses were recorded at Shimadzu UV-240 spectrophotometer, and NMR spectra were carried out using Jeol EX-500 spectrometer (1H NMR, 500 MHz). EI-MS data were carried out on a Finnigan-Mat SSQ 7000 spectrometer. A LC-ESI-MS/MS spectrometer was used for MS analysis (negative mode) with an Exion LC AC system for separation and SCIEX Triple Quad 5500 + MS/MS system equipped with electrospray ionization for detection. Polyamide 6S (Flucka, 5 × 140 cm) and Sephadex LH-20 (Pharmazia, Uppsala, Sweden, 2 × 60 cm) column chromatography (CC) and Whatman No. 1 and 3MM paper chromatography sheets (PC) (Whatman Ltd., England, 46 × 57 cm) were used for sample isolation. Methanol, distilled water, acetic acid, and n-butanol solvents (Adwic, Cairo, Egypt) with systems: 15% acetic acid (H2O: HOAc, 85:15) and BAW (n-butanol:acetic acid:H2O, 4:1:5, upper phase) were applied. Acid hydrolysis was carried out using 2 N HCl for 2 h at 100 °C. For sugar analysis, sugar reference samples (E. Merck, Darmstadt, Germany), and the upper phase of C6H6-n-BuOH-H2O-pyridine (1:5:3:3) solvent system were used, then the PC was sprayed with aniline phthalate reagent. Phenolic acids and flavonoid reference aglycones (isoflavones, flavones, and flavonols) (Fluka AG, Bucns SG, Switzerland) were used for co-PC and LC-MS/MS authentication. A rotary evaporator (HEIDOLPH 4000, Germany) and distiller (Hamilton, Westwood Industrial Estate) were used for laboratory work. Centrifuge (Sigma 2-15, Germany). The Egyptian Company for biotechnology (S.A.E.) supplied the biochemical spectrum kits, Obour City, Cairo, Egypt.

Plant Material

Trifolium resupinatum L., Fabaceae, was collected and authenticated by Prof. Dr. Ibrahim El Garf from the Botanical Garden of Cairo University. A plant voucher (accession No. 446) was kept at the National Research Centre Herbarium, Giza, Egypt. About 800 g of air-dried leaves powder was extracted by repeated percolation with 70% aqueous methanol (3 × 4 l) and approximately 120 g was yielded after evaporation of the solvent at 60 °C under reduced pressure. Part of the extract (10 g) was set aside to examine its pancreatic lipase inhibitory effect and in vivo investigations.

Chemical Profiling

Isolation and Structural Elucidation

The remaining residue (110 g) was defatted using chloroform to produce a defatted residue (90 g) which was loaded on a polyamide CC (450 g). Successive elution was carried out starting with water, then the polarity decreased by increasing the methanol in water to 100% methanol. Based on PC properties after spraying with AlCl3 and FeCl3, similar ones were combined, and four main fractions were obtained (A–D). Fraction A (H2O: MeOH; 3:2) (17.2 g) was applied on PC using BAW three times to produce compounds 1 (12 mg), 12 (12 mg), and 14 (18 mg). While fraction B (28.5 g) (H2O: MeOH; 1:1) was purified through 15% acetic acid (double solvent) on PC revealing two subfractions BI and BII. BI (13.8 g) was further chromatographed through Sephadex CC using H2O:MeOH (1:1) eluent and yielded compounds 2 (34 mg), 5 (49 mg), and 7 (8 mg). Likewise, compounds 3 (28 mg), 4 (10 mg), and 8 (10 mg) were isolated from BII (9.5 g). Furthermore, the isolates 6 (23 mg), 9 (10 mg), and 10 (6 mg) were obtained through PPC using BAW followed by 15% acetic acid from fraction C (H2O:MeOH; 3–4:6–7) (12.5 g). Finally, fraction D (H2O:MeOH; 1:4) (7 g) was subjected to PPC using BAW to give 11 (13 mg) and 13 (8 mg) which were eluted through Sephadex CC with H2O:MeOH (1:1) solvent. Repeated purification using Sephadex CC (MeOH) for each sample was achieved.

LC-ESI-MS/MS Analysis

The analysis was performed using C-18 Column (Ascentis® Express 90 Å, 2.1 × 150 mm, 2.7 µm). Two mobile phases gradient (A: ammonium formate; 5 mM, pH 8 and B: acetonitrile; HPLC grade) were designed as follows: 0–1 min (5% B), 1–20 min (5–100% B), 20.01–25 min (100% B), 25.01–30 min (5% A). The volume (5 µl) was injected and run with a flow rate (0.3 ml/min). The MS/MS negative ionization mode analysis was practical from 100 to 1000 Da with a scan (EMS-IDA-EPI) for MS1 using the ensuing criteria: 25 psi curtain gas, −4500 ion spray voltage, 500 °C source temperature, 45 psi ion source gas 1 and 2, and from 50 to 1000 Da for MS2 with −80 declustering potential, −35 collision energy, and 15 collision energy spread. Negative ionization mode is a useful tool for structural investigations of flavonoids, i.e., through the identification of fragments of interest and relative intensities of some diagnostic ions (Farid et al. 2022). The peaks and spectra were administered using the Analyst TF 1.7.1 and PeakView® 1.2 Softwares (SCIEX, Framingham, MA, USA). The isolated pure compounds were used as standard samples together with other authentic isoflavones and previously isolated phenolics (Marzouk et al. 2012; El Shabrawy et al. 2014; Hussein et al. 2017, 2018; Kassem et al. 2017; El-Sherei et al. 2018). Compounds annotation was assessed by comparing their mass spectrometry data and retention time of each and mapping them through molecular networking.

Molecular Networking

The molecular network was constructed on the GNPS website (http://gnps.ucsd.edu). Raw data was converted by MSConvert package (Proteowizard Software Foundation, Version 3.0.19330, USA) to an open-source format (.mzML). Using WinSCP (SFTP, FTP, WebDAV and SCP client), the transformed “.mzML” files were uploaded on the GNPS platform. For the comparison of T. resupinatum extract and the standard mixtures, the converted data of both were proposed with two groups in one job. GNPS parameters are minimum-matched fragments (four ions), fragment ion tolerance (0.5 u), minimum pairs cosine score (0.65), parent mass tolerance (2.0 u), library search minimum-matched peaks (4), and score threshold (0.65), and do not filter in all library filtering option and other advanced options are set as default. The designed negative molecular network can be retrieved via the following link: https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=9915465576554c3eb5116d062177ea58. The spectral network was visualized in conjunction with the help of Cytoscape 3.9.1. The molecular networks visualization characterizes each spectrum as a node, and spectrum-to-spectrum groupings as edges and associated by structural correspondence identified via the MS (Xu et al. 2021). Natural products dereplication databases were also searched such as ChemSpider (http://www.chemspider.com/) PubChem (https://pubchem.ncbi.nlm.nih.gov/) and Dictionary of Natural Products (http://dnp.chemnetbase.com/faces/chemical/Chemical Search.xhtml).

Biological Assessment

Pancreatic Lipase Inhibition

The porcine pancreatic lipase (PPL) inhibition potential of T. resupinatum leaf extract and the two major compounds (2 and 5) were established by the in vitro method suggested by Kim et al. (2010) using NPB (p-nitrophenyl butyrate) as a substrate. A potassium phosphate buffer (0.1 mM, pH 6) was used for the preparation of a lipase solution (100 µg/ml). Samples with different concentrations (7.81–1000 μg/ml) were pre-incubated with lipase solution (10 min, 37 °C) and then the reaction was taking place by addition NPB substrate (0.1 ml), and incubated at 37 °C for 15 min. Finally, a multiplate reader was used to measure the quantity of p-nitrophenol released during the reaction. Each experiment was achieved in triplicates. The percentage inhibition was calculated using the equation: % inhibition = (1 – Asample/Acontrol) × 100, where a sample is the absorbance of the tested samples and Acontrol is the absorbance of control. The IC50 value was specified as the concentration of lipase inhibitor to inhibit 50% of its activity under the assay requirements.

In Vivo Assay

Body Weight Assessment

Due to insufficient amounts of the two major isoflavones (2 and 5), the extract was only evaluated for the in vivo assessment. A diet-induced obese rat model was obtained from the animal house in the National Research Centre. Fifteen Wistar rats (140–150 g) were held with a 12:12-h light-dark cycle in a temperature-controlled room. The diets provide at least 3.85 kcal/g of energy. The high-fat diet (HFD) was achieved by modifying the percentage of fat higher than a normal diet. Fat content is 20% sheep tallow and 1% cholesterol. Nipples of stainless steel for feeders and drinking were supplied for each cage to allow recording the individual feed intake of each rat. They were classified into three groups (n = 5) as follows: control group, received a normal chow diet (ND) for 8 weeks; group 1, received a HFD for 8 weeks; and group 2, received HFD + T. resupinatum extract (25 mg/kg body weight suspended in 0.5 ml saline) daily for 8 weeks by oral gavage. Body weight measurements were recorded from all groups at 0 and after 8 weeks.

Blood Biochemical Analysis

Blood samples were taken by puncture of the rat orbital sinus. The samples were kept coagulating at room temperature for 20–30 min and then centrifuged at 1006 × g for 15 min. Sera were separated and stored at 8 °C for biochemical analysis. Blood glucose levels were measured using the hexokinase colorimetric method. Triacylglycerides and total cholesterol levels were determined by enzymatic methods in all groups.

Statistical Assessment

The data were produced as mean ± SD. Statistical assessment of the biochemical data was performed with SPSS software version 15. The comparison between group means was analyzed by one-way ANOVA.

Molecular Docking

Protein Structure Preparation

The prerequisite protein structure in the docking analyses was downloaded from PDB, http://www.rcsb.org.pdb (the Protein Data Bank). The crystal structure of the porcine lipase (PDB ID: 1ETH) was retrieved in PDB format. In the AutoDock tools, the structure was prepared by the exclusion of water molecules, and all heteroatom coordinates, added with missed atoms, polar hydrogens, and charges. It was made sure that no residues carry the non-integral charges which are a prerequisite for the receptor file in the software AutoDock 1.5.7 (Akshatha et al. 2021). Finally, the resulting protein file was saved as a PDBQT format.

Ligand Preparation

The chemical structures of isolated constituents were obtained from the PubChem compound database (https://pubchem.ncbi.nlm.nih.gov) and retrieved as Structure Data File (SDF) formats. Compound 4 is unavailable on PubChem database; thus, it was sketched using ChemDraw (Cambridge Soft, Waltham, MA, USA) and further optimized and converted using Avogadro 1.2.0 (http://avogadro.cc/) to pdb files. Using AutoDock Tools software (version 1.5.7), the pdb files of the isolated chemicals were transformed into pdbqt files format (Swargiary and Daimari 2020). Orlistat, a medication used to treat obesity, was used as a positive control for molecular docking assay, downloaded through https://www.drugbank.ca/ (the drug bank website) and employed as acquired.

Docking

After the preparation of ligands and protein, docking was accomplished in AutoDock Vina. We noted the x, y, z values from attributes of the SBD site sphere using Biovia Discovery Studio software (v21.1, 2021) (Suryanarayana et al. 2021). A Search Space performed approximately the same size as pancreatic lipase, 60 Å radius was created and centered at x = 71.52, y = 26.883, z = 187.849 (Urbizo-Reyes et al. 2022), therefore, the set exhaustiveness parameter is increased. Separately, the individual ligand was docked with the enzymes and the final binding affinity (kcal/mol) of a given ligand-enzyme interaction was obtained. The docking binding between docked agents and the targets was analyzed using Biovia Discovery Studio software (v21.1, 2021) (Swargiary and Daimari 2020).

ADME and Drug-Likeness Estimation

Website http://www.swissadme.ch was accessed in a web browser that displays the submission page of SwissADME and the canonical SMILES of the isolated phytoconstituents were input for estimation of ADME and drug likeness. The SwissADME server includes configurations which are selected for robustness, speed, and clear interpretation of the results. In addition, isolated compounds were subjected to molecular properties prediction for drug-likeness by MolSoft (MolSoft, 2007) software (Youssef et al. 2020).

Results and Discussion

Chemical Analysis

Isolation and Structure Elucidation

The applied chromatographic techniques managed the isolation of 14 phenolic compounds from T. resupinatum extract. Chemical and physical investigations were used to elucidate their structural composition. In addition, their structural data were compared with previously reported spectroscopic data for more confirmation. In addition to one phenolic acid (chlorogenic acid; 1), three isoflavone aglycones: formononetin (2), pseudobaptigenin (5), and genistein (7); five isoflavone 7-O-β-glucosides; ononin (3; formononetin 7-O-β-glucoside), rothindin (6; pseudobaptigenin 7-O-β-glucoside), genistin (8; genistein 7-O-β-glucoside), daidzin (9; daidzein 7-O-β-glucoside), and sissotrin (10; biochanin A 7-O-β-glucoside) as well as the acetyl derivative; 6''-O-acetyl ononin (4; formononetin 7-O-β-(6''-O-acetyl) glucoside) were established. Furthermore, flavonol aglycones and their 3-O-β-glucoside; kaempferol (11), astragalin (12; kaempferol 3-O-β-glucoside), quercetin (13), and isoquercetin (14; quercetin 3-O-β-glucoside) were isolated. Compounds 4, 8, 9, 10, and 12 were isolated for the first time from T. resupinatum but reported beforehand for other Trifolium species. The physical constants and NMR data were compared with published values for all isolated compounds (see, Supporting Information).

LC-MS Profiling

Analysis of the LC-ESI-MS base peak chromatogram of the 70% aqueous methanol leaf extract of T. resupinatum (Fig. S1) revealed the dominance of pseudobaptigenin (peak 72, isolate 5) and formononetin (peak 73, isolate 2), and their respective 7-O-glucosides (peaks 57 and 58, isolates 6 and 3, respectively). After ingestion, isoflavone glycosides are metabolized by the β-glucosidase enzyme in the small intestine to produce the more efficiently absorbed form of isoflavones with lower hydrophilic properties and smaller molecular weights (corresponding aglycons)(Kassem et al. 2017). Therefore, the two isoflavone aglycones 2 and 5 were selected for the in vitro pancreatic lipase assay along with the total extract.

figure b

Metabolite Annotation

In this context, detailed metabolite profiling of T. resupinatum leaf extract via HPLC-ESI-MS/MS was performed using a standard mixture including the isolated compounds (Fig. S1). For a more accurate representation of the species hydroalcoholic extract metabolome, molecular networking was created (Fig. 1). With the aid of GNPS libraries, possible correlations between each MS/MS spectrum were found, thereby enabling additional annotation of unknown but related molecules. About 81 metabolites were distinguished, 69 of which are species-first detection (Table S1). They belong to many chemical classes comprising fatty acids (four metabolites), amino acids (five metabolites), organic acids (two metabolites), and sugars (two metabolites) with phenolics as predominant class (68 compounds) (Fig. 2). The interpretated phenolics included 52 flavonoid compounds (23 isoflavonoids, 16 flavonols, seven flavones, five flavanones, and one flavanonol), along with 11 phenolic acids and aldehydes, as well as five coumarins (Table S1, Fig. 2).

Fig. 1
figure 1

The molecular network created using MS/MS data (negative ionization mode) of Trifolium resupinatum extract (yellow) and standards mixture (blue)

Fig. 2
figure 2

Classes of chemical metabolites identified in Trifolium resupinatum leaf extract with ratios of flavonoid subgroup structures in relation to the total of identified compounds

In the negative ionization mode of the constructed molecular network, 425 nodes were grouped in 34 clusters (two linked nodes at least) and 191 self-looped nodes, whereby the interesting clusters A, B, C, and D guided the characterization of various phenolic skeletons (Fig. 1). The yellow nodes represent the targeted extract sample while the blue ones corresponded to the standards mixture ions. The unmatched clusters were analyzed and verified manually. On the other hand, the remaining constituents were tentatively elucidated via their accurate molecular weights, retention time, MS2 fragments combined with GNPS libraries matches, and in parallel with the standards used or comparing their fragmentation picture with the previous reported data (Table S1).

Isoflavonoids are the most prominent constituents of the leguminous plant family (Fabaceae). They were frequently categorized into seven subclasses including isoflavones, isoflavans, isoflavanones, coumestans, rotenoids, pterocarpans, and coumaronochromones (Sohn et al. 2021). Members of the genus Trifolium, in particular red clover, contain significant concentrations of the variable structure patterns of the unique isoflavones in aglycone forms, daidzein (7,4′-dihydroxyisoflavone), genistein (4',5,7-trihydroxyisoflavone), formononetin (7-hydroxy-4´-methoxyisoflavone), biochanin A (5,7-dihydroxy-4′-methoxyisoflavone), and pseudobaptigenin (7-hydroxy-3',4'-methylenedioxyisoflavone), or some of their 7-glucoside forms, daidzin, genistin, ononin, sissotrin, and rothindin, respectively (Malca-Garcia et al. 2022). Less frequently occurring isoflavone aglycones such as prunetin (7-O-methyl-genistein), pratensein (3'-hydroxy biochanin A), and irilone (5,4'-dihydroxy-6,7-methylenedioxyisoflavone) have also been reported. In addition, minor acetylated and malonated glycosyl derivatives have been recorded (Das et al. 2020).

Cluster A of the created molecular network of T. resupinatum gathered 6 isoflavone aglycones encompassing nodes of formononetin (73, m/z 267.02 [M−H]) connected to biochanin A (34; m/z 283.01 [M−H]), and pseudobaptigenin (72; m/z 281.01 [M−H]) connected to irilone (66; m/z 297.01 [M−H]), both through the mass difference of 16 Da. Moreover, medicarpin (75; m/z 269.03 [M – H]) and daidzein (76; m/z 253.015 [M−H]) were grouped in the same cluster. Medicarpin is a benzopyran furanobenzene compound belonging to the pterocarpan phytoalexin group of the isoflavonoids family. It was formerly detected in several Trifolium species (Malca-Garcia et al. 2022) and various leguminous plants (Gampe et al. 2016; Cheng et al. 2019). Oxidation of pterocarpans yields coumestans which were also annotated in T. pratense and T. repens (Malca-Garcia et al. 2022). The most prevalent is coumestrol (54; m/z 267.02 [M−H]), which is perceived in T. resupinatum extract for the first time. It has an additional furan ring between B and C rings and dicatechol groups that resembles estradiol’s structure to which its positive effects are attributed (Ionescu et al. 2021). Another set of six isoflavone aglycones were observed as self-looped nodes; hydroxy biochanin A (35; m/z 299.01 [M−H]), genistein (44; m/z 268.97 [M−H]), pratensein (51; m/z 299.01[M−H]), and prunetin (64; m/z 283.01 [M−H]), in addition to pseudobaptigenin-O-sulfate (60; m/z 360.98 [M−H]) monitored by the daughter ion peak at m/z 281 [M−80] due to SO3 loss. These compounds were not found in Persian clover beforehand except for isoflavone aglycones 34, 44, 71, 73, and 76.

Besides the isoflavonoid aglycones, some isoflavone glycosides were pictured in clusters B and C (Fig. 1). Pseudobaptigenin 7-O-glucoside (57; m/z 488.912 [M–H + FA (formic acid)]) is directly attached by a thick edge to 71 (m/z 530.91 [M–H + FA]) with a mass shift of 42 Da as probable acetyl moiety (−COCH3), confirming the first identification of pseudobaptigenin 7-O-acetylglucoside in the genus Trifolium. On the other hand, both compounds appeared as formic acid adduct ions equivalent to [M−H + HCOOH] or [M + 45 Da], as compound 58 (formononetin 7-O-β-glucoside; m/z 474.93 [M–H + FA]). The isolated acetyl glucoside isomer of 58; acetylononin (45; m/z 471.03 [M−H]) was confirmed by the specific fragment at m/z 267 [M–H–acetyl−glucosyl]. The acetyl glycoside isoflavone conjugates of leguminous plants have already been mentioned (Gampe et al. 2016; Das et al. 2020); however, their presence in the studied species has not been previously reported. Furthermore, the characteristic ion of daidzin (25; m/z 414.96 [M−H]) was observed connected to that of genistin (33; m/z 430.94 [M−H]) in cluster C. Other glycosylated isoflavonoids were observed as self-looped nodes in the molecular network and could be annotated as coumestrol-O-hexoside (12; m/z 429.01 [M−H]), sissotrin (29; m/z 445.01 [M−H]), irilone-4′-O-glucoside (31; m/z 458.97 [M−H]), pratensein-O-glucoside (52; m/z 461.01 [M−H]), and dimethoxy isoflavone-O-hexoside (wistin) (63; m/z 505.01 [M–H + FA]) designated through the prominent peaks of their precursors’ fragment ions due to loss of the hexoside moiety [M–H–162]. Numerous peaks were verified by comparing their mass spectra and retention data with those of standards, including the isolated compounds (Table S1). Previous studies characterized these isoflavone glycosides combinations in T. pratense (Malca-Garcia et al. 2022) and other leguminous species (Gampe et al. 2016; Cheng et al. 2019). However, our study is the first to dereplicate them in T. resupinatum except for compounds 33, 57, and 58.

An additional sulfated compound was recognized as a flavone structure and recognized as apigenin-O-sulfate (18; m/z 348.99 [M−H]). Other flavone aglycones were reported before in some Trifolium species (Malca-Garcia et al. 2022) and identified as apigenin (46; m/z 269.02 [M−H]), tricin (49; m/z 329.01 [M−H]), and chrysoeriol (69; m/z 299.01 [M−H]) confirmed by standard samples. The substitution with methoxy groups was indicated by the (CH2)n subsequent loss (–14 Da)n (Hussein et al. 2018). Apigenin 7-O-glucoside (30; m/z 431.01 [M−H]), apiin (40; m/z 562.96 [M−H]), and tricin 7-O-glucoside (37; m/z 491.01 [M−H]) were eluted earlier than their corresponding aglycones. Their annotation was supported by the elimination of the hexose [M–H−162] or pentose [M–H−132] moieties to produce the analogous aglycone anions.

Another acetyl derivative was also detected in cluster C concerning a mono-glycosyl flavonol structure, kaempferol 3-O-(6''-acetyl) glucoside (42; m/z 488.94 [M−H]) which is directly linked to a node at m/z 446.93 with 42 Da mass difference, that displayed kaempferol 3-O-glucoside (26; m/z 446.93 [M−H]) based on the standard isolated compound (Fig. 1). Meanwhile, this node was directly connected with its congeners; quercetin 3-O-glucoside (22; m/z 462.96 [M−H]) and isorhamnetin 3-O-glucoside (28; m/z 476.93 [M−H]) (Fig. 1). Other peaks sharing the same parent ions of 26 and 28 corresponded to kaempferol 7-O-glucoside (36; m/z 446.93 [M−H]) and isorhamnetin 7-O-glucoside (61; m/z 477.01 [M−H]) relying on the base peak ions [Agl−H] at m/z 285 and 315, respectively. In contrast, these ions of the 3-O-isomers were [Agl−H2] at m/z 284 and 314 (Farid et al. 2022). The flavonol glycosides were also eluted first followed by their corresponding aglycones: kaempferol (50; m/z 285.01 [M−H]), quercetin (39; m/z 300.97 [M−H]), isorhamnetin (62; m/z 315.1 [M−H]), in addition to myricetin (67; m/z 317.02 [M−H]) that reported before in T. repens (Agraharam et al. 2022). Flavonols in the genus Trifolium are frequent, mainly kaempferol and quercetin (Malca-Garcia et al. 2022). They were stated earlier in the aerial parts of T. resupinatum (Kassem et al. 2017); however, the levels of different glycosylation and positional isomers are distinguished in the current study through the key fragmentation pathway approach. Two di-O-glycosyl flavonols appeared as a cluster (D) of two connected nodes representing quercetin 3-O-glucoside-7-O-rhamnoside (21; m/z 608.98 [M−H]) and kaempferol 3-O-glucoside-7-O-rhamnoside (32; m/z 593.02 [M−H]), further confirmed throughout the GNPS libraries. They produced diagnostic fragment ions at (m/z 463 [M–H–146] and 447 [M–H−162]) and (m/z 447 [M–H−146] and 431 [M–H−162]), respectively. The higher intensities for their glucosylated fragment ions [M–H+162] than those for the rhamnosylated ones [M–H + 146] suggested the substitution at 3-O-glucoside and 7-O-rhamnoside for both compounds (Farid et al. 2022). Further flavonols-O-diglycosides were observed as self-looped nodes and identified as 3-O-rutinoside of quercetin (24; m/z 609.01 [M−H]) and kaempferol (27; m/z 592.94 [M−H]). The last isomer was kaempferol 3,7 di-O-rhamnoside (38; m/z 577.01 [M−H]), which produced two diagnostic signals at m/z 431 [M–H−146] and 285 [M–H−292] compatible with the standard reference.

Limited amounts of other flavonoids rarely reported in the genus Trifolium, which correspond to flavanones and flavanonols, were obtained and observed in the molecular network as self-looped nodes. Two flavanone aglycones (47; m/z 255.01 [M−H]) and (53; m/z 271.01 [M−H]) suggested to be liquiritigenin and naringenin as described in Trifolium subterraneum L. and T. partense L., respectively (Prati et al. 2007), were also confirmed via GNPS libraries (Table S1). Their corresponding glycosylated derivatives, liquiritoside (19; m/z 416.96 [M−H]) and naringenin 7-O-glucoside (55; m/z 433.01 [M−H]), were evidenced by the product ion [M–H−162]. A further flavanone aglycone was detected as eriodyctiol (70; m/z 287.01 [M−H]). In addition to a single flavanonol aglycone, taxifolin (20; m/z 303.04 [M−H]) which was confirmed through a standard compound and previously reported in T. partense (Prati et al. 2007). Excepting rutin (24), quercetin (39), and kaempferol (50), all annotated flavonols, flavanones, and flavanonol are non-previously reported metabolites in T. resupinatum.

Additional minor phenolic derivatives were marked as self-looped nodes and newly characterized in T. resupinatum. They relied on their ion mass spectra, retention times, and mass fragmentation profile aided with GNPS, published literature, and available authentic references. Phenolic acids; caffeic (4; m/z 179.01 [M−H]), sinapic (7; m/z 223.02 [M−H]), coumaric (10 and 14; m/z 163.01 [M−H]), gentisic (17; m/z 153.02 [M−H]), vanillic (23; m/z 167.01 [M−H]), ferulic (41; m/z 193.01 [M−H]), and chlorogenic (80; m/z 353.09 [M−H]) acids, phenolic benzaldehydes; p-hydroxy benzaldehyde (13; m/z 121.01 [M−H]) and trimethoxy benzaldehyde (59; m/z 195.09 [M−H]) as well as dihydrocoumaroyl-O-glucoside (81; m/z 327.22 [M−H]) were demonstrated. Chlorogenic acid was isolated before from the investigated species by Kassem et al. (2017), while the other phenolic acids and aldehydes were reported previously in T. partense (Prati et al. 2007).

Likewise, coumarins are also reported to occur in forage plants represented herein as hydroxy methyl coumarin (15; m/z 175.01 [M−H]), dihydroxycoumarin (68; m/z 177.01 [M−H]), trihydroxy coumarin (56; m/z 193.08 [M−H]), and dihydroxycoumarin-O-hexoside (43; m/z 339.02 [M−H]). Compound 68 was identified as esculetin (6,7-dihydroxycoumarin) confirmed through the standard compound, and was characterized before in T. partense (Vlaisavljević et al. 2017). Successively, compound 43 was elucidated as esculetin-O-glucoside via mass difference of 162 Da.

Anti-obesity Potential

Pancreatic Lipase Inhibition

Triacylglycerides are hydrolyzed by the pancreatic lipase enzyme into glycerol and fatty acids, aiding in their absorption in the intestine. Lipase inhibition is an effective strategy for preventing triglyceride absorption and obesity. Orlistat inhibits this enzyme professionally but with various harmful effects on the gastric, renal, endocrine, and nervous systems, impairing the other medications’ efficacy (Ragheb et al. 2021). T. resupinatum extract exerted an in vitro inhibition impact on porcine pancreatic lipase inhibition with an IC50 of 471.32 ± 0.8 µg/ml. Comparing the selected compounds, formononetin (2) (IC50 of 47.2 ± 1.1 µg/ml) demonstrated potential enzyme inhibitory activity than pseudobaptigenin (5) (IC50 of 112.8 ± 1.23 µg/ml) considering the efficiency of orlistat (IC50 of 23.8 ± 0.64 µg/ml), as summarized in Table 1.

Table 1 Pancreatic lipase % inhibition of Trifolium resupinatum leaf extract and the major isolated isoflavone aglycones

Many studies have shown that naturally occurring phenolics demonstrated multifunctional properties that can effectually prevent obesity and related metabolic disorders. They are described to regulate digestion and fatty acid oxidation, as well as carbohydrate and lipid metabolism by modulating the activity of digestive enzymes (Oliveira et al. 2022). Pancreatic lipase inhibitory action of some prospective isoflavones has previously been evaluated. Cardullo et al. (2021) proved that modification of formononetin by hydroxylation or bromination improved the in vitro enzyme inhibitory properties. On the other hand, Deng et al. (2020) suggested the inhibitory effect of ethyl acetate fraction of the fungus Grifola frondose (Dicks.) Gray was correlated to its rich content of pseudobaptigenin.

In Vivo Evaluation

Data in Table 2 compared the final total body weight (g) among the control and other groups. They indicated that rats who received a high-fat diet (HFD) significantly increased their total body weight compared with those who received a normal diet (ND). There was significant weight loss in HFD rats treated with T. resupinatum extract (25 mg/kg body weight suspended in 0.5 ml saline) compared with those that received only HFD.Furthermore, the results in Table 2 demonstrated that glucose, triacylglycerides, and total cholesterol levels significantly increased in the sera of rats that received HFD compared with those in a control group. At the same time, these parameters significantly decreased in the group treated with T. resupinatum extract compared to the untreated one (HFD). This potential could be related to flavonoids, particularly isoflavones (formononetin and pseudobaptigenin) that represent major constituents in T. resupinatum leaf extract (Fig. 2) or related to the synergetic interactions of the natural compounds (Liu et al. 2022).

Table 2 Effect of Trifolium resupinatum leaf extract on body weight, serum glucose, serum triacylglycerides, and total cholesterol (mg/dl) in HFD-treated rats

The anti-obesity effects of flavonoids occur through the modulation of proteins, genes, and transcription factors that contribute to reduced lipogenesis, increased lipolysis, and energy expenditure, in addition to moderating inflammatory responses and suppressing oxidative stress (Oliveira et al. 2022). Due to various flavonoid compounds (64.2% of total identified compounds) and their structural variations (Fig. 2, Table S1), the effects varied considerably. Their anti-obesity properties include repressing adipogenesis and lipogenesis by lowering the inflammatory mediators, deactivating the immune cells, enhancing mitochondrial activity by promoting the antioxidant effects, decreasing lipid peroxidation, modifying fat metabolism, regulating apoptosis, and increasing energy consumption (Deng et al. 2020; Oza and Kulkarni 2020; Liu et al. 2022).

Molecular Docking

Molecular docking was carried out to confirm the pancreatic lipase activity of the significant isolates (2 and 5) and predict the affinity of the additional isolated compounds to this enzyme, to which the synergetic effect of the anti-obesity property is attributed. Binding affinity evaluation is used to determine the strength of biomolecular interactions, which is crucial in determining the possibility of an interaction arising in a cell (Odoemelam et al. 2022). Table S2 summarizes the interactions between the amino acids and the ligands at the porcine pancreatic lipase (PPL) binding sites. Docking analysis applied to the PPL protein revealed that all the isolated constituents have better or analogous binding free energies to the control drug (orlistat). The binding affinity of orlistat on pancreatic lipase enzyme is found to be −8.2 kcal/mol. The strongest binding affinity (−9.9 kcal/mol) is afforded by compounds 5, 7, 11, and 13, followed by compounds 2 and 4 (−9.8 kcal/mol), then compound 8 with a binding affinity −9.7 kcal/mol (Table S2). Our findings matched the docking results of Cardullo et al. (2021), who indicated the higher affinity of isoflavone class inhibitors to PPL active sites underlying their interactions. Also, it proved the better fitting of genistein (7) than formononetin (2) on the receptor due to the additional C-5 hydroxyl group which provides an additional hydrogen bond. Li et al. (2023) assumed the higher inhibitory effect of flavonols (quercetin) and isoflavones compared to flavanones and flavan-3-ols. Their inhibitory activity is enhanced by hydroxylation on rings A and B, whereas it is suppressed by glycosylation. In addition, Ahmed et al. (2022) showed a stronger affinity of chlorogenic acid to PPL active sites compared to orlistat.

Two- and three-dimensional illustrations of the interaction between ligand molecules and PPL protein are provided as supplementary material (Figs. S2 and S3). It was revealed from an analysis of docking interactions that all the isolated compounds bind to the same region as orlistat. Moreover, the observed interactions for the orlistat-PPL complex were close to those described in previous literature (de Lima Barros et al. 2022), which implies a good estimate of the present model.

The docking analysis displayed that His152 was involved in the hydrogen bonding interaction of orlistat, as compounds 1, 4, and 8 with the protein active site (Fig. S2 and S3). Additionally, it was found that both compound 11 and orlistat may stabilize the complex with the pancreatic lipase enzyme through a hydrogen bond with the same residue, Gly77 (Figs. S2 and S3). The similarity of the interacting pattern between these phytoconstituents and orlistat indicates that these isolated compounds probably inhibit pancreatic lipase in same pattern as orlistat. The top-scored inhibitors of phytoconstituents presented several π- interactions with Phe78, Tyr115, and Phe216 residues in a similar manner to the hydrophobic interaction of orlistat. A hydrophobic-interaction–based mechanism mainly triggers docking between compound 5 and PPL protein, where the pseudobaptigenin-PPL protein complex may be stabilized with 7 π- interactions. It was implied from previous literature that interaction with His264 is a key element since this residue constitutes the catalytic triad of the PPL active sites (de Lima Barros et al. 2022). Almost all the highly scored constituents formed a hydrophobic interaction with the key residue His264 (Fig. S2 and S3).

Drug-Likeness and Oral Bioavailability

A detailed analysis of the pharmacokinetic properties of the isolated metabolites was performed. Lipinski specified a rule of five (RO5) to predict compounds that could be good drug candidates or drug-like compounds. The RO5 specified that poor absorption or permeation is more probable when there are more than five H-bond donors (HBD), molecular weight (MW) is over 50, Log P is over 5, and the sum of N’s and O’s (HBA) is over 10 (Abdul-Hammed et al. 2021).

As presented in Table S3, the HBD, MW, HBA, and Log P values of most isolated compounds obey the specifications of the RO5. Compounds 1 and 8 have one acceptable violation, as an orally active drug has no more than one violation of the Lipinski criteria (Ivanović et al. 2020). On the other hand, compounds 12 and 14 showed two violations. The bioavailability score aims to predict the possibility of a compound having at least 10% oral bioavailability in rats. The predicted bioavailability of the isolated compounds (Table S3) revealed a bioavailability score of 0.11, 0.17, or 0.55, which means good pharmacokinetic properties and good drug profile (Swain and Hussain 2022).

Distinctive to SwissADME is the bioavailability radar that provides a graphical preliminary glimpse at the drug-likeness parameters of the studied compounds. Figure S4 shows the drug-likeness graph as a hexagon, with each vertex indicating a characteristic that defines a bioavailable drug. The pink zone within the radar hexagon represents the optimal physicochemical space for each property predicted to be orally bioavailable (lipophilicity, XLOGP3 between −0.7 and + 5.0; size, MW between 150 and 500 g/mol; polarity, TPSA between 20 and 130 Å2; solubility, log S not higher than 6; saturation, fraction of carbons in the sp3 hybridization not less than 0.25; and flexibility, no more than nine rotatable bonds) (Ndombera et al. 2019). Most of the constituents fall within the five parameters of the pink area and are considered drug-like.

According to the Molsoft database tool, the scale of a drug-likeness score of isolated compounds ranges from –0.15 to 0.79 (Fig. 3). The isolated compounds, which represent drug-likeness scores with positive values, are to be considered like drugs. On the contrary, compounds 4, 6, and 9 showed negative values, indicating low probabilities of being drugs (Gad et al. 2020).

Fig. 3
figure 3

Plotting of drug-likeness score of isolated compounds using MolSoft. Green-colored curve is non-drug–like behavior and the blue-colored curve is drug-like behavior. Compounds with zero or negative values seem not to be drug candidates. DS, drug-likeness score; (1) chlorogenic acid, (2) formononetin, (3) ononin, (4) 6''-O-acetyl ononin, (5) pseudobaptigenin, (6) rothindin, (7) genistein, (8) genistin, (9) daidzin, (10) sissotrin, (11) kaempferol, (12) astragalin, (13) quercetin, (14) isoquercetin

Conclusion

The chemical study of T. resupinatum leaf extract detected 81 metabolites, including 14 phenolic isolates, 69 of which were annotated for the first time in Persian clover. Two major isolated aglycones (formononetin and pseudobaptigenin) showed promising in vitro pancreatic lipase inhibitory actions, which might serve as a potential mechanism for anti-obesity. Preliminary in vivo screening on high-fat diet-induced obese rats showed that administration of the extract resulted in their weight loss and possible improvement of lipid serum triacylglycerides, total cholesterol, and glucose levels. The in silico investigation corroborated these findings and agreed to understand how these constituents interact with vital enzymes associated with obesity. It also granted them insights into the drug-likeness and ADME properties.