Introduction

Propolis, the resinous substance collected by honey bees (Apis mellifera) from buds and resins of various plant species, has been used in folk medicine for many years because of its beneficial effects on various symptoms such as wounds, sore throats, and stomach ulcers (Burdock 1998; Huang et al. 2014). Propolis is composed of various inorganic minerals and organic compounds, including vitamins, amino acids, lipids, organic acids, and flavonoids (Huang et al. 2014). Among the chemical components of propolis, phenolic compounds, including flavonoids, are the major constituents and contribute largely to the pharmacological effects of propolis (Banskota et al. 1998).

Propolis is generally classified as poplar- or Baccharis-type according to its botanical origin. The poplar-type propolis, which originates from Populus buds that are primarily found in East Asian countries such as China and Korea, is known to have high phenolic contents because the poplar-type plants have high phenolic contents (Ristivojevic et al. 2015). Baccharis-type propolis originates from Baccharis dracunculifolia, which inhabits South American countries (i.e., Brazil, Bolivia and Argentina) (Park et al. 2004). The Baccharis spp. is a significant source of tropical Brazilian green propolis, and cinnamic acid is one of the most significant constituents in the Baccharis plants. Typically, artepillin C, a cinnamic acid derivative, is the representative phenolic compound present in Brazilian propolis. In addition, low flavonoid and phenolic contents and a high content of volatile compounds are the relevant characteristics of Baccharis-type plants.

Because the compositional diversity of propolis depends on the habitats of the source plants, propolis samples collected from different origins exhibit different characteristics or biological activities such as antitumor, antibacterial, antiviral, antifungal, anti-inflammatory, and antioxidant activities (Burdock 1998). For pharmaceutical and food applications, knowledge of the chemical composition of the propolis raw materials is necessary; however, to date, the quality of propolis has been examined using a subset of the known phenolic constituents. Therefore, more detailed information is required to better understand the complicated substances in propolis and their related functions. Zhou and colleagues reported an HPLC-based determination of the geographical origins of various Chinese propolis (Zhou et al. 2008). Sawaya and coworkers also demonstrated a simple propolis fingerprinting method using easy ambient sonic-spray ionization mass spectrometry (EASI-MS) to characterize their geographical origin (Sawaya et al. 2010). However, these analytical techniques were not suitable for identifying unknown compounds due to its low resolving power and poor mass accuracy. The development of LC/MS-based analytical methods has enabled to more reliably and accurately identify complicated propolis compounds. Pietta et al. showed that the analytical platforms based on atmospheric pressure chemical ionization (APCI) MS and HPLC combined with photodiode array detection can be utilized for reliably identifying a large number of propolis components (Pietta et al. 2002). Gardana and coworkers also applied LC-tandem MS system in order to determine phenolic compounds in different source origins (Gardana et al. 2007).

More recently, high-resolution mass spectrometry has been used to characterize phenolic compounds in propolis with high accuracy. Among various types of high-resolution mass analyzers including Q-Tof and Orbitrap, Fourier transform ion cyclotron resonance (FT-ICR) is mostly considered as a powerful tool to interpret elemental compositions of compounds of interest (Choi et al. 2018). The indisputable resolving power (full width at half maximum, FWHM: > 800,000 at m/z 200) and mass accuracy (< 1 ppm) of FT-ICR enable to identify chemical compositions of extremely complicated mixtures without chromatographic separation. The isotopic fine structure obtained from FT-ICR MS can also be used to determine the elemental formula (Shi et al. 1998; Nakabayashi et al. 2013). da Costa and coworkers showed the direct infusion electrospray ionization (ESI) FT-ICR mass spectrometry-based evaluation of phenolic compounds in plant leaves (da Costa et al. 2016). Gardana and colleagues also used UPLC/MS/MS and high-resolution FT-ICR MS systems to reliably detect propolis allergens in raw propolis materials and commercial formulations (Gardana and Simonetti 2011).

In the present study, ethanol-extracted propolis (EEP) prepared from various propolis samples originating from different countries were analyzed using high-resolution 15 T FT-ICR mass spectrometry coupled with a reverse-phase ultra-high-performance liquid chromatography (RP-UPLC) system to determine the geographical origins of the propolis (Fig. 1). This analysis showed the characteristic features of the different propolis samples and figured out the key molecules discriminating those geographical origins. Furthermore, these results could be further utilized to assess the quality of commercial products.

Fig. 1
figure 1

Schematic workflow of UPLC/FT-ICR MS-based high-resolution analytical platform for determining the geographical origins of raw propolis samples

Materials and methods

Chemicals

Caffeic acid, p-coumaric acid, rutin, quercetin, cinnamic acid, kaempferol, chrysin, pinocembrin, caffeic acid phenethyl ester (CAPE), and artepillin C were purchased from Sigma (St. Louis, MO). All other reagents were of analytical grade.

Preparation of ethanol-extracted propolis

Each raw propolis material (10 g) was mixed with ethanol (30 mL) and then incubated for 48 h at room temperature with vigorous shaking. The resultant extract was obtained by filtration. The extracts were finally prepared as 1% of the total flavonoid content based on the quercetin, which was determined as described in the literature (Chang et al. 2002). The resultant EEP samples were stored at − 20 °C until analysis.

UPLC analysis

Chromatography was performed on an ACQUITY HSS T3 column (1.8 μm, 2.1 × 100 mm, Waters, Milford, MA) using an ACQUITY UPLC™ system (Waters) by injecting 2 μL of each EEP sample. The column was maintained at 40 °C. The gradient condition of UPLC started at 10% (v/v) acetonitrile (ACN)/water with 0.1% formic acid and was maintained for 5 min at a flow rate of 300 μL/min; the ACN content was linearly ramped to 45%, where it was maintained at this flow rate for 10 min, then ramped to 90%, where the flow rate was maintained for 3 min. The column was washed with 98% ACN for 4 min and re-equilibrated with 10% ACN for 4 min for the next run. The effluent was monitored at a wavelength of 280 nm.

UPLC/FT-ICR MS analysis

Mass spectrometric analysis of the extracts was performed using a 15 T FT-ICR mass spectrometer (solariX XRTM system, Bruker Daltonics, Billerica, MA) equipped with an electrospray ionization (ESI) source and an ACQUITY UPLC™ system. The UPLC conditions were the same as those previously described. The eluent was introduced into the mass spectrometer to acquire high-resolution MS spectra in positive ion mode within the mass range from m/z 150 to m/z 1000. The MS parameters of the positive ESI mass spectrometer were an ESI voltage of 4.5 kV, a drying gas flow rate of 8.0 L/min, a drying gas temperature of 220 °C, a skimmer voltage of 15 V, a collision gas energy of − 3.0 V, an accumulation time of 50 ms, a transient length of 1.398 s, an acquisition size of 4 MB, and a scan number of 1, with a sine-bell apodization window function applied in the time-domain signal. External calibration was performed with quadratic regression using a 100 μg/mL arginine solution. Data acquisition was controlled by the ftmsControl 2.1 and HyStar 4.1 software (Bruker Daltonics), and data processing for selection of molecular features was performed using the DataAnalysis 4.4 program (Bruker Daltonics). Bucketing of the molecular features was processed using ProfileAnalysis 2.2 software (Bruker Daltonics).

Multivariate analyses

The multivariate data analyses such as principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA) with Pareto scaling were performed using the SIMCA-P+ 12.0 software (Umetrics, Umeå, Sweden) to discriminate the different origins. Molecular features containing the retention time and mass-to-charge ratio were extracted from the UPLC/FT-ICR MS raw spectra of four propolis samples and one mixture sample to determine key compounds among different propolis samples. The datasets including the UPLC/FT-ICR MS-based molecular features were used for the multivariate analysis. After the PLS-DA analysis, the interesting variables were selected using the variable importance in projection (VIP) scores over 4 for further identification of key propolis components reflecting the origin. The molecular formulae and corresponding compounds of the selected variables were identified via METLIN (Smith et al. 2005), HMDB (Wishart et al. 2007), and other publicly available database searches.

Results and discussion

Quantitation of ten phenolic components in various EEP samples by UPLC analysis

To investigate the chemical composition of propolis originating from different countries, we prepared EEP using raw materials from four different countries (i.e., Argentina, Brazil, China, and Korea). Four EEP samples were analyzed using UPLC with a UV detector (280 nm), and typical chromatograms were obtained. In general, a diode array detector (DAD) has been used to measure a variety of flavonoids or phenolic compounds at a range of wavelengths (i.e., 200–500 nm) because those molecules have typical UV absorption maximum (Harnly et al. 2007; Pellati et al. 2011; Zhang et al. 2013). Most flavonoids or phenols showed an absorption maximum at around 240–290 or 300–350 nm. However, those phenolic compounds were mostly been detected at a UV wavelength of 280 nm (Seal 2016; Yang et al. 2016), so the UPLC-based detection of the phenolic components in EEP samples was done at a wavelength of 280 nm in this study. As shown in Fig. 2, the chromatographic profiles of EEP samples from Asian countries (China and Korea) were similar (Fig. 2d, e), whereas the UPLC chromatograms of South American propolis (Argentina and Brazil) were apparently distinct from each other (Fig. 2b, c). The UPLC profile of Argentinean propolis appeared more similar to that of Korean propolis.

Fig. 2
figure 2

UPLC chromatograms of EEP samples from South America (b Argentina, c Brazil) and Asia (d China, e Korea). a The chromatogram obtained from polyphenol standards. The numbers and the corresponding dotted lines indicate each phenolic standard. 1 = cinnamic acid; 2 = coumaric acid; 3 = caffeic acid; 4 = chrysin; 5 = pinocembrin; 6 = CAPE; 7 = kaempferol; 8 = artepillin C; 9 = quercetin; 10 = rutin. f Bar graph showing the concentrations of ten phenolic compounds in various EEP samples of different origins

We selected ten phenolic compounds (i.e., coumaric acid, quercetin, cinnamic acid, kaempferol, chrysin, pinocembrin, CAPE, artepillin C, caffeic acid, and rutin), which are the most well-known constituents of propolis, and quantitated the compounds in the EEP samples using standards of these phenolic compounds. Each concentration of the key components in the EEP samples is shown in Fig. 2f. The major compound in Brazilian propolis was artepillin C, as previously determined, and the concentration of artepillin C in Brazilian propolis was sixfold higher than that in Chinese propolis; no artepillin C was observed in the Argentinean or Korean propolis. In addition to the comparison of the chromatographic profiles of phenolic compounds monitored at the wavelength of 280 nm, we also compared the EEP profiles using UPLC combined with high-resolution FT-ICR MS analysis.

In-depth analysis of EEP samples using high-resolution UPLC/FT-ICR MS

The investigation of key components contributing to the determination of the geographical origins of propolis is necessary for assessing the quality of propolis and for better use of propolis depending on the biological activities of the key components. Total UPLC/FT-ICR MS ion chromatograms were obtained for four EEP samples (Fig. 3a–d). To confirm the analysis of the EEP samples, the identification and intensity-based quantification were carried out from the raw data. The quantitative analysis results for the ten key phenolic species were compared with those obtained by UPLC analysis. The quantitative results via UPLC/FT-ICR MS agreed well with the UPLC data, supporting the reliability of the FT-ICR MS data. The comparative results are summarized in Table 1.

Fig. 3
figure 3

Total ion chromatograms of EEP samples from a Argentina, b Brazil, c China, and d Korea, as obtained by UPLC/FT-ICR MS. PLS-DA models obtained using UPLC and UPLC/FT-ICR MS data. Quantitation data of ten phenolic compounds using UPLC was used for model (e), whereas selected key molecular features (VIP > 4, and fold change ≥ 2 or ≤ − 2 or observed only in each sample) obtained by FT-ICR MS analysis were used to generate model (f)

Table 1 Comparisons of ten phenolic compounds identified from EEP samples using UPLC and UPLC/FT-ICR MS

To elucidate the key compounds determining the geographical origins of propolis, the molecular features obtained via UPLC/FT-ICR MS analysis were investigated. Approximately 8000 molecular features, including the retention time and m/z values, were extracted from the LC/FT-ICR MS raw data for the four EEP samples. PLS-DA plots of the four propolis samples showed that the UPLC/FT-ICR data well separate the propolis samples by their geographic origin, while the Chinese and Korean propolis were not well resolved by the UPLC-based PLS-DA model, indicating that the selected quantitative information for the ten phenolic compounds in EEP samples was not sufficient to differentiate the samples’ origins (Fig. 3e, f). Given these observations, we speculated that the analysis of a few selected constituents in propolis might be insufficient for the quality control or quality assurance of the propolis products and the accurate determination of their source origins. Interestingly, Argentinean propolis differed from the Brazilian propolis, although the two countries are both located in Latin America, whereas China and Korea are in Asia; the Argentinean propolis was not included for further identification of key compounds.

Investigation of key components discriminating Korean propolis from Brazilian and Chinese propolis

As previously mentioned, the chemical composition of South American propolis differs substantially from that of East Asian propolis because of their botanical origins (i.e., poplar- and Baccharis-type plants). As shown in Fig. 3f, we observed good discrimination of Brazilian and Korean propolis; we therefore decided to determine the key propolis components discriminating Korean and Brazilian propolis. In addition, the molecular features obtained from Korean and Chinese propolis samples were investigated to figure out the key molecules that differentiate these two Asian propolis varieties. To determine the key components distinguishing the different propolis samples among 8164 molecular features obtained from UPLC/FT-ICR MS analysis, approximately 100 features with VIP values greater than 4.0 and fold changes above or below 2 or fold changes observed only in each sample were selected for further investigation.

Finally, we obtained 35 and 43 key components discriminating Korean and Brazilian propolis and Korean and Chinese propolis, respectively. All the significant compounds contributing to the geographical determination of Korean and Brazilian propolis are summarized in Table 2, while the key compounds discriminating Korean and Chinese propolis are listed in Table 3. Identification of the chemical formulae was possible from the accurate molecular masses and their experimental isotopic fine structures (IFSs) of all metabolites. Prediction of the compound was made by searching METLIN and HMDB databases. The extracted ion chromatograms, the corresponding mass spectra, and IFSs of the proposed compounds were summarized in the Additional file 1. Then, the proposed key compounds could be divided into five subgroups based on their chemical class (i.e., flavonoids, phenols, terpenoids, fatty acids, and others). The pie charts of the chemical classes of the key compounds discriminating Korean and Brazilian propolis showed that the flavonoids were the most abundant (61%), followed by phenols (22%), terpenoids (6%), others (6%), and fatty acids (5%) in Korean propolis; meanwhile, the most frequently assigned class was flavonoids (29%) and others (24%), followed by phenols (23%), terpenoids (18%), and fatty acids (6%) in Brazilian propolis (Fig. 4a). Key propolis classes for differentiating Korean and Chinese propolis were divided by flavonoids (52%), terpenoids (24%), phenols (16%), and others (8%) in Korean propolis, whereas Brazil-specific compounds were classified by fatty acids (56%), others (22%), flavonoids (11%), and terpenoids (11%) (Fig. 4b).

Table 2 Key compounds discriminating Korean and Brazilian propolis. The key compounds were selected from key molecular features with VIP > 4, and a fold change ≥ 2 or ≤ − 2 or observed only in each sample
Table 3 Key compounds discriminating Korean and Chinese propolis. The key compounds were selected from key molecular features with VIP > 4, and a fold change ≥ 2 or ≤ − 2 or observed only in each sample
Fig. 4
figure 4

Pie charts showing compositional differences of key components discriminating a Korean and Brazilian propolis and b Korean and Chinese propolis

In particular, flavonoid species seemed a most significant contributor differentiating Korean propolis from Brazilian and Chinese propolis. In Brazilian propolis, di- and tri-terpenoids including Gibberrellins, a kind of plant hormones that affect plant growth and developments (Hedden and Sponsel 2015), were significantly observed. Those key components would strongly reflect their botanical differences. Between two Asian propolis origins (Korea versus China), flavonoids and terpenoids species were reliably identified as Korea-specific key molecules, but Chinese propolis appears to possess fatty acid class compounds. The unsaturated fatty acids including ricinoleic acid and eicosanedioic acid, distinctly found in the Chinese propolis could also be utilized as dietary sources as the unsaturated fatty acids present in propolis are considered to be a good source to the diet (Rebiai et al. 2017).

Conclusions

Here, a high-resolution 15 T FT-ICR MS equipped with UPLC system was introduced to investigate the key phenolic compounds responsible for determining the geographical origin of propolis (i.e., Korea versus Brazil and Korea versus China). We then proposed 16 flavonoids, 8 phenolic compounds, 4 terpenoids, 2 fatty acids, and 5 others in Korean and Brazilian propolis, while 15 flavonoids, 10 fatty acids, 8 terpenoids, 4 phenols, and 6 others were proposed in Korean and Chinese propolis. These key compounds can be used as chemical markers to classify and identify the geographical origins of propolis. In the pharmaceutical and food industries, those key propolis components could play a significant role in distinguishing high-quality propolis from inferior or fake propolis. Moreover, the information of the key propolis constituents can be utilized to verify the effects of propolis in the prevention and treatment of various symptoms and diseases. Further characterization and biological evidence of the key compounds could focus on the evaluation of the compounds for quality assessment of propolis products and for standardization of propolis.

The chemical composition of propolis is strongly linked to vegetation present in the collection area as well as collecting periods and climates (Bankova et al. 2000). However, changes in vegetation on the Chinese continent are as great as changes from southern Argentina to northern Brazil. Yang and coworkers showed that the compositions and concentrations of aroma-active components collected from 23 regions of Chinese continent were significantly different (Yang et al. 2010). Therefore, a more detailed investigation of plant species close to the production area of propolis even in the same country is needed to determine the source origin more accurately. Although the results were obtained with only four different propolis samples from each origin and the key propolis compounds were not confirmed by tandem MS analysis, this UPLC/FT-ICR MS-based high-resolution platform showed the potentials for the comprehensive analysis of highly complicated bioactive compounds.

Furthermore, it could also be used to investigate novel propolis compounds with biological activities and are helpful for the pharmaceutical and food industries, which require an understanding of the chemical composition, botanical origin, and biological properties of propolis.