Introduction

The utilization of recycled biomasses to generate bio-products is gaining momentum as it contributes to the advancement of the circular economy in numerous industries, including pharmaceuticals and agriculture (Barracosa et al. 2019; Duque-Acevedo et al. 2020). In particular, composting stands out as a notable biotechnology that uses a controlled aerobic microbial process to efficiently convert organic waste into sanitized and biochemically stable material (Ballardo et al. 2020; Monda et al. 2018). Significant quantities of residues and by-products are generated by horticultural crops and agro-industrial productions, which present valuable sources of organic matter. These resources can be effectively recycled through composting, resulting in the production of highly reactive humified material with versatile applications (Morales-Corts et al. 2018; Moretti et al. 2015; Verrillo et al. 2021a,b, 2022).

Apart from their conventional use as soil conditioners and fertilizers, composts have been found to contain valuable humic substances that can be extracted and used for remediating soils contaminated with organic pollutants (Conte et al. 2005; García-Díaz et al. 2015; Piccolo et al. 2021). Humic substances are organic colloidal components resulting from the physical, chemical, and microbiological transformation of plant and animal residues (Lichtfouse et al. 1998; Piccolo 2016; Dou et al. 2020). They are supramolecular associations of a wide diversity of heterogeneous molecules of relatively small mass held together by weak and dispersive forces (Piccolo 2002; Piccolo et al. 2019b). The intricate heterogeneity of humic substances grants them robust surfactant properties, which can be attributed to their composition of both hydrophobic and hydrophilic components, including fatty and phenolic acids. (Nebbioso and Piccolo 2011; Drosos et al. 2017; Vinci et al. 2022). With their abundant redox-active groups (i.e., carboxyl and hydroxyl functional groups), humic substances facilitate the formation of stable complexes with certain organic pollutants, metals, and radionuclides (Halim et al. 2003; Wang et al. 2023) and may serve as electron mediator during microbial respiration, promoting the biodegradation of organic pollutants and bio-reduction of heavy metals. As a result, humic substances also have the capability to effectively remove heavy metals from contaminated soils (Piccolo et al. 2019a, 2021; Bi et al. 2019). Indeed, when used alone or in combination with biomimetic catalysts, humic substances have demonstrated the ability to act as soil washing agents and facilitate the subsurface remediation of heavy metals and organic pollutants (Mao et al. 2015; Piccolo et al. 2019a; Tsang and Yip 2014; Yang and Hodson 2019).

A comprehensive understanding of the molecular characteristics of humic substances is necessary to fully exploit their chelating properties as biosurfactants. The lack of knowledge in this area can be attributed, in part, to the diverse molecular composition of humic substances obtained from various biomasses utilized in composting, as well as their varying levels of maturity. These factors play a substantial role in determining the effectiveness of humic substances in soil remediation procedures (Cozzolino et al. 2016; Pane et al. 2013; Piccolo et al. 2019a). A detailed characterization of humic substances samples extracted from various composted substrates is thus essential to elucidate the relationship between chemical composition and soil remediation activity. This characterization may help determine how different sources of compost affect the efficiency of humic substances in metal and radionuclides complexation and the removal of organic pollutants during soil washing (Piccolo et al. 2019a).

This study aims to probe the molecular composition of humic substances extracted from green composts made from residues of various agricultural biomasses (Fig. 1). Here, we used untargeted Orbitrap ultra-high-resolution mass spectrometry (UHRMS) analysis to measure with very high mass accuracy the mass-to-charge ratio of individual compounds forming the architecture of humic supra-structures, using both positive atmospheric pressure photoionization (APPI) and negative electrospray ionization (ESI) (Stenson et al. 2002; Marshall and Rodgers 2008; Nebbioso and Piccolo 2011; Maria et al. 2019; Solihat et al. 2019; Hawkes et al. 2020; Vinci et al. 2022). Additionally, we integrate UHRMS data with solid state 13C cross-polarization magic-angle-spinning nuclear magnetic resonance (13C-CPMAS-NMR) to provide a reliable description of the overall composition of carbon compounds in the complex humic matrices extracted from compost (Verrillo et al. 2021a,b, 2022). By combining these advanced analytical techniques, we aim to overcome the inherent limitations associated with individual approaches and gain a more accurate understanding of the molecular composition of humic substances.

Fig. 1
figure 1

Schematic representation of the workflow applied in this study. Organic composts were made with different mixes of organic waste. The humic substances extracted from the composts were analyzed by HRMS: high resolution mass spectrometry (Orbitrap Fusion Lumos Tribrid) and solid state 13C NMR: nuclear magnetic resonance spectroscopy (Bruker AV-300, equipped with a 4 mm MAS probe.)

Materials and methods

Green composts and humic substances extraction

Green composts were produced at the composting center of the Experimental Farm of the University of Naples Federico II at Castel Volturno, Italy (Verrillo et al. 2021a,b). In this work, seven different composts were prepared by mixing residues of different agricultural biomasses (viz. lemon, bell pepper, fennels, artichokes, coffee ground, coffee husks, and nursery residues, Figure. S1A). Extraction of humic substances has been carried out according to the procedure reported by Verrillo et al. 2022 (Figure. S1B). The humic substances solutions were freeze-dried before analysis.

13C Cross-polarization magic-angle-spinning NMR spectroscopy

Solid-state 13C Cross-Polarization Magic-Angle-Spinning (CPMAS) NMR Spectroscopy spectra of humic substances were performed on a Bruker AV-300, equipped with a 4 mm MAS probe. The acquisition parameters were set as following: 10,000 Hz rotor spin rate; 2 s recycle time; 1H-power for CP 92, 16 W; 1H 90º pulse 3 µs; 13C power for CP 92, 75W; 1 ms of contact time; 40 ms of acquisition time; 4000 scans. Samples were packed in 4 mm zirconium rotors with Kel-F caps. To account for the molecular overlapping of solid state 13C NMR spectra, the signal resonances are conventionally grouped into six main spectral regions representative of the different types of carbon functional groups.

Ultrahigh-resolution orbitrap fusion lumos tribrid mass spectrometer

ESI (-) and APPI ( +) ionization modes were used to ionize a wider range of compounds in humic substances samples. The freeze-dried humic substances were dissolved and sonicated (~ 1 h) in a mixture of methanol and water 50:50 (v:v) and added with an aliquot of 20 µL of ammonia to reach pH 10 for negative electrospray ionization (-) measurements. In the case of APPI ( +) analyses, the extracted humic substances were instead dissolved and sonicated (~ 1 h) in a mixture of methanol:water:acetone 5:5:1 (v:v:v). Both ion sources (ESI and APPI) were interfaced to an Orbitrap Fusion Lumos Tribrid 1 M mass spectrometer (Thermo Scientific, San Jose, CA, USA). Detailed Orbitrap Fusion Lumos Tribrid1M mass spectrometer settings are reported in supplementary information (SI1). All peaks detected with a signal to noise ratio greater than 3 were processed using the Composer Software (Sierra Analytics, USA) and the relative detailed settings are discussed in supplementary information (SI2).

Results and discussions

Solid-state 13C CPMAS-NMR characterization of humic substances from different green composts

Our results revealed that humic substances isolated from the seven distinct compost samples exhibited a predominant presence of O-alkyl carbons (60–110 ppm). These O-alkylated carbons are associated with mono-, oligo-, and poly- saccharides (Table 1, Figure S2). Furthermore, within the humic substances analyzed, the alkyl-C chemical range constituted a maximum of 24.8% of the total area in the fennel-derived samples. This was followed by nursery residues, coffee husk, and artichoke, which accounted for approximately 24.1%, 23.6%, and 15.4% of the total C signals, respectively (Table 1). In general, a contrasting trend was observed for the distribution of total aromatic components, albeit with less pronounced variations (Table 1). Humic substances derived from artichoke, lemon, and nursery residues exhibited aryl and O-aryl-C components that accounted for 24.9, 24.2, and 23.4% of the total signal area, respectively (Table 1). In contrast, a relatively lower content of 20.1% was observed in humic substances derived from fennel. (Table 1).

Table 1 Relative distribution (%) of signal areas in chemical shifts (ppm) intervals and structural indexes (SI3) of the different humic substances extracted from composts

The structural parameters calculated based on the relative abundance of various carbon groups (Table 1) can provide insights into the heterogeneous molecular characteristics of humic substances. These parameters can be utilized to establish correlations between the structural and conformational attributes of humic material and their bioactive properties, environmental reactivities, and metal binding capacity (Vaccaro et al. 2015; Savarese et al. 2022). We observed that the hydrophobic index values, which are close to unity for all the humic substance samples (Table 1) indicate a comparable contribution from polar/hydrophilic and apolar hydrophobic components. This finding supports the supramolecular pseudo-micellar conformation of humic materials in aqueous solutions, enabling them to effectively interact with both organic and inorganic contaminants (Mazzei and Piccolo 2015; Piccolo et al. 2021). Nonetheless, the O-alkyl ratio provides insights into the diversity of alkyl compounds with polar and apolar characteristics found in the humic extracts obtained from various green composts (Monda et al. 2018). The absence of NMR signals (Figure S2) around 84/89 ppm, which are typically associated with C4 carbons participating in glycosidic bonds, indicates a selective extraction of smaller carbohydrate components into the humic extracts from the green composts. Hence, our results suggest that the extracted humic substances primarily contain decomposed and fragmented carbohydrate compounds, rather than intact and rigid cellulose chains. The comparable aromatic index values, along with the overall decreasing trend in the Lignin ratio (Table 1), suggest the presence of polyphenolic and aromatic compounds derived from degraded lignin in the extracts. This finding aligns with previous studies by Savy et al. (2017).

Molecular chemodiversity of humic substances extracted from different composts

To our knowledge, this study represents the first instance of utilizing a combined approach involving negative electrospray ionization (-) and atmospheric pressure photoionization (APPI) ( +) Orbitrap UHRMS analysis for the examination of humic substances extracted from composts. By employing a combination of atmospheric pressure photoionization (APPI) in the positive ionization mode and negative electrospray ionization (−), we were able to amplify both the types and quantities of ions generated and detected. Our approach enabled a more comprehensive exploration of the compositional landscape within humic substances derived from composts, thereby facilitating a more detailed analysis. Hence, the simultaneous utilization of APPI ( +) and ESI (−) demonstrated significant disparities in the distribution and intensity of ions resolved within the mass range of 100 m/z to 800 m/z for ESI and 100 m/z to 1200 m/z for APPI. For example, Fig. 3A exhibits the full scan mass spectra obtained through negative electrospray ionization (ESI(-)) and atmospheric pressure photoionization (APPI ( +)) for the humic substances extracted from artichoke compost. In the ESI (−) spectrum, the highest intensity peaks were concentrated within the m/z 200–300 range. In contrast, the APPI ( +) analysis demonstrated a bimodal distribution, with the first local maximum centered around m/z 200 and a second maximum centered around m/z 350. Following data filtering (SI2), a substantial number of molecular formulae (N = 6356 individual formulae) were successfully assigned by integrating the datasets obtained from both negative electrospray ionization and positive atmospheric pressure photoionization. In total, there were 7413 distinct molecular formulae, with the highest molecular diversity observed in humic substances extracted from bell pepper composts. Conversely, we found that the lowest diversity was observed in humic substances derived from fennel composts, consisting of 5102 molecular formulae (Table S1).

Figure 2B displays the doughnut plots illustrating the relative abundance of different molecular series (compounds assigned as CcHhOoNnPpSs) detected in each ionization mode. In the ESI (−) spectrum of humic substances from the artichoke compost, approximately 44% of the identified molecular anions were CHO compounds, 43% were CHNO, 6% were CHNOS, and a small proportion consisted of phosphorous compounds (CHNOP and other P-containing compounds). Conversely, in the APPI ( +) analysis, 54% of the detected cations/radical species were CHNO compounds, followed by CHO (26%), CHN (8%), CH (3%), and CHOS (2.5%) in the humic substances from the artichoke compost. No phosphorous-containing compounds were detected (dataset provided in Table S2). All analyzed humic substances samples displayed a relatively high abundance of nitrogen-containing compounds (CHNO) and CHO compounds with oxygen content ranging from O1 to O8-O10 in nearly every sample (Fig. 2C). The prevalence of nitrogen-containing CHNO compounds over CHO compounds suggests the incorporation of nitrogen atoms into the macrostructure of humic substances, potentially facilitated by the inclusion of amino acids and protein residues within the supramolecular framework of humic matter (Piccolo et al. 2019b).

Fig. 2
figure 2

a Negative electrospray ionization [ESI (-) (red)] and atmospheric pressure photoionization [APPI ( +) (blue)] Fourier Transform—Orbitrap Fusion Lumos Tribrid 1 M full scan mass spectra of humic substances extracted from the artichoke compost; b doughnut plots featuring the relative abundance of each (color coded) molecular series detected in each ionization mode; c number of molecular formulae detected as a function of the number of oxygen atoms in both ESI (−) (top) and APPI ( +) (bottom) for each CHO, CHON, CHOS and CHNOS molecular series and d) summary of the relative abundances of compounds assigned to molecular families, combining both negative electrospray ESI (−) and positive photoionization APPI ( +). Molecular families containing C: carbon, H: hydrogen, O: oxygen, N: nitrogen, S: sulfur, P: phosphorus, Other: other compositions, CRAM: carboxyl-rich alicyclic molecules

The Van Krevelen diagram (Fig. 3) was employed to further analyze the detected compounds and their distribution among known biomolecular families, including lipid-like, protein-like, amino sugar-like, carbohydrate-like, condensed hydrocarbons, lignin-like, tannin-like, and aromatic structures (Gougeon et al. 2009; Roullier-Gall et al. 2014; Spano et al. 2021). In our APPI( +) analysis, all humic substances exhibited lower O/C and H/C ratios compared to the formulae analyzed using ESI(-) (Table S3). Additionally, higher aromaticity index and double bond equivalence (DBE) values were observed, consistent with previous research on humic substances extracted from soils (Vinci et al. 2022). Aromatic hydrocarbons generally have low H/C and O/C ratios, whereas aliphatic compounds exhibit high H/C and low O/C ratios. Hence, APPI ionization was more inclined to ionize less polar compounds with lower O/C ratios, increased aromaticity, higher N/C values, and a selective ionization of N-containing compounds. In contrast, negative electrospray ionization predominantly targeted more polar compounds with higher O/C and H/C ratios (Fig. 3).

Fig. 3
figure 3

Van Krevelen plot representing humic substances extracted from artichoke compost in both ionization techniques, negative electrospray ionization [ESI (−) left] and atmospheric pressure photoionization [APPI ( +) (right)]. Data color coded according to the molecular species CHO, CHON, CHOS and Others. Molecular families containing C: carbon, H: hydrogen, O: oxygen, N: nitrogen, S: sulfur, Other: other compositions. Dotted lines represent compound categorization based on aromaticity index (AI) and elemental H/C ratio. Zone α, condensed polycyclic aromatics (AI > 0.66): zone β, polyphenols (0.50 < AI < 0.66): zone χ, highly unsaturated and phenolic compounds (AI ≤ 0.50 and H/C < 1.5): and zone δ, aliphatic compounds (1.5 ≤ H/C ≤ 2.0)

The Van Krevelen diagrams (Fig. 3, Figure S3 and Table S1) demonstrate that the predominant components in the analyzed samples are lignin/CRAM molecules, accounting for 50% of the composition, followed by lipids and unsaturated hydrocarbons, each comprising 20% of the samples. The elemental ratios presented in Table S3 indicate that humic substances primarily consist of alicyclic and aromatic compounds characterized by high H/C ratios, low O/C ratios, low double bond equivalence (DBE), and AI indexes. These findings suggest the presence of carboxyl-rich alicyclic molecules (CRAM) within the humic substances, with carboxyl-C/aliphatic-C ratios ranging from 1:2 to 1:7. The presence of CRAM has been previously reported in dissolved organic matter (DOM) using NMR and ultra-high-resolution mass spectrometry (Hertkorn et al. 2013). CRAM compounds comprise highly carboxylated alicyclic structures that are associated with fused non-benzene rings, originating from both biotic and abiotic sources (Nebbioso et al. 2014; DiDonato and Hatcher 2017). Additionally, we observed abundant lipid compounds such as fatty acids, hydroxyacids, and alcohols, which result from the microbial degradation of biomasses during composting (Spaccini and Piccolo 2007). Furthermore, the elevated N/C ratio observed in coffee husk, coffee ground, and nursery residues indicates a higher concentration of nitrogen-containing compounds in the corresponding humic extracts.

Discussion on the molecular composition in relation to soil washing for remediation purposes

The molecular characterization of humic substances extracted from green compost for future soil washing experiments has not been previously explored. Previous studies have focused on examining humic substances derived from other sources, such as manure compost, solid waste, and sewage sludge, in order to assess their molecular composition (Kulikowska et al. 2015a, 2015b; Mao et al. 2015; Piccolo et al. 2019a; Tsang and Yip 2014; Yang and Hodson 2019). The hydrophobic and hydrophilic constituents of humic substances exhibit self-assembly properties, forming pseudo-micelles, which facilitate the redistribution of organo-metallic compounds. Additionally, the presence of carboxylic functional groups enhances the complexation and effective mobilization of organo-metals from soil environments (Ardakani and Stevenson 1972). The complexation abilities of humic molecules are influenced by their molecular composition, which is still not fully understood and can vary based on the source and type of compost (Monda et al. 2018). Hence, the effectiveness of humic substances in pollutant removal through soil washing may be influenced by the specific compost source employed. In this study, our objective was to examine the molecular composition of humic substances extracted from various types of green composts and compare them to those extracted from manure compost, which has previously shown successful soil washing results in contaminated sites (Piccolo et al. 2019a). Furthermore, the use of green compost offers several sustainability advantages over the application of humic substances derived from manure compost, solid waste, and sewage sludge. Green compost, comprising plant materials, significantly reduces environmental impact by minimizing methane and other potential greenhouse gas emissions. Additionally, it eliminates concerns associated with animal waste and its potential contribution to water pollution. The use of renewable and locally available resources in green compost production avoids the need for long-distance transportation. Moreover, carbon sequestration occurs through the capture and storage of atmospheric carbon dioxide into the soil, thereby assisting in mitigating climate change.

To accomplish this objective, we conducted comprehensive characterization using untargeted ultra-high-resolution mass spectrometry (UHRMS) in both atmospheric pressure photoionization (APPI) and negative electrospray ionization (ESI) modes, alongside 13C cross-polarization magic angle spinning nuclear magnetic resonance (13C-CPMAS-NMR) spectroscopy. Our findings revealed that humic substances extracted from green composts exhibited a notably high abundance of carboxyl-rich alicyclic molecules (CRAM). These CRAM compounds are recognized for their strong metal complexation properties, suggesting their potential effectiveness in soil washing for pollutant removal (Piccolo et al. 2019a). Through a comparison of the molecular composition of humic substances from green composts and manure composts (Piccolo et al. 2019a), we observed similarities in the content of carboxyl and aromatic carbon between the composts analyzed in this study.

Consequently, it can be anticipated that humic substances derived from green composts with higher levels of carboxyl and phenolic functional groups would exhibit superior abilities to complex heavy metals and aid in the separation of organo-metal complexes from contaminated soils. Our analysis further demonstrated the presence of carbohydrate components, particularly O-alkyl carbons associated with saccharides, in the humic substances from green composts. The relative proportions of alkyl and aromatic carbon components varied across different composts, indicating differences in the composition of humic substances. The coexistence of hydrophilic and hydrophobic constituents contributed to the formation of supramolecular pseudo-micellar structures within the humic substances.

Overall, our study provided valuable insights into the molecular composition of humic substances derived from diverse compost sources and their potential applicability in soil remediation. These findings underscore the significance of comprehending the compost materials and their influence on the efficacy of humic substances in pollutant removal from soils. Future research endeavors will focus on applying humic substances sourced from various green composts in soil washing experiments to determine the most effective remediation approach based on compost composition.

Conclusions

By employing advanced techniques such as ultra-high-resolution mass spectrometry (UHRMS) and nuclear magnetic resonance (NMR), this study unveiled the diverse molecular composition of humic substances extracted from different green composts derived from various agricultural biomasses. Our findings highlighted the presence of similar molecular components in these humic extracts, with a substantial proportion consisting of carboxyl-rich alicyclic molecules (CRAM). Alongside CRAM, fatty acids, and phenolic acids likely contribute to the stabilization of the supramolecular pseudo-micellar structure of humic substances, which, when in aqueous solutions, could potentially exhibit effective soil washing capabilities for the removal of both organic and inorganic contaminants. This research represents the first comprehensive characterization, to the best of our knowledge, of the molecular composition of humic substances derived from diverse green composts, employing high-resolution mass spectrometry with a mass error of less than 1 ppm, enabling the identification of 5000–7000 mass components. The significant presence of carboxyl-rich aliphatic and aromatic compounds, combined with their notable hydrophobicity, suggests their potential effectiveness as biosurfactants for soil washing applications. This detailed understanding of humus extracted from green composts represents a significant advancement in soil washing technologies, emphasizing the utilization of natural and sustainable recycled biomasses rather than non-renewable sources like peat, lignite, organic solvents, or synthetic surfactants, which can have detrimental effects on soil biological activity and biodiversity. In summary, green composting aligns with sustainable principles by harnessing renewable resources, reducing waste, promoting nutrient recycling, and supporting ecological balance in agriculture. The elucidation of the molecular composition of humic substances from green composts enhances our knowledge and paves the way for the development of soil washing techniques based on environmentally friendly and sustainable approaches.