Introduction

Proteins are complex molecules that can contain more than 50 amino acids. They can be roughly divided into insoluble fibroid proteins and soluble globular proteins. The fibroid proteins, e.g. in manure fibres, are more resistant to hydrolysis due to the lower solubility than globular proteins (Sanders 2001). Proteins are abundantly found in food processing industries, for example, dairy and meat, and therefore end up in waste and wastewaters of dairy, slaughterhouse, and fishery industries, as well as in food residuals (Braguglia et al. 2018; Wang et al. 2005). Being an important component of cells, proteins are also found in biomass, such as algae and spent biomass of biological treatment processes (Ganesh Saratale et al. 2018; Magdalena et al. 2018; Mata-Alvarez et al. 2014).

These protein-rich waste streams are usually high in organic load and represent a major environmental problem, especially regarding the growth of the global population and increasing demand for food protein (Bustillo-Lecompte and Mehrvar 2015; Hassan and Nelson 2012). Direct discharge of protein-rich waste streams into surface water can lead to severe damage to the water quality, i.e., causing eutrophication and aquatic life extinction (Baker et al. 2021). The stricter discharge standards and the decreasing availability of freshwater contribute to the requirement for both wastewater treatment and resources recycling (Bustillo-Lecompte and Mehrvar 2015). Due to the high organic load, the protein-rich waste streams are preferably treated by anaerobic digestion (AD), which can purify the wastewater by converting the organic matters to a gaseous energy carrier, i.e., biogas (Adhikari et al. 2018).

Anaerobic protein degradation generally consists of four steps, which are hydrolysis of protein to amino acids, deamination and acidogenesis of amino acids to ammonium and VFAs, acetogenesis of VFAs to acetic acid, and methanogenesis (Hassan and Nelson 2012). Previous studies found that protein hydrolysis is slower than that of carbohydrates, and its efficiency decreases with the increase in the availability of carbohydrates (Wang et al. 2014; Yu and Fang 2001). The mechanism of this impact is unclear. Although the Stickland reaction pathways and involved microorganisms were reported in the 1960s (Barker 1961), the conclusions were based on studies on culturable bacteria that account for a small fraction of the total diversity (Stewart 2012). Generally, it is believed that in an AD reactor, there is a specific group that converts proteins (Barker 1961), and this group is reported to have a slower growth rate than the carbohydrates fermenting bacteria, i.e., protein-fermenting microbiota μmax = 0.08—0.15 d−1, carbohydrates-fermenting microbiota μmax = 0.3—1.25 h−1 (Pavlostathis and Giraldo-Gomez 1991; Tang et al. 2005). There is a lack of information to identify and characterize the protein-fermenting microorganisms (hereafter referred to as protein degraders) and to link the microbial process performance with the microbial community during anaerobic protein conversion.

With the development of genetic technologies, it is possible to analyse the entire microbial community and identify the dominant microorganisms. In our present study, we aimed to identify the dominant microorganisms under three circumstances, i.e., protein as the sole substrate, carbohydrate as the sole substrate, and a mixture of protein and carbohydrate as the substrate. In the experiments, we used casein and lactose as substrates representing proteins and carbohydrates, respectively. By characterizing the dominating genera and their ability to ferment proteins/carbohydrates, we intended to reveal the mechanism of the carbohydrates’ impact on the anaerobic protein degradation in a complex microbiota. These results will lead to a better understanding of anaerobic protein degradation at a microbial level, and would support the future design and operations of anaerobic digestion reactors for the treatment of protein-rich waste streams.

Materials and methods

Materials

Complex microbiota from an anaerobic membrane bioreactor (AnMBR) fed with protein- and lipids-rich dairy wastewater was used as inoculum, the inoculum was taken at Biothane—Veolia Water Technologies Techno Center Netherlands B.V Research Facilities, in Delft, The Netherlands. The volatile suspended solids (VSS) content of the inoculum was 14.5 g VSS·kg−1 wet weight, and 3 g VSS was added to each reactor (working volume of 2 L) to achieve an initial microbial concentration of 1.5 g VSS·L−1.

Three substrates were used in the experiment: A) casein (Fisher Scientific), with casein as the sole carbon source, B) lactose (Sigma Aldrich), with lactose as the sole carbon source, and C) mixture of casein and lactose as carbon sources, further referred to as MIX, with a ratio of 50%: 50% in terms of chemical oxygen demand (COD). The addition of nutrients was based on the anaerobic fermentation process growth media protocol of Hendriks et al. (2018). The substrates contained 6000 ± 100 mg COD·L−1 and 800 ± 50 mg N·L−1.

Operation of acidification reactors

The reactors were mounted in parallel as shown in Fig. 1, each reactor set-up had a feed tank, a 3.5 L double layer (water jacketed) glass vessel, and an effluent tank. The feed tanks were stored in a fridge at 4 °C to minimize biological processes. The two vessels of 2 L working volume were operated as acidification reactors at 25 ± 3 °C and an HRT of 24 h. The temperature in the acidification reactors was controlled by using a recirculating water bath (Thermo Haake DC10), the pH in each tank was measured with a probe (Hamilton pH sensor), and a controller (Hach SC200 Universal Controller, USA) was used to control the pH in both reactors at 6 ± 0.2 by adding 0.5 mol·L−1 NaOH (Yu and Fang 2003). The volumetric loading rate (VLR) was 8 kg COD·m−3·d−1.

Fig. 1
figure 1

Schematic diagram of the reactor set-up

The operation of the reactors was divided into two stages, stage I lasted 78 days (days 7–84) and stage II lasted 48 days (days 85–132). In the first stage, the CAS reactor, fed with substrate A) casein, was acclimated to pure protein substrate, whereas the LAC reactor, fed with substrate B) lactose, was acclimated to pure carbohydrate substrate. At the beginning of stage II, the feed of both reactors (CAS and LAC) was switched to substrate C) the mixture of casein and lactose (MIX), and the microbiota was acclimated to the mixture of protein and carbohydrate. Magnetic stirrers were used to ensure the mixing of the feed, while two stirring blades connected to motors were used for mixing the reactors at 50 rpm. Four peristaltic pumps (Longer L100-18–2 with FG25-13 pump head, China) were used for influent feeding and effluent extraction. The operational conditions were constant during all stages, controlled by LabVIEW.

Effluent samples were taken twice a week, and the reactors were considered stable when the difference between daily feed COD and effluent COD was less than 2% and the variation in total VFA production and deamination was less than 10% in a week. The reactors were operated for 132 days. To acclimate the microbiota towards a specific substrate, stage I lasted 84 days, whereas stage II started from day 86 and lasted 46 days.

Deamination activity tests

To investigate the deamination activity of the microbiota, batch tests were carried out in duplicates with microbiota samples taken from the reactor at the end of each stage. Glass bottles with a total volume of 350 mL were used for the batch test. The microbiota concentration in each bottle was 1.5 g VSS·L−1. Substrate A) of 190 ± 0.2 mL and buffer solution (3.5 g NaHCO3·L−1) was added to achieve a total liquid volume of 200 ± 0.5 mL. All bottles were incubated at 25 °C, with manual shaking before sampling. Liquid samples of 2 mL were taken from each bottle with syringes at 0 h, 1 h, 4 h, 8 h, 12 h, 20 h, and 24 h, and COD, total Kjeldahl nitrogen (TKN), VFAs, and ammonium (NH4+) were measured.

Analytical methods

The solids contents (TSS and VSS), TKN, and NH4+ of the reactor effluent were analysed following standard methods (APHA 1998). The COD measurements were done with HACH-Lange kits (LCK014) without pre-treatment. For VFA measurement, samples were centrifuged at 13,500 × g for 5 min and filtered through 0.45 μm membrane filters (Whatman, Sigma Aldrich). VFAs, including acetic acid (C2), propionic acid (C3), iso-butyric acid (iC4), butyric acid (C4), iso-valeric acid (iC5), valeric acid (C5), and hexanoic acid (C6), were measured by gas chromatography (GC, Agilent Technologies 7820A) equipped with a CP 7614 column (WCOT Fused Silica 25 m × 0.55 mm, CP-wax 58 FFAP capillary, Agilent Technologies) and flame ionization detector. The injector temperature was 250 °C, and nitrogen gas (28.5 mL·min−1) with a split ratio of 10 was used as a carrier. The GC oven method sequence was started at 100 °C and held for 2 min, then increased to 140 °C and held for 6 min. The VFA production was calculated by first converting the measured VFAs (C2-C6) concentrations (mg·L−1) to COD (mgCOD·L−1), then divided by the COD of the effluent and times 100%.

The protease activity was measured with the Pierce™ Fluorescent Protease Activity kit (Thermo Scientific, USA), including contents of FTC-Casein, TPCK Trypsin, and BupH™ Tris-buffered saline. Measurement was carried out in triplicates, 100 μL sample taken directly from the reactors was mixed with 100 μL FTC-Casein working reagent (10 μg·mL−1 FTC-Casein in 25 mM Tris, 0.15 M NaCl, pH 7.2 assay buffer) in a 96-well plate (Pierce White Opaque 96-Well Plates). The plate was incubated for 2 h at 25 °C inside the fluorometer (FLUOstar, galaxy), the fluorescence was read every 120 s, and the fluorescein filter was set between 485 nm excitation wavelength and 520 nm emission wavelength. The reads of each sample were converted to BAEE (fluorescence unit, Na-Vebzoyl-L-Arginine Ethyl Ester, units·L−1), based on the calibration curve built with reference protease, Trypsin, and the conversion factor between its concentration and BAEE (14,000 BAEE units·L−1 equivalent can be produced by 50 mg·L−1 Trypsin). Final protease activity was calculated by linear fitting of the BAEE measurements and then normalized with the COD concentration of the sample.

Microbial community analysis

For microbial community analysis, samples were taken at the beginning and the end (stable operation) of each stage, as shown in Table 1. Aliquots of 2 mL liquid sample were centrifuged at 14,000 × g for 5 min, the supernatant was removed, and then the sample was stored at -20 °C before DNA extraction with FastDNA™ Spin Kit for Soil (MP BIOMEDICALS). The extracted DNA concentrations in all samples were checked by the Qubit 3 Fluorometer (Thermo Fisher Scientific, USA) to ensure a DNA concentration higher than 20 ng·μL−1. The DNA samples were then sent for amplicon sequencing by Illumina NovaSeq 6000 platform (Novogene, UK), using the primers 341F [(5′–3′) CCTAYGGGRBGCASCAG] and 806R [(5′–3′) GGACTACNNGGGTATCTAAT] for bacteria/archaea in the V3–V4 regions. Paired-end reads (2 × 250 bp) were assigned to samples based on their unique barcodes and truncated by cutting off the barcode and primer sequences. Paired-end reads were merged using FLASH (V1.2.7) (Magoč and Salzberg 2011), and the splicing sequences were called raw tags. High-quality clean tags (Bokulich et al. 2013) were obtained by quality filtering on the raw tags according to the QIIME (V1.7.0) (Caporaso et al. 2010). The clean tags were compared with the Gold database (see details http://drive5.com/uchime/uchime_download.html) using UCHIME algorithm (Edgar et al. 2011) to detect and remove chimera sequences (Haas et al. 2011) to obtain the effective tags. Sequences analysis were performed by Uparse (V7.0.1001) (Edgar 2013) using the effective tags. Sequences with ≥ 97% similarity were assigned to the same Operational Taxonomic Unit (OTU) and screened for further annotation by Mothur software against the SSUrRNA database of SILVA Database (Wang et al. 2007). Phylogenetic relationship of all OTUs was obtained by the MUSCLE (Version 3.8.31) (Edgar 2004). The sequences were deposited at DDBJ/EMBL/GenBank under the accession KFUH00000000.

Table 1 Microbiota samples taken at the beginning and end of each stage for microbial community analysis

OTUs abundance information was normalized using a standard of sequence number corresponding to the sample with the least sequences. Subsequent analysis of alpha diversity and beta diversity were all performed based on this output normalized data. Alpha diversity is applied in analysing biodiversity through observed species, Chao1, and phylogenetic diversity (PD). The indices in the samples were calculated with QIIME (V 1.7.0) and displayed with R software (V 2.15.3). Beta diversity analysis on weighted unifrac was calculated by QIIME (V 1.7.0). Principal Coordinate Analysis (PCoA) based on the weighted unifrac distance matrix of OTUs was performed and displayed by WGCNA package, stat packages and ggplot2 package in R software (V 2.15.3).

Results

COD balance and pH in the acidification reactors

The COD balance in the two acidification reactors during the operation period of 132 days is shown in Fig. 2A, the COD in the effluents was higher than 93% of the influent CODs of both the CAS and LAC reactors. The pH in the reactors was well regulated at 6 (Fig. 2B). Limited biogas gas production was expected at pH 6 (Holliger et al. 2016). Therefore, the COD in the two acidification reactors during the operation period was balanced with a maximum gap of 5%.

Fig. 2
figure 2

Average COD balance in the CAS and LAC reactors (A), where Eff, H2, and CH4 represent the output COD in the liquid effluent, and resulting biogas (H2 and CH4), the Gap represents the difference between the total output COD and input COD in the influent; pH in the two reactors recorded by the online pH probes (B)

Effect of the presence of lactose on deamination and VFA production

Deamination

Figure 3A shows the deamination efficiency of the two acidification reactors during the two stages. In stage I, casein was fed as the sole carbon and nitrogen source to the CAS reactor, and the deamination efficiency was calculated as the percentage of NH4+ released from the fed protein. The deamination efficiency increased from only 40% to almost 100% during the start-up period (day 0—20) and fluctuated between 60 and 80% during days 20—64. During days 70—84, a relatively stable period was observed, the average deamination degree was 77%, with a relative standard deviation of less than 5%. In the LAC reactor, lactose was fed as the carbon source and NH4+ was added as a nitrogen source, and therefore the deamination efficiency was 0%.

Fig. 3
figure 3

Deamination (A) and VFA production as % of effluent COD (B) in CAS reactor and LAC reactor during the operational period, VFA relative abundance in the CAS reactor (C), and VFA relative abundance in the LAC reactor (D)

In stage II, both reactors were fed with the MIX (substrate C). The deamination efficiency in the CAS reactor dropped to 15% at the beginning of stage II. Subsequently, an increasing trend with fluctuations (between 41 and 64%) was observed, and a similar deamination efficiency as in stage I was achieved at the end of stage II (80%). Similarly, the deamination efficiency of the LAC reactor also increased from 4 to 73% during days 86—120 but decreased to 35—55% during days 120—132.

VFA production

Figure 3B shows that the effluent VFA content of the CAS reactor fluctuated between 30 and 55% of the effluent COD during days 0 – 49 (VFA concentration data can be found in supplementary material Table S1). It increased steadily from 51 to 64% of the effluent COD during days 49 – 64 and stabilized at 75 ± 5% during days 64 – 84, Whereas in the LAC reactor, the VFA production was relatively stable between 20 and 39% of the effluent COD throughout stage I, with an average of 23 ± 6%. The difference in VFA production efficiency between the CAS and LAC reactor was 40% in stage I.

In stage II, VFAs production in the CAS reactor initially decreased to 34% of the effluent COD, followed by a gradual increase to 67% during days 120—132. The variation in VFAs production agreed with the variation in deamination efficiency, that is: a sudden decrease in protein degradation at the beginning of stage II. The VFAs production in the LAC reactor considerably increased from 37 to 76% of the effluent COD during days 86 – 114; however, it started dropping to 60 ± 2% by the end of stage II. The observed increased COD to VFA conversion efficiency could be attributed to the production of the more reduced VFAs iC5, C5, and C6 (Fig. 3D) coinciding with protein degradation. The presence of lactose reduced the degradation efficiency of casein, resulting in a decreased deamination efficiency and VFAs production in the CAS reactor in stage II.

Figure 3C presents the VFA composition as the percentage of total VFA-COD in the CAS reactor during stage I and stage II. During days 56 – 84, the VFA production and composition were considered stable in stage I. The relative abundance of VFAs was similar to the fermentation products of gelatine at pH 6 in the study of Breure and van Andel (1984). When the substrate was changed to the MIX in stage II, the composition of VFAs during stable operation performance (days 125 – 132) had changed compared to that in stage I. The C2 increased from 30% ± 9% to 50 ± 3%, and C4 increased from 14 ± 3% to 23 ± 5%.

As shown in Fig. 3D, the fluctuations in VFA composition in the LAC reactor mainly concerned C2 and C4 during stage I. A relatively stable VFA composition was observed from day 64 to day 84, where the most abundant VFAs were C2 (43 ± 2%) and C4 (44 ± 4%). In stage II, the VFA composition in the LAC reactor gradually changed with the increasing production of iC4, iC5 and C5, and C6. This trend was also observed in the CAS reactor, and the identified main VFA products during stage II were C2 (42 ± 5%) and C4 (28 ± 5%).

Effect of the presence of lactose on protease activity and deamination activity

Protease activity

At the end of experimental stage I (day 84), the protease activity of the samples from both reactors was analysed. As shown in Fig. 4, protease activity of 0.18 ± 0.01 BAEE unit·mgCOD−1·h−1 was observed in the CAS reactor, whereas no protease activity was detected in the LAC reactor sample. During stage II (day 91 and day 132), the CAS and LAC reactors were both found to have protease activity but at a level lower than 0.04 ± 0.01 BAEE unit·mgCOD−1·h−1.

Fig. 4
figure 4

Protease activity measured in stage I and stage II (the column represents the average protease activity in BAEE unit·mgCOD−1·h.−1, error bars represent the standard deviation)

The results showed that with the presence of lactose, the protease activity decreased to lower than 25% of that in stage I in the CAS reactor, indicating a negative effect of lactose on the protease activity, whereas in the LAC reactor, protease activity was slowly developed under the presence of protein.

Deamination activity

At the end of each stage, microbiota samples were taken from the reactor for assessing the deamination activity in batch tests. Figure 5A shows the ammonium production by the microbiota sample taken from the CAS and LAC reactors at the end of stage I. The deamination activity was calculated by linear fitting of the scattered plot of NH4+ concentrations as a function of time and divided by the microbiota concentration, 1.5 gVSS·L−1. Only suspended microbiota was found and collected from the CAS reactor (CAS microbiota) during this stage, whereas suspended microbiota (LAC suspended microbiota) and biofilm (LAC biofilm) were found and collected from the LAC reactor. The CAS microbiota had the highest deamination activity of 10.6 mg-N·gVSS−1·h−1. In contrast, no deamination activity of the LAC suspended microbiota and LAC biofilm was observed.

Fig. 5
figure 5

Ammonium production by microbiota sample taken at the end of stage I—84 days (A) and stage II – 132 days (B). The ammonium production rate was derived from the linear fitting of the plots; error bars represent the standard deviation

Figure 5B shows the ammonium production by the microbiota samples collected from both reactors at the end of stage II, including CAS suspended microbiota, CAS biofilm, LAC suspended microbiota and LAC biofilm. Compared to the deamination activity of CAS microbiota from stage I, the four microbiota samples from stage II showed a similarly low level of deamination activity, which was 1.2 mg-N·gVSS−1·h−1.

Effect of the presence of lactose on microbial community

Microbial community composition in stage I and stage II

Alpha diversity of the samples was compared using three indices, the observed species, chao1 and phylogenetic diversity indices. As can be seen in Fig. 6A, samples from the CAS reactor, especially during stage I, showed a higher diversity. In Fig. 6B, the PCoA analysis illustrated the similarity of the microbial communities in all samples that are indicated in Table 1. The distance between samples indicates their similarity in microbial community, the shorter the distance, the higher the similarity, and vice versa. As shown in the figure, samples C1.2 and C1.3 taken from the CAS reactor stage I, displayed the longest distance to the rest of the samples, indicating that the microbial community under casein-fed conditions had a very different composition than the lactose-fed or MIX-fed conditions. The samples L2.2 and L2.3, taken from the LAC reactor, displayed the shortest distance, indicating that the microbial community can be considered similar in the suspended microbiota and biofilm in this reactor by the end of stage II. Remarkably, suspended microbiota L1.3 has a shorter distance to stage I samples, whereas biofilm sample L1.4 was closer to stage II samples. This means that the microbial community composition in the biofilm samples of stage I became predominant in stage II in LAC reactor.

Fig. 6
figure 6figure 6

Alpha diversity of the microbial community in CAS and LAC reactors during stage I and stage II, average values of the observed species, Chao1, and phylogenetic diversity (PD) are presented and the error bars indicate the minimum and maximum values of the samples in each stage (A), Principal coordinates analysis (PCoA) ordination based on the weighted unifrac distance matrix of OTUs comparing all samples taken from CAS and LAC reactors during stage I and stage II. The PCoA is coloured by reactor: red symbols indicate CAS, the green ones indicate LAC. The percentage of the variation explained by the plotted principal coordinates is indicated on the axes (B), phylum relative abundance of microbiota in samples from CAS and LAC reactors during stage I and stage II (C), and top 15 dominant genera in CAS and LAC reactors during stage I and stage II (D)

Figure 6C shows the relative abundance at phylum level in each sample. When comparing the difference between stage I and stage II in the CAS reactor, the phyla Firmicutes and Proteobacteria were promoted by the presence of lactose and increased from 5.5% and 11.5% OTUs (average of C1.2 and C1.3) to 41% and 42% OTUs (C2.3), respectively. At the same time, the phyla Chloroflexi and Euryarchaeota (Archaea) were predominant in stage I but decreased in relative abundance in stage II from 23.5% and 18.5% OTUs (average of C1.2 and C1.3) to 1% and 4% OTUs (C2.3), respectively. Microorganisms belonging to these phyla had a slower growth rate than the acid producing bacteria at pH 6. The phylum Bacteroidetes exhibited first an increase from 14% OTUs (average of C1.2 and C1.3) to 24% OTUs (C2.2) and then decreased to 7% OTUs (C2.3). With the feed changed from casein to the MIX, the composition of the microbial community in the CAS reactor significantly changed at the phylum level, and the presence of lactose promoted the growth of bacteria affiliated to phyla Proteobacteria and Firmicutes.

When comparing the differences in microbial community composition between stage I and stage II in the LAC reactor, the phylum Firmicutes was found to be predominant in stage I, accounting for more than 60% OTUs in L1.1 and L1.2, 47% OTUs in L1.3, and 39% OTUs in L1.4. However, in stage II, Firmicutes decreased to 29% OTUs (L2.3). The phylum Proteobacteria were the second most abundant in L1.1 and L1.2 (16% OTUs) and gradually became more abundant than Firmicutes in L1.4, with a relative abundance of 47% OTUs, and was maintained above 40% OTUs during stage II. Bacteria from the phylum Bacteroidetes was also abundantly present in all the LAC samples, it accounted for 17% OTUs in L1.1 and 12% OTUs in L2.3, and the composition of the microbial community in the LAC reactor remained relatively stable during the two different stages.

According to the alpha and beta diversity analysis, the microbiota, that was acclimated to casein as the sole substrate, was significantly different from the microbial community acclimated to lactose as the sole substrate. The phyla Chloroflexi and Euryarchaeota (Archaea) were predominant in the casein-fed reactor, whereas Proteobacteria and Firmicutes were predominant when lactose was present in the reactor. In general, the microbial community had a higher similarity in stage II, when lactose was present.

Characterization of predominant genera

Figure 6D presents the top 15 predominant genera in stage I and stage II in CAS and LAC reactors. In the CAS reactor, it was found that genera Longilinea, Proteiniphilum, and midas_g_156 were predominant in stage I, whereas genera Stenotrophomonas, Comamonas, Clostridium, Acinetobacter, and Bacteroides were dominating in stage II. Among the dominating genera found in CAS reactor, the genera Longilinea, Proteiniphilum, Stenotrophomonas, Acinetobacter, and Clostridium sensu stricto are identified as facultative, i.e., being able to ferment both sugars and proteins/amino acids (see Table 2). Whereas genus Comamonas is identified as only able to ferment proteins/amino acids, genus Bacteroides is identified as only able to ferment sugars (Dueholm et al. 2021).

Table 2 Substrate-utilization of the identified top 33 predominant genera

During stage I of the LAC reactor, the dominating genera included Enterobacter, Clostridium, Bacteroides, Lactococcus, Ruminococcaceae UCG 009, Acinetobacter, Clostridium sensu stricto 1, and Clostridium sensu stricto 12. The dominating genera in stage II were Enterobacter, Pseudomonas, Clostridium sensu stricto 12, Bacteroides, Bifidobacterium, and Clostridium sensu stricto 1. As shown in Table 2, in addition to the genera mentioned in the previous paragraph, genera Proteiniphilum, Pseudomonas, Ruminococcaceae UCG 009, and Bifidobacterium are reported to be only able to ferment sugars. Whereas Lactococcus is identified as a facultative genus, which can ferment both proteins/amino acids and sugars.

Results in Table 2 show the 33 dominating genera identified in the collected samples, which has a relative abundance higher than 0.1% in at least one sample, and their ability to ferment sugars and proteins/amino acids is also presented. According to the abundance and substrate utilization information, the top 33 genera from all samples were categorized into 4 groups: 1) facultative bacteria, able to ferment both sugars and proteins; 2) sugar fermenting bacteria, able to ferment sugars but not proteins; 3) protein fermenting bacteria, able to ferment proteins but not sugars; and 4) not available, substrate utilization information was not available. We calculated the percentage of each group in the top 33 genera by counting the number of genera in each group and divided it by the total number of genera (33), and plotted the results as shown in Fig. 7. Half of the identified genera (49%) are reported to be facultative. Due to the limited availability of systematic genome analysis of some genera, the ability, to ferment proteins/amino acids, of some sugar-fermenting genera could not be fully confirmed, e.g. Bifidobacterium and Ruminococcaceae UCG 009, meaning that they could possibly be assigned to the facultative category as well.

Fig. 7
figure 7

Categorization of the top 33 genera in all samples according to substrate utilization characteristics, facultative indicates the percentage of genera that can ferment both proteins and sugars, sugar fermenting indicates the percentage of genera that can only ferment sugars, protein fermenting indicates the percentage of genera that can only ferment proteins, and not available indicates the percentage of genera that were not able to characterized

Discussion

As reported in previous studies, the presence of carbohydrates retards the production of protease, limits protein solubility and hydrolysis, and restricts the acidification of amino acids (Breure et al. 1986b; Glenn 1976; Yang et al. 2015). In this study, it was found that when the carbon source was changed from casein to the mixture of casein and lactose, the efficiency of deamination in the protein-fed reactor decreased from 77 to 15% and the VFA production decreased from 75 to 34% of the effluent COD (Fig. 3).

Additionally, a decrease in protease activity and deamination activity of the microbiota was also observed in the presence of lactose (Figs. 4 and 5), which has not been reported before. The deamination activity was 10.6 mg-N·gVSS−1·h−1 when the microbiota was acclimated to pure casein, it decreased to about 10% after the microbiota was acclimated to the MIX, and it took at least a month to acclimate the microbial community to the MIX to achieve a similar deamination efficiency as in stage I. Although the protein concentration in stage II was half that in stage I, it was not the reason for the lower deamination efficiency, as casein is completely deaminated within 20 h at both 6.0 g·L−1 and 3.0 g·L−1 (Deng et al. 2022). In the presence of lactose, the microbial community has a low protease activity and deamination activity, and only degraded casein to a limited extent. These results are in agreement with the study carried out by Breure et al. (1986a), which shows that gelatine degradation is retarded when glucose is present as a second carbon source.

Although the retardation of protein degradation by the presence of carbohydrates was frequently observed, the mechanism of the phenomenon has not been revealed. Our study investigated the effect of lactose on the microbial community and found that when lactose was present, the microbial community, previously acclimated to casein, became similar to the microbial community acclimated to lactose (Fig. 6A). The results indicated a selection of microorganisms by the presence of lactose. Generally, Firmicutes are the most abundant under various environmental conditions in anaerobic reactors (Pasalari et al. 2021), Chloroflexi and Firmicutes are dominant in protein-fed reactors (Kovács et al. 2013; Palatsi et al. 2011). In our study, the phyla Chloroflexi, Proteobacteria, and Bacteroidetes were predominant in the casein-fed reactor, whereas Proteobacteria and Firmicutes were predominant when lactose was present in the reactor. The observed retardation of protein degradation can be correlated to the shift in the microbial community in the presence of lactose.

Furthermore, among the identified microorganisms, genera that can ferment both proteins/amino acids and carbohydrates were the predominant population in the microbial community (Table 2). Thermodynamically, sugar fermentation yields more energy than proteins/amino acids: cells gain 1–2 mol ATP·mol−1 glucose (Zhou et al. 2018) compared to 0.5 mol ATP·mol−1 amino acids (Barker 1981). The bacteria that can ferment sugars, including the facultative bacteria and the sugar fermenters, have a higher growth rate and biomass yield than the bacteria that are only fermenting proteins/amino acids (Pavlostathis and Giraldo-Gomez 1991; Tang et al. 2005), and it is, therefore, expected that the former can outgrow the latter during the mixture-feeding conditions at a specified solids retention time. Consequently, lactose was degraded prior to casein.

To confirm that the predominant facultative microorganisms are the active protein degraders, further research on the functional analysis of the microbial community is recommended. In addition, microbial community analysis has been focused on the methane-producing archaea and the granulation-involving filamentous (De Vrieze and Verstraete 2016), and less attention has been given to the protein degraders in a complex microbiota. To better manage and control the AD process for the treatment of protein-rich streams, an in-depth understanding of the protein degraders in complex microbiota is required.

In conclusion, the retardation of casein degradation by the presence of lactose was due to 1) the substrate-preference of the dominant bacteria, including the facultative bacteria and the sugar fermenters, and 2) the higher growth rate and yield of the dominant bacteria. Overall, this study offers a better understanding of the mechanisms behind the retardation of protein degradation by the presence of carbohydrates, which can provide insights into the design of the anaerobic digestion process of protein-rich waste streams to improve protein degradation.