Two novel cyanobacterial α-dioxygenases for the biosynthesis of fatty aldehydes

α-Dioxygenases (α-DOXs) are known as plant enzymes involved in the α-oxidation of fatty acids through which fatty aldehydes, with a high commercial value as flavor and fragrance compounds, are synthesized as products. Currently, little is known about α-DOXs from non-plant organisms. The phylogenic analysis reported here identified a substantial number of α-DOX enzymes across various taxa. Here, we report the functional characterization and Escherichia coli whole-cell application of two novel α-DOXs identified from cyanobacteria: CalDOX from Calothrix parietina and LepDOX from Leptolyngbya sp. The catalytic behavior of the recombinantly expressed CalDOX and LepDOX revealed that they are heme-dependent like plant α-DOXs but exhibit activities toward medium carbon fatty acids ranging from C10 to C14 unlike plant α-DOXs. The in-depth molecular investigation of cyanobacterial α-DOXs and their application in an E. coli whole system employed in this study is useful not only for the understanding of the molecular function of α-DOXs, but also for their industrial utilization in fatty aldehyde biosynthesis. Key points • Two novel α-dioxygenases from Cyanobacteria are reported • Both enzymes prefer medium-chain fatty acids • Both enzymes are useful for fatty aldehyde biosynthesis


Introduction
Fatty aldehydes represent a structurally diverse group of aliphatic compounds derived from the corresponding fatty acids. They occur in a wide range of living organisms including plants, insects, and mammals as metabolic intermediates of lipids (Foster and Anderson 2019;Gilbertson et al. 1972;Liu et al. 2012). Many types of fatty aldehydes provide pleasant and diverse odor impressions depending on their chemical structure determined by chain length, the presence/ number/position of double bonds, and further substitutions (Hammer et al. 2020). Thus, they are widely applied in the food, perfume, cosmetic, and detergent industries as important flavor and fragrance compounds.
Extraction from natural sources, such as plants and fungi, has been traditionally achieved to obtain fatty aldehydes, and the in-depth analysis of these extracts has served as a useful basis for the identification of valuable compounds (Hammer et al. 2021). Currently, most of commercial flavors and fragrances are chemically synthesized using either fossilbased starting materials or renewable lipid sources (Gaylord 1957). However, this often requires hazardous chemicals and energy-demanding high-temperature and high-pressure conditions. In contrast, biotechnological routes can be a more effective way to accomplish "greener" and practical production of fatty aldehydes. More importantly, they can meet the consumer's preference for naturally produced products (Kunjapur and Prather 2015).
To date, most of the biosynthetic pathways for fatty aldehydes -either as the final product or as precursors to industrially relevant compounds -have been devised by introducing fatty acid acyl-CoA/ACP reductases (FARs) (Foo et al. 2020;Lehtinen et al. 2018Lehtinen et al. , 2017Reiser and Somerville 1997) or carboxylic acid reductases (CARs) (Akhtar et al. 2013;Maurer et al. 2019;Wu et al. 2021). Many attempts were made for the de novo synthesis of fatty aldehydes by introducing FAR in various organisms such as Escherichia coli, Saccharomyces cerevisiae, or Acinetobacter baylyi (Foo et al. 2020;Lehtinen et al. 2018Lehtinen et al. , 2017Reiser and Somerville 1997). For these approaches, however, activated fatty acid (AFA) precursors, typically CoAderivatives, are required as substrate, and the FAR enzyme activity is often too low to generate industrially sufficient yields. The utilization of free fatty acids (FFAs) instead of AFAs as the substrate possesses advantages since FFAs are more abundant in biological systems and can be readily supplied from cheap renewable resources (Hu et al. 2020). The alternative strategy most known to complement the FAR-based pathway is based on CARs (Akhtar et al. 2013;Hu et al. 2020;Maurer et al. 2019;Wu et al. 2021). CARs are not only capable of directly acting on FFA but also are versatile enzymes with broad substrate spectra. However, issues related to technical complexity and expensive process cost cannot be avoided when using this enzyme class as they require the cofactors ATP, NADPH, and Mg 2+ and the additional expression of a partner enzyme, a phosphopantetheinyl transferase (PPTase) (Akhtar et al. 2013;Maurer et al. 2019;Wu et al. 2021). In contrast, α-dioxygenases (α-DOXs), a family of heme-dependent fatty acid dioxygenases that convert FFAs into fatty aldehydes, can be potent candidates to tackle these problems as they require only molecular oxygen, but no expensive cofactors (Goulah et al. 2013;Hamberg et al. 2002;Hammer et al. 2020;Kaehne et al. 2011;Koeduka et al. 2005).
Biologically, α-DOXs catalyze the initial step of fatty acid α-oxidation in plants for the synthesis of oxylipin and, hence, are of physiological significance (Mosblech et al. 2009). The expression of α-DOX has been upregulated as a means of a defense mechanisms against various kinds of biotic and abiotic stress factors including pathogens, oxidative and cold stress, and osmosis, for example (Koeduka et al. 2005). In the α-DOX-mediated α-oxidation of a fatty acid (C n ), molecular oxygen is accepted as the cosubstrate and inserted at an α-methylene carbon to produce (R)-2-hydroperoxy fatty acids (C n ) as intermediate, which spontaneously decarboxylates, yielding the fatty aldehydes (C n-1 ) and also the (R)-2-hydroxy fatty acid (C n ) (Hamberg et al. 2002;Mukherjee et al. 2011).
Although this fatty acid α-oxidation mechanism is wellknown in plants and, accordingly, catalytic functions of various α-DOXs from plants have been thoroughly elucidated, there is only one functional study of an α-DOX from a non-plant organism, the CsDOX from the cyanobacterium Crocosphaera subtropica (Hammer et al. 2020). CsDOX was revealed to have a catalytically equivalent function to α-DOX from plants but shows a different substrate specificity. This report motivated us to explore the functional diversity of α-DOXs from non-plant organism and to evaluate their catalytic potentials in whole-cell systems. In this study, we identified two novel α-DOXs from the cyanobacteria Calothrix parietina (CalDOX) and Leptolyngbya sp. (LepDOX) via InterPro-based domain search (Hunter et al. 2009), and we demonstrate the in-depth molecular characterization of these enzymes. Furthermore, using an E. coli whole-cell system, the CalDOX-or LepDOX-mediated microbial synthesis of fatty aldehydes from readily available cheap substrates was achieved. The results obtained from this study will provide not only new insights into the molecular function of non-plant α-DOXs but also enable the biotechnological production of fatty aldehydes and/or fatty aldehyde-derived biochemicals using these enzymes.

Phylogenetic tree analysis
An InterPro-based search using AtDOX1 from Arabidopsis thaliana as the query (GenBank accession no. AAK68727.1) identified approximately 1400 known or putative α-DOX family members from genome-sequenced organisms (Apweiler et al. 2001). Sequences with > 1000 amino acids in length as well as redundant (CD-HIT, a cutoff with 0.99) (Huang et al. 2010) and obviously faulty sequences were discarded. Among the sequences originating from various organisms, one from each genus was selected for the construction of a phylogenetic tree. The resulting amino acid sequences were aligned using MUSCLE (Edgar 2004), after which the alignment was trimmed by removing N′-(0-368) and C′-(980-1288) extensions. After two rounds of alignment optimization, the final alignment was obtained. The best model was determined through ModelFinder (WAG + R8) (Kalyaanamoorthy et al. 2017), subsequently applied for tree building using IQTree (http:// www. iqtree. org/), and finally visualized by FigTree (http:// tree. bio. ed. ac. uk/ softw are/ figtr ee/).

Expression and purification
Codon-optimized genes of CalDOX (GenBank accession no. MZ522724) and LepDOX (GenBank accession no. MZ522725) were synthesized (BioCat, Heidelberg, Germany) (Supplemental Fig. S1) and were cloned into the expression vector pET28a (Novagen, Darmstadt, Germany). The recombinant genes for putative α-DOXs were transformed into competent E. coli BL21 (DE3) cells. E. coli cells containing α-DOX genes were cultivated in LB (Lysogeny Broth) medium containing 50 μg/mL kanamycin. The induction of protein expression was performed by adding 0.5-mM isopropyl β-D-1-thiogalactopyranoside (IPTG). Cells were further incubated at 18 °C for 18 h and were harvested by centrifugation at 4500 × g and 4 °C for 20 min. Then, cell pellets were resuspended in buffer A (50-mM Tris-HCl, 200-mM NaCl, and 20-mM imidazole; pH 8.0) and disrupted by an ultrasonicator. Only during cell lysis for α-DOX, 1% (v/v) of Triton X-100 was supplemented to facilitate extraction. The wholecell lysate was centrifuged twice at 10,000 × g and 4 °C for 30 min. The supernatant containing soluble crude extract was loaded onto a column containing Ni-agarose resin (Carl Roth, Karlsruhe, Germany) and purified by immobilized metal ion affinity chromatography (IMAC). The His-tagged proteins were eluted by the addition of buffer B (50-mM Tris-HCl, 200-mM NaCl, and 300-mM imidazole; pH 8.0). The proteins were analyzed to check expression and purity by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE). The protein concentrations were quantified by Bradford assay using bovine serum albumin (BSA) as the standard (Carl Roth). For further purification and the analysis of oligomeric state of CalDOX and LepDOX, size-exclusion chromatography (SEC) was performed with a HiPrep 16/60 Sephacryl S-200 HR column (Cytiva, Marlborough, MA, USA) using a buffer containing 10-mM Tris-HCl and 200-mM NaCl (pH 8.0) as the mobile phase.

Oxygen depletion assay
Oxygenase activities of α-DOX enzymes were measured by an oxygen consumption assay. Unless otherwise mentioned, the enzyme reaction was conducted as follows using the fluorescence-based 96-well oxygen sensor, OxoPlate OP96U (PreSens, Regensburg, Germany): The enzyme reaction containing 2-mM myristic acid in 0.1-M Tris-HCl (pH 7.0) with 1% (v/v) Triton X-100 in a total volume of 270 μL was initiated by the addition of 3 μL purified CalDOX or LepDOX (0.07 to 1 μg). After sealing with transparent adhesive foil, the oxygen sensor plate was immediately put into a Varioskan™ LUX multimode plate reader (Thermo Fisher Scientific, Darmstadt, Germany). According to the manufacturer's instructions, oxygen consumption was monitored by fluorescence measurement at 25 °C with dual kinetic and bottom reading modes using two different filter pairs for the fluorescence detection from indicator and reference dyes. For both dyes, 540 nm was used for excitation and fluorescence emission measured at 650 and 590 nm for indicator and reference dyes, respectively. A two-point calibration was constructed using air-saturated and air-free water standards.
Based on the equation provided by the manufacturer, the dissolved oxygen level in the reaction mixture was calculated, which was then subtracted from the saturated oxygen level to obtain the amount of oxygen consumed by enzyme reactions. One unit was defined as the amount of enzyme that consumes 1 μmole of oxygen per minute under the specified condition.

Effects of temperature, pH, and metal ions
The effect of temperature on enzyme activity was investigated by incubating 3.3 μg of purified CalDOX with 2-mM myristic acid. Enzymatic reactions were performed under various temperature conditions ranging from 20 to 40 °C with an interval of 5 °C at pH 7 (0.1-M Tris-HCl) and 1000 rpm for 20 min in a total volume of 300 μL in the absence of Triton X-100. Then, 300 μL of hexane was added to the reaction mixture for the extraction of fatty aldehydes, followed by centrifugation at 10,000 × g for 1 min to separate the two phases. After taking the organic phase, enzyme activity was determined by quantification of tridecanal produced from the reaction via GC-FID analysis.

Whole-cell biotransformation
Expression of recombinant CalDOX and LepDOX enzyme was induced in E. coli cells as described above. After harvesting cells by centrifugation at 9600 × g and 4 °C for 10 min, cells were washed with a resting cell medium (RCM) containing 200-mM potassium phosphate and 50-mM NaCl (pH 7.4), followed by centrifugation under the same conditions. After decanting the supernatant, cells were resuspended in the RCM.
Whole-cell biotransformations were conducted with E. coli resting cells at 12.5 g wet weight per L RCM at 35 °C and 1000 rpm with a reaction volume of 300 μL using a ThermoMixer (Eppendorf, Hamburg, Germany). The reaction was initiated by adding 15 μL of 100-mM fatty acid substrates (C10:0 or C14:0) from a DMSO stock solution to 285 μL of the resting cell suspension. After incubating for the indicated times, the biotransformations were stopped by the addition of 30-μL 2-M HCl. The samples were extracted with an equal volume of ethyl acetate three times and dried over anhydrous Na 2 SO 4 . The collected organic phases were subjected to GC-FID analysis.

Gas chromatography analysis
Sample analysis was performed through a gas chromatography (GC) system (Shimadzu, Nakagyo-ku, Kyoto, Japan) with a flame ionization detector (FID) using a ZB-5MSi capillary column (30 m × 0.25 mm, 0.25-µm film thickness; Phenomenex, Torrance, CA, USA) as the stationary phase and H 2 as the carrier gas. The linear velocity of the mobile phase was set to 42.2 cm/s. A 1 µL sample was injected into the GC inlet in split mode with a ratio of 10, and the injection port temperature was set at 300 °C. The temperature program of the oven started from 80 °C for 5 min and then increased to 240 °C with a rate of 8 °C/min, and this was held for 8 min. For identification and quantification of fatty acids and fatty aldehydes, the corresponding standards at different concentrations were prepared, and calibration curves were constructed.

Structural modeling and sequence alignment
The amino acid sequences of CalDOX and LepDOX were analyzed by alignment with two structurally and functionally known α-DOXs from A. thaliana (AthDOX1; Gen-Bank accession no. AAK68727.1 and PDB code, 4KVK) and Oryza sativa (OsaDOX; GenBank accession no. ABA98060.2 and PDB code, 4HHS). Sequence alignment was performed using Clustal Omega (Sievers and Higgins 2014) and ESPript (Gouet et al. 1999). Homology model structures of CalDOX and LepDOX were constructed using the crystal structure of wild-type OsaDOX (PDB code: 4HHS) based on SWISS-MODEL (http:// swiss model. expasy. org/). The illustration of structures was made by PyMOL (http:// www. pymol. org).

Phylogenetic tree of α-DOX family
The phylogenetic analysis revealed that disparate clusters of α-DOX sequences distributed over a range of bacteria, fungi, and metazoa in addition to plants, none of which has so far been reported to include α-DOX (Fig. 1). In five bacterial phyla including Cyanobacteria, Proteobacteria, Actinobacteria, Acidobacteria, and Bacteroidetes, α-DOXs were found, in which α-DOXs from Cyanobacteria and Actinobacteria were clustered together, while those from Proteobacteria were dispersed throughout the tree. α-DOX-possessing fungi were restricted to the phylum Ascomycota. In case of plants, α-DOXs were found not only in higher plants but also in primitive land plants, such as moss (Bryophyta).
The amino acid sequence analysis shows that all sequences contain the conserved catalytic tyrosine, which has an essential role in the oxygen insertion into the fatty acid. Regarding the two heme ligands playing a role in generating the radical of catalytic tyrosine, the proximal histidine residue was also conserved, whereas the distal histidine was partially conserved.
Among these newly identified α-DOX sequences, Cal-DOX and LepDOX from non-pathogenic cyanobacteria Calothrix parietina and Leptolyngbya sp., respectively, were chosen for biochemical investigation in this work.

CalDOX and LepDOX are hemoproteins
Plant α-DOXs are known to be hemeproteins (Goulah et al. 2013;Liu et al. 2004;Mukherjee et al. 2011;Zhu et al. 2013), which utilize heme as the indispensable constituent to perform their oxygenase activity. In our study, the cell pellets of E. coli with recombinantly expressed α-DOX enzymes as well as their purified forms exhibited a red color, indicating that α-DOX derived from cyanobacteria also possesses a heme (Fig. 2). To further elucidate this, the absorbance spectra of the purified enzymes were scanned using UV-Vis spectroscopy. As a result, CalDOX and LepDOX showed Soret peaks at 412 nm, which is the typical property of plant α-DOXs (Fig. 2). From these results, the cyanobacterial α-DOXs newly identified in our study were found to be functionally equivalent to plant α-DOXs in terms of their heme-dependency, which was not elucidated for the previously described cyanobacterial CsDOX (Hammer et al. 2020).
Despite the obvious identification of CalDOX and Lep-DOX as heme proteins, high variation in Soret peak intensities were observed for every cultivation batch, with A 412 / A 280 ratios ranging from 0.03 to 0.2, meaning a high batchto-batch variation of heme incorporation into the enzymes. In every cultivation, samples had different proportion of heme-containing active proteins. In an effort to saturate heme incorporation, growth media were supplemented with 5-aminolevulinic acid (5-ALA), FeCl 3 , 5-ALA and FeCl 3 , or hemin during the synthesis of cyanobacterial α-DOXs. However, the supplementation did not improve the heme occupancy significantly (data not shown). Next, when hemin was added to purified CalDOX, a slightly higher intensity of the Soret peak was observed, indicating that heme can indeed be incorporated even into the already formed Cal-DOX protein. Still, the heme incorporation appears to be insufficient to achieve full saturation (Supplemental Fig. S2).

Oligomeric states of CalDOX and LepDOX
Although plant α-DOXs are functionally known as monomers (Goulah et al. 2013;Zhu et al. 2013), oligomeric states of AtDOX can highly vary with the expression and detergent systems from 1 to 10 subunits (Liu et al. 2006). Here, the absorbances at 280 and 412 nm were used for the monitoring of the total protein content and heme-bound protein, respectively, during the size-exclusion chromatographic run. Also, R z values, the ratio of absorbance at the Soret peak to the absorbance at 280 nm (Colas and De Montellano 2004), for monomeric and aggregated fractions of Cal-DOX and LepDOX pooled from SEC were determined (Table S2). Two major peaks corresponding to the aggregated and monomer forms were revealed for both CalDOX and LepDOX (Fig. 3a). Each form had the same monomer molecular mass of around 67 kDa on SDS-PAGE. In both aggregate and monomer forms, heme-association with the proteins was identified with CalDOX with similar values of R z (Fig. 3a and Table S2). In contrast, whereas heme-binding was clearly observed in the monomer form in LepDOX with similar R z values to that of aggregates and monomers of CalDOX, the occupancy of the heme in the aggregated form of LepDOX was significantly lower, as evidenced by the 2.8-fold lower Soret peak intensity compared to that of the monomeric form at the same protein amount (gram-based), that is, a lower R z value ( Fig. 3a and Table S2). In case of CalDOX, both aggregate and monomer fractions showed oxygen-consuming activity with myristic acid, indicating that the aggregate maintains an intact catalytic machinery of CalDOX ( Fig. 3b; left). However, LepDOX in aggregate form shows little activity, whereas its monomer shows oxygenase activity ( Fig. 3b; right). This indicates that aggregate formation led to disruption of the catalytic machinery for LepDOX, possibly by affecting the enzyme's heme environment. Thus, only the monomeric form of LepDOX was used for the enzyme assay. These results provide evidence Fig. 2 Identification of cyanobacterial α-DOXs as hemeproteins. a Cell resuspension for CalDOX and LepDOX and their purified forms show red colors. b Identification of the Soret peak with purified CalDOX and LepDOX by UV-Vis spectroscopy that the presence or absence of heme in the aggregated form of cyanobacterial α-DOXs could be highly associated with their catalytic activity. However, the possibility that structural configurations of aggregates also can affect their catalytic machinery cannot be excluded.
To better understand the status of oligomeric states of the enzymes, native gel analyses were performed (Supplemental Fig. S3). Although it is difficult to estimate the exact molecular mass of oligomers on native gels, a rough information on the native states of the enzymes could be obtained. The aggregate fractions in both enzymes were found to be heterogeneous mixtures, which was not obviously from SEC analyses. For the monomer fractions, in contrast to the native CalDOX showing only one major band, LepDOX consisted Oxygen consumption activities of the aggregated and monomeric forms of CalDOX and LepDOX. Oxygen consumption activities were determined by incubating 2-mM myristic acid with the aggregated and monomeric forms of enzymes (0.07 to 1 μg) at 25 °C and pH 7 of two major bands. The existence of the LepDOX monomer as two different species can be explained by the possibility that non-heme-bound and heme-bound species in the LepDOX monomer fraction may have different charges that lead to distinct migration rates according to native PAGE.

Effect of Triton X-100
In this study, Triton X-100 has been used for cyanobacterial α-DOX enzymes during cell lysis. The extraction yields (mg protein/L cell culture) for CalDOX and LepDOX were improved approximately 2.5-and 1.1-fold, respectively when 1% (v/v) of Triton was added during cell disruption. Interestingly, both purified enzymes from the cells treated with Triton resulted in a stronger (~ 1.6-fold) and blue-shifted Soret peak at the same protein concentration (Table S1). With a higher and shifted Soret peak, the enzymes showed a 1.2-fold higher oxygenase activity (Table S1). This may be explained by the fact that the detergent can behave as a membrane-like environment, maintaining the conformation of membrane proteins and its heme environment similar to the native state (Noordermeer et al. 2001).

Substrate spectra of CalDOX and LepDOX
For the investigation of the substrate spectra, CalDOX was pre-examined for saturated fatty acids (SFAs) ranging from caproic acid (C6:0) to stearic acid (C18:0) in the absence or presence of 1% (v/v) Triton X-100 (Supplemental Fig. S4). Detergents have been often used to increase the solubility of hydrophobic compounds (Hammer et al. 2020;Kaehne et al. 2011) and are helpful to offset the solubility effect of substrates with different hydrophobicities. With Triton X-100, the oxygenase activity for all the tested substrates substantially increased, and this effect became more prominent with substrates with higher carbon-chain length. Besides the activity, substrate preference of the enzyme was changed with the supplementation of the detergent as described below.
Next, the substrate spectra of CalDOX and LepDOX were evaluated in the presence of Triton X-100, further including the valuable unsaturated fatty acids (USFAs), such as palmitoleic acid (C16:1 (9Z)), elaidic acid (C18:1 (9E)), linoleic acid (C18:2 (9Z, 12Z)), and α-linolenic acid (C18:3 (9Z, 12Z, 15Z)) (Fig. 4). Relative activities (RA; %) for each substrate represent the activities relative to the maximal values, set to 100%, which correspond to the values with C12:0 for both enzymes. The behavior of the two α-DOXs shown for SFAs was generally similar in that both enzymes exhibited the maximal and near-maximal activities for lauric acid (C12:0), myristic acid (C14:0), and palmitic acid (C16:0) (> 80% of RA). For shorter-chain SFAs, only CalDOX showed a good preference for C10:0 (100% of RA) and observable activities for C6:0 and C8:0 (7.4 and 18.8% of RA, respectively), whereas LepDOX showed a moderate activity for C10:0 (40.6% of RA) and no detectable activities for C6:0 and C8:0. This implies that CalDOX better accepts shorter substrates and has a broader substrate scope. When examined for USFAs, the enzyme activities were lower than those of SFAs with the same chain length (i.e., C16:0 and C18:0). One plausible reason is that USFAs are less solubilized by Triton X-100 than SFAs (Ahyayauch et al. 2006). Furthermore, there was no big difference in the activity among the USFAs. Whereas the introduction of a double bond reduced activities, the number or configuration of double bonds in USFAs may not be crucial for the activity of CalDOX and LepDOX.

Effects of temperature, pH, and metal ions
The temperature optimum was observed by measuring fatty aldehydes produced from enzymatic reactions based on GC-FID analysis. Since Triton X-100 may interfere with the extraction of products, it was not included in these enzymatic reactions. In this study, the temperature optimum was investigated only for CalDOX since the activity of LepDOX was too low to obtain reliable GC peaks for quantification in the absence of detergents (Supplemental Fig. S5b). The maximum activity of CalDOX was found at 30 and 35 °C with no significant difference (p > 0.05) (Fig. 5a). The enzyme activity sharply decreased at temperatures lower than 30 and above 40 °C, showing less than 60% of RA. The original organism of CalDOX, C. parietina, was isolated from lake water, and the genus Calothrix is known to grow in the range of 25 and 40 °C (Issa 1999), which is in accordance with our results.
The effect of pH was studied at the pH range of 4-10 ( Fig. 5b). Four buffering systems (0.1 M) were used: sodium acetate, sodium phosphate, Tris-HCl, and glycine-NaOH. Both enzymes showed a broad plateau of maximum and near-maximum activity (> 80% of RA) under slightly acidic to strongly alkaline conditions (i.e., pH 6 to 9). Little activity was retained at very acidic conditions, such as pH 4 and 5. The activities of CalDOX and LepDOX were dramatically lower in sodium phosphate buffers at pH 6 or 7 compared to the same pHs in acetate or Tris buffers. One possible reason might be that a buffer component (i.e., phosphate) acted as the inhibitor for the enzyme. Although also a universal buffer system using Davies buffer (Davies 1959) was carried out to avoid the effect of an individual buffer component, activity was shown only at pH 7, which was, however, tenfold lower than the activity at pH 7 with Tris-HCl (Supplemental Fig. S6). Furthermore, PIPES buffer (pH 6 and 7) also strongly inhibited the CalDOX activity (data not shown). Previous literature about AtDOX also revealed that its activity was sensitive to the buffering components including PIPES, HEPES, and MOPS (Liu et al. 2006). Therefore, Tris buffer at pH 7 was used for the routine in vitro enzyme assays in this study.
Previously, metal ions were reported as inhibitors for plant α-DOXs. The oxygenase activities of CalDOX and LepDOX were also studied in the presence of 1-mM of various metal ions during fatty acid conversion ( Table 1). All of the tested metal ions except for Mg 2+ significantly reduced the relative activities of both enzymes.

Whole-cell biotransformations for the synthesis of fatty aldehydes
In this study, the catalytic potential of whole E. coli resting cells expressing CalDOX or LepDOX was evaluated by exogenous supplementation of two different fatty acids, capric and myristic acids in 0.2-M potassium phosphate buffer with 50-mM NaCl (pH 7.4). Both enzymes were able to produce the expected C n-1 aldehydes, nonanal and tridecanal, respectively, as the major products by conversion of capric and myristic acids in E. coli resting cells (Fig. 6). In contrast, no corresponding product formation was observed with control E. coli cells containing an empty vector (data not shown). Meanwhile, neither C n-1 alcohols nor C n-1 FAs, produced by over-reduction or re-oxidation of fatty aldehydes by endogenous fatty aldehyde reductases or oxidases, respectively, were detected in this study although they were b Reactions were performed with 2-mM myristic acid with Triton X-100 (1%, v/v), initiated by purified enzymes (0.1 or 1 μg) at 25 °C under various pH values: 0.1-M sodium acetate (pH 4 to 6), 0.1-M sodium phosphate (pH 6 to 7), 0.1-M Tris-HCl (pH 7 to 9), and 0.1-M glycine-NaOH (pH 9 to 10). Activity was determined by the oxygen depletion assay at the initial time phase often produced by whole cells containing fatty aldehyde-producing α-DOX or CAR in previous studies (Foo et al. 2017;Maurer et al. 2019). The catalytic performance of CalDOX was found to be superior to that of LepDOX in the wholecell E. coli system ( Fig. 6 and Supplemental Fig. S7). For both substrates, CalDOX achieved full conversion in 20 or 40 min from 5-mM of capric or myristic acid, respectively. Both, conversion rate and yield for LepDOX-catalyzed reactions were much lower compared to CalDOX. The higher catalytic performance of CalDOX can be explained, for example, by the more intense red color of CalDOX-harboring cells, indicating a higher heme incorporation when expressed in E. coli (Fig. 2). The catalytic performance of LepDOX was particularly poor for the shorter capric acid showing only conversion of 4.2-8.6% (mol/mol). However, the enzyme displayed a substantially higher conversion for myristic acid (i.e., ~ 50%, mol/mol). These observations are in line with our in vitro substrate spectrum results, which revealed CalDOX had activities toward C10:0 and C14:0 with comparably high levels, whereas the acceptance of C10:0 by LepDOX was reduced (Fig. 4). In case of CalDOX, the amount of fatty aldehyde already synthesized decreased with the incubation time after achieving the full conversion (Supplemental Fig. S7). As no detectable peak was observed in chromatograms that could arise from fatty aldehyde degradation in the aqueous system or from cellular metabolism, evaporation of the fatty aldehydes might have contributed to the loss in mass.

Discussion
Although most α-DOXs studied so far are from plants (Kaehne et al. 2011;Koeduka et al. 2005;Liu et al. 2006), the existence of α-DOX enzymes in non-plant organisms should not be excluded considering the recent discovery of a cyanobacterial α-DOX (Hammer et al. 2020). However, the complex phylogeny across taxa had not been examined Table 1 Effect of metal ions a Control represents the enzyme reaction in the absence of metal ions b Enzyme activities were determined by the oxygen depletion assay with 2-mM myristic acid in a buffer containing 1% (v/v) Triton X-100 by incubating with purified CalDOX or LepDOX (0.1 to 1 μg) at 25 °C and pH 7 (0.1 M Tris-HCl) in the presence of 1-mM of various metal ions 76.0 ± 6.3 67.7 ± 3.3 After incubating for the indicated times, the biotransformations were stopped by the addition of 2-M HCl (30 μL), and the samples were extracted three times with an equal volume of ethyl acetate yet. Our phylogenetic analysis revealed that α-DOXs are widely distributed among the various organisms other than plants. Such an unexpected diversity suggests the possible existence of the equivalent α-oxidation mechanism in nonplant organisms to that in plants, providing a novel hint for the occurrence of α-oxidation systems in other taxa. This is meaningful since identical fatty acid metabolisms have not been reported in non-plant taxa to date. Furthermore, the distribution of α-DOX in primitive bacteria as well as complex plants indicates an evolutionary origin back to the period prior to the appearance of plants.
For plant α-DOXs, heme is the key constituent involved in their oxygenase activity. The recombinant CalDOX and LepDOX from cyanobacteria expressed in E. coli were also found to have heme-dependent oxygenase activity in this study. Our homology model analysis revealed that the key residues involved in the oxygenase activity of plant α-DOXs are also obviously conserved in CalDOX and LepDOX (Fig. 7a, b and Supplemental Fig. S8), supporting this phenomenon. To be specific, in the α-DOX-catalyzed reactions, the radical form of tyrosine (Y379 of OsaDOX and Y386 of AthDOX), which is generated by the activated heme moiety, is responsible for the oxygen incorporation to the α-carbon of the fatty acid (Mukherjee et al. 2011). This catalytic tyrosine is stabilized by π-π stacking interactions by phenylalanine residues (F375 and F552 for OsaDOX and F382 and F559 for AthDOX) (Supplemental Fig. S8) (Goulah et al. 2013;Zhu et al. 2013). In OsaDOX, the iron atom in the heme molecule has a distance from the catalytic tyrosine residue of ~ 13.0 Å, and the H157 and H382 residues as the distal and proximal ligands are coordinated with the iron atom (Supplemental Fig. S8a). The catalytic tyrosine and two phenylalanine residues in our model structures of Cal-DOX and LepDOX were revealed sequentially and positionally equivalent to plant DOXs ( Fig. 7 and Supplemental S8). Furthermore, the two histidine residues (H131 and H361 for CalDOX, and H132 and H364 for LepDOX) were also at structurally identical positions of OsaDOX with respect to the heme ligands ( Fig. 7a and b).  (H131 and H361 in CalDOX and H132 and H364 in LepDOX), catalytic residues (Y358 in CalDOX and Y361 in Lep-DOX), and some fatty acid-interacting sites residing close to the catalytic residue (H290,T295,R529,F354,and F522 in CalDOX and H290,T295,R528,F521,and F357 in LepDOX) are shown as sticks.
The surface representation of c CalDOX and d LepDOX homology models reveal 13 potential substrate-binding recognition sites in the binding cleft (V18, R22, H290, W294, T295, T304, A310, F519, F522,F354,I523,A526,and R529 in CalDOX and L18,R22,H290,W294,T295,T304,V310,F357,F518,F521,I523,A525,and R528 in LepDOX) of the enzymes. e The model structure of CalDOX in complex with stearic acid. The catalytic tyrosine is shown in violet Native plant α-DOXs are considered to be monotopic membrane proteins (Goulah et al. 2013), and the recombinant α-DOXs expressed in E. coli are often treated with several kinds of detergents in cell lysis or purification (Hammer et al. 2020;Liu et al. 2006). In our study with cyanobacterial α-DOXs, the treatment of Triton X-100 during cell lysis was effective for enhancing the extraction yield and purified enzyme's heme content and activity, indicating recombinant cyanobacterial α-DOXs also possibly interact with the lipid membranes of E. coli. N-terminal amphipathic α-helices of plant α-DOXs are known to be associated with membrane lipid droplets in plants (Bannenberg et al. 2009;Goulah et al. 2013). These α-helices play a role in the connection of the amphipathic membrane and catalytic site residing in the aqueous cellular compartment. Localization of α-DOXs close to fatty substrates can make it easier to extract lipid molecules, which are then guided through hydrophobic channels to the active site. In spite of lacking information on cyanobacterial fatty acid α-oxidation systems, it should be mentioned that cyanobacterial cells also contain lipid droplets (Lundquist et al. 2020), which suggests an idea for the possible localization of cyanobacterial α-DOXs in a native environment.
The substrate scope of most plant α-DOXs has been only investigated with fatty acids longer than 14 carbon atoms, and the comparative information on shorter substrates is lacking (Koszelak-Rosenblum et al. 2008;Liu et al. 2006). The previously known α-DOX from cyanobacteria, CsDOX, was revealed to have a different substrate spectrum from rice OsaDOX, in which CsDOX showed a better performance on shorter-chain substrates (< C16) than OsaDOX (Hammer et al. 2020). Particularly, CsDOX preferred C12:0 and C14:0, and then, activity was gradually reduced with a decreasing carbon length of substrate, which was similar to the behavior of LepDOX. In contrast, CalDOX even accepted C10:0 as the best substrate in addition to C12:0 and C14:0. Our whole-cell analysis also proved the superior capability of CalDOX for C10:0. Collectively, cyanobacterial α-DOXs were found to have a better acceptance of shorter substrates and a broader substrate scope than plant α-DOXs. Keeping in mind that fatty aldehydes with chain length from 8 to 13 are often used as flavor and fragrance compounds, such properties of cyanobacterial α-DOXs are advantageous when applied for industrial applications. In the complex structure of OsaDOX with palmitic acid (PA), 13 residues are positioned in the fatty acid binding channel, which stretches ~ 20 Å from the substrate entrance to the catalytic tyrosine. These were revealed as the substrate recognition sites (L53, R57, H311, W315, T316, M325, A331, F379, F549, F552, I553, A556, and R559) ). Among them, H311 and R559 are close to the catalytic tyrosine, and they have ionic interactions with the carboxylate group of fatty acid, and T316 plays a role in α-carbon positioning (Liu et al. 2006). These three residues are conserved in CalDOX and LepDOX ( Fig. 7 and Supplemental S8). The model structures of CalDOX and LepDOX have a similar depth of substrate binding channel compared to plant α-DOXs (~ 20 Å) (Fig. 7e). However, the hydrophobic residues aligning in the substrate binding channel were not fully conserved among the analyzed α-DOXs (Supplemental Fig. S8c). For example, M325 of OsaDOX interacting with the C8 atom of PA was replaced by T304 in CalDOX and LepDOX. L53 of OsaDOX interacting with C12 atom of PA was replaced by V18 in CalDOX. Accordingly, the width and shape of the entrance site of the substrate binding channel were different in the structure of the CalDOX and LepDOX models ( Fig. 7c and d). Such a variation in some substrateinteracting residues may well explain the different substrate spectrum of α-DOXs (Hammer et al. 2020;Koszelak-Rosenblum et al. 2008;Liu et al. 2006). The docking model of Cal-DOX with stearic acid reveals a substrate binding channel of CalDOX with a depth of ~ 20 Å, which is able to accept the SFA with 18 carbons at most (Fig. 7e). This implies the region beyond C18 in aliphatic fatty acids might not have a direct effect on substrate binding and selectivity of α-DOXs.
It should be noted that biochemical factors such as temperature, pH, and metal ions investigated here highly affect the nature of the fatty acids as well as of the enzyme. For example, fatty acid solubility and structure can be influenced by the pH condition. Under alkaline conditions, a fatty acid is more solubilized and forms micelle structures. In contrast, fatty acids are less solubilized and form crystal structures under acidic condition. Accordingly, the pH profiles obtained here not only attribute to the effect of pH on the biochemical properties of enzyme, but also to those of the substrates. Moreover, metal ions can form an insoluble complex with fatty acids via their carboxyl group (Chen et al. 2011), reducing the amount of available substrate for enzymes. In this context, the lower activities of α-DOXs shown by metal ions in this study may not be solely the consequence of the direct effect of metals on the enzyme itself. Therefore, it would be also of interest in future studies to evaluate the direct effect of metal ions on the enzyme's properties without fatty acid substrate.
While the activities of purified enzymes were highly inhibited at high concentrations (0.1 M) of phosphate buffer at pH 7, whole-cell catalysis was done using 0.2-M potassium phosphate (pH 7.4) as the buffering agent in this study. It is well-known phenomenon that in vitro and in vivo systems differently affect the enzyme's behaviors (de Carvalho 2017;Kratzer et al. 2011Kratzer et al. , 2015. In in vivo systems, intracellular enzymes are protected from the external environment by the cellular membrane and strategies to keep the intracellular environment homeostatic, whereas isolated free enzymes are susceptible to inactivation (Baker-Austin and Dopson 2007; de Carvalho 2017). Accordingly, there are frequent cases that the behavior of enzymes present within the cell is not directly affected by the extracellular condition (e.g., intracellular enzymes from extremophiles like halophiles and acidophiles) (de Carvalho 2017;Oren 2008). Similarly, inorganic phosphate is the essential compound for life including E. coli as the component of lipid membrane, building block of DNA or RNA, and energy source in the form of ATP. Accordingly, intracellular phosphate concentration is maintained at optimal levels in the range of 1 and 10-mM (that is, homeostasis) for E. coli by sophisticated uptake by transporters and control of metabolic reactions (Shulman et al. 1979;Xavier et al. 1995). This may well explain why CalDOX and LepDOX performed well in the cellular environment with phosphate buffer. There are several advantages when applying α-DOX in whole-cell biotransformations. As a potentially membrane-associated protein, α-DOXs may exhibit their catalytic activities better in a living cell with an intact membrane. Additionally, unlike CARs, demanding expensive cofactors, such as NADPH and ATP, and an additional of enzymes for posttranslational modification (i.e., phosphopantetheinyl transferases) (Akhtar et al. 2013;Wu et al. 2021), only aeration is sufficient to supply of oxygen when utilizing α-DOX in a whole-cell system. Although both enzymes produced the expected C n-1 aldehydes as the major products, C n-1 alcohols or C n-1 fatty acids, the possible side products or new substrates for α-DOX, respectively, were not detected in this study. The aldehyde-reducing capability in whole cells is dependent on the cell physiology associated with the intracellular redox state (Maurer et al. 2019). In a previous report, compared to CAR-expressing cells with a higher NADH/NAD + ratio reflecting more reductive environment and reduced growth rate, α-DOX-expressing cells had a significantly lower amount of alcohol side product (Maurer et al. 2019). This may be the reason for the lack of observance of fatty alcohols in this study. The limited substrate specificity of fatty aldehyde dehydrogenases can be another reason. No detection of fatty acids with reduced carbon number could be due to their low yields as shown in the previous work in which the re-oxidized fatty acids accounted for only around 5% of the total products (Maurer et al. 2019).
It has been revealed throughout this study that heme-occupancy was the critical determinant for the catalytic performance of CalDOX and LepDOX both in vitro and in vivo. The different heme-incorporating capacity is probably because α-DOXs have different heme-binding environment. In Osa-DOX, the heme molecule interacting with the H157 and H382 residues is further stabilized mainly by hydrophobic interaction from 23 amino acids (A149, I152, Q153, V156, H157, M160, D161, H162, N260, W262, T376, Y379, R380, M381, H382, I416, F453, L456, I470, L472, L475, R479, and R483) within 4 Å (Supplemental Fig. S8c). Model structures and sequence alignments show that these residues in the vicinity of the heme molecule are partially conserved among α-DOXs (Supplemental Fig. S8c), suggesting that α-DOXs may have different heme-binding affinities or selectivities (Liu et al. 2004). Another possible causal for incomplete incorporation of heme into CalDOX and LepDOX could be because the heme synthesized in E. coli is unnatural for cyanobacterial α-DOXs (Liu et al. 2004), which, therefore, may be enhanced by expressing the enzymes in different hosts.
In conclusion, we show that cyanobacterial α-DOXs are catalytically equivalent to plant α-DOXs but exhibit different substrate scope. Our in-depth molecular study of α-DOXs from non-plant organisms not only provides us with deeper understanding of this enzyme class but also facilitates the biosynthesis of fatty aldehyde-based aroma compounds in industry. To increase the industrial applicability of cyanobacterial α-DOXs, future studies need to be directed toward investigating their whole-cell catalytic performances at significantly higher substrate concentration than 5-mM.