Introduction

Photoreceptors detect light signals, a critical environmental stimulus for living organisms, and convert them into biological signals1,2. This enables the organisms to adapt to and respond to changes in their light environment2. The LOV superfamily acts as photosensory units in various enzymes and signal transduction proteins with flavin mononucleotide (FMN) as their chromophores1,3,4,5. The core structure of LOV domain consists of five central β-sheets (Aβ, Bβ, Gβ, Hβ, and Iβ) and four α-helices (Cα, Dα, Eα, and Fα), which are highly conserved throughout blue-light receptors1,6 (Fig. 1). Upon irradiation, LOV domains utilize flavin mononucleotide (FMN) chromophores to sense blue light, converting such stimuli into structural signals through conformational changes in the protein moiety1,6,7,8.

Fig. 1: Structure and photocycle of AsLOV2.
figure 1

a Domain architecture of Avena sativa phototropin 1 (AsPhot1). AsPhot1 consists of three functional domains, which are organized into LOV1 (yellow), LOV2 (yellow), and STK (green). LOV2 is flanked by A’α (red) and Jα (blue) helices. b The three-dimensional structure of AsPhot1–LOV2 (AsLOV2, A’α + LOV2 domain + Jα) and the light-induced photoadduct (Cys–FMN) formation of FMN in AsLOV2. In this experiment, we used an AsLOV2 construct where the A’α (red) and Jα (blue) helices are coupled to the LOV2 domain (yellow). Upon blue-light irradiation, the photoreaction is initiated by the formation of a photoadduct (orange square dashed box) between the FMN chromophore and a cysteine residue. c Multiple sequence alignment of the LOV domains: AsPhot1–LOV2 (AsLOV2), Arabidopsis thaliana phototropin 1 LOV2 (AtPhot1–LOV2, AtLOV2), Phaeodactylum tricornutum Aureochrome 1a LOV (PtAureo1a–LOV), Erythrobacter litoralis 222 (EL222), and Neurospora crassa VVD (NcVVD). The sequences of each species’ LOV domain corresponding to A’α (red), Jα (blue) helices and the LOV core domain (yellow) are highlighted in colored boxes.

Various studies have been conducted on the photoresponse of these LOV domains1,9. These studies have proposed that, upon activation by blue light, a photoadduct is formed through a covalent bond between FMN and cysteine located within the Eα of these domains, followed by subsequent structural changes in the protein moiety of LOV domains10,11,12,13. Especially, diverse photoreceptors that possess LOV domains as light-sensing regions undergo dimerization during their light-induced signal transduction14,15,16,17,18,19,20,21,22. These structural changes play a crucial role in modulating downstream biological processes.

Although spectroscopy studies have suggested that various LOV domains may form transient dimers12,13,14,16,17,18,19,20,21,22, the dimerization observed in these studies was dependent on concentration, even in the absence of light14,19,23,24,25,26,27,28. Moreover, while ultrafast studies effectively probe the electronic structure of the chromophore, they do not directly detect global structural changes of the proteins involved in photo-induced dimerization. Notably, as summarized in Supplementary Fig. 1, several structural studies on the proteins have reported the dimer conformations of LOV domains, suggesting that the proteins can form dimer conformations with various dimeric interactions24,27,28,29,30,31,32,33. However, they do not offer detailed structural insights into the photo-induced dimerization. These limitations have made it challenging to obtain specific structural information about photo-induced dimerization, leaving the structural characteristics of LOV domain dimers and their dimerization process elusive.

In this work, to overcome these challenges, we apply time-resolved X-ray liquidography (TRXL) to capture the photoinduced structural dynamics in AsLOV2, which does not show concentration dependence on dimerization in the dark state (Fig. 1 and Supplementary Fig. 2a, b). AsLOV2 (residue 413-520) belongs to the LOV2 domain of Avena sativa phototropin 1 (AsPhot1), sharing the same structural framework as canonical LOV domains34. We used a construct that incorporates the A’α helix (residue 404-412) connecting to the LOV1 domain on one side and the Jα helix (residue 521-546), which links to the serine/threonine kinase (STK) domain on the other side of AsLOV234. Due to the presence of the A’α and Jα helices, this construct maintains a monomeric state in the dark, making it a suitable model system for observing transient dimerization upon irradiation. In this work, we refer to this construct including the A’α and Jα helices (residue 404-546) as AsLOV2. TRXL, also known as time-resolved X-ray solution scattering (TRXSS)15,35,36,37,38,39,40,41,42,43,44,45,46,47, is one of the sensitive experimental methods for detecting global conformational changes in solution induced by a trigger such as optical excitation and temperature jump. When TRXL is applied to macromolecules, it is also referred to as TR small-angle/wide-angle X-ray scattering (TR-SAXS/TR-WAXS). Kinetic analysis of the TRXL data from both wild type (WT) and I427V mutant48, the latter of which exhibits a relatively faster dark recovery compared to WT, identified three distinct intermediates connected by two time constants on the microseconds and milliseconds timescales in the light-induced structural transition of LOV2. Additionally, structural analysis aided by molecular dynamics (MD) simulations revealed that LOV2 undergoes extensive conformational changes related to the changes in oligomeric state (dimerization) through two monomeric intermediate states. Regarding the dimerization, we demonstrated that the formation of the dimeric interface for AsLOV2 is associated with the inter-monomeric interaction between the β-scaffolds, as evidenced by size exclusion chromatography (SEC) experiments on three AsLOV2 constructs: a construct with the Jα helix deleted (ΔJα), a construct with the A’α helix deleted (ΔA’α), and a construct with both Jα and A’α helices deleted (ΔJα/ΔA’α). Moreover, we observed that the SEC profiles of K413E and E475K mutants, designed to disrupt the electrostatic interaction between AsLOV2 monomers, exhibit a decreased dimer fraction compared to that of the WT. This proposes that the PPIs, including not only the interaction between the β-scaffolds but also a local electrostatic interaction, contribute to the formation of the dimeric interface. The findings of our study provide direct insights into the structural mechanism of photo-induced dimerization in AsLOV2, as well as a structural template for understanding the process of protein-protein association driven by PPIs49,50.

Results

TRXL data and kinetic analysis

To investigate the light-induced structural changes of AsLOV2, we measured TRXL data from microseconds (μs) to hundreds of milliseconds (ms) using the laser pump-X-ray probe scheme36,37,38,39 (details in Methods). TRXL experiments were conducted for both WT and I427V mutant. The I427V mutant48, which exhibits a faster photocycle than WT, was deliberately chosen to efficiently collect TRXL data with a higher signal-to-noise ratio (SNR), as the slow recovery of WT required the lowering of the repetition rate for measuring data at long time delays. The TRXL data were collected from 5.62 μs to 316 ms for WT and from 5.62 μs to 100 ms for the I427V mutant (Supplementary Fig. 3). The difference scattering curves, ΔS(q, t), of the I427V mutant, show a positive peak with marginal scattering intensities in the small-angle region (q < 0.2 Å−1) on the timescale from several μs to tens of μs (Fig. 2a). Within the timescale from tens of μs to hundreds of ms, the difference curves show the formation and increase of a significant negative peak in the small-angle region (Fig. 2a). During this relatively late timescale, the oscillatory features are also observed in the wide-angle region (0.2 < q < 0.6 Å−1) (Fig. 2a). To extract kinetic and structural information from the difference scattering curve, we conducted a global kinetic analysis using singular value decomposition (SVD) and kinetics-constrained analysis (KCA) methods for ΔS(q, t) in the q range of 0.03 to 1.0 Å−1 by following a well-established data analysis protocol (details in Methods)39. In SVD, the experimental data set of I427V was decomposed into time-independent q-spectra (left singular vectors, LSVs), the weight of the singular vector (singular values, S), and time-dependent amplitude changes of the RSVs (right singular vectors, RSVs).

Fig. 2: Global kinetic analysis of the TRXL data of AsLOV2 I427V.
figure 2

a Experimental (black) and calculated (red) difference scattering curves. b The top panel represents the sequential kinetic model of AsLOV2 determined in this study. The lower panel shows the SADSs obtained through KCA analysis. SADS1 (black), SADS2 (red), and SADS3 (blue) correspond to I1, I2, and P (photoproduct) of the sequential kinetic model, respectively. The static difference curve (qΔSstatic, light state – dark state, gray), normalized to SADS3, is also shown. c Time-resolved changes in the population of transient species (I1 (black), I2 (red), and P (blue)). Since the protein exhibits dimer conformation in the P state, the maximum population of P is 0.5. This value is half those of the other states (G, I1, and I2), which exhibit monomer conformations.

From SVD analysis, the three structurally different components were identified. The two different time constants of 130 ± 16 μs and 3.4 ± 0.7 ms were obtained by fitting the first three RSVs with the common time constants (Supplementary Figs. 4 and 5). In the subsequent kinetic analysis using KCA, we considered two kinetic models ((i) a sequential kinetic model and (ii) a parallel kinetic model) including three transient components and related two-time constants of 130 μs and 3.4 ms obtained from the SVD analysis, and extracted species-associated difference scattering curves (SADSs) containing direct structural information of the transient species (Fig. 2 and Supplementary Fig. 6). The theoretical time-resolved difference scattering curves for each kinetic model were generated by a linear combination of SADSs. The theoretical difference curves for the sequential kinetic model show good agreement with experimental data in both small- and wide-angle regions, while those for the parallel kinetic model show notable discrepancies from the experimental data in the small-angle region (Supplementary Fig. 6). This suggests that the sequential kinetic model is more suitable for describing the light-induced structural dynamics of AsLOV2 (Fig. 2). In the sequential kinetic model, the first intermediate (I1) is formed within 5.62 μs, which is the first time delay of our TRXL measurement. Subsequently, the second intermediate (I2), formed with a time constant of 130 μs is converted into the third intermediate (I3) with a time constant of 3.4 ms (details of SVD and KCA analysis are described in Methods).

Among the three SADSs extracted from the sequential kinetic model (Fig. 2b), the first SADS (SADS1), corresponding to I1, shows a positive peak in the small-angle region. The scattering intensity of SADS1 is marginal compared to the other two SADSs, indicating that the formation of I1 involves relatively minor structural changes compared to the formation of the other species. In contrast, the second SADS (SADS2), corresponding to I2, exhibits a negative peak in the small-angle region, and its intensity is more than five times larger than that of SADS1, indicating that the formation of I2 encompasses more substantial structural changes than that of I1. The third SADS (SADS3) exhibits a prominent negative peak in the small-angle region and has significantly larger intensity than SADS1 and SADS2, implying that the formation of I3 involves relatively larger structural changes than the formations of I1 and I2, and major structural changes occur in the I2 → I3 transition. The time-dependent population of SADSs shows that the population of I2 increases as that of I1 decreases with a time constant of 130 μs, followed by a decrease of I2 as the population of I3 simultaneously increases with a time constant of 3.4 ms (Fig. 2c).

Considering that I3 was the intermediate formed last by the light activation in the sequential kinetic model, I3 was predicted to correspond to the photoproduct (P). To validate this, we conducted static X-ray scattering (SAXS) experiments on both the ground (G) and light states of AsLOV2 (See Methods and Supplementary Table 2 for details). A noticeable difference in the SAXS patterns emerged at the small-angle region before and after irradiation of the LED light (Supplementary Fig. 7). Notably, the static difference curve (qΔSstatic, light state – G state) exhibits good agreement with SADS3, indicating that I3 observed in the TRXL measurement corresponds to P (Fig. 2b).

We collected the TRXL data on WT as well, which has a slower photocycle than I427V48. The small-angle region (q < 0.2 Å−1) of the difference scattering curves shows a rise starting from 100 μs, with its magnitude more amplified after 17.8 ms. The wide-angle region (0.2 < q < 0.6 Å−1) also shows a noticeable difference from 316 μs onwards (Supplementary Figs. 3 and 8). The difference scattering curves of WT are similar to those of I427V, except that the changes of WT occur on a slower timescale than I427V (Supplementary Fig. 3). We performed global kinetic analysis using SVD and KCA for the WT, following the same approach used for the I427V data, to investigate the kinetics and structural changes (Supplementary Fig. 8). From the SVD analysis, a sequential kinetic model, similar to that of I427V, with three transient species and two-time constants (682 ± 118 μs and 10.6 ± 2.89 ms) was applied to WT. The kinetic model of WT exhibits relatively slower kinetics compared to that of I427V (Supplementary Figs. 8b–e). Various kinetic models consisting of three significant transient species and two-time constants were considered, and among them, the sequential kinetic model in which the three species are connected in series with two-time constants fits well with the experimental data. Therefore, we can deduce that WT has the reaction pathway of I1 → I2 → P, as in I427V (Supplementary Fig. 8). Furthermore, the features and scattering amplitudes of the WT’s SADSs obtained through KCA analysis are identical to those of I427V’s SADSs within the experimental signal-to-noise ratio (SNR), indicating that the nature of structural changes involved in the light-induced transition of both constructs is the same (Supplementary Fig. 8b). In addition, there is no significant difference in the circular dichroism (CD) spectra between I427V and WT in the G state (Supplementary Fig. 2c), which indicates I427V mutation does not induce noticeable perturbation in the secondary structure of AsLOV2 in the G state. The results from global kinetic analysis and CD data demonstrate that although the I427V mutation accelerates the photocycle of AsLOV248, the mutation does not significantly affect the structural changes associated with the photocycle of AsLOV2. These findings suggest that the light-induced transitions of both WT and I427V involve nearly identical structural changes. For this reason, we conducted structural analysis only on the I427V data, which has a relatively higher SNR compared to that of WT, and all the results of the TRXL-based structural analysis discussed in this study were derived from the I427V data.

MD simulation-aided structural analysis of TRXL data

Previous studies on AsLOV2 have suggested that the light-induced transition of this protein can be mediated by structural changes in the Jα helix11,34,51,52,53,54,55. Considering this structural information, two frames were taken into account for AsLOV2, classifying its structure based on the structural characteristics of its Jα helix11,34,51,52,53,54,55: (i) a structure with a folded Jα helix, and (ii) a structure with an unfolded Jα helix. To extract the structural information from the three SADSs (SADS1, SADS2, and SADS3) obtained from the experimental data, the structural frames of AsLOV2 were simulated to generate various candidate structures as follows. For case (ii), to verify whether there is a structural preference due to the unfolding of the Jα helix, we performed non-equilibrium MD (NEMD) simulations for all physically allowed directions (±x, ±y, and +z) through which the Jα helix could unfold (Supplementary Fig. 9). Through this process, we generated candidate structures with the Jα helix unfolded in various axial directions (details in Methods). In both simulations (i) and (ii), the crystal structure of G (PDB ID: 2V1A)34 was used as the initial protein conformation. The MD-sampled structures obtained from these simulations were used to calculate theoretical X-ray static scattering curves corresponding to individual protein conformations. The theoretical difference scattering signal was generated by subtracting the static scattering curve of G from each theoretical static scattering signal. The theoretical scattering curve of G was selected from various theoretical curves obtained through MD simulations, based on the lowest χ2 value compared to the G state’s SAXS curve. Subsequently, by fitting to minimize the χ2 values between theoretical and experimental difference curves, we selected the 10 best structures that best matched SADS1 (or SADS2 or SADS3) (Supplementary Fig. 10). For each light-induced state (I1, I2, or P), the optimal structure that yielded the lowest χ2 value for the experimental data (SADS1, SADS2 or SADS3) among the 10 best structures was used to illustrate the representative structural characteristics of the protein in each state. (details in Methods and Supplementary Fig. 10).

The best-fitted curves for SADS1 and SADS2 showed excellent agreement with the corresponding SADSs (Fig. 3a, b). A comparison between the optimal structure of I1 and the G structure showed local structural differences between the two states (Fig. 3c). Only minor structural differences primarily in the Iβ and Jα were observed in I1, as further indicated by the 2D difference distance map calculated between I1 and G (Fig. 3c, e). On the other hand, the structural analysis of SADS2 revealed that the optimal structure of I2, and the 2D difference distance map between I2 and G represent a fully unfolded Jα helix (Fig. 3d, f). Moreover, the structural analysis suggests that the theoretical difference curves better describe the features of SADS2 when the Jα helix unfolds in the +y and +z directions, compared to other directions (±x and –y) (Supplementary Fig. 9). In addition, relatively minor structural differences around the A’α helix for these two states were also observed in the 2D difference distance map (Fig. 3f). To further confirm that the formation of I2 is primarily mediated by the unfolding of the A’α and Jα helices, we conducted an additional control analysis. AsLOV2 has a core structure comprising four α-helices, along with the A’α and Jα helices (Fig. 1b). Notably, the Fα helix is relatively similar in size to the Jα helix among these core helices. During the control analysis, we investigated the impact of the Fα helix unfolding on the formation I2, with details described in Supplementary Note 1. The theoretical difference curves from the unfolded Fα helix structures failed to describe SADS2, indicating that the Fα helix unfolding does not significantly contribute to the formation of I2 (Supplementary Note 1 and Supplementary Fig. 11). This supports that the unfolding of the A’α and Jα helices is the primary structural change driving the formation of I2.

Fig. 3: Structural analysis of SADS1 (corresponding to I1) and SADS2 (corresponding to I2).
figure 3

a Comparison of SADS1 (black) and representative theoretical difference curves for I1 (orange). b Comparison of SADS2 (black) and representative theoretical difference curves for I2 (green). c Optimal structure of I1 (orange) that best describes SADS1. d Optimal structures of I2 (green) that best describe SADS2. In (c) and (d), the G state structure (gray) is transparently displayed for comparison. e, f Difference distance maps of (e) I1 and (f) I2 with respect to the G state. For each state, the distance map was plotted by calculating the difference of the corresponding Cα-Cα distances between the optimal structures and the G state structure. The regions where prominent distance changes are observed in the difference distance map are marked with square dashed boxes: orange in (e) and green in (f).

Conversely, the structural analysis of SADS3 could not yield satisfactory fits to SADS3. To obtain a clue on the structure of P, we reconstructed molecular shapes based on static SAXS patterns of G and P (See Supplementary Methods and Supplementary Fig. 7b for details). A comparison of these two shapes indicates that a substantial global structural change is involved in the transition from G to P, as evidenced by the observation that the shape of G shows a radius of gyration (Rg) value of ~14.75 Å, whereas that of P shows an Rg value of ~22.72 Å. Notably, the Rg value of P is significantly larger than those of the optimal structures of I1 and I2 obtained from TRXL-based structural analysis (Supplementary Fig. 7c). Considering the prior structural studies of proteins sharing the LOV domain that have suggested the proteins may form a dimer in the light state in the light state14,19,23,24,25,26,27,28, the significantly larger Rg of P observed in our static SAXS and TRXL experiments compared to the three states observed (G, I1, and I2) indicates that AsLOV2 can adopt a dimeric conformation when activated by light. Subsequently, we investigated whether the optimal structures of G, I1, I2, and P could accurately describe the static scattering curve of the light state. The linear combination of the theoretical curves, calculated using the optimal structures, showed good agreement with the experimental static curve in the small-angle region (See Supplementary Note 2 and Supplementary Fig. 12 for details).

In addition to structural analysis, the SEC results also support dimerization conformation (See Supplementary Note 3, Fig. 4d, and Supplementary Fig. 13a for details). In the dark state, the SEC profile of WT only shows the monomer fraction, while in the light state, approximately 15% of the monomer fraction shifts to the elution volume corresponding to a dimer conformation. This shift confirms that the light-activated AsLOV2 undergoes a structural transition from a monomer to a dimer, consistent with the findings from the scattering experiments.

Fig. 4: Structural analysis of SADS3.
figure 4

a Schematic diagram showing six representative AsLOV2 dimer conformation candidates and the corresponding residuals between SADS3 and the best-fit scattering curve determined from the structural analysis. In the upper panel, the results of structural analysis for three hetero L/G dimers are shown, and in the lower panel, those for three homo L/L dimers are shown. In the schematic of the frames for the homo- and hetero- dimers, the light state LOV2 core domain (L) is depicted in bright yellow, and the non-activated LOV2 core domain (G) is depicted in dark yellow. Jα helix is shown in blue, and A’α is shown in red. The best-fit curve with the smallest residual and χ2 value is highlighted with the red box. The docking score (DS) and confidence score (CS) from molecular docking simulations, which assess the plausibility of dimer formation, are shown for each conformation. b Comparison of SADS3 (black), static difference scattering curves (qΔSstatic, light state − dark state, gray), and the best-fit curve of the optimal structure of P (red). c Comparison of low-resolution structures reconstructed from static SAXS and the optimal structure of P obtained from TRXL. d Representation of helices and residues associated with dimerization in representative protein conformations for P. Jα and A’α are shown in blue and red, and, K413 (yellow) and E475 (green) which form a salt bridge are represented as spheres. e Dimer fractions obtained from SEC for WT (black), C450A (light gray), K413E (orange), E475K (green), ΔJα (blue), ΔA’α (red), and ΔJα/ΔA’α (brown). The SEC results for each construct in the dark state are represented as triangles, while those in the light state are represented as circles.

For the structural analysis of SADS3, we generated candidate structures with various dimeric conformations using MD simulations (Fig. 4a and Supplementary Fig. 14). To achieve this, we classified the previously reported dimer structure of LOV2 into three types of dimer conformations based on their dimeric interface27,30,56: (a) Structures where the A’α helices of two monomers (A’α–A’α) interact to form a coiled-coil dimerization, (b) structures where the β-scaffolds interact with each other (β-scaffold–β-scaffold), and (c) structures where A’α–A’α helices and Jα–Jα helices of two monomers interact to form coiled-coil dimerization. For each type of dimer conformation, we considered the following two monomer-excitation configurations: (i) homo L/L dimer (light state monomer + light state monomer) (a, b, and c frames) which consists of two light-activated monomers, and (ii) hetero L/G dimer (light state monomer + G state monomer) (a’, b’, and c’ frames) which consists of one light-activated monomer and one non-activated G state monomer. Consequently, we generated candidate structures by considering a total of six frame conformations through MD simulations (details in Methods).

The MD-sampled structures with the dimer conformations were used to calculate theoretical static scattering curves, which were then utilized in the structural analysis of P based on the approach used for I1 and I2. The residuals between each of the six conformations and SADS3 show that the three homo L/L dimer configurations appear to better describe SADS3 than those with hetero L/G dimer configurations (Fig. 4a and Supplementary Fig. 14). The optimal model with the lowest residual, obtained for the b-frame structure, well matches the molecular shape of the P obtained from static SAXS, supporting the validity of the structural analysis against SADS3 (Fig. 4b). In the optimal structure of P, the Jα and A’α helices of each monomer subunit are unfolded, exposing the β-scaffolds, and the dimeric interface between the two subunits is formed on the exposed β-scaffolds (Fig. 4c).

To obtain detailed spatiotemporal aspects of the light-induced dimerization, we inspected the locations and trajectories of the intermediates with respect to the ground state across various parameter landscapes (Fig. 5). First, we examined the Rg value as a function of the maximum particle dimension (Dmax) based on the structural pool and optimal structures of each state (Fig. 5a). The trajectory in this parameter landscape (termed as L1) shows the gradual increases in the Rg and Dmax values of the protein during the transition from G to P via I1 and I2 (Fig. 5a). Specifically, subtle changes are observed in the G → I1 transition, while substantial changes occur in the I1 → I2 → P transitions, suggesting that the formation of I1 involves minor structural changes, whereas the formations of I2 and P are associated with more extensive structural changes. Furthermore, we considered the following three parameter landscapes (L2, L3, and L4): (L2) the distance between Ala523 and Ala543 (D1) as a function of distance between Phe403 and Glu412 (D2) (Fig. 5b), (L3) the solvent accessible surface area (SASA) among the β-scaffold, A’α and Jα helices (area 1 in Fig. 5d) as a function of root mean square deviation (RMSD) of the β-scaffold (Fig. 5c), and (L4) the SASA of area 1 as a function of the SASA among the β-scaffold and the remaining structural components of the protein, excluding the A’α, Jα helices, and β-scaffold (area 2) (Fig. 5d). Specifically, for P, from each dimer, two locations were extracted from two monomer structures constituting the dimeric conformation of P (monomer A or monomer B; as depicted in Fig. 6), to facilitate the comparison of the landscape distributions of P to those of G, I1, and I2. Phe403 and Glu412 are located at the N-terminal and C-terminal of the A’α helix, respectively, while Ala523 and Ala543 are located at the N-terminal and C-terminal of the Jα helix, respectively. Therefore, in L2, the changes in D1 and D2 reflect the structural alterations in the A’α and Jα helices, respectively. L2 shows that the G → I1 transition is characterized by an increase in D1, while the I1 → I2 transition demonstrates a decrease in D1 and an increase in D2 (Fig. 5b). This indicates that the formation of I1 involves the structural extension of the A’α helix and that of I2 includes both rearrangement of the A’α helix and unfolding of Jα helix. The optimal structure of I2 shows that the rearrangement of the A’α helix corresponds to the unfolding of the helix. Specifically, the optimal structure shows that the unfolded A’α helix wound itself, and this is observed in the decrease of D1 as well. Additionally, the I2 → P transition shows a slight increase in D1 and a broadening of the D2 distribution, suggesting that this transition involves rearrangements in both helices (Fig. 5a). In L3, during the G → I1 transition, a substantial increase in the RMSD of the β-scaffold is observed, accompanied by a slight increase in SASA of area 1, implying that the transition is predominantly characterized by a structural change within the β-scaffold (Fig. 5c). For the subsequent I1 → I2 → P transitions, L3 demonstrates a significant rise in SASA of area 1 but not in the RMSD of the scaffold. In contrast to the notable changes in area 1 observed in L3, only a subtle change of SASA of area 2 is observed in L4 (Fig. 5d). These features indicate that the transitions involve the exposure of the β-scaffold to the external environment in the direction of the two helices (area 1).

Fig. 5: Structural landscapes of the AsLOV2 photocycle from the TRXL study.
figure 5

a The landscape of Rg as a function of Dmax. b The landscape of distance between Ala523 and Ala543 (D1) as a function of distance between Phe403 and Glu412 (D2). Phe403 and Glu412 are located at the N- and C-terminals of the A’α helix, respectively, while Ala523 and Ala543 are located at the N- and C-terminals of the Jα helix, respectively. These distances track the light-induced structural changes of the A’α and Jα helices, respectively. c The landscape of solvent accessible surface area (SASA) among the A’α, Jα helices, and β-scaffold, (area 1 in the inset of Fig. 5d) as a function of root mean square deviation (RMSD) of the β-scaffold. The RMSD and SASA profiles unveil the β-scaffold’s structural alterations and exposure of the β-scaffold to the external environment in the direction of the two helices, respectively. d The landscape of SASA of area 1 as a function of the SASA among the β-scaffold and the remaining structural components of AsLOV2 excluding the A’α, Jα helices, and β-scaffold (area 2 in the inset). In the structure (inset), A’α, Jα helices, area 1 and area 2 are denoted by red, blue, yellow and purple, respectively. The SASA profile of area 2 demonstrates changes in the exposure of the β-scaffold to the external environment in the direction of the remaining structural components. For each case of I1 and I2, the parameters were determined from the 10 best structures obtained in the structural analysis. For P, from each dimer, two parameters were calculated from two monomer structures constituting the dimeric conformation (monomer A or monomer B in Fig. 6) based on the 10 best structures of P. The values were calculated from the optimal structure of the G (black dots), and these values were compared with those of I1, I2, and P. The parameters for I1, I2, monomer A, and monomer B are denoted by green, blue, red, and magenta dots, respectively. The parameters calculated from the entire structural pool used in the structural analysis are represented in the landscapes (gray dots).

Fig. 6: Structural dynamics of the AsLOV2 photocycle revealed by TRXL study.
figure 6

The photocycle of AsLOV2 includes the G state, three intermediates (I1, I2, and P), and related time constants (WT: 682 μs and 10.6 ms, and I427V: 130 μs and 3.4 ms), as determined from the kinetic analysis of the scattering data. For the TRXL measurements of WT and I427V samples, each sample (WT or I427V) was prepared at a concentration of 1.5 mM in 20 mM Tris pH 7.0, 200 mM NaCl buffer. The optimal structure of G was determined from the structural analysis on the static solution scattering data, while the optimal structures of three intermediates were extracted through the structural analysis using the TRXL data. The optimal structures (I1 and I2) indicate that the structural changes within the A’α and Jα helices allow the exposure of the β-scaffold to the external environment. Subsequently, AsLOV2 undergoes dimerization (P), utilizing the dimeric interface formed between their β-scaffolds. For each state, the optimal structure is depicted alongside the 9 best structures, with the optimal structure highlighted and the remaining structures shown semi-transparently. The A’α and Jα helices are marked in red and blue, respectively, and the remaining structures are marked in gray.

Discussion

The kinetic and structural analysis of the TRXL data show that the photoreaction of the AsLOV2 domain has the kinetic model shown in Fig. 6, where it transitions from the G state to intermediate states I1 and I2, eventually leading to the formation of the photoproduct P. In this kinetic model, I1, the first intermediate corresponding to SADS1, is generated from the G state upon light activation. After that, I1 is transitioned to I2, the second intermediate corresponding to SADS2, with a time constant of 682 μs for WT and 130 μs for I427V, and I2 is finally transformed into P with a time constant of 10.6 ms for WT and 3.4 ms for I427V (Fig. 6). We compared the time constants observed in our kinetic analysis with those determined in spectroscopic studies (Tables 1 and 2). Time-resolved spectroscopic studies reported that in the photoresponse of AsLOV2, FMN undergoes singlet-to-triplet conversion in the sub-nanosecond time regime, followed by the formation of cysteinyl–flavin (Cys–FMN) photoadduct of the protein in the time range of several μs to tens of μs10,52,53,54,55,57,58,59,60,61,62 (Table 1). These primary events occur in a time range shorter than the temporal resolution of our study, which indicates that the transition from G to I1 should involve the formation of the photoadduct. Regarding this, the absence of the SEC profile for a dimer conformation in the light state of the C450A mutant, in which the formation of the Cys–FMN photoadduct10,63 is inhibited, confirms that the formation of the photoadduct is necessary for the light-induced dimerization (See Supplementary Note 3, for details). For the G → I1 transition, we observe the increases in both D1 and the RMSD of the β-scaffold (Fig. 5b, c). Furthermore, the 2D distance difference map between the optimal structures of I1 and G state, shown in Fig. 3e, also reveals that I1 has two prominent structural differences in the Iβ and A’α helix compared to the G state. Considering the formation of the photoadduct, occurring within a time scale shorter than that of our TRXL measurement10,52,53,54,55,57,58,59,60,61,62, these results suggest that the formation of the photoadduct mediates the subsequent structural changes of the β-scaffold and A’α helix during the transition. Notably, the optimal structure of I1 and the increase of D1 indicate that the formation of I1 involves the structural extension of A’α helix64 (Fig. 3c, 5b). Meanwhile, a previous structural study reported that the light-activated LOV domain from Bacillus subtilis (BsYtvA) forms I1 with a time constant of ~ 2 μs47, which is similar to that observed for I1 formation of AsLOV2 in our study. Considering that BsYtvA is a type of LOV protein without the Jα helix, this similarity suggests that the Jα helix is not significantly involved in the formation of I1 during the light-induced transition of LOV proteins. This is consistent with our structural analysis, which shows that structural changes in the A’α helix and β scaffold are involved in the formation of I1.

Table 1 Comparison of time constants of AsLOV2 observed in spectroscopic studies and TRXL
Table 2 Comparison of time constants associated with LOV dimerization in various photoreceptor proteins

The transition from I1 to I2 occurs with a time constant of 682 μs for WT, consistent with previous spectroscopy results that suggested a transition on the timescale of tens to hundreds of μs52,53,54,55 (Table 1). The I1 → I2 transition accompanies the changes in D1 and D2, indicating that it involves the structural rearrangement of A’α and Jα helices. The optimal structure of I2, where the A’α and Jα helices are unfolded, shows that the transition accompanies the unfolding of the helices (Figs. 3b, d, f, 5b, c). Specifically, our structural analysis shows that structures with unfolding Jα helix along the +y and +z directions better describe SADS2 compared to those with unfolding along the other two directions ( ± x and –y), suggesting the existence of preferred directions of structural changes during the Jα helix unfolding in the transition (Supplementary Fig. 9). The unfolding of the Jα helix along the +y and +z directions increases the distance between the β-scaffold and the helix, leading to a greater separation between the Jα helix and the β-scaffold in the optimal structure of I2 compared to G and I1 (Supplementary Fig. 9). Considering that the SASA of the β-scaffold increases within area 1, not area 2, during the formation of I2 (Fig. 5c, d), these structural characteristics suggest that the β-scaffold in I2, which is shielded from the external environment by its interaction with the A’α and Jα helices in G and I1, is no longer protected. This is consistent with previous studies that suggested disruption of the interaction between the β-scaffold and the Jα helix upon unfolding11,51,65,66. Furthermore, for the interaction between the helices and β-scaffold, the SEC profiles from constructs with the Jα helix deleted (ΔJα), the A’α helix deleted (ΔA’α), and both Jα and A’α helices deleted (ΔJα/ΔA’α) show the elution volume for the dimer conformation even in the dark state (Supplementary Figs. 13d–f), consistent with previous studies from several LOV domains from other species with truncated A’α or Jα helices18,64,67. The docking simulations of the constructs demonstrate that the dimerization is more favorable in ΔJα/ΔA’α compared to those where only a single helix (ΔJα or ΔA’α) was removed (See Supplementary Note 3, Supplementary Methods, and Supplementary Table 1 for details). These results demonstrate that the intra-monomer interactions between the helices and β-scaffold hinder AsLOV2 dimerization, supporting that the disruption of this interaction, as identified in our structural analysis, must precede AsLOV2 dimerization.

Subsequently, I2 converts into P with a time constant of 10.6 ms for WT. The structural analysis of P shows that P has a dimer conformation, indicating that the I2 → P transition accompanies the association of two I2 monomers to form a dimer. In this regard, the low-resolution structure of P obtained from our static SAXS data shows good agreement with the structure of P obtained from the structural analysis of SADS3 (Fig. 4c), further confirming that AsLOV2 dimerization is associated with the I2 → P transition. In particular, in each monomer that constitutes the optimal structure, the Jα and A’α helices are unfolded, and the β-scaffolds form a dimeric interface (Fig. 4). Regarding this, in our structural analysis, candidates with the homo L/L dimer conformations exhibit a better fit to SADS3 when compared to those with hetero L/G dimer conformations. In the homo L/L dimer conformation, the unfolded A’α and Jα helices of both monomers in the light state allow the β-scaffolds to freely interact with the external environment (Fig. 4a). In the hetero L/G dimer conformation, the monomer in the light state permits β-scaffold interaction with the external environment as the A’α and Jα helices unfold, whereas the monomer in the G state maintains intramolecular interaction between the β-scaffold and the two helices, thereby shielding the β-scaffold from exposure to the external environment (Fig. 4a). These structural differences indicate that the formation of dimeric interaction between the β-scaffolds of two different AsLOV2 monomers is more favorable in the homo L/L dimer conformation compared to the hetero L/G dimer conformation. Considering that candidates with the homo L/L dimer conformation provide a better representation of SADS3 than those with the hetero L/G dimer conformation, these structural differences between homo- and heterodimers demonstrate the necessity of freely interacting β-scaffolds for the formation of transient dimers in the light-induced transition of AsLOV2. This key feature illustrates the critical role of interactions between the β-scaffold and the two helices in forming the dimer structure of P27. For the dimer conformations, docking simulations also indicate that the formation of dimers is more favorable in the homo conformations than in the hetero conformations (Fig. 4a and Supplementary Table 1), aligning with the results of our structural analysis (See Supplementary Note 3, for details). These structural characteristics suggest that the formation of I2 should involve the unfolding of the two helices, exposing the β-scaffold to the external environment, which can lead to AsLOV2 dimerization through the dimeric interaction formed between the β-scaffolds of two different monomers.

Additionally, the I2 → P transition displays the change in the D1 and D2 landscape (L2), implying that the unfolded A’α and Jα helices undergo further structural arrangements during the formation of P (Fig. 5a). Regarding this, the increases in the SASA of the β-scaffold within area 1 during the transition indicate that these arrangements lead to a more pronounced exposure of the β-scaffold to the external environment (Fig. 5c, d). Considering that the dimeric interface of P is formed between the β-scaffolds, these features suggest that the further arrangements of the unfolded helices facilitate intermolecular interactions between the β-scaffolds, thereby allowing for the more efficient formation of the dimeric interface in P.

For the light-induced dimerization, the structural landscapes also suggest that the light-induced dimerization of AsLOV2 involves a gradual increase in the structural flexibility of the protein. L2 and L3 reveal gradual increases in the fluctuations of D1, D2, and the RMSD of the β-scaffold, indicating increasing structural flexibility of the Jα and A’α helices, as well as the β-scaffold, as the reaction proceeds. Notably, the fluctuation in the RMSD of the β-scaffold was smaller than those of D1 and D2, suggesting that the flexibility of the helices increases significantly more than that of the β-scaffold. Regarding this, the comparison of the 10 best structures of P shows differences in the overall structure, with significant structural variations in the Jα and A’α helices (Supplementary Fig. 10f). Considering that the helices are located on the surface of AsLOV2, it is suggested that the global structure of AsLOV2 would be flexible in the P state due to the structural flexibility of the helices. This global structural flexibility is also observed by L1, which shows the gradual increases in the fluctuations of Rg and Dmax (Fig. 5a).

Moreover, it was predicted that electrostatic interaction between K413 near A’α helix and E475 in the β-scaffold is also related to the formation of the dimeric interface from the theoretical approach based on PDBePISA50, an effective tool for identifying interactions at protein interfaces (See Fig. 4d and Supplementary Methods for details). Regarding this interaction, in the light state, the SEC profiles of K413E and E475K mutants, constructed to disrupt the electrostatic interaction, show a decreased dimer fraction compared to that of the WT (Fig. 4e and Supplementary Fig. 13b, c), confirming that not only the interaction between the β-scaffolds but also such electrostatic interaction plays a role in the formation of the dimeric interface (See Supplementary Note 4 for details).

Some spectroscopy studies reported that LOV domains found in other photoreceptor proteins, such as phototropin LOV domain (phot–LOV), Erythrobacter litoralis 222 (EL222), and aureochrome LOV domains (Aureo–LOV), form dimers on a relatively broad timescale ranging from ms to tens of seconds upon activation14,16,17,18,19,21,22 (Table 2). In particular, the dimerization of phot–LOV proteins, such as the Arabidopsis thaliana phot1-LOV2 (Atphot1-LOV2) and Chlamydomonas reinhardtii phot-LOV1 (Crphot-LOV1) have been observed on a timescale of tens of ms, which is similar to the dimerization constant of 10.6 ms observed in this TRXL study18,19,20. Considering that AsLOV2 is a type of phot-LOV, this similarity suggests that the dimerization of the LOV protein moiety involved in the light-induced transition of the phototropin superfamily shares similar kinetic properties18,19,20 (Table 2).

For LOV proteins without an effector domain (short LOV domains), previous structural studies have shown that LOV domains within these proteins exhibit various conformational changes, including dimerization and dissociation, during light-induced transitions23,32. These structural changes enable the LOV proteins to interact with their partner proteins or molecules and mediate signal transduction processes. In contrast, LOV domains in LOV proteins with an effector domain exhibit structural changes that are synchronized with the modulation of the domains14,28. For several LOV proteins with an effector domain, it has been observed that the β-scaffolds in their LOV domains form the primary dimerization interface, implying a shared structural characteristic among LOV domains concerning their dimeric interfaces14,16,17,21,22,27,67,68. Furthermore, previous studies on LOV domains of phototropin proteins, a type of LOV domains found in LOV proteins with an effector domain, have suggested that the light-activated LOV proteins exhibit structural changes in their β- scaffolds18,19,45, which strengthen their dimeric interactions. These structural changes have implied that the enhanced dimeric interactions, in turn, facilitate the activation of downstream serine/threonine kinase domains by promoting structural rearrangement of the downstream moieties in phototropin proteins45,69,70. Considering these findings and the fact that the effector domain of AsPhot1 is connected to the C-terminus of its LOV2 domain (AsLOV2), the structural mechanism proposed in our study (Fig. 6) suggests that the light-induced dimerization of AsLOV2 influences downstream reactions by reorganizing the structural arrangement of the effector domains and enhancing the interaction between them in AsPhot1 (Figs. 1a and 6). Our study provides a structural interpretation of AsLOV2’s light-induced dimerization, as summarized in Fig. 6, which offers a conformational template for understanding how LOV proteins with an effector domain utilize light to regulate the structural and functional properties of their effector domains. The structural mechanism of the dimerization for AsLOV2 suggests that protein oligomerization can be mediated by oligomeric interfaces facilitated by structural changes in subunits.

In this study, we elucidated the structural changes associated with AsLOV2’s light-induced dimerization using TRXL-based structural investigations, considering both local and global structural aspects. Kinetic analysis of TRXL data revealed that AsLOV2 undergoes the formation of a photoproduct through two different structural intermediates induced by light activation (Fig. 2). Furthermore, our MD simulation-aided structural analysis demonstrated that light-activated AsLOV2 converts to the photoproduct of transient dimer by generating the dimeric interface by the exposure of the β-scaffold due to the unfolding of the A’α and Jα helices (Fig. 6). The results of this structural analysis are also supported by additional SEC experiments and docking simulations, which show that the dimeric interaction of the protein is strengthened in the helix-deleted constructs (ΔJα, ΔA’α, and ΔJα/ΔA’α). Particularly, inspired by the structural information on AsLOV2’s light-induced dimerization transition, we propose that structural changes in the protein can mediate the formation of the oligomeric interface, ultimately leading to protein association. These findings will provide not only structural information involved in the light-induced dimerization of proteins sharing the LOV domain but also mechanistic insights into how proteins regulate their oligomeric states through protein association. Although our experimental and theoretical approaches successfully captured the structural changes associated with the formation of the dimer conformation for AsLOV2, additional studies with improved time resolution and SNR, such as those employing serial crystallography, are necessary to elucidate the local structural changes involved in the photoadduct formation between the chromophore and the core domain of the protein, as these changes are considered to be crucial structural components for the light-induced structural transitions. Furthermore, in the LOV-based optogenetic field, the unfolding phenomenon of Jα has been primarily utilized to develop it as an optogenetic tool71,72. Our observation suggests the possibility of regulating dimerization more tightly and expanding the scope of dimeric optogenetic tools.

Methods

Preparation of Avena sativa LOV2 (AsLOV2) samples

The wild-type (WT) Avena sativa LOV2 (AsLOV2) protein was prepared following a reported protocol11. The coding sequence of Avena sativa phototropin 1 LOV2 domain (residue 404-546) was cloned into a 2B-T vector using the LIC (ligation independent cloning) method. The Jα helix deletion construct (ΔJα, residue 404-520), A’α helix deletion construct (ΔA’α, residue 413-546), and A’α and Jα deletion construct (ΔJα/ΔA’α, residue 413-520) were also cloned in the same manner. The mutants of AsLOV2 (I427V, K413E, E475K, and C450A) were generated using a mutagenesis kit (EZchange Site-directed Mutagenesis kit, Enzynomics, South Korea). The primer sequences used for constructing AsLOV2 variants are provided in Supplementary Data 1. AsLOV2 WT, mutants, and variants were over-expressed in Escherichia coli BL21(DE3) induced with 0.4 mM IPTG and purified using Ni-affinity chromatography (Ni-Sepharose 6 Fast Flow, Cytiva, USA) and an anion exchange chromatography system (Q-Sepharose Fast Flow, Cytiva, USA). The purified proteins were dialyzed against 20 mM Tris pH 7.0, and 200 mM NaCl. The concentrations of AsLOV2 samples were determined by a UV-Vis spectrometer (UV-2550, Shimadzu, Japan).

Time-resolved X-ray liquidography experiment and data processing

Time-resolved X-ray liquidography (TRXL) experiment of AsLOV2 wild type (WT) was performed at the BioCARS 14-ID-B beamline of Advanced Photon Source (APS) (Argonne, USA) following the well-established experimental protocol15,35,36,37,38,39,40,41,42,43,44,45,46,47. The X-ray probe contained 24 single bunches of polychromatic 12 keV X-rays (standard top-up mode of APS), with a focusing size of 35 μm × 35 μm. After the irradiation of the protein sample by a circularly polarized nanosecond (ns) laser pulse (pump) with a fluence of 1.0 mJ/mm2 at 450 nm, the X-ray probe pulse was delivered to the protein sample contained in a capillary flow cell to observe the light-induced structural changes recorded in small-angle X-ray scattering (SAXS) and wide-angle X-ray scattering (WAXS) regions. The X-ray scattering patterns were collected using a two-dimensional detector (MX340-HS, Rayonix, USA) for 20 and 18 time delays for WT and the I427V mutant, respectively. For the measurements, the WT and I427V samples (1.5 mM concentration) dissolved in 20 mM Tris pH 7.0, and 200 mM NaCl buffer were used. In order to prevent irradiation damage to the protein samples from X-ray and laser exposures, the sample solution was passed through a capillary flow cell using a syringe pump (LEGATO® 111, KD Scientific, USA), and a fresh sample was provided for every exposure of the pump and probe pulses. The measurements covered a time range from 5.62 μs to 316 ms for WT and from 5.62 μs to 100 ms for I427V. The scattering pattern at a negative time delay (−5 μs) is also collected and used as a reference for calculating the differences in the scattering curves. The scattering patterns at positive time delays contain structural information contributed by the initial state, intermediates, and photoproduct, while the scattering pattern at the negative time delay contains structural information of the initial state. From the scattering curves, the difference scattering curves, ∆S(q, t) = S(q, t) − S(q, −5 μs), were calculated by subtracting the scattering curve at negative time delay (−5 μs) from the curves at positive time delays. To better describe scattering features in the WAXS region (q > 0.2 Å−1), q∆S(q, t) was generated by multiplying q with ∆S(q, t). For each time delay, more than 100 scattering curves were averaged to achieve a high signal-to-noise ratio. During the measurements, the temperature of the samples was kept at 293 K using a cold nitrogen stream (Oxford Cryostream, UK). From the TRXL data, the contribution from the heating of the solvent due to energy transfer by excited protein molecules was removed using a well-established protocol39 (See Supplementary Methods and Supplementary Fig. 15 for the details).

Global kinetic analysis for TRXL data (SVD and KCA analysis)

To identify the kinetics of the structural transition in AsLOV2, we performed singular value decomposition (SVD) analysis on the TRXL data (difference scattering curves)37,39. SVD analysis decomposes the data matrix, A, of the difference scattering curve into three matrices, U, S, and V, satisfying the following equation (A = USVT). U is a matrix called left singular vectors (LSVs), which is composed of time-independent spectra for each q. S is a diagonal matrix that represents the weight of each singular vector. The right singular vectors (RSVs) referred to as V are composed of time-dependent profiles for each LSV. LSVs offer a basis for the space spanned by species-associated difference scattering curves (SADSs), and RSVs present information on the time-dependent concentration changes of intermediates. Therefore, SVD analysis provides the number of structurally distinguishable intermediates and the dynamics of each species, regardless of the kinetic model. Based on the singular values and autocorrelation factors of the corresponding singular vectors from SVD analysis, we identified three significant singular vectors (Supplementary Fig. 4). To determine the number of time constants involved in the light-induced structural transition, we simultaneously fitted the first three RSVs using: (i) exponential sharing one common time constant, (ii) exponentials sharing two common time constants, and (iii) exponentials sharing three common time constants (Supplementary Fig. 5). For both WT and I427V, the analysis using (ii) effectively described the RSVs features, while the analysis using (i) was insufficient, and the analysis using (iii) resulted in overfitting, confirming the involvement of two common time constants in the light-induced structural transition of WT and I427V. For the WT, the time constants of 682 μs and 10.6 ms were obtained within the time range of 5.62 μs to 316 ms. For I427V, time constants of 130 μs and 3.4 ms were obtained within the time range of 5.62 μs to 100 ms (Supplementary Figs. 3 and 8).

For each protein construct (WT or I427V), we employed kinetics-constrained analysis (KCA) to extract three species-associated difference scattering curves (SADSs) corresponding to the three species from the experimental scattering curves (Fig. 2b and Supplementary Fig. 8). In the KCA, we extracted SADSs from the kinetic components, specifically the major RSVs and LSVs, obtained from the SVD analysis rather than from the raw experimental scattering curves by using well-established method73. During the KCA the experimental scattering curves were decomposed into three SADSs corresponding to the three species as follows.

$${\Delta S}_{{theory}}\left({{{{\rm{q}}}}}_{{{{\rm{i}}}}},{t}_{j}\right)={\sum}_{k=1}^{3}\left[{C}_{k}\left({t}_{j}\right)\right]{\Delta S}_{{C}_{k}}\left({{{{\rm{q}}}}}_{{{{\rm{i}}}}}\right)$$
(1)

where ∆Stheory(qi, tj) is the theoretical difference scattering curve at given q and t values, ∆SCk(qi) is the SADS containing structural information for the formation of the kth intermediate species at a given q value, and Ck(tj) is the instantaneous population of the kth intermediate at a given t value. The population can be calculated using the time constants obtained from the SVD analysis. Subsequently, we minimized the discrepancy between the theoretical and experimental curves by applying the Nelder-Mead simplex algorithm.

MD simulations-aided structural analysis of TRXL data

We conducted molecular dynamics (MD) simulation-aided structural analysis on the three SADSs to extract the three-dimensional protein conformations involved in the light-induced transition of AsLOV2. To this purpose, we classified the structural information of monomer conformations of AsLOV2 into the following two representative conformational frames based on the structural characteristics of their Jα helix11,34,51,52,53,54,55: (i) a monomer structure with a folded Jα helix, and (ii) a monomer structure with an unfolded Jα helix. Subsequently, we performed the MD simulations considering the two representative conformational frames ((i) and (ii)) to generate candidate structures for describing SADS1, SADS2 and SADS3.

To generate the equilibrium structures for (i), we performed the MD simulation using a crystal structure with monomer conformation (PDB ID: 2V1A)34 as the initial structure. The simulations were performed using GROMACS 2019.274 with the Charmm36 force field75, in combination with the TIP3P water model. To describe the cysteinyl–flavin (Cys–FMN) photoadduct condition of the chromophores, the force field was modified according to a previous study76. The charge of the system was neutralized by adding sodium ions. The system was equilibrated under NVT condition for 100 ps with a velocity-rescale thermostat (τT = 2 fs and T = 300 K) and was subsequently equilibrated under NPT condition for 100 ps with a velocity-rescale thermostat (τT = 2 fs and T = 300 K) and a Parrinello-Rahman barostat (τT = 2 fs and T = 300 K). After equilibration, 100 ns production simulations were performed on the equilibrated structure. After the simulation, 10,000 candidate structures with (i) were sampled from the MD trajectory at 10-ps intervals. From the MD trajectories, the representative structure for (i) was selected by using the clustering method based on the GROMOS algorithm77 and then used as the starting structure for the MD simulation of (ii).

To generate protein conformations with the geometry of (ii), we used the representative structure for (i) as an initial structure and conducted a non-equilibrium molecular dynamics (NEMD) simulation employing the pulling algorithm of GROMACS. In the pulling simulation, we defined two residues, Gln502 (Q1) and Leu546 (Q2), representing the N and C-terminus of the Jα helix. Afterward, we set the direction of the external pulling force to cause the Q2 position to move away from Q1, thereby inducing the unfolding of the Jα helix. Considering the structural flexibility of the unfolded Jα helix in (ii), we applied the force to pull the Jα helix along the directions of ±x, ±y, and z (Supplementary Fig. 9). In pulling simulations, the pulling force strength was applied as 1000 kJ/mol·nm2 between two positions. All NEMD simulations were conducted using the pulling algorithm along the hypothetical reaction coordinates until the Jα helix completely unfolded. From each NEMD trajectory, 5000 candidate structures for (ii) were sampled at 10-ps intervals.

After the simulations, we aimed to find the optimal protein conformations for I1, I2, and P by comparing SADSs with the difference theoretical scattering curves from the MD-generated structures. We calculated the theoretical scattering curves of the candidate structures using the CRYSOL78 with all default parameters. To account for the effect of the polychromatic X-ray beam, a well-established protocol was used to incorporate the energy spectrum of the X-ray pulse into the theoretical scattering curve39. Subsequently, the theoretical difference scattering curves were obtained by subtracting the theoretical static scattering curve corresponding to the ground (G) state from the theoretical static scattering curves corresponding to I1, I2, and P. After generating the candidate pools of theoretical difference curves, the theoretical difference scattering curves were compared with three SADSs to identify the optimal structures for I1, I2 and P. The optimal structures were assessed by calculating the reduced χ2 using the following equation, which quantified the discrepancy between the theoretical and experimental difference scattering curves.

$${\chi }_{{red}}^{2}=\frac{1}{N-1}{\sum}_{i=1}^{N}{\left[\frac{\mu \cdot \varDelta {S}_{{theory}}\left({q}_{i}\right)-\varDelta {S}_{\exp }\left({q}_{i}\right)}{\sigma ({q}_{i})}\right]}^{2}$$
(2)

Stheory(q) represents the theoretical difference scattering curve, ∆Sexp(q) corresponds to the experimental TRXL data, µ is the scaling factor between ∆Stheory(q) and ∆Sexp(q), N is the number of q points in the experiment, and σ(q) represents the experimental standard deviation. µ comprises two scaling factors: (i) the relative ratio between the experimental and theoretical scattering curves, and (ii) the practical photoconversion yield of AsLOV2 in the TRXL experiment. In principle, a χ2red value close to 1 indicates the most optimal solution within the experimental error. Based on the possible structural frames, for each state, the 10 best-fitted theoretical difference curves with the lowest χ2red values were selected as the best solutions. The 10 best protein structures were then selected from the MD-generated structures, which were used to generate the 10 best-fitted theoretical curves. These 10 best structures were subsequently used to generate the structural landscapes (Fig. 5), providing detailed structural information about the light-induced transition of AsLOV2. Furthermore, for each light-induced state (I1, I2, or P), the optimal structure with the smallest χ2red value among the 10 best structures was used to illustrate the representative structural characteristics of the protein in each state (Figs. 3, 4, and Supplementary Fig. 10).

The optimal solutions for SADS1 and SADS2 demonstrated excellent agreement with the corresponding experimental data (Fig. 3), whereas the solution for SADS3 did not. Regarding this, the low-resolution structure of P obtained from the ab initio shape reconstruction of the SAXS profile for the light state suggested that AsLOV2 can potentially adopt a dimeric conformation when activated by light. Thus, we additionally performed MD simulations considering three representative conformational frames to generate candidate structures with various dimer conformations for describing SADS327,30,56: (a) Dimer structures where A’α helices of two monomers (A’α–A’α) interact to form a coiled-coil dimerization, (b) Dimer structures where β-scaffolds interact with each other (β-scaffold–β-scaffold), and (c) Dimer structures where A’α–A’α helices and Jα–Jα helices of two monomers interact to form coiled-coil dimerization. For each type of dimer conformation, we considered the following two monomer-excitation configurations: (L1) homo L/L dimer (light state monomer + light state monomer) (a, b, and c frames) which consists of two light-activated monomers, and (L2) hetero L/G dimer (light state monomer + G monomer) (a’, b’, and c’ frames) which consists of one light-activated monomer and one non-activated G monomer. By considering the three dimer conformations and two monomer-excitation configurations, a total of six MD simulations were conducted to generate candidate structures with dimer conformations. For MD simulations of (L1), we used two optimal structures of I2 as two monomers and aligned them according to the dimer geometries of (a), (b), or (c), thereby obtaining an initial structure for each dimer conformation (a, b or c) (Supplementary Fig. 14). For MD simulations of (L2), we generated an initial structure for each dimer conformation (a’, b’, or c’) by using one optimal structure of I2 as a monomer unit and one G structure as the other monomer unit and aligning them according to the dimer geometry of (a’), (b’), or (c’) conformation (Supplementary Fig. 14). The initial structures were used for MD simulations, and these MD simulations were carried out in the same manner as the simulation applied for (i). During these MD simulations, production simulations were conducted over a period of 300 ns. 30,000 candidate structures with dimer conformations were sampled from each MD trajectory at 10-ps intervals. These structures were employed in the structural analysis for SADS3 using the same approach as applied to the candidate structures with the monomer conformations, and the 10 best structures of P were obtained. The initial structures and optimal structures for each state from the MD simulations-aided structural analysis are provided in Supplementary Data 2.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.