Introduction

Homochirality of biological molecules remains a mystery (Blackmond 2011; Perez-Garcia and Amabilino 2007; Buhse et al. 2021; Sallembien et al. 2022). Thus, chiral proteinogenic amino acids are primarily left-handed (laevo-L) and nucleic acids are primarily right-handed (dextro-D). There is consensus that homochirality was accomplished in two steps: production of a seed enantiomeric excess followed by amplification of this seed EE to full homochirality (Blackmond 2011; Breslow and Cheng 2009). It is believed that homochirality is a requirement for life although it has also been proposed that homochirality could be a consequence of life itself (Wu et al. 2012).

Several sources for creating the initial seed EE have been discussed in the literature, among them the possibility that seed EE of L-amino acids and D-sugars was transported to earth on meteorites from distant reaches of our galaxy. Indeed, a class of meteorites does show an excess of L-amino acids and D-sugars although for a large majority of meteorites, population of organic molecules is racemic (Elsila et al. 2016). Several other concerns have also been raised for the EE on meteorites, among them a possibility of contamination (Burton et al. 2014) with biomolecules from earth’s biosphere and questions about the stability of organic molecules under high temperatures encountered as the meteorites travelled through earth’s atmosphere (Glavin and Bada 2001). Concerns about racemization of amino acids over their lifetime on meteorites have also been raised (Cohen and Chyba 2000). Magneto-chiral effect was shown to produce EE among some molecules although its applicability for life-molecules remains to be demonstrated (Hattori et al. 2017). An EE of 10–18 by parity violating energy difference (PVED) has been calculated for amino acids (Bada 1995). Recently, we showed that although Archean earth’s surface atmosphere was anoxic, there was enough oxygen in stratosphere that could have produced magnetic circular dichroism (MCD) in paramagnetic O2 molecules (Sharma 2023). MCD in stratospheric O2 resulted in relative differential circular polarization of 10–10 for UVC light in 200–250 nm region and an asymmetric photolysis of amino acid molecules produced in volcanic plumes, to give an EE of 10–12. This EE is several orders of magnitude higher than PVED and, at the same time does not require the extra-terrestrial source for seed EE.

The most promising mechanism for amplification of seed EE to full homochirality is believed to involve preferential crystallization (PC) of a conglomerate enantiomer from solution (Elsner et al. 2005; Rougeot and Hein 2015), or one of the several variations of this mechanism like Viedma ripening (VR) (Viedma et al. 2008; Sogutoglu et al. 2015; Belletti et al. 2022). However, EE produced by PC is transient and it invariably goes to the equilibrium value of zero in both solution and solid form. VR does produce large EE in crystalline form but requires an essential mechanism of racemization in solution. No such racemization mechanism is known for amino acids that would be viable under prebiotic conditions. These restrictions of PC and the requirements of VR are absent in the conglomerate racemic compound system described here and has the potential for amplifying seed EE of some amino acids to large permanent EE in solution. Mechanism for generating large solution-EE (50–99%) at the eutectic point of racemic compound amino acids has also been discussed and demonstrated (Morowitz 1969; Klussmann et al. 2006a, b; Klussmann et al. 2006a, b; Breslow and Levine 2006) but it requires a small starting EE of around 1% or greater. No such requirement is there with the system discussed here, and seed solution-EE can be amplified sequentially from near-zero value in conglomerate to 50% or greater at the eutectic point of the racemic compound. Such prebiotic scenarios might be commonly around as is explained by a simple model of water with dissolved amino acids seeping over porous bed (CS) and collecting into a deeper pool (RCS). The requirements for such a sequentially-coupled system and possible end result of large permanent EE (~ 50%) in solution are explained with the help of ternary phase diagrams.

Theory

Most proteinogenic amino acids crystallize from solution as racemic compounds (Klussmann et al. 2006a, b) whereas chiral resolution by PC or VR requires crystallization as conglomerates (Rougeot and Hein 2015; Viedma et al. 2008). The mode of crystallization depends on thermodynamic as well as kinetic considerations and it has been proposed that compared to their normal RC behavior in bulk solutions, amino acids could crystallize differently for high degrees of supersaturation due to rapid evaporation in porous media (Viedma 2001). Indeed, two very common amino acids, aspartic and glutamic acids normally crystallize as racemic compounds, but results in crystallization as conglomerates under rapid evaporation of aqueous solutions and high supersaturation (Viedma 2001). Both, glutamic and aspartic acids are present in Ferredoxin (Eck 1966), known to be one of the first proteins involved in the origin of life. As shown here, this dual behavior of some amino acids as conglomerates and racemic compounds might have been exploited under widely prevalent prebiotic circumstances for amplification of their solution enantiomeric excess.

Sequentially linked CS-RCS system is shown in a schematic diagram (Fig. 1a) where aqueous solution of amino acid switches from CS to RCS as the ambient condition changes from one of rapid evaporation and high supersaturation in a porous medium to normal evaporation in a bulk medium. Physical realization of such a schematic (Fig. 1b) shows aqueous solution of amino acid slowly seeping over a porous bed (CS) (Viedma 2001) and collecting in a deeper pool of water (RCS).

Fig. 1
figure 1

a: Schematic for dual behavior where amino acid solution produces conglomerate system (CS) under high evaporation and racemic compound system (RCS) otherwise. b: Physical realization of schematic. Amino acid solution seeping through porous bed results in high evaporation (CS) and collects in a deeper pool (RCS)

Figures 2 shows the ternary phase diagrams for the same amino acid behaving as CS and RCS under different ambient conditions. Phase diagrams in Fig. 2 are based on data for aspartic acid (Cao et al. 2022) which shows unusual behavior even in bulk aqueous solutions. Thus, aqueous solution with equal proportion of L and D aspartic acid results in a conglomerate upon crystallization. This lasts for several hours before aspartic acid reverts to its normal crystallization as a racemic compound (Lee and Lin 2010). Bulk solution of racemic compound aspartic acid always results in a racemic compound upon crystallization (Viedma 2001). This unusual behavior of aspartic acid makes it possible to measure its solubility both as a conglomerate and a racemic compound (Lee and Lin 2010).

Fig. 2
figure 2

Ternary phase diagrams of a CS (1a) and RCS (1b) of the same compound (example, aspartic acid). Trajectory from supersaturated solution A → B is due to PC in CS. CS switches to RCS at point B (b)

Phase diagram switches from 2(a) to 2(b) as the amino acid behavior changes from conglomerate to racemic compound. In above phase diagrams, vertices W, L, D correspond to pure water, L-amino acid and D-amino acid respectively. Points C and F give solubility of pure L and D enantiomer respectively. G is the eutectic point (EE = 0) for conglomerate amino acid and points H, E are eutectic points (EE ≠ 0) for racemic compound amino acid. Plausible path ABG (Fig. 2a) is shown for preferential crystallization of D-amino acid as a conglomerate and also the path shift from BG to BH (Fig. 2b) following a transition of amino acid from conglomerate to racemic compound at point B.

Mass fraction units are used here. Lower case letters l, d and w (l + d + w = 1) will be used throughout for mass-fraction of L-amino acid, D-amino acid and water respectively. As an example for aspartic acid (Cao et al. 2022; Lee and Lin 2010; Dalton and Schmidt 1933), for point C on the ternary diagram, lC = 0.004, dC = 0, wC = 0.996; and for point F, lF = 0, dF = 0.004, wF = 0.996. For some other points (aspartic acid, 25 oC):

  • Eutectic point G of conglomerate: lG = 0.004, dG = 0.004, wG = 0.992.

  • Eutectic point H of racemic compound: lH = 0.004, dH = 0.0014, wH = 0.9946.

  • Eutectic point E of racemic compound: lE = 0.0014, dE = 0.004, wE = 0.9946.

  • Vertex L: lL = 1, dL = 0, wL = 0; Vertex D: lD = 0, dD = 1, wD = 0; Vertex W: lW = 0, dW = 0, wW = 1.

  • Racemic point R on the L-D base: lR = 0.5, dR = 0.5, wR = 0.

To understand the transition from conglomerate form of aspartic acid to its racemic compound form, an overlap of Fig. 2(a) and (b) is shown in Fig. 3. Some additional lines are also drawn in Fig. 3. These include straight lines (DTP, DAM) for constant concentrations of L-amino acid.

Fig. 3
figure 3

Combined ternary phase diagram for CS and RCS

The behavior of amino acid as a conglomerate is described by the ternary phase diagram and solubility isotherm CGF in Fig. 2(a). For such supersaturated, almost racemic solution of an amino acid described by point A (EE ≈ 0, dA > lA) in Figs. 2(a) and 3, seeds of D-enantiomer are more likely to form spontaneously, triggering preferential crystallization of D-enantiomer of amino acid with the solution acquiring a greater L-concentration than D (solution EEL > 0). Eventually, seeds of L-enantiomer form and trigger crystallization of L-enantiomer with EEL going to zero for both the crystallized solid and in solution. Path ABG (Figs. 2a and 3) is one of many along which the solution can evolve, beginning with the onset of preferential crystallization at A and terminating at the eutectic point (G) of the conglomerate solubility isotherm. If however, the conglomerate system switches (at point B) to racemic compound before point G is reached, it is possible that the solution will terminate at point H, the eutectic point of the racemic compound solubility isotherm. For this to happen, the path ABG must intersect the tie line HR of the racemic compound system. For given initial supersaturation (point A), preferential crystallization of D-enantiomer triggers the evolution of the conglomerate system along any path within the triangular region AMG; AM representing path of constant L-concentration for crystallization of D-enantiomer and MG is the path of constant D-concentration for eventual crystallization of L-enantiomer. Clearly, point T represents the minimum initial supersaturation (Smin ≡ ST) necessary for the system to evolve to eutectic point H following a switch from conglomerate to the racemic compound, i.e. for S < ST, the path of preferential crystallization will be confined to triangular region TPG and will not intersect the tie line HR. It can be shown (supplementary information) that Smin depends on the enantiomeric mass fraction at eutectic points for conglomerate (G) and racemic compound (H). Thus,

$${S}_{min}\cong \frac{{l}_{H}+{l}_{G}-{d}_{H}}{{l}_{G}}$$
(1a)

For several amino acids (Klussmann et al. 2006a, b), \({l}_{H},{l}_{G}= {l}_{C}\) (Figs. 2a and 3), giving,

$${S}_{min}\cong 2-\frac{{d}_{H}}{{l}_{H}}$$
(1b)

For aspartic acid solution at 25 °C, lH = 0.004, dH = 0.0014, lG = 0.004, giving Smin = 1.65. Higher initial supersaturation (S > Smin) shown by point A, results in increasing triangular phase space area PMX (left of the tie line HR, in the tri-phasic space of racemic compound) within which transition from conglomerate → racemic compound will take the solution from point B to the eutectic point H of the racemic compound (EEL ≈ 0.48 at H, for aspartic acid). If conglomerate → racemic compound switch takes place right of the tie line HR, resultant equilibrium solubility and solution EEL is described by a point on segment HO of the racemic compound isotherm. This is explained later in more detail. Ratio of phase space areas of triangle PMX to triangle GMA (Fig. 4) increases with (S—Smin) (supplementary information) and is given by,

Fig. 4
figure 4

Ratio of triangular areas PMX/GMA in ternary diagram is a measure of likelihood that CS → RCS transition will result in the solution EE at the eutectic point of RCS

$$\frac{Area(PMX)}{Area(GMA)}\cong {[\frac{S-{S}_{min}}{S-1}]}^{2}$$
(2)

This ratio of areas can be looked upon as a measure of the likelihood that conglomerate → racemic compound transition will amplify the solution EEL from near zero at point A of conglomerate to its equilibrium value, EEL = (lHdH)/(lH + dH) at eutectic point H of the racemic compound, which is 0.48 for aspartic acid. This is illustrated in Fig. 4. Viability of supersaturation as large as 10 or even greater, is discussed later (Section 3).

If conglomerate → racemic compound transition point B is to the right of tie line HR, the resultant equilibrium solubility of the racemic compound is given by the point of intersection between lines BR and section HO of the solubility isotherm, with excess solute crystallizing out as a racemic compound at point R. In this case, the solution EEL will have a value between 0 and EEL,max of 0.48 for aspartic acid. In the biphasic domain (HOR) of the racemic compound, the difference of enantiomeric concentration (l-d)/w remains constant along line BR and can be correlated to the resultant equilibrium solution-EEL on isotherm HO (supplementary information). To the left of tie line HR, the concentration-difference in the tri-phasic domain HLR is in general greater than in the biphasic domain HOR (supplementary information) and conglomerate → racemic compound transition in this region results in solution EEL at point H (Fig. 5).

Fig. 5
figure 5

Solution-EE of the RCS as a function of the enantiomeric concentration difference at the CS → RCS transition point B

Discussion

One theory (Klussmann et al. 2006a, b) for prebiotic chiral resolution rests on the observation that most proteinogenic amino acids are racemic compounds and for any supersaturated state (for example, point B in Figs. 2b and 3) in the triphasic domain HLR, the equilibrium solution EEL will be given by the solubility of enantiomers at the eutectic point (H). This is independent of the overall EEL for the amino acid or EEL(B) at point B. As an example, for solubility in water at 25 °C, the solution EEL(H) for alanine and serine at eutectic point H is 0.59 and 0.99 respectively, independent of EEL(B) which could be just 1%. This is relevant since it is understood that most prebiotic biochemical reactions must have taken place in aqueous medium. However, starting from an almost racemic, supersaturated state A (EEL(A) > 0, Fig. 3), the racemic compound amino acid solution cannot evolve on its own to state B within the triphasic domain HLR. If the amino acid is a conglomerate, PC can result in evolution of solution from A to B with solution EEL(B) of few percent. But PC trajectory eventually terminates at the equilibrium eutectic point G of the conglomerate, with EEL(G) = 0. An ideal situation would require an amino acid that is initially a conglomerate and PC increases the solution EEL from EEL(A)≈0 to EEL(B)≈few percent. It then switches to racemic compound, resulting in the final amplified EEL(H) at the eutectic point H. This is exactly the case described here. Aqueous solution of amino acid seeping over a porous bed is subject to high evaporation rate and large supersaturation. Aspartic acid and glutamic acids are known to behave like conglomerates in this situation. Amino acid solution collecting in a deeper pool and subject to lower evaporation rates then switches to a racemic compound in a time of few hours.

It is understood that in an almost-racemic sample of molecules, minimum seed-EE that can be amplified must exceed the mean amplitude (δ│EE│) of its statistical fluctuation (Lente 2012), i.e. δ│EE│ ≈ 1/√N < EEseed. Here N is the total number of enantiomers in the given sample. This is seen experimentally in Soai reaction (Soai 2022) where seed EE ≈ 5 × 10–7 can force the reaction in the direction of predetermined chirality (Soai et al. 2014). For a racemic solution of an amino acid like aspartic acid, solubility is around 10 mg/g H2O (Cao et al. 2022). For a small pool of volume Vm3, number of amino acid molecules in a saturated solution is N = (104 V/M)NA, where M is the molecular weight of amino acid (M = 133 g/mol for aspartic acid) and NA is the Avogadro number. For the pool volume V = 1m3, δ│EE│ ≈ 1/√N = 1.5 × 10–13 for aspartic acid. Similar reasoning has been used to propose (Kondepudi and Nelson 1985) that even seed EE of 10–18 due to PVED can be amplified by autocatalytic reactions (Blackmond 2020) to homochirality.

In a recent work (Sharma 2023) we showed that in terrestrial magnetic field, magnetic circular dichroism by stratospheric oxygen creates a net circular polarization of 10–10 in UVC light from sun. This in turn results in asymmetric photodissociation of amino acid enantiomers produced in volcanic eruptions to give seed EE of 10–12 and with a sign that remains constant over a large area of earth. This seed EE could potentially be amplified by the mechanism described in this report. Other possible sources include seed EE delivered on meteorites (Elsila et al. 2016). As mentioned earlier, while a small class of meteorites have shown an excess of L-amino acids, majority of them show a racemic population. Several questions remain unanswered including lack of knowledge of an estimate for average seed EE delivered on meteorites over large area of earth.

Figure 1 illustrates a realistic scenario on primitive earth for amplification of seed EE of some amino acids like aspartic and glutamic acids which behave like conglomerates under condition of rapid evaporation and high supersaturation in porous medium but are racemic compounds otherwise in conventional bulk solution form. Both of these amino acids are present on Ferrodoxin, one of the oldest known proteins.

Supersaturation values around 2.5 are seen in some experiments involving bulk solution of DL-aspartic acid (racemic compound). Rapid evaporation of such solution in porous medium (Fig. 1) results in higher supersaturation before onset of precipitation of conglomerate crystals. In general, supersaturation that can be achieved (before nucleation) increases with evaporation rates. For low initial concentrations, supersaturation as high as 10 has been observed for glycine following rapid evaporation in solution (He et al. 2006). Assuming there is a slight excess of D-enantiomer in the almost-racemic seed-EE, the scenario of Fig. 1 can result in preferential crystallization of D-enantiomer initially and a slight excess of L-enantiomer in the conglomerate solution of amino acid, seeping over the porous substrate and collecting in the deeper pool. The conventional body of bulk solution in the pool, with relatively low evaporation rates and lower supersaturation, favors amino acids in the racemic compound form. As seen experimentally, for aspartic acid already present in the pool in conglomerate form, it can take between 16 h (at 25 C) and 6 h (at 45C) for conversion to racemic compound solution (Lee and Lin 2010).

As mentioned earlier, for eutectic point G of aspartic acid conglomerate, lG = 0.004, dG = 0.004, wG = 0.992 with racemate solubility in water of 8.06 mg/g. As an example, for large supersaturation induced by rapid evaporation, S = 10 and metastable solubility at point A (Figs. 2a and 3) is \(\frac{{l}_{A}+{d}_{A}}{{w}_{A}}=80.6 mg/g\), giving lA = 0.039, dA = 0.039, wA = 0.922. With PC of only the D enantiomer in the beginning, solution EEL increases along path AX. PC can evolve along various paths ABG within the triangular phase space AMG. Trajectory ABG must intersect section PX of tie line to reach solution EEL of eutectic point H. Starting from EEL(A) = 0 at point A with S > Smin (Smin = 1.65 for aspartic acid) intersection with PX at point X results in minimum increase of solution EEL,min(X) due to PC. As shown in the supplementary section,

$${EE}_{L, min}(X)\cong \frac{{l}_{H}-{d}_{H}}{{2.S.l}_{G}-{l}_{H}+{d}_{H}}$$
(3)

This gives \({EE}_{L, min}\left(X\right)=0.194, 0.070, 0.049, 0.033\) for S = 2, 5, 7, 10 respectively. This is the smallest EEL which must be produced by PC for trajectory ABG to cross over into the triphasic domain at point X before transition to racemic compound. For increasing supersaturation (S), EEL,min(X) is inversely proportional to S and the sequential mechanism in the CS-RCS described here for amplification of solution EEL becomes increasingly viable. In reality, nucleation of the other (D) enantiomer as well as heterogeneous nucleation will drive PC along trajectory ABG, intersecting tie line PX at a point above X. Thus, the smallest EEL which must be produced by PC will be greater than \({EE}_{L, min}\left(X\right).\) For aspartic acid, conglomerate → racemic compound transition in the pool will result in the equilibrium EEL(H) of 0.48 at the racemic compound’s eutectic point H, provided the transition point B is in the triphasic domain PMX. For such aspartic solution in the pool, conglomerate → racemic compound phase transition thus results in maximum amplification of EEL(X) → EEL(H) by a factor of 2.5, 6.9, 9.8 and 14.5 for S = 2, 5, 7, 10 respectively. For an initial supersaturation of just 1.06, solution EEL of 0.04 has been demonstrated in PC experiments with conglomerate amino acid threonine (Elsner et al. 2005). Clearly, higher supersaturation increases the likelihood that PC trajectory will cross over into the triphasic domain PMX. Likelihood that PC will result in a state within the triphasic domain PMX is also measured by the ratio of area of phase space PMX to the total available phase space GMA (Eq. 2) available under PC. As seen from Fig. 4, this ratio increases with supersaturation and is 0.12, 0.70, 0.80, 0.86 for S = 2, 5, 7, 10 respectively.

To summarize, starting from an almost racemic (EEL < 0), highly supersaturated conglomerate solution represented by point A on ternary phase diagram (Fig. 3), equilibrium enantiomeric excess (EEL) of the racemic compound solution will depend on the position of point B (Fig. 3) up to which preferential crystallization occurs in conglomerate solution flowing over shallow porous bed, followed by conglomerate → racemic compound transition in the pool. As given in Fig. 5, if point B is in the tri-phasic area of racemic compound (triangle PMX of Fig. 3), the equilibrium solution EEL of the pool will be given by its maximum value at the eutectic point, EEL,max(H) = (lHdH)/(lH + dH). For aspartic acid, EEL,max(H) is 0.48. If point B is in the bi-phasic area of racemic compound (quadrilateral AXPG of Fig. 3), the equilibrium solution EEL of the pool is given by a value between 0 and EEL,max(H) (Fig. 5).

Conclusion

While it is known that some amino acids can behave either as conglomerates or as racemic compounds depending on the prevailing ambient, we have shown how this dual behavior could have combined sequentially under viable prebiotic conditions to produce near homochirality in solution. Starting with CS in near racemic condition, PC produces a solution-EE of ~ 1%. With change of ambient conditions, the CS switches to RCS, resulting in solution-EE at the eutectic point of RCS. For aspartic acid as a model system, this solution eutectic-EE is ~ 50%. For sequential amplification to be effective, we have derived an equation that can be used to calculate the required threshold supersaturation of CS and is 1.65 for aspartic acid at 25 C. For given supersaturation, we also derive an equation to calculate the minimum required solution-EE in CS before switching to RCS can be effective for further sequential amplification to its eutectic EE value. While the analysis is based on available data for aspartic and glutamic acids which show such dual behavior, unusual crystallization behavior under rapid evaporation is not uncommon in other materials (Hilden et al. 2003). In such conditions of rapid evaporation and highly saturated metastable solubility, crystallization is dominated by kinetic (Ostwald’s step rule) rather than thermodynamic considerations (van Santen 1984). Such dual behavior might be common in hitherto unexplored crystallization of other amino acids as well.