In this section we review the body of work that assumes that a deep seated solar cycle dynamo generates a strong toroidal magnetic field at the base of the convection zone and studies the rise of active region scale magnetic flux tubes from the bottom of the convection zone to the surface to form the observed solar active regions.
Results from the thin flux tube simulations
Beginning with the seminal work of Moreno-Insertis (1986) and Choudhuri and Gilman (1987), a large body of numerical simulations solving the thin flux tube dynamic equations (1), (2), (6), (7), and (8)—or various simplified versions of them—have been carried out to model the evolution of emerging magnetic flux tubes in the solar convective envelope (see Choudhuri 1989; D’Silva and Choudhuri 1993; Fan et al. 1993, 1994; Schüssler et al. 1994; Caligari et al. 1995; Fan and Fisher 1996; Caligari et al. 1998; Fan and Gong 2000; Weber et al. 2011, 2013). The results of these numerical calculations have provided possible explanations for some of the basic observed properties of solar active regions and put constraints on the field strength of the toroidal magnetic fields if they originate from the base of the solar convection zone.
A set of the earlier calculations (see Choudhuri and Gilman 1987; Choudhuri 1989; D’Silva and Choudhuri 1993; Fan et al. 1993, 1994) considered initially buoyant toroidal flux tubes by assuming that they are in temperature equilibrium with the external plasma. Various types of initial undulatory displacements are imposed on the buoyant tube so that portions of the tube will remain anchored within the stably stratified overshoot layer and other portions of the tube are displaced into the unstable convection zone which subsequently develop into emerging \(\varOmega \)-shaped loops.
Later calculations (see Schüssler et al. 1994; Caligari et al. 1995, 1998; Fan and Gong 2000) considered more physically self-consistent initial conditions where the initial toroidal flux ring is in the state of mechanical equilibrium. In this state the buoyancy force is zero (neutrally buoyant) and the magnetic curvature force is balanced by the Coriolis force resulting from a prograde toroidal motion of the tube plasma. It is argued that this mechanical equilibrium state is the preferred state for the long-term storage of a toroidal magnetic field in the stably stratified overshoot region (Sect. 3.1). In these simulations, the development of the emerging \(\varOmega \)-loops is obtained naturally by the non-linear, adiabatic growth of the undulatory buoyancy instability associated with the initial equilibrium toroidal flux rings (Sect. 4.1). As a result there is far less degree of freedom in specifying the initial perturbations. The eruption pattern needs not be prescribed in an ad hoc fashion but is self-consistently determined by the growth of the instability once the initial field strength, latitude, and the subadiabaticity at the depth of the tube are given. For example Caligari et al. (1995) modeled emerging loops developed due to the undulatory buoyancy instability of initial toroidal flux tubes located at different depths near the base of their model solar convection zone which includes a consistently calculated overshoot layer according to the non-local mixing-length treatment. They choose values of initial field strengths and latitudes that lie along the contours of constant instability growth times of 100 days and 300 days in the instability diagrams (see Fig. 6), given the subadiabaticity at the depth of the initial tubes. The tubes are then perturbed with a small undulatory displacement which consists of a random superposition of Fourier modes with azimuthal order ranging from \(m=1\) through \(m=5\), and the resulting eruption pattern is determined naturally by the growth of the instability.
On the other hand, non-adiabatic effects may also be important in the destabilization process. It has been discussed in Sect. 3.2 that isolated magnetic flux tubes with internally suppressed convective transport experience a net heating due to the non-zero divergence of radiative heat flux in the weakly subadiabatically stratified overshoot region and also in the lower solar convection zone. The radiative heating causes a quasi-static upward drift of the toroidal flux tube with a drift velocity \(\sim 10^{-3} |\delta |^{-1} \mathrm {\ cm\ s}^{-1}\). Thus the time scale for a toroidal flux tube to drift out of the stable overshoot region may not be long compared to the growth time of its undulatory buoyancy instability. For example if the subadiabaticity \(\delta \) is \(\sim - 10^{-6}\), the time scale for the flux tube to drift across the depth of the overshoot region is about 20 days, smaller than the growth times (\(\sim \) 100–300 days) of the most unstable modes for tubes of a \(\sim 10^5 {\mathrm {\ G}}\) field as shown in Fig. 6. Therefore radiative heating may play an important role in destabilizing the toroidal flux tubes. The quasi-static upward drift due to radiative heating can speed-up the development of emerging \(\varOmega \)-loops (especially for weaker flux tubes) by bringing the tube out of the inner part of the overshoot region of stronger subadiabaticity, where the tube is stable or the instability growth is very slow, to the outer overshoot region of weaker subadiabaticity or even into the convection zone, where the growth of the undulatory buoyancy instability occurs at a much shorter time scale.
A possible scenario in which the effect of radiative heating helps to induce the formation of \(\varOmega \)-shaped emerging loops has been investigated by Fan and Fisher (1996). In this scenario, the initial neutrally buoyant toroidal flux tube is not exactly uniform, and lies at non-uniform depths with some portions of the tube lying at slightly shallower depths in the overshoot region. Radiative heating and quasi-static upward drift of this non-uniform flux tube bring the upward protruding portions of the tube first into the unstably stratified convection zone. These portions can become buoyantly unstable (if the growth of buoyancy overcomes the growth of tension) and rise dynamically as emerging loops. In this case the non-uniform flux tube remains close to a mechanical equilibrium state during the initial quasi-static rise through the overshoot region. The emerging loop develops gradually as a result of radiative heating and the subsequent buoyancy instability of the outer portion of the tube entering the convection zone.
All of the aforementioned thin flux tube simulations have ignored the influence of the convective flows on the rising flux tube. The latest calculations by Weber et al. (2011, 2013) have incorporated the influence of the giant-cell convection and the mean flows on the motion of the thin flux tube by using a time-dependent external velocity field, computed separately by a 3D global convection simulation of the rotating solar convective envelope, for the drag force term of the thin flux tube equation of motion. Specifically, this time-dependent external velocity field is computed from the 3D global convection simulation using the anelastic spherical harmonic (ASH) code, as described in Miesch et al. (2006). It captures giant-cell convection and the associated mean flows with a solar-like differential rotation, and has a moderate Reynolds number of about 100 in the middle of the convection zone (Miesch et al. 2006).
In the following subsections we review the major findings and conclusions that have been drawn from the various thin flux tube simulations of emerging flux loops.
Latitude of flux emergence and rise time
As a buoyant flux tube rises, the Coriolis force acting on the radial outward motion of the flux tube (or the tendency for the rising tube to conserve angular momentum) drives a retrograde motion of the tube plasma. This retrograde motion then induces a Coriolis force directed towards the Sun’s rotation axis which acts to deflect the trajectory of the rising tube poleward. The amount of poleward deflection by the Coriolis force depends on the initial field strength of the emerging tube, being larger for flux tubes with weaker initial field as was first found by Choudhuri and Gilman (1987). Simulations by Caligari et al. (1995) of the \(\varOmega \)-shaped emerging loops that develop due to the undulatory buoyancy instability of the initial toroidal flux tubes at the bottom of the convection zone show that, for tubes with initial field strength \(\gtrsim 10^5 {\mathrm {\ G}}\), the trajectories of the emerging loops are primarily radial with poleward deflection no greater than \(3^{\circ }\). For tubes with initial field strength exceeding \(4 \times 10^4 {\mathrm {\ G}}\), the poleward deflection of the emerging loops remain reasonably small (no greater than about \(6^\circ \)). However, for a tube with equipartition field strength of \(10^4 {\mathrm {\ G}}\), the rising trajectory of the emerging loop is deflected poleward by about \(20^\circ \). Such an amount of poleward deflection is too great to explain the observed low latitudes of active region emergence. Furthermore, it is found that with such a weak initial field the field strength of the emerging loop falls below equipartition with convection throughout most of the convection zone. Such emerging loops are expected to be subjected to strong deformation by turbulent convection and may not be consistent with the observed well defined order of solar active regions.
Simulations that incorporate the effect of giant-cell convection on the emerging loops developed due to the undulatory buoyancy instability of the initial toroidal flux tubes at the bottom of the convection zone (Weber et al. 2013) show that including convection produces a scatter in the emerging latitude but also systematically reduces the poleward deflection of the mean latitude of emergence. It is also found that including convection significantly reduces the rise time of the emerging loops, especially for weaker initial tube field strengths. Without convection, the rise time increases with decreasing initial field strength of the toroidal flux tube at the bottom of the convection zone, ranging from about 3 month for \(10^5\) G flux tubes to more than 3 years for \(1.5 \times 10^4\) G flux tubes. The large rise time for the weak initial field strength is due to the slow growth of the undulatory buoyancy instability at that field strength. With the inclusion of convection, the evolution of the emerging loops becomes dominated by the drag force from the convective motion compared to the magnetic buoyancy for flux tubes with initial field strength below about \(3 \times 10^4\) G (Weber et al. 2013, see also Sect. 5.6.1), and as a result the mean rise times of the emerging loops are significantly reduced. It is found that with convection (Weber et al. 2013), the mean rise times of emerging loops with initial field strengths ranging from \(1.5 \times 10^4\) G to \(10^5\) G are all below 0.7 years, with the mid-field strength (about \(4 \times 10^4\) G) flux tubes having the longest rise time, and with the weakest and the strongest field strengths resulting in the lowest rise times (about 1 to 3 months).
Active region tilts
A well-known property of the solar active regions is the so called Joy’s law of active region tilts. The averaged orientation of bipolar active regions on the solar surface is not exactly toroidal but is slightly tilted away from the east-west direction, with the leading polarity (the polarity leading in the direction of rotation) being slightly closer to the equator than the following polarity. The mean tilt angle is a function of latitude, being approximately \(\propto \sin (\mathrm {latitude})\) (Wang and Sheeley Jr 1989, 1991; Howard 1991b, a; Fisher et al. 1995; Kosovichev and Stenflo 2008; Stenflo and Kosovichev 2012).
Using thin flux tube simulations of the rise of buoyant \(\varOmega \)-loops in a rotating solar convective envelope, D’Silva and Choudhuri (1993) were the first to show that the active region tilts as described by Joy’s law can be explained by the Coriolis force acting on the flux loops. As the emerging loop rises, there is a relative expanding motion of the mass elements at the summit of the loop. The Coriolis force induced by this diverging, expanding motion at the summit is to tilt the summit clockwise (counter-clockwise) for loops in the northern (southern) hemisphere as viewed from the top, so that the leading side from the summit is tilted equatorward relative to the following side. Since the component of the Coriolis force that drives this tilting has a \(\sin (\mathrm {latitude})\) dependence, the resulting tilt angle at the apex is approximately \(\propto \sin (\mathrm {latitude})\).
Caligari et al. (1995) studied tilt angles of emerging loops developed self-consistently due to the undulatory buoyancy instability of toroidal flux tubes located at the bottom as well as just above the top of their model overshoot region, with selected values of initial field strengths and latitudes lying along contours of constant instability growth times (100 days and 300 days). The resulting tilt angles at the apex of the emerging loops (see Fig. 12) produced by these sets of unstable tubes (whose field strengths are within the range of \(4 \times 10^4 {\mathrm {\ G}}\) to \(1.5 \times 10^5 {\mathrm {\ G}}\)) show good agreement with the observed tilt angles for sunspot groups measured by Howard (1991a).
They also found that loops formed from toroidal flux tubes with an equipartition field strength of \(10^4 {\mathrm {\ G}}\) develop a tilt angle of the wrong sign at the loop apex.
Later simulations by Weber et al. (2011, 2013) incorporated the influence of the giant-cell convection on the rise of buoyantly unstable thin flux tubes from the bottom of the solar convection zone, and studied the resulting tilt angles at the apex of the emerging loops. For each initial field strength and flux for the toroidal flux tube, a large ensemble of simulations are carried out where the emerging loops are buffeted by the different spatial parts and temporal ranges of the convective flow. The resulting tilt angles vs. emerging latitudes from the simulations are shown in Fig. 13, where the different panels show the results for different ranges of fluxes and initial field strengths, with panel (c) showing the results for all of the field strengths ranging from 15 kG to 100 kG and all of the fluxes (\(10^{20}\) Mx, \(10^{21}\) Mx, and \(10^{22}\) Mx). Linear least-squares fits of the form \(\alpha = A \lambda \) (the red line) and \(\alpha = B \sin (\lambda )\) (the cyan line), to the simulated tilt angles (\(\alpha \)) as a function of emerging latitudes (\(\lambda \)) of the emerging loops, are done for the various flux and magnetic field combinations shown in the different panels.
It is found that buffeting by the giant-cell convection systematically increases the mean tilt angle of the emerging loops in the Joy’s law direction due to the mean kinetic helicity of the upflows of the convection. As a result, emerging loops with weaker initial field strengths (\( \lesssim 30 \) kG) that exhibit tilts of the wrong sign without the influence of convection, now also show a mean tilt consistent with the Joy’s law sign with convection. Generally for all values of the magnetic flux and magnetic field strength, the slopes of the linear fits (the red and cyan lines) are systematically greater than the corresponding observed Joy’s law slope obtained from the sunspot group data by Dasi-Espuig et al. (2010) (the blue line) and significantly smaller than that obtained from the magnetogram data by Stenflo and Kosovichev (2012) (the green line). Mid-field strengths of 40 kG–50 kG produce the largest best-fit slopes (A and B in Fig. 13d), but the value of slope \(B = 26^{\circ } \pm 2^{\circ }\) still falls short of the \(32.1^{\circ } \pm 0.7\) value found by Stenflo and Kosovichev (2012) from MDI magnetogram data (the green line).
The influence of convection also produces a scatter of the tilt angles about the mean tilt behavior described by the Joy’s law linear-fit. The root-mean-square (rms) scatter tends to increase with decreasing field strength and decreasing flux (see Table 3 in Weber et al. 2013). Using Mount Wilson sunspot group data, Fisher et al. (1995) found that the rms scatter of the sunspot group tilts from the Joy’s law is \(< 40^{\circ }\). Weber et al. (2013) found that emerging loops with field strengths less than about 40 kG produce a scatter that is too large to be consistent with the above observation result. Overall, Weber et al. (2013) suggest that the initial field strength of active region progenitor flux tubes needs to be sufficiently large, probably \(\gtrsim 40\) kG, in order for them to satisfy the Joy’s Law trend for the mean tilt angle as well as the observed amount of scatter of the tilt angles about the mean Joy’s Law behavior.
Using a series of 96 minute cadence magnetograms from SOHO MDI and analyzing 715 bipolar magnetic regions which emerged within \(30^{\circ }\) from the central meridian and outside already existing active regions, Kosovichev and Stenflo (2008) investigated how the active region tilt angle evolves during flux emergence and how it correlates with other properties of the emerging region. The study shows that at the beginning of emergence the tilt angles are random, and the mean tilt angle is about zero (see Fig. 14a). However by the middle of the emergence period (flux growth period), the tilt angles clearly show a systematic mean as a function of latitude that follows Joy’s law (Fig. 14b). At the end of the emergence period, the Joy’s law dependence has become more pronounced as the scatter from the systematic mean tilt decreases (Fig. 14c). The above result that the systematic mean tilt following Joy’s law is established during the flux emergence period (flux growth period) suggests that the tilt of the emerging flux tube has developed in the interior before reaching the surface. This is consistent with the above model of rising flux tubes where the tilt angle is caused by the effect of the Coriolis force during the rise.
Kosovichev and Stenflo (2008) found that the mean tilt angle does not show a systematic dependence on the flux of the active region in contradiction to the result from the thin flux tube calculation of rising flux tubes in the absence of convection (e.g., Fisher et al. 1995). However with the influence of convection included in the thin flux tube simulations, the best-fit slopes for the mean tilt no longer show a significant systematic variation with the flux above the uncertainties, changing from the conclusion without convection (compare Table 1 and Table 2 in Weber et al. 2013).
Furthermore, Kosovichev and Stenflo (2008) found that there is no tendency for the active region mean tilt to relax towards the east-west direction after the emergence has ceased and the driving Coriolis force has vanished, at which time the tension of the flux tube is expected to act to restore the original toroidal orientation of the tube at the base of the solar convection zone. The latter result may be understood if the active region magnetic fields on the photosphere become dynamically disconnected from the interior flux tubes soon after emergence (e.g Fan et al. 1994; Schüssler and Rempel 2005). On the other hand, as suggested in Kosovichev and Stenflo (2008), it may be that Joy’s law of solar active regions reflects not the Coriolis effect of the rising flux tubes but the spiral orientation of the nearly toroidal magnetic field lines in the interior generated by the latitudinal differential rotation (Babcock 1961).
Non-linear simulations of the two-dimensional MHD tachocline (Cally et al. 2003) show that bands of toroidal magnetic fields in the solar tachocline may become tipped relative to the azimuthal direction by an amount that is within \(+/- 10^\circ \) at sunspot latitudes due to the non-linear evolution of the 2D global joint instability of differential rotation and toroidal magnetic fields. This tipping may either enhance or reduce the observed tilt in bipolar active regions depending on from which part of the tipped band the emerging loops develop. Thus the basic consequence of the possible tipping of the toroidal magnetic fields in the tachocline is to contribute to the spread of the tilts of bipolar active regions.
Using the line-of-sight magnetograms from the Helioseismic and Magnetic Imager (HMI) on the Solar Dynamic Observatory (SDO), Schunker et al. (2020) have measured the evolution of the tilt angles of 153 emerging active regions. It is found that for the beginning phase (Phase 1) of the flux emergence (during which the polarity separation speed is increasing, lasting about 0.5 days after the time of emergence), the active region polarities are on average east-west aligned with a zero mean tilt angle. The systematic mean tilt of the active regions following Joy’s law is established during the Phase 2 of the emergence when the polarity separation speed is decreasing. These observation results are in general agreement with those of Kosovichev and Stenflo (2008). However, Schunker et al. (2020) found it is still surprising that the active regions emerge with an east-west alignment since the thin flux tube model predicts a systematic tilt angle has developed at the apex of the emerging loop. With a simple model calculation (Apendix C Schunker et al. 2020), they found that the observed average north-south separation motion of the polarities during the emergence is not consistent with an emerging loop with an initial constant tilt at the apex. They concluded that Joy’s law is caused by an inherent north-south separation speed present when the flux first reaches the surface and the polarities move to lie over their foot points anchored at some depth below the surface. Progress to understanding the observed tilt angle evolution during active region flux emergence can be obtained by analyzing the resultant surface flux evolution from the sophisticated radiation MHD simulations of near surface layer active region flux emergence (such as Stein and Nordlund 2012; Birch et al. 2016; Chen et al. 2017), considering different rising structures from the deep interior.
Morphological asymmetries of active regions
An intriguing property of solar active regions is the asymmetry in morphology between the leading and following polarities. The leading polarity of an active region tends to be in the form of large sunspots, whereas the following polarity tends to appear more dispersed and fragmented; moreover, the leading spots often form earlier and tend to be longer lived than the following. Fan et al. (1993) offered an explanation for the origin of this asymmetry. In their thin flux tube simulations of the non-axisymmetric eruption of buoyant \(\varOmega \)-loops through a rotating model solar convective envelope, they found that an asymmetry in the magnetic field strength develops between the leading and following legs of an emerging loop, with the field strength of the leading leg being about 2 times that of the following leg. The field strength asymmetry develops because the Coriolis force, or the tendency for the tube plasma to conserve angular momentum, drives a counter-rotating flow of plasma along the emerging loop, which, in conjunction with the diverging flow of plasma from the apex to the troughs, gives rise to an effective asymmetric stretching of the two legs of the loop with a greater stretching and hence a stronger field strength in the leading leg. Fan et al. (1993) argued that the stronger field along the leading leg of the emerging loop makes it less subject to deformation by the turbulent convection and therefore explains the more coherent and less fragmented appearance of the leading polarity of an active region.
However, subsequent simulations using the more physical mechanical equilibrium initial state (e.g., Caligari et al. 1995, 1998; Fan and Fisher 1996) show that the field strength asymmetry between the leading and following legs of the emerging loop depends on the initial field strength of the toroidal flux tube. Fan and Fisher (1996) (see Fig. 15) found consistently stronger fields along the leading leg compared to the following for tubes with weaker initial fields (\(B < 60\) kG), but nearly equal field strengths of the two legs or even the reverse in the upper convection zone for stronger initial fields. The result is similar when buffeting by giant-cell convection is included in the thin flux tube model (Weber et al. 2011, 2013). Weber et al. (2011) computed dB/ds, the derivative of the field strength along the arc-length s in the direction of solar rotation, at the apex of the emerging loop. It is found that dB/ds tends to be systematically greater (less) than zero, i.e. the leading (following) side having a stronger field, for loops with an initial field strength \(\le 50\) kG. But the systematic trend is reversed for greater initial field strengths (60 kG to 100 kG).
On the other hand, later 3D MHD simulations of emerging flux produced by the convective dynamo coupled to near-surface layer 3D MHD simulations of active region formation (Chen et al. 2017) show that the effect of the giant-cell convective flow can produce the observed earlier formation and the more coherent leading polarity sunspots of solar active regions (see Sect. 9).
Geometrical asymmetry of emerging loops and the asymmetric proper motions of active regions
Another asymmetry in the emerging loop generated by the effect of the Coriolis force is the asymmetry in the east-west inclinations of the two sides of the loop. This asymmetry is first shown in the thin flux tube calculations of Moreno-Insertis et al. (1994) and Caligari et al. (1995) who modeled emerging loops that develop self-consistently as a result of the buoyancy instability of toroidal magnetic flux tubes initially in mechanical equilibrium. Moreno-Insertis et al. (1994) and Caligari et al. (1995) found that as the emerging loop rises, the Coriolis force, or the tendency for the tube to conserve angular momentum, drives a counter-rotating motion of the tube plasma, which causes the summit of the loop to move retrograde relative to the valleys, resulting in an asymmetry in the inclinations of the two legs of the loop with the leading leg being inclined more horizontally with respect to the surface than the following leg. This asymmetry in inclination can be clearly seen in Fig. 16, which shows a view from the north pole of an asymmetric emerging loop obtained from a simulation by Caligari et al. (1995).
When buffeting by giant-cell convection is included in the thin flux tube model (Weber et al. 2011), the systemmatic trend of the geometrical asymmetry remains a robust result for emerging loops that originate from initial flux tubes in mechanical equilibrium. Emerging loops with initial field strength ranging from 15 kG to 100 kG all show this asymmetry.
The observational consequences of this geometric asymmetry are discussed in Moreno-Insertis et al. (1994) and Caligari et al. (1995). The emergence of such an eastward inclined loop is expected to produce apparent asymmetric east-west proper motions of the two polarities of the emerging region, with a more rapid motion of the leading polarity spots away from the emerging region compared to the motion of the following polarity spots. Such asymmetric proper motions are observed in young active regions and sunspot groups (see Chou and Wang 1987; van Driel-Gesztelyi and Petrovay 1990; Petrovay et al. 1990). Furthermore, the asymmetry in the inclination of the emerging loop may also explain the observation that the magnetic inversion line in bipolar regions is statistically nearer to the main following spot than to the main proceeding one (van Driel-Gesztelyi and Petrovay 1990; Petrovay et al. 1990).
Weber et al. (2013) computed an apparent rotation rate of an emerging active region by evaluating the apparent zonal motion of the center point between the leading and the following intersections of the emerging loop with the constant-r surface of 0.95 \(R_{\odot }\), when the emerging loop approaches the top boundary of the thin flux tube simulation. It is found that only for emerging loops with initial field strengths \(\ge 60\) kG, do the apparent rotation rates approach the local rotation rate at \(r = 0.95 \, R_{\odot }\), being faster than the solar surface plasma rotation rate, consistent with the observed sunspot rotation rate (e.g., Gilman and Howard 1985). Although the above geometrically determined apparent rotation speed can approach the local rotation rate at 0.95 \(R_{\odot }\), the velocity of the tube plasma near the apex is retrograde with respect to the local rotation rate because of the conservation of angular momentum of the rising tube as it moves away from the rotation axis (e.g., Weber et al. 2011). This retrograde motion of the tube plasma with respect to the local rotation rate is not detected in helioseismic studies of emerging active regions (see Sect. 9.2), and therefore argues against active region and sunspot fields as buoyantly rising flux tubes from the bottom of the solar convection zone.
Hemisphere trend of the twist in solar active regions
Vector magnetic field observations of active regions on the photosphere have revealed that on average the solar active regions have a small but statistically significant mean twist that is left-handed in the northern hemisphere and right-handed in the southern hemisphere (see Pevtsov et al. 1995, 2001, 2003). What is being measured is the quantity \(\alpha \equiv \langle J_z/B_z \rangle \), the ratio of the vertical electric current over the vertical magnetic field averaged over the active region. When plotted as a function of latitude, the measured \(\alpha \) for individual solar active regions show considerable scatter, but there is clearly a statistically significant trend for negative (positive) \(\alpha \) in the northern (southern) hemisphere (see Fig. 17).
A linear least squares fit to the data of \(\alpha \) as a function of latitude (Fig. 17a) found that \(\alpha = -2.7 \times 10^{-10} \, \theta _{\mathrm {deg}}\mathrm {\ m}^{-1}\), where \(\theta _{\mathrm {deg}}\) is latitude in degrees, and that the r.m.s. scatter of \(\alpha \) from the linear fit is \(\varDelta \alpha = 1.28 \times 10^{-8} \mathrm {\ m}^{-1}\) (Longcope et al. 1998). The observed systematic \(\alpha \) in solar active regions may reflect a systematic field line twist in the subsurface emerging flux tubes.
If the measured \(\alpha \) values are a direct consequence of the emergence of twisted magnetic flux tubes from the interior, then it would imply subsurface emerging tubes with a field line twist of \(q = \alpha /2 \) (Longcope and Klapper 1997), where q denotes the angular rate of field line rotation about the axis over a unit axial distance along the tube. Several subsurface mechanisms for producing twist in emerging flux tubes have been proposed (see, e.g., the review by Petrovay et al. 2006). The twist may be due to the current helicity in the dynamo generated toroidal magnetic field, from which buoyant flux tubes form at the base of the convection zone (Gilman and Charbonneau 1999), or it may be acquired during the rise of the flux tubes through the solar convection zone (Longcope et al. 1998; Choudhuri 2003; Choudhuri et al. 2004; Chatterjee et al. 2006).
Longcope et al. (1998) explain the origin of the observed twist in emerging active region flux tubes as a result of buffeting by the helical turbulence in the solar convection zone during the rise of the tubes. Applying the dynamic model of a weakly twisted thin flux tube (Sect. 2.1), Longcope et al. (1998) modeled the rise of a nearly straight, initially untwisted tube, buffeted by a random velocity field representative of the turbulent convection in the solar convection zone, which has a nonzero kinetic helicity due to the effect of solar rotation. The kinetic helicity causes helical distortion of the tube axis, which in turn leads to a net twist of the field lines about the axis in the opposite sense within the tube as a consequence of the conservation of magnetic helicity. This process is termed the \(\varSigma \)-effect by Longcope et al. (1998). Quantitative model calculations of Longcope et al. (1998) show that the \(\varSigma \)-effect can explain the hemispheric sign, magnitude, latitude variation, and the r.m.s. dispersion of the observed \(\alpha \) of solar active regions. Furthermore, the model predicts that the mean twist scales inversely with the flux \(\varPhi \) of the active region as: \(\sim \varPhi ^{-0.69}\). This is roughly because larger flux tubes rise more rapidly, providing less time for actions of the turbulence.
As discussed in Sect. 5.1.2, \(\varOmega \)-shaped emerging loops are themselves acted upon by the Coriolis force, developing a “tilt” of the loop. This helical deformation of the tube axis will then also induce a twist of the field lines of the opposite sense within the tube as a consequence of conservation of magnetic helicity. Calculations have been done based on the weakly twisted thin flux tube model (see Eqs. 9 and 10), which describes the evolution of the twist in response to the motion of the tube, taking into account helicity conservation. It is found that this twist generated by the large scale tilting (writhing) of the emerging \(\varOmega \)-loop resulting from the Coriolis force has the right hemispheric sign and latitude dependence, but is of too small a magnitude to account for the observed twist in solar active regions (Longcope and Klapper 1997; Fan and Gong 2000).
Another interesting and natural explanation for the origin of twist in emerging flux tubes is the accretion of the background mean poloidal field onto the rising flux tube as it traverse through the solar convection zone (Choudhuri 2003; Choudhuri et al. 2004; Chatterjee et al. 2006). In a Babcock–Leighton type dynamo (e.g. section 5 in Charbonneau 2020), the dispersal of solar active regions with a slight mean tilt angle at the surface generates a mean poloidal magnetic field. The mean tilt angle of solar active regions is produced by the Coriolis force acting on the rising flux tubes (see Sect. 5.1.2). In the northern hemisphere, when a toroidal flux tube rises into a poloidal field that has been created due to the tilt of the same type of flux tubes emerged earlier, the poloidal field that gets wrapped around the flux tube by that mechanism will produce a left-handed twist for the tube. This is illustrated in Fig. 18.
Using a circulation-dominated (or flux-transport) Babcock–Leighton type mean-field dynamo model, Choudhuri et al. (2004) did a rough estimate of the twist acquired by an emerging flux tube rising through the solar convection zone. Fig. 19 shows the resulting butterfly diagram indicating the sign of \(\alpha \) of the emerging regions as a function of latitude and time. It is found that at the beginning of a solar cycle, there is a short duration where the sign of \(\alpha \) is opposite to the preferred sign for the hemisphere. This is because of the phase relation between the toroidal and poloidal magnetic fields produced by this types of solar dynamo models. At the beginning of a cycle, the mean poloidal magnetic field in the convection zone is still dominated by that generated by the emerging flux tubes of the previous cycle, and toroidal flux tubes of the new cycle emerging into this poloidal field gives rise to a right-handed (left-handed) twist of the tube in the northern (southern) hemisphere. However, for the rest of the cycle starting from the solar maximum, the poloidal magnetic field changes sign and the twist for the emerging tubes becomes consistent with the hemispheric preference. For the whole cycle, it is found that about 67% of the emerging regions have a sign of \(\alpha \) consistent with the hemispheric rule. The rough estimate also shows that the magnitude of the \(\alpha \) values produced by poloidal flux accretion is consistent with the observed values, and that there is an \(a^{-2}\) dependence on the radius a of the emerging tube, i.e., smaller sunspots should have greater \(\alpha \) values. This prediction is also made by the \(\varSigma \)-effect mechanism.
Given the frozen-in condition of the magnetic field, it is expected that the accreted poloidal flux would be confined in a sheath at the outer periphery of the rising tube, and that in order to produce a twist within the tube, some form of turbulent diffusion needs to be invoked (Chatterjee et al. 2006). By solving the induction equation in a co-moving Lagrangian frame following the rising flux tube and using several simplifying assumptions, Chatterjee et al. (2006) modeled the evolution of the magnetic field in the rising tube cross-section as a result of poloidal flux accretion and penetration due to a field strength dependent turbulent diffusivity. They found that with plausible choices of assumptions and parameter values an \(\alpha \) value comparable to the observations is obtained.
When a buoyant magnetic flux tube is formed at the base of the solar convection zone from the dynamo generated (predominantly) toroidal magnetic field, it should already obtain an initial twist due to the weak poloidal mean field contained in the magnetic layer. This initial twist will then be further augmented or altered due to poloidal flux accretion and also due to the \(\varSigma \)-effect as the tube rises through the solar convection zone. MHD simulations of the formation and rise of buoyant magnetic flux tubes directly incorporating the mean field profiles from dynamo models (for both the fields at the base and in the bulk of the convection zone) as the initial state are necessary to quantify the initial twist and contribution from poloidal field accretion. Such simulations should be done for dynamo mean fields at different phases of the cycle to access the cycle variation of the twist in the emerging flux tubes.
In a set of 2D MHD simulations of the buoyant rise of a twisted horizontal magnetic flux tube, Manek et al. (2018) show that the presence of a weak background horizontal field can severely affect the dynamic rise of the tube. It is found that tubes with a twist where the local tube azimuthal field at the bottom of the tube aligns with the background field are more likely to rise than those with the opposite twist, because the rise of tubes with the latter alignment is suppressed by a relatively weaker background field. Manek et al. (2018) therefore suggest that given the orientation of the poloidal field that produces the cycle’s toroidal field via the latitudinal differential rotation, a left-hand (right-hand) twisted toroidal flux tube would more likely survive the rise in the northern (southern) hemisphere, consistent with the observed hermisphere preference of the twist of solar active regions. However, this model assumes that the poloidal and toroidal fields switch polarity exactly in phase, and does not take into account the solar cycle large-scale mean field phase relation (between the poloidal and the toroidal field) from observation and the flux transport dynamo models (e.g., Choudhuri et al. 2004).
Observationally, looking for any systematic variations of the \(\alpha \) value (or twist) in solar active regions with the solar cycle phase is helpful for identifying the main mechanisms for the origin of the twist. Using high-quality vector magnetograms taken with the Spectro-polarimeter (SP) on board the Hinode satellite, Hao and Zhang (2011) have examined the hemisphere twist sign rule for active regions (ARs) in the descending phase of solar cycle 23 and the ascending phase of solar cycle 24. They found that the ARs in the ascending phase of solar cycle 24 follow the usual hermisphere trend for the sign of twist, while the ARs in the descending phase of solar cycle 23 do not show the trend. This result appears opposite to the cycle variation predicted by the model of Choudhuri et al. (2004). On the other hand, Hao and Zhang (2011) also found that on average the sunspot umbra and penumbra show opposite signs of the \(\alpha \) value (or the current helicity), with the whole AR dominated by the sign in the penumbra, which tends to conform to the hemisphere preferred sign of the twist. This result seems consistent with the model prediction by Chatterjee et al. (2006), which considered the development of the twist of an emerging flux tube by poloidal field accretion and turbulent diffusion, and predicted the existence of a ring of reverse current helicity on the periphery of active regions. The above observational result that weak and inclined fields tend to conform to the hermisphere sign rule of twist and strong and vertical fields tend to violate it has also been found in the study by Otsuji et al. (2015), who also used observations of solar active regions by the Hinode/SP.
Furthermore, observational studies of the correlation between active region twist (as measured by \(\alpha \)) and tilt angles have revealed interesting results (Holder et al. 2004; Tian et al. 2005; Nandy 2006). The \(\varSigma \)-effect predicts that the twist being generated in the tube is uncorrelated to the local tilt of the tube at the apex (Longcope et al. 1998). However, due to the Coriolis force, active region \(\varOmega \)-tubes acquire a mean tilt that has a well defined latitudinal dependence as described by the Joy’s law. The mean twist generated by the \(\varSigma \)-effect also has a latitudinal dependence that is consistent with the observed hemispheric rule. Thus, due to the mutual dependence of their mean values on latitude there should be a correlation between the tilt angle and twist of solar active regions and the correlation is expected to be positive if one assigns negative (positive) sign to a clockwise (counter-clockwise) tilt. However, Holder et al. (2004) found a statistically significant negative correlation between the twist and tilt for the 368 bipolar active regions studied, opposite to that expected from the mutual dependence on latitude of mean twist and mean tilt of active regions. Removing the effect of the mutual dependence of the mean tilt and mean twist on latitude (by either determining the correlation at a fixed latitude, or by subtracting off the fitted mean tilt and mean twist at the corresponding latitude), they found that the negative correlation is enhanced. Furthermore, it is found that the negative correlation is mainly contributed by those active regions (174 out of 368 regions) that deviate significantly from Joy’s law (by \(> 6 \sigma \)), while regions that obey Joy’s law to within \(6 \sigma \) show no significant correlation between their twist and tilt. A separate study by Tian et al. (2005) found that a sample of 104 complex \(\delta \)-configuration active regions, more than half of which have tilts that are opposite to the direction prescribed by Joy’s law, show a significant negative correlation between their twist and tilt [after correcting for their definition of the sign of tilt which is opposite to the definition used in Holder et al. (2004)]. The results of Holder et al. (2004) and Tian et al. (2005) both indicate that there is a significant population (about one half in the case of Holder et al. (2004)) of solar active regions whose twist/tilt properties cannot be explained by the \(\varSigma \)-effect together with the effect of the Coriolis force alone. These active regions are consistent with the situation where the buoyant flux tube that forms at the base of the solar convection zone has acquired an initial twist, such that as it rises upward due to buoyancy into an \(\varOmega \)-tube, it develops a writhe that is of the same sense as the initial twist of the tube (see Sect. 5.4). In the extreme case, the twist can be so large that the flux tube becomes kink unstable (see Sect. 5.5). The resulting tilt at the apex due to the writhe has the negative correlation with the twist as described in the above observations.
On the minimum twist needed for maintaining cohesion of rising flux tubes in the solar convection zone
As described in Sect. 5.1, simulations based on the thin flux tube approximation have revealed many interesting results with regard to the global-scale dynamics of active region emerging flux loops in the solar convective envelope, which provide explanations for several basic observed properties of solar active regions. However, one major question ignored by the thin flux tube model is how a flux tube remains a discrete and cohesive object as it moves in the solar convection zone. The manner in which solar active regions emerge on the photosphere suggests that they are coherent flux bundles rising through the solar convection zone and reaching the photosphere in a reasonably cohesive fashion. To address this question, 3D MHD models that fully resolve the rising flux tubes are needed.
As a natural first step, 2D MHD simulations were carried out to model buoyantly rising, infinitely long horizontal magnetic flux tubes in a stratified layer representing the solar convection zone, focusing on the dynamic evolution of the tube cross-section. The first of such calculations was done in fact much earlier by Schüssler (1979) and later, simulations of higher numerical resolutions have been performed (Moreno-Insertis and Emonet 1996; Longcope et al. 1996; Fan et al. 1998a; Emonet and Moreno-Insertis 1998). The basic result from these 2D models of buoyant horizontal flux tubes is that due to the vorticity generation by the buoyancy gradient across the flux tube cross-section, if the tube is untwisted, it quickly splits into a pair of vortex tubes of opposite circulations, which move apart horizontally and cease to rise. If on the other hand, the flux tube is sufficiently twisted such that the magnetic tension of the azimuthal field can effectively suppress the vorticity generation by the buoyancy force, then most of the flux in the initial tube is found to rise in the form of a rigid body whose rise velocity follows the prediction by the thin flux tube approximation. The result described above is illustrated in Fig. 20 which shows a comparison of the evolution of the tube cross-section between the case where the buoyant horizontal tube is untwisted (upper panels) and a case where the twist of the tube is just above the minimum value needed for the tube to rise cohesively (lower panels).
This minimum twist needed for tube cohesion can be estimated by considering a balance between the magnetic tension force from the azimuthal field and the magnetic buoyancy force. For a flux tube near thermal equilibrium whose buoyancy \(| \varDelta \rho / \rho | \sim 1/\beta \), where \(\varDelta \rho \equiv \rho - \rho _{\mathrm {e}}\) denotes the density difference between the inside and the outside of the tube and \(\beta \equiv p / (B^2/ 8 \pi )\) denotes the ratio of the gas pressure over the magnetic pressure, such an estimate (Moreno-Insertis and Emonet 1996) yields the condition that the pitch angle \(\varPsi \) of the tube field lines on average needs to reach a value of order
$$\begin{aligned} \tan {\varPsi } \equiv \frac{B_{\phi }}{B_{z}} \gtrsim \left( \frac{a}{H_p}\right) ^{1/2}. \end{aligned}$$
(30)
In Eq. (30), \(B_z\) and \(B_{\phi }\) denote the axial and azimuthal field of the horizontal tube respectively, a is the characteristic radius of the tube, and \(H_p\) is the local pressure scale height. The above result on the minimum twist can also be expressed in terms of the rate of field line rotation about the axis per unit length along the tube q. For a uniformly twisted flux tube, \(B_{\phi } = q r B_z\), where r is the radial distance to the tube axis. Then q needs to reach a value of order (Longcope et al. 1999)
$$\begin{aligned} q \gtrsim \left( \frac{1}{H_p a}\right) ^{1/2} \end{aligned}$$
(31)
for the flux tube to maintain cohesion during its rise. Note that the conditions given by Eqs. (30) and (31) and also the 2D simulations described in this section all assume buoyant flux tubes with initial buoyancy \(| \varDelta \rho / \rho | \sim 1/\beta \). For tubes with lower level of buoyancy, the necessary twist is smaller with \(\tan \varPsi \) and q both \(\propto |\varDelta \rho / \rho |^{1/2} \) (see Emonet and Moreno-Insertis 1998).
Longcope et al. (1999) pointed out that the amount of twist given by Eq. (31) is about an order of magnitude too big compared to the twist deduced from vector magnetic field observations of solar active regions on the photosphere. They assumed that the averaged \(\alpha \equiv J_z / B_z\) (the ratio of the vertical electric current over the vertical magnetic field) measured in an active region on the photosphere directly reflects the twist in the subsurface emerging tube, i.e., \(q = \alpha / 2\) (Longcope and Klapper 1997; Longcope et al. 1998). If this is true then it seems that the measured twists in solar active regions directly contradict the condition for the cohesive rise of a horizontal flux tube with buoyancy as large as \( | \varDelta \rho / \rho | \sim 1/\beta \). However it should be noted that the above estimate by Longcope et al. (1999) is largely based on the weakly twisted thin flux tube model. In reality the fragmentation of the emerging flux tube by 3D convection, especially in the top layer of the convection zone where the active region flux tubes clearly can no longer be considered thin, makes the connection of the observed \(\alpha \) in solar active regions to the twist q of the rising flux tubes in the deep solar convection zone very uncertain.
Later, 3D simulations of \(\varOmega \)-shaped arched flux tubes have been carried out (Abbett et al. 2000, 2001; Fan 2001). Fan (2001) performed 3D simulations of arched flux tubes which form from an initially neutrally buoyant horizontal magnetic layer as a result of its undulatory buoyancy instability (see Sect. 4.2 and Fig. 7). It is found that without any initial twist the flux tubes that form rise through a distance of about one density scale height included in the simulation domain without breaking up. This significantly improved cohesion of the 3D arched flux tubes compared to the previous 2D models of buoyant horizontal tubes is not only due to the additional tension force made available by the 3D nature of the arched flux tubes, but also due largely to the absence of an initial buoyancy and a slower initial rise (Fan 2001). With a neutrally buoyant initial state, both the buoyancy force and the magnetic tension force grow self-consistently from zero as the flux tube arches. The vorticity source term produced by the growing magnetic tension as a result of bending and braiding the field lines is found to be able to effectively counteract the vorticity generation by the growing buoyancy force in the apex cross-section, preventing it from breaking up into two vortex rolls. The 2D models (Moreno-Insertis and Emonet 1996; Longcope et al. 1996; Fan et al. 1998a; Emonet and Moreno-Insertis 1998) on the other hand considered an initially buoyant flux tube for which there is an impulsive initial generation of vorticity by the buoyancy force. A significant initial twist is thus required to suppress this initial vorticity generation. Therefore the absence of an initial vorticity generation by buoyancy, and the subsequent magnetic tension force resulting from bending and braiding the field lines allow the arched tube with no net twist in Fan (2001) to rise over a significantly greater distance without disruption.
Abbett et al. (2000) performed 3D simulations where an initial horizontal flux tube is prescribed with a non-uniform buoyancy distribution along the tube such that it rises into an \(\varOmega \)-shaped loop. As discussed above, due to the prescribed buoyancy in the initial horizontal tube, there is an impulsive initial generation of vorticity by the buoyancy force which breaks up the apex of the rising \(\varOmega \)-loop if there is no initial twist. However the separation of the two vortex fragments at the apex is reduced due to the three-dimensional effect (Abbett et al. 2000). By further including the effect of solar rotation using a local f-plane approximation, Abbett et al. (2001) found that the influence of the Coriolis force significantly suppresses the degree of fragmentation at the apex of the \(\varOmega \)-loop (Fig. 21).
They also found that the Coriolis force causes the emerging loop to become asymmetric about the apex, with the leading side (leading in the direction of rotation) having a shallower angle with respect to the horizontal direction compared to the following (Fig. 21), consistent with the geometric asymmetry found in the thin flux tube calculations (Sect. 5.1.4).
Another interesting possibility is suggested by the 3D simulations of Dorch and Nordlund (1998), who showed that a random or chaotic twist with an amplitude similar to that given by Eqs. (30) or (31) in the flux tube can ensure that the tube rises cohesively. Such a random twist may not be detected in the photosphere measurement of active region twists which is determined by taking some forms of average of the quantity \(\alpha = J_z / B_z\) over the active region.
Martínez-Sykora et al. (2015) carried out a comprehensive multi-parametric study of the buoyant rise of \(\varOmega \)-shaped magnetic flux tubes in an adiabatically stratified layer with 3D MHD simulations that achieved a hitherto significantly higher spatial resolution and hence significantly higher Reynolds numbers through the use of Adaptive Mesh Refinement (AMR) of the numerical code. They examined the dependence of the tube evolution on the field line twist and on the curvature of the tube axis. They found that the results are quite different for the low-diffusion regime with the Reynolds number for the rising flux tube \(R_e \sim O(100)\) compared to those in the high-diffusion regime with \(R_e \lesssim O(10)\), which characterized the earlier 3D simulations. In the high-diffusion regime, the amount of longitudinal flux retained in the rising head of the flux tube increases with the curvature of the flux tube axis (see Fig. 22), consistent with the earlier 3D simulations of Abbett et al. (2000). But when the low-diffusion regime is reached (with the use of AMR), a smaller magnetic twist (below the critical value given in equation 31) is able to prevent the splitting of the flux tube into two vortex tubes, and the loop curvature does not play a significant role in the cohesion of the rising tube (see Fig. 23).
Results from rotating spherical-shell simulations
Fan (2008) has carried out a set of 3D anelastic MHD simulations of the buoyant rise of twisted magnetic flux tubes in an adiabatically stratified model solar convection zone in a rotating spherical shell geometry but without the presence of convection (see Fig. 24 and the associated video for one example simulation). These simulations have considered twisted, buoyant toroidal flux tubes at the base of the solar convection zone with an initial field strength of \(10^5 {\mathrm {\ G}}\), being \(\sim \) 10 times the equipartition field strength, and thus have neglected the effect of convection. Although it should be noted that the thin flux tube simulations incorporating the influence of the giant-cell convection (Weber et al. 2011, 2013) show that even with such high field strengths, the evolution of the rising flux tube is still significantly impacted in the regions of the strongest downflows. The main finding from the 3D simulations of Fan (2008) is that the twist of the tube induces a tilt at the apex of the rising \(\varOmega \)-tube that is opposite to the direction of the observed mean tilt of solar active regions, if the sign of the twist follows the observed hemispheric preference. It is found that in order for the tilt driven by the Coriolis force to dominate, such that the emerging \(\varOmega \)-tube shows a tilt consistent with Joy’s law of active region mean tilt, the initial twist rate of the flux tube needs to be smaller than about a half of that required for the tube to rise cohesively. Under such conditions, the buoyant flux tube is found to undergo severe flux loss during its rise, with less than 50% of the initial flux remaining in the final \(\varOmega \)-tube that rises to the surface (see Fig. 24). However the severe flux loss may be a result of the high level of diffusion due to the limited spatial resolution of the simulations as shown in Martínez-Sykora et al. (2015). By carrying out 3D MHD simulations with AMR in a local Cartesian domain that achieved a significantly higher spatial resolution and hence significantly higher Reynolds numbers, Martínez-Sykora et al. (2015) found that the tube cohesion for a moderately twisted buoyant flux tube can be significantly improved when the high Reynolds number regime is reached through the use of AMR (see end of Sect. 5.3). Thus simulations in a global rotating spherical shell geometry as Fan (2008) but reaching a significantly higher spatial resolution as that in Martínez-Sykora et al. (2015) are needed.
Furthermore, Fan (2008) found that the Coriolis force drives a retrograde flow along the apex portion of the rising tube, resulting in a relatively greater stretching of the field lines and hence stronger field strength in the leading leg of the tube. With a greater field strength, the leading leg is more buoyant with a greater rise velocity, and remains more cohesive compared to the following leg (see Figs. 25a,b). Figure 25c shows selected field lines threading through the coherent apex cross-section of the final \(\varOmega \)-tube, resulting from the simulation of a weakly twisted buoyant tube described in Fan (2008, see the LNT run in that paper). It can be seen that the field lines in the leading side are winding about each other smoothly in a coherent fashion, while the field lines in the following side are significantly more frayed.
The 3D simulation of Fan (2008) found a retrograde flow of \(\sim 100\) m/s for the tube plasma at the apex of the emerging tube as it reaches about 30 Mm below the surface. This is consistent with the result from the thin flux tube simulations (e.g., Caligari et al. 1995; Weber et al. 2011) and it is due to the tendency for the rising flux tube to conserve the angular momentum as it rises from the bottom of the solar convection zone. This retrograde flow at the rising flux tube’s apex was examined by Birch et al. (2010) for its detectability by the time-distance helioseismology, as a possible subsurface pre-emergence signature for emerging active regions. It was found that a statistical approach of averaging over a large number (about 150) of emerging active regions is needed to reach the sufficient signal to noise ratio for detecting the travel time perturbation signature produced by the flow. Such a statistical study of a large sample of emerging active regions was conducted by Birch et al. (2013), and it puts strong constraints on models of active region flux emergence (Sect. 9.2).
Jouve and Brun (2007) have also carried out anelastic MHD simulations in a rotating spherical shell geometry to study the buoyant rise of an axisymmetric toroidal flux ring in an isentropically stratified (non-convecting) envelope. They have considered a even greater initial field strength of \(1.8 \times 10^5 {\mathrm {\ G}}\) for the initial toroidal flux ring. As was discussed in Fan (2008), the poleward deflection of the rise trajectory of the tube due to the Coriolis force is far more severe for an axisymmetric toroidal ring (where the whole ring is moving away from the rotation axis of the Sun) than for a localized 3D \(\varOmega \)-shaped tube (see Sect. 3.1 in Fan 2008). Thus an initial field strength of \(\gtrsim 1.8 \times 10^5 {\mathrm {\ G}}\) is needed for an axisymmetric toroidal ring to rise nearly radially (Jouve and Brun 2007). The simulations of Jouve and Brun (2007) also recovered the previous results from Cartesian simulations that if the flux tube is not twisted, it splits into two counter rotating vortices before reaching the top of the envelope.
Fournier et al. (2017) presented highly-resolved 3D compressible MHD simulations of the buoyant rise of non-axisymmetric magnetic flux tubes in an adiabatically stratified rotating stellar interior envelope with varying rotation rates, without the presence of convection. The simulations considered flux tubes that are sufficiently twisted (satisfying the minimum twist requirement eq. 30) to ensure a coherent rise. With the use of AMR, the simulations well resolve the rising flux tube, with the initial tube diameter near the bottom of the domain being resolved by at least 50 grid points. They found that the compressible simulations show results that are consistent with the previous thin flux tube and anelastic simulations in regard to the rising trajectories and rise times. Through parameter studies, they derived a control parameter that determines the rise time and the regime of the rise (buoyancy-dominated or rotation-dominated), in terms of the stellar rotation rate, the magnetic field strength of the flux tube, and the azimuthal wavenumber of the rise pattern.
The rise of highly twisted, kink unstable magnetic flux tubes as a possible origin of \(\delta \)-sunspots
As discussed in Sect. 5.2, most of the solar active regions are observed to have very small twists, with an averaged value of \(\alpha \equiv \langle J_z/B_z \rangle \) (the averaged ratio of the vertical electric current over the vertical magnetic field) measured to be on the order of \(0.01 \mathrm {\ Mm}^{-1}\) (e.g. Pevtsov et al. 1995, 2001). However there is a small but important subset of active regions, called the \(\delta \)-sunspots, which are observed to be highly twisted with \(\alpha \) reaching a few times \(0.1 \mathrm {\ Mm}^{-1}\) (Leka et al. 1996), and to have unusual polarity orientations that are sometimes reversed from Hale’s polarity rule (see Zirin and Tanaka 1973; Zirin 1988; Tanaka 1991). These \(\delta \)-sunspots are compact structures where umbrae of opposite polarity are contained within a common penumbra. They are found to be the most flare productive active regions (see Zirin 1988). Through careful analysis of the evolution of flare-active \(\delta \)-sunspot groups, Tanaka (1991) proposed a model of an emerging twisted flux rope with kinked or knotted geometry to explain the observed evolution of these regions.
Motivated by the observations of \(\delta \)-sunspots, MHD calculations of the evolution of highly twisted, kink unstable magnetic flux tubes in the solar convection zone have been carried out (Linton et al. 1996, 1998, 1999; Fan et al. 1998b, 1999). For an infinitely long twisted cylindrical flux tube with axial field \(B_z (r)\), azimuthal field \(B_{\theta } (r) = q\,(r)\,r B_z (r)\), and plasma pressure \(p\,(r)\) in hydrostatic equilibrium where \(dp/dr = -(B_{\theta }^2 / 4 \pi r) - d(B_z^2+B_{\theta }^2) / dr\), a sufficient condition for the flux tube to be kink unstable is (see Freidberg 1987)
$$\begin{aligned} \frac{r}{4} \left( \frac{q'}{q} \right) ^2 + \frac{8 \pi p'}{B_z^2} < 0 \end{aligned}$$
(32)
to be true somewhere in the flux tube. In Eq. (32) the superscript ’ denotes the derivative with respect to r. This is known as Suydam’s criterion. Note that condition (32) is sufficient but not necessary for the onset of the kink instability and hence there can be cases which are kink unstable but do not satisfy condition (32). One such example are the force-free twisted flux tubes which are shown to be always kink unstable without line-tying (i.e., infinitely long) (Anzer 1968), but for which \(p' = 0\). Force-free fields are the preferred state for coronal magnetic fields under low plasma-\(\beta \) conditions and are not a likely state for magnetic fields in the high-\(\beta \) plasma of the solar interior. Linton et al. (1996) considered the linear kink instability of uniformly twisted cylindrical flux tubes with \(q = B_{\theta } / r B_z\) being constant, confined in a high \(\beta \) plasma. They found that the equilibrium is kink unstable if q exceeds a critical value \(q_\mathrm {cr} = a^{-1}\), where \(a^{-2}\) is the coefficient for the \(r^2\) term in the Taylor series expansion of the equilibrium axial magnetic field \(B_z\) about the tube axis: \(B_z(r) = B_0 (1 - a^{-2} r^2 + \cdots ) \). This result is consistent with Suydam’s criterion. They further argued that an emerging, twisted magnetic flux loop will tend to have a nearly uniform q along its length since the rise speed through most of the solar convection zone is sub-Alfvénic and torsional forces propagating at the Alfvén speed will equilibrate quasi-statically. Meanwhile expansion of the tube radius at the apex as it rises will result in a decrease in the critical twist \(q_\mathrm {cr} = a^{-1}\) necessary for the instability. This implies that as a twisted flux tube rises through the solar interior, a tube that is initially stable to kinking may become unstable as it rises, and that the apex of the flux loop will become kink unstable first because of the expanded tube cross-section there (Parker 1979; Linton et al. 1996).
The non-linear evolution of the kink instability of twisted magnetic flux tubes in a high-\(\beta \) plasma has been investigated by 3D compressible MHD simulations (Linton et al. 1998, 1999) and 3D anelastic MHD simulations (Fan et al. 1998b, 1999). Fan et al. (1998b, 1999) modeled the rise of a kink unstable flux tube through an adiabatically stratified model solar convection zone.
In the case where the initial twist of the tube is significantly supercritical such that the e-folding growth time of the most unstable kink mode is smaller than the rise time scale, Fan et al. (1999) found sharp bending of the flux tube as a result of the non-linear evolution of the kink instability. During the onset of the kink instability, the magnetic energy decreases while the magnetic helicity is approximately conserved. The writhing of the flux tube also significantly increases the axial field strength and hence enhances the buoyancy of the flux tube. The flux tube rises and arches upward at the portion where the kink concentrates, with a rotation of the tube orientation at the apex that exceeds \(90^{\circ }\) (see Fig. 26). Based on the orientation of the magnetic bipoles seen in a horizontal cross-section (Fig. 27) taken near the top of the kinked loop approaching the top boundary, it is conjectured that the emergence of this kinked flux tube may give rise to a compact magnetic bipole with polarity order inverted from the Hale polarity rule (Fig. 27) as often seen in \(\delta \)-sunspots.
However the aforementioned simulations (Linton et al. 1998, 1999; Fan et al. 1998b, 1999) are limited to modeling the rise of the kink unstable flux tubes within an adiabatically stratified model convection zone, and thus it is not shown whether the emergence of the kinked tubes from the convection zone into the solar atmosphere can produce the observed characteristics of the \(\delta \)-sunspots on the photosphere. To address this question, Takasao et al. (2015)(see also review by Toriumi and Wang (2019)) carried out a 3D compressible MHD simulation of a buoyant, kink-unstable flux tube that becomes kinked in the top layer of an adiabatically stratified model convection zone and subsequently emerges into the solar atmosphere. It is found that due to the development of the kink instability before the emergence, the magnetic twist at the apex of the kinked tube is greatly reduced as a result of conversion of twist into writhe, although the two legs of the kinked tube are still strongly twisted. Instead of forming a bipolar structure, the emergence of the subsurface kinked flux tube produces at the photosphere a complex quadrupole structure (see e.g. Fig. 35 and associated movie in Toriumi and Wang 2019), containing a narrow elongated bipolar pair sandwiched between a pair of coherent twisted bipolar spots. The central narrow bipolar channel is formed due to the submergence of U-shaped loops that develop at the apex of the kinked tube.
The conservation of magnetic helicity requires that the writhing of the tube due to the kink instability is of the same sense as the twist of the field lines. Hence for a kinked emerging tube, the rotation or tilt of the emerging magnetic bipole from the east-west polarity orientation defined by the Hale’s polarity rule should be related to the twist of the tube. The rotation or tilt should be clockwise (counterclockwise) for right-hand-twisted (left-hand-twisted) flux tubes. This tilt–twist relation can be used as a means to test the model of kinked flux tubes as the origin of \(\delta \)-sunspots (Tanaka 1991; Leka et al. 1994, 1996; López Fuentes et al. 2003). Observations have found with both consistent and opposing cases (Leka et al. 1996; López Fuentes et al. 2003). A study (Tian et al. 2005) which includes a large sample (104) of complex \(\delta \)-configuration active regions shows that 65–67% of these \(\delta \)-regions have the same sign of twist and writhe, supporting the model of kinked flux tubes.
Another scenario that explains the origin of the compact \(\delta \)-sunspot configuration is the “multi-buoyant segment model” (see review by Toriumi and Wang 2019) first simulated by Toriumi et al. (2014) to explain the quadrupolar flux emergence pattern of active region (AR) NOAA 11158. In this scenario, two adjacent buoyant segments of a single subsurface twisted flux tube rise as an M-shaped loop towards the surface to form two bipolar emerging regions. The adjacent inner polarities with opposite signs of the two emerging bipolar regions collide to form a compact \(\delta \)-sunspot with highly sheared and compressed polarity inversion line (PIL), reminiscent of the flux emergence pattern observed in AR 11158. The inner polarities collide to form the compact \(\delta \)-sunspot because they are connected beneath the surface by downward moving U-shaped fields. In a recent realistic radiation MHD simulation of active region formation in a deep convecting domain that encompasses both the bulk of the convection zone and the near surface layer, Toriumi and Hotta (2019) successfully simulated the spontaneous formation of \(\delta \)-sunspots through a multi-buoyant segment scenario that naturally develops due to the interaction between the emerging magnetic flux and turbulent convection (see Sect. 9.1).
3D MHD simulations of buoyant flux tubes in a stratified convective velocity field
General considerations
To understand how active region flux tubes emerge through the solar convection zone, it is certainly important to understand how 3D convective flows in the solar convection zone affect the rise and the cohesion of the buoyant flux tubes. 3D MHD simulations in the presence of stratified convective flows are needed to fully address the above question. The thin flux tube model (e.g., Weber et al. 2011) suggests that in order for convection not to significantly affect the buoyant rise of the flux tube, the magnetic buoyancy of the flux tube should dominate the downward hydrodynamic force from the convective downflows:
$$\begin{aligned} \frac{B^2}{8 \pi H_p }> C_{\mathrm {D}}\frac{\rho v_{\mathrm {c}}^2}{\pi a} \,\,\Rightarrow \,\, B > \left( \frac{2 C_{\mathrm {D}}}{\pi } \right) \left( \frac{H_p}{a}\right) ^{1/2} B_{\mathrm {eq,downflow}}, \end{aligned}$$
(33)
where \(B_{\mathrm {eq,downflow}}\equiv (4 \pi \rho )^{1/2} v_{\mathrm {c}}\) is the field strength at which the magnetic energy density is in equipartition with the kinetic energy density of the convective downdrafts, \(v_{\mathrm {c}}\) is the flow speed of the downdrafts, \(H_p\) is the local pressure scale height, a is the tube radius, and \(C_{\mathrm {D}}\) is the aerodynamic drag coefficient which is of order unity. In Eq. (33) we have used the aerodynamic drag force as an estimate for the magnitude of the hydrodynamic forces. The estimate (33) leads to the condition that the field strength of the flux tube needs to be significantly higher than the equipartition field strength by a factor of \(\sqrt{H_p / a}\). For flux tubes responsible for active region formation, \(\sqrt{H_p / a} > 3\) near the bottom of the solar convection zone. Thus we call \(B \gtrsim 3 B_{\mathrm {eq,downflow}}\) the “magnetic buoyancy dominated regime”, and expect \(B < 3 B_{\mathrm {eq,downflow}}\) to be the regime where convective downflows become dominant.
Simulations in a local Cartesian geometry without rotation
Fan et al. (2003) carried out direct 3D anelastic MHD simulations of the evolution of a buoyant magnetic flux tube in a stratified convective velocity field in a Cartesian box that spans 3 density scale heights. The density contrast between the bottom and top of the domain is approximately equal to that between the bottom of the solar convection zone and about 37 Mm depth the photosphere. The basic result of the simulations is illustrated in Fig. 28.
They first computed a 3D convective velocity field in a superadiabatically stratified fluid, until the convection reaches a statistical steady state. The resulting velocity field (see top-left image in Fig. 28) shows the typical features of overturning convection in a stratified fluid as found in many previous investigations. In the bulk of the convecting domain, the downflows are concentrated into narrow filamentary plumes, some of which extend all the way across the domain, while the upflows are significantly broader and are of smaller velocity amplitude in comparison to the downdrafts. A uniformly buoyant, twisted horizontal magnetic flux tube having an entropy that is equal to the entropy at the base of the domain is inserted into the convecting box (see top-right image in Fig. 28).
In the case where the field strength of the tube is in equipartition to the kinetic energy density of the strongest downdraft (left column in the bottom panel of Fig. 28), i.e. \(B=B_{\mathrm {eq,downflow}}\), the evolution of the tube depends sensitively on the local condition of the convective flows. Despite being buoyant, the portions of the tube in the paths of downdrafts are pushed downward and pinned down to the bottom, while the rise speed of sections within upflow regions is significantly boosted. \(\varOmega \)-shaped emerging tubes can form between downdrafts. It is found that the three-dimensional evolution and the cohesion of the flux tubes with \(B = B_{\mathrm {eq,downflow}}\) no longer depend sensitively on the initial twist of the tube, in contrast to the results obtained in the absence of convection. In the case of convection-dominated evolution, the flux tube is being bent by the flow in an incoherent manner along the tube. The flux tube is no longer able to develop vortex tubes coherent along the tube axis, which is the main reason for the flux tube to break up and cease to rise in the absence of convection. Despite being severely distorted, the \(\varOmega \)-tube that emerge between downdrafts can not be ruled out as a possible source of solar active regions, given the fact that the observed emerging magnetic regions often show complex morphologies.
On the other hand in the case where the tube field strength is 10 times the equipartition value (right column in the bottom panel of Fig. 28), the horizontal flux tube rises under its uniform buoyancy, nearly unaffected by the convection. In this magnetic buoyancy dominated regime, a sufficient initial twist is needed to prevent the tube from breaking up into two vortex tubes and rise coherently, similar to the result shown in the previous simulations of rising flux tubes in the absence of convection (Sect. 5.3).
In the anelastic simulations of Fan et al. (2003) discussed above, the physical values of the downflow speed and \(B_{\mathrm {eq,downflow}}\) are all scaled to the value of the superadiabaticity \(\delta _r\) at the base of the domain used for the reference state. If we assume a value of \(2 \times 10^{-8}\) for the superadiabaticity near the base of the solar convection zone for \(\delta _r\), the \(B_{\mathrm {eq,downflow}}\) described in the above result would correspond to about 10 kG.
The 3D simulations of Fan et al. (2003) have very limited spatial resolution with the initial radius of the tube being resolved by about 6 grid points and the simulations also incorporate explicit viscosity and magnetic diffusion. As has been shown in Martínez-Sykora et al. (2015), who have carried out 3D MHD simulations of buoyant flux tubes in the absence of convection achieving a much higher spatial resolution (and hence significantly lower diffusions) through the use of AMR, the results change significantly when the low-diffusion regime is reached compared to the high-diffusion regime. Therefore the results such as those discussed above in Fan et al. (2003) of buoyant flux tubes in convection need to be further examined with higher resolution simulations. However it is difficult to achieve the low-diffusion regime using AMR in the case where convection is present and small scale features develop everywhere.
Global rotating convective spherical-shell simulations
Jouve and Brun (2009) have carried out the first set of global anelastic MHD simulations of the buoyant rise of an initially toroidal flux ring in a rotating, fully convective spherical shell, possessing self-consistently generated mean flows such as meridional circulations and differential rotation, representative of the conditions of the solar convective envelope (see, e.g., review by Miesch 2005). They inserted into the fully developed convecting envelope a buoyant toroidal flux ring with different initial field strengths, twist rates, and initial latitudes, to study how the flux tube rises in the presence of convection and the associated mean flows, and how the dynamic evolution depends on the above initial parameters.
It is found that the magnetic field strength corresponding to the value that is in equipartition with the kinetic energy of the strongest downflows is rather high, \(B_{\mathrm {eq,downflow}}\approx 6.1 \times 10^4 {\mathrm {\ G}}\). The initial field strength B of the toroidal flux rings considered in the simulations are all significantly greater than \(B_{\mathrm {eq,downflow}}\), being \(2.5 B_{\mathrm {eq,downflow}}\), \(5 B_{\mathrm {eq,downflow}}\), and \(10 B_{\mathrm {eq,downflow}}\). Thus, except for the case with \(B= 2.5 B_{\mathrm {eq,downflow}}\), all of the other cases simulated are in the magnetic buoyancy dominated regime (with \(B > 3 B_{\mathrm {eq,downflow}}\) as discussed in Sect. 5.6.1). As a result, the simulations recovered many of the findings obtained from previous simulations in the absence of convective flows. These include the dependence of the poleward deflection of the tube on the initial tube field strength (e.g., Choudhuri and Gilman 1987; Fan 2008), the critical dependence on the initial twist for the cohesion of the buoyantly rising flux tube (e.g., Emonet and Moreno-Insertis 1998; Abbett et al. 2000), and the dependence of the tilt angle of the emerging tube on the initial twist (Sect. 5.4 and Fan 2008). Due to the relatively high magnetic diffusivity in the code, flux tubes with a very large initial field strength (ranging from \(1.5 \times 10^5 {\mathrm {\ G}}\) to \(6 \times 10^5 {\mathrm {\ G}}\)) and a large radius, corresponding to a total flux on the order of a few times \(10^{23} {\mathrm {\ Mx}}\), significantly greater than the typical active region fluxes, are considered, such that the rise times of the flux tubes are \(\lesssim \) the diffusive time scale of about 14.5 days. Because most of the cases considered are essentially in the magnetic buoyancy dominated regime, the rising toroidal flux tube only develops rather moderate undulations by the influence of the convective flows (see Fig. 29), and \(\varOmega \) tubes with undulations extending the depth of the convection zone are not found.
It is also found that flux tubes introduced at lower latitudes (e.g., at \(15^{\circ }\)) have difficulty reaching the top of the domain (even with a strong initial field strength of \(5 B_{\mathrm {eq,downflow}}\approx 3 \times 10^5 {\mathrm {\ G}}\)), and the authors attributed the cause of this to the differential rotation. For the weakest field strength case (with \(B = 2.5 B_{\mathrm {eq,downflow}}= 1.5 \times 10^5 {\mathrm {\ G}}\)), it is found that portions of the toroidal ring are pinned down by the convective downdrafts, and eventually the tube loses its buoyancy due to magnetic diffusion and is unable to rise to the top (see top panel of Fig. 29).
Jouve et al. (2013) have extended the above work to consider initial toroidal flux rings at the base of the convection zone with a longitudinally localized buoyancy distribution to simulate the buoyant rise of a single \(\varOmega \)-shaped loop of flux in the rotating, convective spherical shell with self-consistently generated mean flows (i.e. the differential rotation and meridional circulation). It is found that the rise and the characteristics of the emerging regions are strongly affected by the convective motions when loops with initial field strengths \(\le 10^5\) G are considered, however emerging regions with the correct tilt and a dominant leading polarity are still found in these simulations. However most of these simulations have used initial toroidal flux tubes with a right-handed initial twist in the northern hemisphere, which is opposite to the observed preferred sign of twist (left-handed) for active regions in the northern hemisphere (e.g., Pevtsov et al. 2001). It has been shown in the simulations in the absence of convection (e.g. Fan 2008) that with a magnitude of the twist that reaches the critical twist needed for the buoyant flux tube to rise cohesively, the orientation of the tilt angle of the final emerging region is strongly influenced by the sign of the twist of the rising \(\varOmega \) loops, where a left-handed twist tends to produce a tilt that is opposite to the observed tilt of solar active regions in the northern hemisphere. It is still not clear whether, in the presence of the rotating convection, the buoyant rise of an \(\varOmega \)-shaped tube with a left-handed twist in the northern hemisphere (consistent with the observed hemisphere preference of the sign of active region twist) can produce an emerging region with the correct tilt and a dominant leading polarity, as were found for the right-hand twisted cases in Jouve et al. (2013).
It should be emphasized that so far, all 3D simulations of isolated rising flux tubes in a global rotating spherical convective envelope have to use initial toroidal flux tubes with a large radius, and hence a large total flux (a few times \(10^{23}\) Mx), about an order of magnitude greater than the flux of typical large solar active regions of \(10^{22}\) Mx, due to the large numerical diffusion (that erodes the magnetic buoyancy) resulting from the limited numerical resolution. Clearly significantly higher resolution simulations with a reduced magnetic diffusion are necessary to model the evolution of rising flux tubes from the bottom of the solar convection zone in more realistic parameter regimes. Specifically, it is important to model cases with a weaker initial field strength (\(10^4 {\mathrm {\ G}} \lesssim B \lesssim 10^5 {\mathrm {\ G}}\)) under the significant influence of the rotating solar convection, as well as the influence of the turbulent magnetic field of the convective dynamo (e.g., Pinto and Brun 2013), to study whether \(\varOmega \)-shaped emerging flux bundles with properties consistent with solar active regions can develop.