Introduction

The fundamental processes that result in triggering of a substorm have been a topic of intense debate and research over several decades. The energy variables that strongly influence the buildup and subsequent release of energy in a substorm are the magnetotail lobe current and associated lobe magnetic field, the cross-tail voltage, the plasma sheet magnetic and thermal pressure, and the speed of parallel flows along the stretched tail magnetic field lines. The ionosphere and inner magnetosphere influence the growth phase, onset phase, expansion phase and recovery phase of a typical substorm. These regions may behave very differently under the strongly variable solar wind conditions. Their interaction may be different under various types of substorm activity, for example isolated substorms that occur during relatively quiet periods, storm time substorms or sawteeth events which are sometimes referred to as periodic substorms (Partamies et al. 2009).

A standard view of the substorm temporal development is through a nonlinear energy buildup and unloading process (Baker et al. 1999). The growth phase begins when the IMF turns southward for a certain period of time, then plasma sheet thinning develops, and then reconnection occurs. This is followed by a rapid dipolarization during the expansion phase. The growth phase is not always clearly observable because of intermittent northward and southward fluctuations in the IMF. The precise mechanism and sequence of events during substorm onset are still under investigation (Sergeev et al. 2012), but reconnection in the tail is widely accepted to be the point when dipolarization occurs. This begins the expansion phase of a substorm. Finally, after plasma energy is lost through plasma outflow and ohmic dissipation, the substorm moves into the recovery phase and the magnetotail reverts back to a stretched tail configuration.

The nonlinear loading and unloading character of geomagnetic substorms under southward IMF conditions has been investigated by several authors (Klimas et al. 1992, 1996; Blanchard and McPherron 1993). Vassiliadis et al. (1995) employed a complex nonlinear filter approach for describing the solar wind-magnetosphere coupling and later to predict the AL index (Klimas et al. 1998). Weigel et al. (1999) used a neural network technique to predict the AL index. The nonlinear dynamical WINDMI model was used by Horton et al. (2003) to classify substorms into 3 categories, including the so-called northward turning triggered substorm (Gallardo-Lacourt et al. 2012). Spencer and Patra (2013) investigated the effect of enhanced ionospheric conductance on the substorm dynamics.

Here we proceed to develop and analyze the WINDMI model characteristics following (Horton and Doxas 1996) and (Spencer et al. 2007) as a tool to forecast the loading–unloading type of substorm. We note that a strongly fluctuating solar wind may trigger surges in the AE index (Pulkkinen et al. 2006), but this effect is not represented in the physics of the WINDMI model. The current version of the WINDMI model is available at the NASA Community Coordinated Modeling Center for real-time forecasts of space weather geomagnetic storm and substorm activity (Mays et al. 2009).

Kalmoni et al. (2015) used auroral brightening signatures from ground-based auroral imagers to identify onset times. These onset times may indicate the beginning of substorms or pseudo-breakup events. Using these onset times as a rough guide, and combining them with AL and SML (SuperMAG lower auroral electrojet index) indices obtained from WDC Kyoto and SuperMAG, respectively, we tune the WINDMI model to produce substorms to establish bounds for the model parameters under different magnetospheric and solar wind conditions.

The paper is outlined as follows. In the next section the WINDMI model is described. In the third section we discuss the data and methodology employed to analyze the substorms. We then present the results of our analysis in the fourth section and finally draw some conclusions in the fifth section.

WINDMI model

The plasma physics-based nonlinear dynamical WINDMI model uses an equivalent solar wind voltage \(V_{\rm sw}\) generated by solar wind-magnetosphere coupling functions as input to eight ordinary differential equations that accounts for the transfer of power between the global energy components of the nightside magnetosphere. The WINDMI model is discussed in some details in Doxas et al. (2004), Spencer et al. (2007) and more recently in Patra et al. (2011). The equations of the model are given by:

$$L \frac{{\text {d}}I}{{\text {d}}t}= V_{\mathrm{sw}}(t) - V + M \frac{{\text {d}}I_1}{{\text {d}}t}$$
(1)
$$C \frac{{\text {d}}V}{{\text {d}}t} = I - I_1 - I_{\mathrm{ps}} - \Sigma V$$
(2)
$$\frac{3}{2} \frac{{\text {d}}p}{{\text {d}}t} = \frac{\Sigma V^2}{\Omega _{\mathrm{cps}}} - u_0 p K_\Vert ^{1/2} \Theta (U) - \frac{pVA_{\mathrm{eff}}}{\Omega _{\mathrm{cps}} B_{\mathrm{tr}} L_y}-\frac{3p}{2\tau _E}$$
(3)
$$\begin{aligned} \frac{{\text {d}}K_\Vert }{{\text {d}}t}= I_{ps} V - \frac{K_\Vert }{\tau _\Vert } \end{aligned}$$
(4)
$$\begin{aligned} L_I \frac{{\text {d}}I_1}{{\text {d}}t}= V - V_I + M\frac{{\text {d}}I}{{\text {d}}t} \end{aligned}$$
(5)
$$\begin{aligned} C_I \frac{{\text {d}}V_I}{{\text {d}}t}= I_1 - I_2 - \Sigma _I V_I \end{aligned}$$
(6)
$$\begin{aligned} L_2 \frac{{\text {d}}I_2}{{\text {d}}t}= V_I - (R_{\mathrm{prc}} + R_{A2})I_2 \end{aligned}$$
(7)
$$\begin{aligned} \frac{{\text {d}} W_{\mathrm{rc}}}{{\text {d}}t}= R_{\mathrm{prc}} I_2^2 + \frac{pVA_{\mathrm{eff}}}{B_{\mathrm{tr}} L_y} - \frac{W_{\mathrm{rc}}}{\tau _{\mathrm{rc}}} \end{aligned}$$
(8)

The coupled nonlinear equations track the exchange of electromagnetic, electric and mechanical energy through eight pairs of energy-conserving terms. The remaining terms in each equation describe the loss of energy from the magnetosphere–ionosphere system through plasma outflows, ionospheric losses and ring current particle energy decay.

The coefficients in the model are physical parameters related to the geometrical and plasma configuration of the magnetosphere–ionosphere system. The parameters \(L, C, \Sigma , L_I, C_I\) and \(\Sigma _I\) are the magnetospheric and ionospheric inductances, capacitances and conductances, respectively. \(A_{\rm eff}\) is an effective aperture for particle injection into the ring current that on the dusk side merges with the Alfven layer Doxas et al. (2004). The resistances in the partial ring current and region-2 current, \(I_2\) are \(R_{\rm prc}\) and \(R_{A2}\), respectively, and \(L_2\) is the inductance of the region-2 current. The coefficient \(u_0\) in Eq. 3 is a heat flux-limiting parameter.

The energy confinement times for the central plasma sheet, parallel kinetic energy and ring current energy are \(\tau _E, \tau _k\) and \(\tau _{\rm rc}\), respectively. The effective width of the magnetosphere is \(L_y\), and the transition region magnetic field is given by \(B_{tr}\). The pressure gradient current due to the magnetic field curvature is given by \(I_{ps} = L_x(p / \mu _0 )^{1/2}\), where \(L_x\) is the effective length of the earth’s magnetotail. The pressure gradient current flows in the y direction to balance the pressure gradient in the x direction anti-sunward as given by the equilibrium MHD momentum equation \(-\nabla p + {\mathbf {J}} \times {\mathbf {B}} = 0\). The outputs of the model are the AL and Dst indices, which are taken to be proportional to the magnetospheric field-aligned currents. Estimates of the physical parameters are listed in Spencer et al. (2007). The AL index is calculated from the region 1 current \(I_1\) index by assuming it to be proportional with a constant \(\lambda _{AL}[A/nT]\), resulting in \(\Delta B_{AL}=-I_1/\lambda _{AL}\).

The input coupling function used for the model is the rectified \(vB_s\) formula (Reiff and Luhmann 1986), given by:

$$\begin{aligned} V_{sw}^{Bs} = 40(kV) + v_{sw}B_s^{IMF}L_y^{eff}(kV) \end{aligned}$$
(9)

where \(v_{sw}\) is the x-directed component of the solar wind velocity in GSM coordinates, \(B_s^{IMF}\) is the southward IMF component, and \(L_y^{eff}\) is an effective cross-tail width over which the voltage is produced. For northward or zero \(B_s^{IMF}\), a base viscous voltage value of 4 kV is used as input into the system.

Substorm onsets are triggered in the WINDMI model through a switching function \(\Theta (U)\). The pressure unloading function \(\Theta (U)=\frac{1}{2}[1+\tanh {U}]\) where \(U=(I-I_c)/\Delta I\) in Eq. (3) is turned on by a geotail critical current \(I_c\) over an interval \(\Delta I\) when a transition to plasma loss along newly opened magnetic field lines with an equivalent parallel thermal flux \(q_{||}\) occurs. The parallel thermal flux goes from zero to one as the geotail current I rises to \(I_c\). The unloading function used here corresponds to current gradient-driven tearing modes or cross-field current instabilities, as outlined in Yoon et al. (2002).

In Fig. 1, we show the result of driving the WINDMI model with the solar wind velocity Vx being 450 km/s and IMF Bz of − 6.67 nT. This is a synthetic signal which was chosen to make the product of Vx and Bz a certain value. With the effective width of the magnetosphere taken to be 10 earth radii, this results in a solar wind voltage of 191 kV. A substorm is triggered when \(C = 1000 F\), \(\Sigma = 10 S\), \(I_c = 3.2 MA\) and \(\Omega =4000 R_E^3\).

Fig. 1
figure 1

WINDMI under 191-kV input. The WINDMI model is driven with a synthetic southward IMF voltage signal with the solar wind velocity Vx being 450 km/s and IMF Bz of − 6.67 nT. A substorm is produced when sufficient energy is stored in the magnetotail and a critical current \(I_c\) is reached

Substorm data and methodology

Kalmoni et al. (2015) used a set of substorm and pseudo-breakup events to study how the growth rate of auroral beads is related to possible instability mechanisms in the near-earth plasma sheet. We used the same set of events but studied the substorm energy and triggering conditions using solar wind and IMF as drivers. The reason we used the events from that paper is because it would be possible to compare the auroral brightening times to the surges in the westward auroral electrojet due to the aforementioned solar wind driving. There were 17 events in total. Acceptable solar wind data from ACE were available for 13 events, and the model was able to trigger a substorm on 9 events. Of these, only 7 events had all the characteristics of a substorm needed for matching to the Kyoto lower auroral electrojet index AL and the SuperMAG lower auroral electrojet index SML. Four events did not have acceptable ACE data, meaning some data were missing, or unusable. For another 4 events, the solar wind input was not strong enough or did not last long enough for this model to trigger a substorm. Finally, in another 2 substorms, the AL and SML signatures were not present or did not coincide in timing, with the solar wind forcing, propagated to the nose of the magnetosphere.

Several motivating factors were considered when selecting parameters to tune in order to establish the substorm-related characteristics of the model. Firstly, depending on solar wind conditions and auroral electrojet intensities, by fixing the average \(B_z\) in the central plasma sheet, the effective width \(L_y\) and \(\Omega _{cps}\), we may estimate the mass density present in the central plasma sheet from the capacitance values needed to trigger a substorm. The plasma sheet Bz is used implicitly in the calculation of the plasma sheet capacitance C.

Secondly, the perpendicular \(v_E = {\mathbf {E}} \times {\mathbf {B}}\) flow velocity with \(B_z= 10\) nT goes as 100 km/s per 1 mV/m electric field. Thus we will be able to track the cross-tail electric field if consistent satellite data could be found in future to constrain the model. Thirdly, from the geotail current we could track how the magnetic curvature \({\text {d}}{\mathbf {b}}/{\text {d}}s\) changes during the substorm development, which is a condition for ballooning instability, Wong et al. (2001). Fourthly we could use the critical current parameter \(I_c\) to estimate the conditions when current-driven instabilities may be triggered. Finally, the pressure gradient current \(I_{ps}\) in the model is an estimate of the earthward pressure gradient dp/dx, which is also related to conditions for ballooning instability.

The parameters that have the strongest influence on the strength and character of substorms in the model are the geotail inductance L, the plasma sheet capacitance C, the critical current \(I_c\), the heat flux parameter \(u_0\) and the plasma sheet volume \(\Omega _{cps}\). These parameters are varied in the model in order to capture substorm events. However, in order to minimize the number of independent degrees of freedom, we fixed \(\Omega =10000 R_E^3\) in this work. In addition, we do not show the value of \(\Sigma _I\), the ionosphere conductivity value, used to scale the output auroral electrojet to the measured AL and SML indices. This scaling is influenced by several factors, including the width and height of the electrojet.

Our analysis consists of two steps. In the first step, we obtain parameters for each of the substorm or pseudo-breakup cases in Table 1, by tuning the WINDMI model to produce a substorm driven by solar wind input, coinciding as closely as possible with the auroral brightening times observed by Kalmoni et al. (2015). In the second step, we obtain parameters for each of the cases by tuning the model to match the AL and SML indices as closely as possible. The results are discussed in the next section.

Table 1 WINDMI model parameters obtained when tuned to auroral brightening observations

Results

The first set of results obtained were through tuning the model parameters in order that a substorm is triggered around the time of the observation of auroral brightening, and so that a classical growth, expansion and recovery phase can be observed, as shown in Fig. 1. Because the model is driven by solar wind conditions measured at ACE and propagated to the nose of the earth’s magnetosphere, the model can only produce a substorm if the solar wind energy is present and sufficient for triggering to occur. In Table 1, we list the events, the associated times when auroral brightening was observed, the onset triggered in the WINDMI model, the time difference between the two and the values of \(\Sigma\), \(I_c\) and C that were obtained. Overall, the parameters do not vary over large ranges, except that the model onset times are distributed over a 10–12-min interval around the observed auroral brightening times. There were sometimes several substorms triggered in the model due to solar wind forcing, but we chose the start of the event nearest to the observed auroral signatures.

When the model is tuned to the same set of events but now compared to auroral electrojet activity measured by either the AL or SML index, the results are different. The new parameter values obtained in this case are shown in Table 2. Two of the seventeen events prove to be exceptions. For the 2 October 2008 event, no SML or AL signature appears although solar wind forcing is present and the model can be triggered. For the 7 March 2010 event, SML and AL show substorm activity 15–20 min later, but the model triggers too early and cannot be tuned to fit the AL/SML indices. The remaining events have new parameter values. Here we show 3 of these events in Figs. 2, 3 and 4. In each of these figures, the top panel shows the solar wind-rectified voltage given in Eq. 9, and the bottom panel shows the output of the model compared to the AL and SML indices. We show these events in order to make some observations that may be of interest.

Table 2 WINDMI model parameters obtained when tuned to auroral electrojet activity given by AL/SML indices
Fig. 2
figure 2

Event 06:26 UT 22 February 2006: In the bottom panel, substorm onset time from auroral observations is depicted with the red-dotted line. The WINDMI model output is compared to the AL and SML indices. The top panel shows the solar wind input measured by the ACE satellite and propagated to the nose of the magnetosphere

Fig. 3
figure 3

Event 05:50 UT 07 March 2007: In the bottom panel, substorm onset time from auroral observations is depicted with the red-dotted line. The WINDMI model output is compared to the AL and SML indices. The top panel shows the solar wind input measured by the ACE satellite and propagated to the nose of the magnetosphere

Fig. 4
figure 4

Event 05:15 UT 07 March 2010: In the bottom panel, substorm onset time from auroral observations is depicted with the red-dotted line. The WINDMI model output is compared to the AL and SML indices. The top panel shows the solar wind input measured by the ACE satellite and propagated to the nose of the magnetosphere

For the event on 22 February 2006, the auroral observations of brightening put the onset at roughly 6:26 UT. From Fig. 2, we observe that a burst of activity occurs before the event, and the WINDMI model triggers twice, once before the brightening, and once 10 min after. Because the model nonlinear behavior correlates strongly with the loading–unloading paradigm of substorms, the solar wind input provides energy that needs time to accumulate before unloading can be triggered.

For the event on 07 March 2007, we make two observations, referring to Fig. 3. First, there is sufficient energy in the solar wind to cause the model to produce strong substorm activity before the auroral brightening observations, but the AL and SML indices show no activity. Second, the model output corresponds well with the AL and SML indices after the brightening occurs, building up to a substorm and subsequently recovering. We observe also that the SML index shows a sharp spike toward the end of the substorm period, but this is not seen in the AL index.

For the event on 07 March 2010, shown in Fig. 4, the solar wind input decays early, but the AL and SML indices builds up into what looks like a large substorm. The same signatures appeared on 02 October 2008. For these two events, we were unable to trigger what appears to be a longer substorm that occurred over roughly 70–80 min, according to the AL and SML signatures. Whether this is an internally triggered mode is unclear.

The differences between the AL and SML are perhaps due to the way the indices are calculated from station observations. There are three possible reasons: (1) differences in baseline technique, (2) Kyoto has stations that are not yet included in SuperMAG, (3) errors in data. However, this requires further investigation.

Conclusions

In this work we have used the WINDMI model to study substorms from the point of auroral brightening signatures as well as the AL and SML indices. The WINDMI model can be used to identify substorm triggering, based on input solar wind conditions, for a subset of cases when there is sufficient energy in the solar wind to drive events. We used the onset times identified from auroral brightening and then compared the WINDMI triggering and subsequent behavior to AL and SML activity. However, it appears that some substorms cannot be explained with this type of loading–unloading model.

The WINDMI model is a fast near real-time model running at the NASA Community Coordinated Modeling Center (CCMC), giving forecasts of substorms and storms from solar wind data obtained from satellites at the L1 point (ACE or DSCOVR). The motivation for our study is firstly to improve the forecasting capability of the model and secondly to track intermediate variables in the model and use them as a proxy to compare to satellite data. The parameter and state variable values in the model could be used to establish bounds for instabilities that trigger substorm onset. However, many issues remain to be resolved. The model can capture a subset of events, but sometimes overpredicts the occurrence (multiple triggers) or intensity of an event, and sometimes, it does not capture an event, especially when the solar wind parameters are not consistent with the AL and SML indices. Additionally, we need measurements of solar wind parameters, ground-based indices, satellite measurements and identification of substorm signatures to all coincide. There are many instances when one or more do not exist, or are not easily interpretable, to help constrain the model. Investigation of these issues is left for future work.