Physics of kinetic Alfvén waves: a gyrokinetic theory approach

The transverse shear Alfvén wave (SAW) is a fundamental anisotropic electromagnetic oscillation in plasmas with a finite background magnetic field. In realistic plasmas with spatial inhomogeneities, SAW exhibits the interesting spectral feature of a continuous spectrum. That is, the SAW oscillation frequency varies in the non-uniform (radial) direction. This continuum spectral feature then naturally leads to the phase-mixing process; i.e., time asymptotically, the effective radial wave-number increases with time. Any initial perturbation of SAW structures will, thus, evolve eventually into short-wavelength structures; termed as kinetic Alfvén wave (KAW). Obviously, one needs to employ kinetic theory approach to properly describe the dynamics of KAW; including effects such as finite ion-Larmor radius (FILR) and/or wave–particle interactions. When KAW was first discovered and discussed in 1975–1976, it was before the introduction of the linear electromagnetic gyrokinetic theory (1978) and nonlinear electromagnetic gyrokinetic theory (1982). Kinetic treatments then often involved the complicated procedures of taking the low-frequency limit of the Vlasov kinetic theory and/or employing the drift-kinetic theory approach; forsaking, thus, the FILR effects. In recent years, the powerful nonlinear gyrokinetic theory has been employed to re-examine both the linear and nonlinear physics of KAWs. This brief review will cover results of linear and nonlinear analytical theories, simulations, as well as observational evidences. We emphasize, in particular, that due to the enhanced electron–ion de-coupling in the short-wavelength regime, KAWs possess significantly enhanced nonlinear coupling coefficients and, thereby, play important roles in the heating, acceleration, and transport processes of charged particles in magnetized plasmas.


Introduction
In 1942 (Alfvén 1942), Hannes Alfvén discovered that if a perfectly conducting medium (e.g., a fully ionized gas; i.e., a plasma) is immersed in a finite background magnetic field B 0 ; electromagnetic waves can then propagate within it. The reason is that, while the free electron mobility remains extremely large along B 0 , its mobility perpendicular to B 0 is inhibited. That is, a magnetized plasma is an anisotropic conducting medium with an extremely large parallel (to B 0 ) and finite perpendicular (to B 0 ) conductivities. Thus, correspondingly, these propagating electromagnetic waves, called Alfvén waves, have nearly vanishing parallel (to B 0 ) and finite perpendicular (to B 0 ) electric fields.
There are two types of Alfvén waves; the compressional (fast) and shear Alfvén waves (Jackson 1998). The compressional Alfvén wave (CAW) compresses the magnetic field as well as plasma and its group velocity propagates almost isotropically. The shear Alfvén wave (SAW), meanwhile, is nearly incompressible and, thus, more readily excitable by either external perturbations (e.g., solar wind, antenna) or intrinsic collective instabilities (Chen and Zonca 2016). This brief review is focused on the SAW or, more specifically, its kinetic extension; i.e., the kinetic Alfvén wave (KAW).
In a uniform plasma immersed in a uniform background magnetic field, B 0 = B 0ẑ , and adopting the ideal magnetohydrodynamic (MHD) fluid description, it is well known that the SAW satisfies the following linear dispersion relation: Here, and k = k ⟂ + k ∥ b 0 are, respectively, the wave angular frequency and wavevector, b 0 = B 0 ∕B 0 , k ⟂ is the perpendicular (to B 0 ) component of k , k ∥ = k ⋅ b 0 , v A = B 0 ∕(4 m ) 1∕2 is the Alfvén speed with m ≃ n 0 m i being the mass density, and m i ≫ m e . The corresponding wave polarization is: with E ⟂ and B ⟂ denoting, respectively, the fluctuating components of electric and magnetic field perpendicular to B 0 . Equation (1) indicates that SAW is an anisotropic electromagnetic wave; i.e., while its phase velocity can propagate in any direction, its group velocity, v g = v A b 0 , propagates only along B 0 . This property, of course, has the direct bearing on the feature of Alfvén wave resonant absorption (Gajevski and Winterberg 1965;Pridmore-Brown 1966;Grad 1969;Chen and Hasegawa 1974a, b).
In a non-uniform plasma, SAW attains the interesting property of a continuous spectrum. To illustrate this feature, let us consider the simplified slab model of a cold plasma with a non-uniform density, m = m (x) , and a uniform B 0 = B 0ẑ . Assuming at t = 0 a localized initial perturbation B y (x, t = 0) =B y (x, 0) = exp(−x 2 ∕ 2 x ) , |k y x | ≪ 1 , and B y ∕ t = 0 , the perturbation then evolves according to the following wave equation: Here, 2 A (x) = k 2 z v 2 A (x) and the solution is: Equation (4) shows that every point in x oscillates at a different frequency, A (x) . With a continuously varying A (x) ; the wave frequency, thus, constitutes a continuous spectrum. While the above result is based on a model with a one-dimensional nonuniformity in x, this general feature of SAW continuous spectrum also holds in magnetized plasmas with two-or three-dimensional non-uniformities (Grad 1969;Chen and Cowley 1989;Schulze-Berge et al. 1991;Chen and Zonca 2016). A good example is geomagnetic pulsations in the Earth's magnetosphere. Figure 1 shows the oscillations in the Earths' magnetic field as observed by the satellite AMPTE CCE (Engebretson 2011; Engebretson et al. 1987), illustrating the three-component dynamic power (4) B y (x, t) =B y (x, 0) cos A (x)t .  Engebretson (2011), Engebretson et al. (1987)]. The geomagnetic B R , radially outward from the center of the Earth; B E , magnetically Eastward; and B N , approximately along local magnetic field lines correspond to, respectively, B x , B y , and B z . L, MLT, and MLAT correspond, respectively, to the equatorial distance of the magnetic field line (in unit of the Earth radius), magnetic local time, and magnetic latitude 1 Page 4 of 37 spectrum of magnetic field data from a full orbit from 02:30 to 17:30 UT March 6, 1987. Apogee is at the center of the figure. As the satellite moved outward from the morning side, A should decrease due to the decreasing |B 0 | and |k ∥ | (increasing fieldline length), and this was clearly exhibited in the wave frequency of B E , the azimuthal (East-West) component of B (i.e., the effective B y ). B E also shows that the wave frequency increases as the satellite moved inward toward the dusk side; consistent, again, with A . Furthermore, the observed wave frequency consisted of several bands, which could be understood as harmonics of standing waves along the field line; i.e., different |k ∥ |.
B y (x, t) given by Eq. (4) also indicates an unique and important property of SAW continuous spectrum; i.e., the spatial structure evolves with time. Specifically, the wave-number in the non-uniformity direction is, time asymptotically, given by: That |k x | increases with t is significant, since it implies that any initially long-scale perturbations will evolve into short scales. This point is illustrated in Fig. 2; showing the evolution of a smooth B y at t = 0 to a spatially fast varying B y at a later t (Qiu 2010;Qiu et al. 2011).
That is, B x decays temporally due to the phase mixing of increasingly more rapidly varying neighboring perturbations. This property also explains why, in Fig. 1, the radial component of B , B R , is much weaker than B E . We remark that, while we focus in this paper on the formation of SAW continuous spectrum due to nonuniform A , similar effects can be expected due to flow shear, which often exists in laboratory and astrophysical plasmas (Kim 2007;Maiorano et al. 2020). Noting that, as t → ∞ , |k x | → ∞ , it, thus, suggests that the perturbation will develop singular structures toward the steady state. Indeed, taking t = −i , the SAW governing wave equation for the cold-plasma becomes (Chen and Hasegawa 1974a, b): B x , thus, exhibits a logarithmic singularity at the Alfvén resonant point (layer), x 0 , where 2 = 2 A (x 0 ) along with a finite resonant wave-energy absorption rate. Note that at the isolated extrema of the SAW continuum, | � A | = 0 , phase mixing vanishes; consequently, perturbation remains regular and experiences no damping via resonant absorption. This feature has important implications to Alfvén instabilities in laboratory plasmas (Chen and Zonca 2016).
That the solution exhibits singularities naturally suggests that the microscopic length-scale physics neglected in the ideal MHD fluid description should be included in the long-time-scale dynamics of SAWs. For low-frequency SAWs, one can readily recognize the relevant perpendicular (to B 0 ) microscopic scales are either the ion-Larmor radius, i = v ti ∕ i with v ti and i being, respectively, the ion thermal speed and ion cyclotron frequency, and/or s = c S ∕ i with c 2 S = T e ∕m i and T e being the electron temperature. Including the effects of finite i and/or s in the SAW dynamics then led to the discovery of the so-called kinetic Alfvén wave (KAW) Chen 1975, 1976). The pioneering discovery of KAW was carried out before the introduction of linear electromagnetic gyrokinetic theory (Catto 1978;Antonsen and Lane 1980) and, later, nonlinear electromagnetic gyrokinetic theory (Frieman and Chen 1982). The analyses employed, therefore, involved taking the low-frequency ( | | ≪ | i | ) limit of the Vlasov dynamics. This makes theoretical analysis of KAW dynamics in non-uniform plasmas with realistic B 0 (x) intractable; especially when dealing with the nonlinear physics. Indeed, previous nonlinear analyses adopted either the drift-kinetic or the two-fluid description (Mikhailovskii et al. 2007;Onishchenko et al. 2004a, b;Pokhotelov et al. 2003Pokhotelov et al. , 2004Zhao et al. 2011). As our later discussions will show, such approximations not only are inadequate for treating realistic plasma regimes; but also often leave out important physics. The above discussions have, thus, motivated us Zonca 2011, 2013;Zonca et al. 2015) to re-visit and explore further the KAW physics employing the powerful gyrokinetic theories.
Section 2 presents a brief review of the linear gyrokinetic theory (cf. Sect. 2.1) and its applications to KAW (cf. Sects. 2.2 and 2.3 ) along with KAW observations by satellites (cf. Sect. 2.4). The nonlinear gyrokinetic theory is then briefly reviewed in Sect. 3.1. It is then applied to examine the physics of the three-wave parametric decay instabilities, the modulational instabilities associated with the spontaneous generation of convective cells, and the quasi-linear phase-space Here, b 0 = B 0 ∕B 0 , is the gyroradius vector, v ∥ = v ⋅ b 0 , is the magnetic moment adiabatic invariant ( = v 2 ⟂ ∕2B 0 at the leading order) and, assuming there is no equilibrium electrostatic potential, E is an equilibrium constant of motion.
In the guiding-center phase space, charged particle dynamics is naturally separated into the fast cyclotron motion and the slow guiding-center motion. One can then apply the gyrokinetic orderings and systematically average out the fast cyclotron motion (i.e., the gyrophase averaging) and obtain the asymptotically dominant (in terms of the smallness parameter ) perturbed distribution function response. This perturbed distribution function in the guiding-center phase space can then be inversely transformed back to the charged particle phase space and applied toward the field equations (i.e., Maxwell's equations) for a self-consistent kinetic description (Catto 1978;Antonsen and Lane 1980).
For the purpose of the present review, we shall limit our considerations to that of a simple uniform plasma with an isotropic Maxwellian equilibrium distribution function. Readers interested in the detailed analyses and/or broader applications may consult References (Antonsen and Lane 1980; Chen and Hasegawa 1991). Assuming, furthermore, (ratio between the plasma and magnetic pressures) ≪ 1 , such that there is negligible magnetic compression, the particle velocity distribution is then given by: is the Maxwellian distribution function, v t is the thermal speed: T = mv 2 t ∕2 , g satisfies the following linear gyrokinetic equation: and ⟨…⟩ denotes averaging over the gyrophase angle, . Here, the field variables are the scalar and vector potentials, and A , with A ∥ = A ⋅ b 0 and the ⋅ A = 0 Coulomb gauge. The operator e ⋅ , meanwhile, represents the transformation between the particle and guiding center positions.
The corresponding field equations are the Poisson's equation and the parallel Ampère's law, ∇ 2 A ∥ = −4 J ∥ ∕c . In the low-frequency and |k D | 2 ≪ 1 limit with D being the Debye length, Poisson's equation can be approximated as the quasi-neutrality condition; is the velocity-space integral, and subscript j runs over the particle species. Meanwhile, substituting the parallel Ampère's law into the ⋅ J ≃ 0 quasi-neutrality condition as derived by Eq. (15) yields a generalized linear gyrokinetic vorticity equation, which is often convenient to use in studying SAW/KAW dynamics (Chen and Hasegawa 1991;Zonca 2011, 2016;Zonca et al. 2015).

Linear KAW properties
For plane-wave ( , k) perturbations, Eq. (15) gives: Note, here, that J 0 is the Bessel function and J 0 (k ⟂ ) corresponds to the gyro-averaging of the coordinate transformation, that is: In SAW/KAW analyses, it is sometimes convenient to introduce an effective induced parallel potential defined by b 0 ⋅ = − t A ∥ ∕c or: , thus, gives rise to the induced parallel electric field; that is, the net parallel electric field is given by: The quasi-neutrality condition then straightforwardly yields: (Chen and Hasegawa 1991) Here, kj = ∕|k ∥ |v tj , Z kj = Z( kj ) with Z being the well-known plasma dispersion function, and 0kj = I 0 (b kj ) exp(−b kj ) with I 0 the modified Bessel function and b kj = k 2 The linear gyrokinetic vorticity equation, meanwhile, is given by: Chen and Hasegawa (1991) Noting that, for KAW, |k ⟂ i | ∼ O(1) and |k ⟂ e | ≪ 1 and, thus, 0ke ≃ 1 , Eqs. (22) and (23) then become: sk k = 1 + ke Z ke + 1 + k ki Z ki k = 1 + ke Z ke + k 1 + ki Z ki k , and Here, = T 0e ∕T 0i , b k = b ki , k = 0ki , and sk is the dielectric constant for the slowsound (ion-acoustic) wave (SSW). It is also instructive, as done in some literatures, to define the effective parallel potential, ∥k = k − k , and rewrite Eqs. (24) and (25) as: and Equations (26) and (27) demonstrate the coupling between SAW and SSW via the finite |k ⟂ s | term. In the |k ⟂ i | ∼ O(1) short-wavelength limit, SAW evolves into KAW due to both the finite |k ⟂ i | and |k ⟂ s | effects. More specifically, the coupled KAW-SSW dispersion relation becomes: Let us concentrate on the KAW branch and, to further simplify the analysis, assume , and, keeping only the lowest order O(1) terms: From Eq. (28), we then have: k for different values is given in Fig. (3). As to wave polarizations, which are useful for wave identification in observations, we can readily derive:

Linear mode conversion of KAW
Equation (33) has a significant implication in non-uniform plasmas. Consider, again, a slab plasma with a non-uniform 2 A (x) and k 2 ⟂ = k 2 x (x) being the WKB wave-number in the non-uniformity direction, x. Equation (33) then indicates that KAW is propagating That v g⟂ is finite also suggests that, in contrast to SAW, an initial smooth perturbation will not only evolve into short wavelengths, but also propagate toward the lower-2 A (x) region. These features are illustrated in Fig. 6b; where the spatial-temporal evolution of KAW is solved explicitly according to the following wave equation: Fig. 6a, with L indicating the profile length-scale, so that the KAW wave-packet frequency is assumed to be consistent with the SAW frequency at x = 0 . Figure 6b shows the propagation of the KAW wave-packet in the direction of radial non-uniformity, consistent with Eq. (35).
That there exists a finite perpendicular group velocity also implies, in the steady state, the removal of "singular" resonance and linear mode conversion process (Hasegawa and Chen 1976). More specifically, the corresponding wave equation is given by: Here, 0 is the external driving frequency. In the ideal SAW ( ̂→ 0 + ) limit, there is the resonance singularity at  (37) can be approximated as an inhomogeneous Airy equation and solved analytically. Equation (37) can then be solved, with appropriate boundary conditions, by connecting the solutions valid away from the x = x 0 resonance layer via the analytic solution of the inhomogeneous Airy equation valid near x = x 0 Chen 1975, 1976). The solutions away from the singular layer are given by: where The corresponding numerical solutions are plotted in Fig. 7.
Both the analytical results and mode conversion process exhibit two important features. One is, instead of being singular, the amplitude at x = x 0 (where A (x 0 ) = 0 ) is amplified by the Airy swelling factor; (L A ∕̂) 2∕3 . Here, we recall L A is the scale length of A and ̂ , from Eq. (34), is of O( i ) , and, hence, |L A ∕̂| ≫ 1 . The other is the singularity at x = x 0 is being replaced by the Airy scale length; there then exists a KAW formation time scale given by ( 0 ∕L A )t 0 ≃ 1∕ 0 ; i.e., 0 t 0 ≃ (L A ∕̂) 2∕3 . Taking, for an example, a typical laboratory plasma, L A ∕̂≃ O(10 3 ) , we have 0 t 0 ≃ O(10 2 ) , suggesting that it is reasonable to anticipate, in the presence of SAW continuous spectrum, the appearance of KAW in such plasmas.

Satellite observations of KAWs
Due to the diagnostics constraints in laboratory plasmas, most of the KAW observations were made by satellites in the Sun-Earth space plasma environments. Shear Alfvénic oscillations in the magnetosphere have been linked to drivers from the upstream solar wind. Due to the collisionless nature of space plasmas, kinetic effects create large-amplitude waves and pressure pulses in the foreshock region upstream from the quasi-parallel bow shock. The foreshock is found to be an important source of (magnetic) pulsating continuous (Pc) magnetospheric waves in the Pc3 (period 10-45 s), Pc4 (period 45-150 s), and Pc5 (period 150-600 s) ranges (Fairfield et al. 1990;Engebretson et al. 1991;Chi et al. 1994;Clausen et al. 2009;Wang et al. 2019). The mode conversion process associated with the compressional modes of (38) 1 Page 14 of 37 the foreshock waves has been suggested as a directly driven mechanism for the generation of the frequently observed discrete harmonic frequencies of shear Alfvénic field-line resonances (see Fig. 1) (Hasegawa et al. 1979;Hasegawa and Chen 1975;Lee et al. 1994). Indeed, near the magnetopause boundary, a sharp transition is frequently found in wave polarization from predominantly compressional waves in the magnetosheath to transverse in the boundary layer (Song et al. 1993;Rezeau et al. 1989;Chaston et al. 2008). THEMIS observations by Chaston et al. (2008) show a direct evidence of a turbulent spectrum of KAWs at the magnetopause with sufficient power to provide massive particle transport. Using coordinated observations in the foreshock and the magnetosphere, Wang et al. (2019) found direct evidence of Pc5 field line resonances driven by the foreshock perturbations. As remarked earlier, the main mode identification method for KAWs is based on the measurement of the wave polarization, |c E ⟂ ∕v A B ⟂ | . Two cases are illustrated here. One is observation by the Van Allen Probes in the Earth's inner magnetosphere (Chaston et al. 2014) (cf. Fig. 8); the other is observations by the Cluster satellites in the solar wind (Salem et al. 2012) (cf. Fig. 9). Both observations showed the measured polarizations, |c E ⟂ ∕v A B ⟂ | , agree qualitatively and/or quantitatively with those theoretically predicted for KAWs.
Finally, we remark that KAW physics has also been applied theoretically in laboratory fusion plasmas Chen 1975, 1976;Zonca 2013, 2016). Realistic plasma non-uniformities and magnetic field geometries often play crucially important roles in determining SAW/KAW mode structures and stability properties in such plasmas (see, e.g., Ref. Chen and Zonca (2016)). For example, in  toroidal fusion plasmas, the Kinetic Toroidal Alfvén Eigenmodes (KTAEs) (Mett and Mahajan 1992) may exist within the SAW continuum and their dynamics are intrinsically related to those of KAWs. Furthermore, laboratory plasma experiments have shown evidence of coupling between SAW eigenmodes and KAWs (Wong et al. 1996) that may also be externally driven by mode conversion of fast modes (Fasoli et al. 1996). Since KAW carries significant implications to plasma heating and transport, it will be interesting to see more focused investigations on KAW physics in laboratory plasma experiments and/or simulations.

Nonlinear KAW physics
In this section, we first discuss the nonlinear gyrokinetic orderings and present the corresponding equations. We then apply the nonlinear gyrokinetic equations to the fundamental three-wave parametric decay instabilities. Here, we emphasize the qualitative and quantitative differences between the results of nonlinear gyrokinetic theory and those based on the ideal MHD theory. Corresponding simulations not only support the gyrokinetic theory results, but also suggest the excitation of k ∥ ≃ 0 fluctuations; i.e., convective cells. This motivated the studies on the spontaneous excitations of convective cells by KAWs. The results demonstrate the significant effects of finite ion-Larmor radius; and, thus, the nonlinear gyrokinetic theory as a powerful theoretical tool. Finally, we present a quasi-linear description of plasma transport due to KAWs.

Nonlinear gyrokinetic theory
In extending the linear gyrokinetic theory to the nonlinear regime, one allows the fluctuations to be of finite amplitudes with, however, the constraint that the corresponding nonlinear frequencies, n , be much less than the cyclotron frequency. In other words, consistent with the linear gyrokinetic orderings: Here, u ⟂ represents the fluctuation-induced particle (guiding-center) jiggling velocity. Taking, for example, and | ⟂ | ∼ 1∕ i , we then obtain the following nonlinear gyrokinetic orderings (Frieman and Chen 1982): Here, we have taken the background distribution to be Maxwellian, and g satisfies the following nonlinear gyrokinetic equation (Frieman and Chen 1982): L g given by Eqs. (16) and (17), and: Expanding in terms of plane-wave solutions, Eq. (43) yields: and ( k , k) satisfy frequency and wave-vector matching conditions; i.e., k = k � + k �� and k = k � + k �� . The field equations remain the same; i.e., the Poisson's equation or the quasi-neutrality condition and the parallel Ampère's Law or the generalized nonlinear gyrokinetic vorticity equation. The quasi-neutrality condition is formally the same as in the linear theory, that is: with ≡ T e ∕T i , consistent with the definition introduced below Eq. (25) and where we dropped the subscript "0" on equilibrium temperature; and J k = J ki for brevity. The nonlinear gyrokinetic vorticity equation (Chen and Zonca 2016;Chen et al. 2001;Zonca and Chen 2014a, b), meanwhile, is given by: , consistent with the definitions introduced below Eq. (22): and We remark that (NL) A corresponds to the Maxwell stress term due to the J ∥ b 0 × B ⟂ force with J ∥ mainly carried by electrons due to m e ≪ m i . (NL) , meanwhile, is the gyrokinetic stress tensor, which is dominated by ions and reduces to the well-known fluid Reynolds stress in the k 2 ⟂ 2 i ≪ 1 limit (Chen and Zonca 2016;Chen et al. 2001).

Parametric decay instabilities
Parametric decay instability (PDI) is a fundamental nonlinear process involving three nonlinear coupled waves/oscillators (Jackson 1967;Liu and Rosenbluth 1976). One is the pump ("mother") wave and the other two are the decay ("daughter") waves. The PDI can be either resonant if both decay waves are marginally stable or weakly damped normal modes, or non-resonant if one of the decay waves is a heavily damped quasi-mode. Since the pump wave can be either spontaneously or externally excited, PDI, thus, is an important channel for wave energy transfer along with its associated consequences on plasma heating, acceleration, and transports.
Interested readers may refer to the original work  for the detailed derivations of the KAW PDI dispersion relations. Here, we will just present the key points and results. Let the three interacting waves be the pump wave 0 = ( 0 , k 0 ) , the low-frequency daughter SSW s = ( s , k s ) , and the daughter KAW − = ( − , k − ) with − = s − 0 and k − = k s − k 0 ; consistent with frequency and wave-vector matching conditions. Let the small but finite pump wave amplitude be denoted as 0 = e 0 ∕T e . As s could be a quasi-mode, we then need to retain O(| 0 | 2 ) terms to properly account for non-resonant PDI. Carrying on the straightforward algebra (Chen and Zonca 2011), we then derive the KAW PDI dispersion relation: Here: and are the linear dielectric constants of, respectively, the s -SSW and − -KAW decay waves. Meanwhile, again, b k = k 2 ⟂ 2 i ∕2 , k = I 0 (b k ) exp(−b k ) , and, from Eq. (29), A− , as will be further discussed later, corresponds to nonlinear ion Compton scattering: sk where 2 s = 2 i and b s− = b − . Note that G ≥ 0 from Schwartz inequality. C k on the right-hand side of Eq. (52) represents the nonlinear coupling coefficient between s and − daughter waves via the pump wave | 0 | , and: with Furthermore, in the PDI dispersion relation, Eq. (52), we have dropped the term associated with nonlinear frequency shift to focus on the stability property .
Let us first consider the resonant decay, which occurs when both decay daughter waves, s and − , are weakly damped normal modes. This generally requires ≡ T e ∕T i ≳ 5 Chen 1975, 1976) to minimize the ion-Landau damping of the s (SSW) mode. In this case, letting, s = sr + i as well as noting skr ( sr ) = 0 and Ak−r ( A−r ) = Ak−r ( sr − 0 ) = 0 , Eq. (52) reduces to: where dA− and ds are, respectively, the linear damping rates of the KAW and SSW daughter waves. We also note that, to have a parametric growth ( > 0 ), the square bracket term on the right-hand side of Eq. (61) must be positive; which can be shown to dictate sr 0 > 0 . Thus, the KAW decay wave has its normal-mode real frequency lower than that of the KAW pump frequency, 0 , by the amount of the SSW normal-mode frequency, sr . Noting that, for ≪ 1 , we have Thus, to satisfy the frequency and wave-number matching conditions for the resonant decay, | sr | ∼ |k ∥s c S | , we must have k ∥A− ≃ k ∥0 or k ∥s ≃ 2k ∥0 . Consequently, we have ( 0 ∕k ∥0 )( A−r ∕k ∥A− ) < 0 ; i.e., the decay KAW daughter wave, − , has parallel (to B 0 ) group velocity opposite to that of the pump wave. In other words, − can be understood as a KAW due to backscattering of the 0 pump wave by s fluctuations. Finally, | 0 | must be over a threshold value set by ds and dA− to achieve > 0.
Note, from Eqs. (61) and (62), that the parametric decay instability growth rates increase with the nonlinear coupling coefficient, The decay instabilities are, thus, strongest when |k ⟂ i | ∼ O(1) ; and it clearly demonstrates the necessity of keeping FILR kinetic effects in dealing with the decay instabilities of KAW.
Finally, it is illuminating to compare the decay instabilities of KAWs versus those of SAWs in the MHD regime (Sagdeev and Galeev 1969). In a nutshell, employing the ideal MHD fluid theory, the PDI dispersion relation takes the form similar to the KAW PDI dispersion relation, Eq. (52), with KAW terms replaced by corresponding SAW terms; e.g., Ak− by A− etc. The more fundamental change lies in the nonlinear coupling term; that is, C k is replaced by C I given as: Here, 0 is the angle between k ⟂0 and k ⟂− , and i is the ion ratio of specific heats.
In addition to the significantly enhanced PDI growth rates, there is, perhaps, more significant qualitative difference between KAW and SAW PDI in terms of the wavevector of the scattered daughter wave with respect to that of the pump wave. Note, from Eq. (64), C I ∝ cos 2 0 and, thus, the SAW scattering maximizes around 0 = 0 and ; i.e., when k ⟂− is parallel or anti-parallel to k ⟂0 ; or k 0 and k − are co-planar. In contrast, we have, from Eq. (65), C k ∝ sin 2 0 and, thus, the KAW scattering maximizes around 0 = ± ∕2 ; i.e., k ⟂0 and k ⟂− are orthogonal. This difference not only affects, as might be expected, the nonlinear evolution of KAW turbulence, but also, as we will argue further below and perhaps more significantly, charged particle transports induced by the KAW decay processes.
Let us consider the pump wave be the mode-converted KAW at the Earth's dayside magnetopause; thus, k ⟂0 = k ⟂0r with r being in the Sun-Earth radial direction. Now, according to the ideal MHD theory, the decay wave tends to have k ⟂− = k ⟂−r and, thus, the East-West azimuthal symmetry is in general kept. In other words, charged particle's East-West azimuthal generalized momentum, P , is conserved, which implies no or little radial transport (Chen 1999). On the other hand, in the KAW regime, the decay wave would have wave-vector in the East-West azimuthal direction; i.e., k ⟂− = k ⟂−̂ and, hence, the East-West azimuthal symmetry is broken by the daughter wave and, consequently, P is no longer conserved and finite radial transports could occur (Chen 1999). These features are observed in the numerical simulations to be discussed below. In addition, the MHD fluid theory would suggest that the turbulence in the perpendicular to B 0 plane to be preferentially anisotropic in the r direction, while KAW turbulence would tend to be more isotropic.
Insights to the above qualitative and quantitative transitions in the nonlinear coupling coefficient between the long-wavelength MHD fluid and the short-wavelength KAW regimes can be also gained by examining the responsible nonlinear coupling mechanisms. More specifically, while in the MHD regime, slow-sound fluctuations are nonlinearly generated by the ( J ⟂ × B ⟂ ) ⋅ b 0 ∕c parallel (to B 0 ) force; in the KAW regime, the nonlinear force is due to the m i n i ( u ⋅ ) u ∥ convective nonlinear term. Similarly, while in the MHD regime, scatterings of the SAW by the slowsound fluctuations occur via the n s ( u 0 ∕ t) nonlinear ion density modulation; scatterings of the KAW occur, again, via the n i ( u ⋅ ) u 0 convective nonlinearity.
Numerical simulations on the linear mode conversion of KAW and the ensuing nonlinear wave generations were carried out by Lin et al. (2012) using a threedimensional hybrid model, in which ions are treated as fully kinetic particles and electrons are treated as a massless fluid. Readers are referred to the original work for details. Here, we summarize and discuss the essentials. Specifically, consider a slab plasma with B 0 = B 0ẑ and non-uniformities in the x (radial) direction. Simulations demonstrated that an incoming fast compressional Alfvén wave mode converted into a short-wavelength KAW with |k x i | ∼ O(1) localized about the Alfvén resonance point. This mode-converted KAW then serves as a pump KAW and nonlinearly excited secondary KAWs with, preferentially, short azimuthal wavelengths; i.e., |k ⟂ i | ∼ |k y i | ∼ O(1) . To analyze the nonlinear wave generation mechanism in more details, Lin et al. (2012) further carried out dedicated simulations with a prescribed pump KAW in a uniform plasma. The resultant (k ∥ , ) spectra of B x and E ∥ are shown in Fig. 10.
In the right plot of E ∥ , we can see the pump KAW at 0 = (k ∥0 = 0.2, 0 = 0.6) and the slow-sound wave at s = (k ∥s ≃ 2k ∥0 = 0.4, s = 0.2) . Correspondingly, in the left plot of B x , we see the backscattered decay KAW with − = (k ∥− = −0.2, 0 = 0.4) . Note, since T e ∕T i ≃ 0.4 , the slow-sound wave, s , is a heavily ion-Landau damped quasi-mode, and the PDI corresponds to the nonlinear ion-induced scattering. Both the − and s modes have preferentially short wavelengths in the ŷ direction; i.e., |k ⟂s i | ∼ |k ys i | ∼ |k y− i | ∼ O(1) . The simulation results are, thus, consistent with analytical theories discussed above.
As noted by Lin et al. (2012), the B x spectrum also showed excitations around (k ∥ ≈ 0, ≈ 0) with |k y i | ∼ O(1) short wavelengths; which, as suggested, correspond to magnetostatic convective cells (Chu et al. 1978). The nonlinear excitations of convective cells also explain the appearance of B x fluctuations at k ∥0 and 0 , since the pump KAW with k y0 ≃ 0 has B x0 ≃ 0 . These interesting simulation results, thus, naturally lead to the following gyrokinetic analytic theory on excitations of convective cells via the modulational instabilities of a KAW pump wave.

Nonlinear excitations of convective cells
Convective cells have been of theoretical interests since the 1970s (Chu et al. 1978;Lin et al. 1978;Okuda and Dawson 1973;Taylor and McNamara 1971), since they lead to vortex dynamics perpendicular to the confining magnetic field and, consequently, carry significant implications to the cross-field transport (Sagdeev et al. 1978). Historically, convective cells have been classified into two categories, the electrostatic convective cells (ESCC) with E = E ⟂ (Okuda and Dawson 1973;Taylor and McNamara 1971) (Chu et al. 1978). In recent years, there has been renewed interest in convective cells since they may be regarded as paradigms of the so-called zonal structures in laboratory fusion plasmas (Chen and Zonca 2013;Zonca et al. 2015). Zonal structures are fluctuations with k ⋅ B 0 = 0 and varying only in the radial direction (Hasegawa et al. 1979). Zonal structures may have frequencies either around = 0 or a finite frequency (i.e., the so-called geodesic acoustic mode (Winsor et al. 1968)). The = 0 zero-frequency zonal structures could be either zonal flow or zonal field/ current; corresponding, respectively, to ESCC and MSCC. In this respect, zonal structures may be regarded as subset of convective cells.

and the magnetostatic convective cells (MSCC) with
Since convective cells have ≈ 0 , they are nominally damped by either viscosity and/or resistivity; and, thus, generally require nonlinear excitations in order to achieve finite intensities. In laboratory fusion plasmas, nonlinear excitations of convective cells (i.e., zonal structures) usually occur via mode-mode couplings of ambient drift-wave and/or Alfvén-wave instabilities. In this respect, zonal structures may be regarded as spontaneous growth of corrugations of the radial equilibrium profiles, which, in turn, scatter the ambient instabilities into the radially short-wavelength stable domain. Zonal structures, therefore, provide self-regulatory mechanisms for the ambient turbulences and the associated transports. We refer to the recent review (Chen and Zonca 2016) for readers interested in this important topic.
In the present review, we will focus on nonlinear excitations of convective cells by KAWs in uniform plasmas to explore in sufficient details the underlying physics mechanisms. Since convective cells have k ⋅ B 0 = 0 , their nonlinear excitations involve couplings between co-propagating SAWs with the same k ∥ ; which vanishes in the ideal MHD limit due to the cancelation between the Reynolds and Maxwell stresses; i.e., the pure Alfvénic state (Alfvén 1942(Alfvén , 1950Walén 1944). It, thus, has long been recognized that only non-ideal MHD fluctuations, such as KAW, can nonlinearly excite convective cells (Chen and Zonca 2013;Mikhailovskii et al. 2007;Onishchenko et al. 2004a, b;Pokhotelov et al. 2003Pokhotelov et al. , 2004Zhao et al. 2011). Furthermore, since having = 0 , it is also recognized that it takes the form of modulational instabilities for the spontaneous excitations of convective cells by KAWs; that is, deviation from the wave periodic behavior is further reinforced by nonlinearity, which may lead to spectral sidebands and possibly to breaking of the periodic fluctuation into modulated pulses (Benjamin and Feir 1967;Chen and Zonca 2013;Zonca et al. 2015). Previous theoretical studies, however, suffer from two limiting considerations; (1) employing two-fluid or drift-kinetic descriptions, and (2) assuming that ESCC and MSCC are de-coupled. By (1), effects due to finite ion-Larmor radii (FILR) are ignored. Both limiting considerations have been adopted to simplify the theoretical analysis and, as will be shown here, lead to erroneous conclusions on the nonlinear excitation mechanisms. Here, we will employ the nonlinear gyrokinetic equation and demonstrate that both the FILR as well as the finite coupling between ESCC and MSCC play qualitatively crucial roles in the dynamics of the modulational excitations of convective cells. Only key points of the theoretical analysis and results will be highlighted here. Readers are referred to the original works for details.
Carrying out the standard perturbative analysis to O(| 0 | 2 ) , we then derive, after some straightforward but lengthy algebra (Zonca et al. 2015), the following coupled equations between z and z ≡ 0 A ∥z ∕(k ∥0 c): Here, we have let z = i z and k z ⟂ k 0 to maximize the nonlinear couplings: is the frequency mismatch between the normal-mode frequency of KAW at k ± and Furthermore, since ESCC and MSCC are, indeed, strongly coupled, and arbitrary de-coupling assumptions could lead to erroneous conclusions on the stability. It is readily seen that the threshold in Eq. (78) is minimized th,min ≪ 1 for effective mode excitation, which is hard to meet at long wavelength. For this reason, in the original works on CC nonlinear excitation by KAW via modulational instability (Chen and Zonca 2013;Zonca et al. 2015), it was noted that nonlinear excitations of convective cells by KAW are always suppressed in the long-wavelength limit, although only the Y ≃ ( − ) < 0 root was discussed therein and in the recent review on this subject (Chen and Zonca 2016).
The other limit is the short-wavelength limit; i.e., |b k | ≫ 1 , where FILR effects exhibit distinctively. Taking this limit and, to further simplify the analysis, assuming b z ≪ b 0 ; we can readily show that Eq. (75) yields the following unstable solution: where Equation (81) indicates that, in this |b k | ≫ 1 short-wavelength limit, convective cells can be modulationally excited when the pump KAW amplitude exceeds the following threshold value, noting 0 ≃ 0 ∕(1 + ): , we shall assume B y ∕B 0 = 2 B ⟂0 ∕B 0 in the comparison of numerical simulation results with theoretical predictions discussed above. The complete dispersion relation is numerically solved in the (k x i , k y i )plane for fixed k ∥0 i = 0.02 , = 1 and e = i = 0.2 and different values of B y ∕B 0 . Figure 11 shows the marginal stability curves. It clearly demonstrates, consistent with the above analytical predictions, the crucial roles of the finite k ⟂ i effects in the stability properties. Marginal stability curves demonstrate the existence of a necessary condition for instability, b z > k 2 y 2 i ≡ b z , given by: Zonca et al. (2015) where z ≡ z (b z = b z ) , which holds for b 0 ≫ 1 and arbitrary b z . Figure 12, meanwhile, plots the calculated growth rates vs. B y ∕B 0 for (k x i , k y i ) = (0.8, 0.6) and (k x i , k y i ) = (1.0, 0.8) . Corresponding hybrid simulations have also been carried out to investigate the nonlinear excitations of convective cells by a pump KAW (Zonca et al. 2015).
The observed growth rates, as shown in Fig. 12, agree reasonably well with the theoretically predicted values. Meanwhile, simulations also show that, for k z =ŷk y , the ESCC ( E yz ) and MSCC ( B xz ) are coupled and both are spontaneously excited; consistent, again, with the theoretical predictions. For z ∕ 0 = O(1) , in general, it is necessary to solve for CC dispersion relation and polarization from Eq. (120) in the satisfied. Such energy-momentum exchanges, thus, could lead to efficient acceleration/heating, current/flow, as well as cross-field transports; that is, wave-induced collisionless transports in the charged particle's phase space. The self-consistent analysis of charged particle's phase-space dynamics and the corresponding dynamics of collective electromagnetic fluctuations represents, indeed, fundamental and complex investigations on the frontier of plasma physics research. Such complexities, in one aspect, are associated with the complexities of phasespace dynamics of charged particles in the presence of electromagnetic fluctuations that vary, self-consistently, in space and time. Detailed analyses on this topic are beyond the intended scope of this review. Interested readers are referred Ref. Chen and Zonca (2016) for further discussions. In this review, we assume the fluctuations have sufficiently broad spectral widths and finite but small intensities, such that charged particles diffuse stochastically in the phase space and we may employ the quasi-linear description. Furthermore, we will limit our considerations to KAWs in a slab plasma. More general analyses in realistic geometries can be found in Chen (1999).
Let x be the non-uniformity (radial) direction and 0 = B 0 (x)ẑ . Assuming ≪ 1 , B 0 is then approximately constant. The particle distribution function, f, can be decoupled into an "equilibrium" component, F 0 , and a fluctuating component, f , that is: where t with ≪ 1 denotes that F 0 is slowly varying in time and vary spatially only in the non-uniformity x direction. f , meanwhile, is given by the linear gyrokinetic equations, Eqs. (14) and (15) with, however, F M and (−qF M ∕T) replaced, respectively, by F 0 and (q∕mv ∥ )( F 0 ∕ v ∥ ) . Employing the nonlinear gyrokinetic equations (Frieman and Chen 1982;Brizard 1995), it is then straightforward to show that F 0 satisfies the following quasi-linear gyrokinetic equations (Chen 1999): where L g is given by Eqs. (16) and (17), that is: Meanwhile, G res in Eq. (90) represents the contribution of resonant particles to f (Chen 1999), i.e., from Eq. (15):  . We can also employ Eq. (90) to derive the equation for energy transport and heating. Letting K = ⟨mv 2 F 0 ∕2⟩ v , it is then straightforward to show (Chen 1999): where q x = q xc + q xd is the energy flux with: and S is the local heating rate: Similarly, we can derive the following equation for parallel momentum transport and generation (Chen 1999): where P ∥ = ⟨mv ∥ F 0 ⟩ v , x = xc + xd is the parallel momentum flux: and is the local effective parallel force due to KAWs. Multiplying Eq. (116) by (q/m) naturally leads to the equation of current transport and generation.
As noted in Chen (1999), the transport equations derived above have the appealing physical pictures that transports as well as acceleration/heating may be viewed as "collisions" between charged particles of energy = mE = mv 2 ∕2 and generalized x � F 0 ⟩ v momentum = = m + q ∕c with wave packets or quasi-particles of energy = and momentum = . The transport equations also clearly demonstrated that, in collisionless plasmas, wave-particle resonances are responsible for phase-space transports, and the transports consist of both convective and diffusive components with coefficients depending critically on the spectral properties of, in this case, KAWs.

Conclusions and discussion
In this paper, we argue that short-wavelength KAWs are ubiquitous in realistic nonuniform magnetized plasmas due to the very existence of SAW continuous spectra. Employing the powerful theoretical tool of gyrokinetic equations, we then re-examine and explore further the linear and nonlinear physics of KAWs. Our analyses clearly demonstrate that kinetic effects due to, e.g., finite ion-Larmor radii, can qualitatively and quantitatively modify the nonlinear processes. More specifically, we show that in contrast to the MHD fluid description, the FILR effects lead to the significantly enhanced electron-ion de-coupling, which, in turn, leads to significantly enhanced nonlinear coupling coefficients. Our analyses, furthermore, suggest that the KAW turbulence spectra will be more isotropic than those according to the MHD description. In addition, convective cells could be nonlinearly excited only in the short-wavelength regime. These spectral properties obviously carry important implications to the symmetry-breaking wave-induced transports of charged particles.
In other words, based on our theoretical studies, we submit that one needs to employ first-principle-based self-consistent kinetic or gyrokinetic theories to develop a reliable and accurate understanding of KAW physics; especially, when effects associated with nonlinearities, realistic non-uniformities, and geometries are considered.
Since the primary aim of the present paper is to illuminate physics of KAWs based on the gyrokinetic theory approach, our focus, therefore, has been on the fundamental processes. This paper, thus, is not, and, indeed, never intends to be a comprehensive review of all aspects of the rich KAW physics. For complimentary readings, we refer to the monograph by Wu (2012), the review article by Chen and Zonca (2016), and the recent works by Qiu et al. (2019) for KAWs in fusion tokamak plasmas.
As we, hopefully, have demonstrated, there are many interesting issues; especially, in the nonlinear regime, associated with the KAW physics. Some of them remain little explored; for example, the phase-space dynamics of nonlinear wave-particle interactions as well as the physics of the fully developed KAW turbulence, including frequency/wave-number cascading and, possibly, filamentary structures via nonlinear excitations of convective cells. Obviously, careful studies of these physics issues employing the powerful gyrokinetic approach analytically and/ or via numerical simulation will make significant impacts to our deep understandings of the charged particle dynamics and Alfvenén wave turbulences in nature and laboratory plasmas.