1 Introduction and motivation

An overwhelming amount of observational data provides clear and compelling evidence for the presence of dark matter (DM) on a wide range of scales. This component, which is believed to be responsible for the “missing” mass in our universe, is the main ingredient for all the structures we have in our universe. This is one of the oldest unsolved problems in cosmology, being traced back to the 1930s (Zwicky 1933; Bertone and Hooper 2018), and also one of the best measured ones. The evidence for dark matter first emerged from the study of the rotation curves of galaxies. From pioneering studies (Rubin and Ford 1970), it was already evident that the amount of matter necessary to fit the flat observed rotation curves did not match the theoretical curves predicted, assuming Newtonian mechanics and taking into account only the visible matter present in those galaxies. Dark matter was proposed as an additional (non-luminous) component to explain this discrepancy.

Nowadays, the evidence for dark matter comes from precise measurements on a wide range of scales. From sub-galactic and galactic scales, to clusters, going up to the large-scale structure (LSS). On cosmological scales, the observed anisotropies of the cosmic microwave background (CMB) (Ade et al. 2016), together with data from Type Ia Supernovae, determine the total energy density of matter with high precision. This together with the bounds on the abundance of the light chemical elements from Big Bang Nucleosynthesis, which constrains the amount of baryonic matter in the universe, strongly shows the need for a clustering component of non-baryonicFootnote 1 origin, that does not interact (strongly) with photons, and that dominates the matter content of the universe, accounting for approximately \(85\%\) of all matter. The same non-luminous and clustering component is necessary to explain the structures we see in our universe today, as is evident in observations of the large-scale structure of our universe (Anderson et al. 2014; Tegmark et al. 2004).

With all this evidence coming from precise astrophysical and cosmological observations, cosmologists have converged to a phenomenological model to describe our universe, the \(\varLambda \)CDM model. This model is currently the concordance model of cosmology and it accumulates a number of observational successes. It exhibits outstanding agreement with current cosmological observations (Anderson et al. 2014), which is manifested in the parameters of this model being constrained at the percent and sub-percent level. This incredibly simple model is described by only six parameters and parametrizes a large amount of the universe’s history. It describes a universe that is flat and seeded by nearly scale invariant perturbations, composed of baryons, which amount to approximately \(5\%\) of the energy density of the universe, a small radiation component, but in its majority is composed of two unknown ingredients. The energy budget of the universe is dominated (\(\sim 70\%\)) by a component responsible for the current accelerated expansion of the universe called dark energy, and a clustering component, the dark matter, making up to \(\sim 25\%\) of our universe. These large-scale observations give a coarse-grained description of these non-baryonic components in the hydrodynamical limit where dark matter is described as a perfect fluid with very small pressure (\(w\approx 0\)) and sound speed, \(c_\mathrm {s} \approx 0\), that does not interact, at least strongly, with baryonic matter. Dark energy is parametrized by a cosmological constant, the simplest model for the present accelerated expansion of our universe.

Therefore, within \(\varLambda \)CDM, the Cold Dark Matter (CDM) paradigm emerged from the large-scale observations and describes the component responsible for the formation of the structures of our universe through gravitational clustering. In the CDM model, DM is described by a perfect fluid that must be massive, sufficiently cold, which means non-relativistic at the time of structure formation, and collisionless to explain the observational data on large linear scales. This coarse-grained description of a CDM is very successful in fitting the linear, large-scales observations from the CMB, LSS, to clusters, and general properties of galaxies.

However, even though we know the hydrodynamical properties of DM on large-scales to a very high precision, the microphysics of the DM component remains unknown. This allows for the creation of a plethora of possible models of DM. Those models recover the large scale properties of CDM, but invoke very different objects and phenomena to play the role of DM.

This incredible variety of viable models of DM can be seen in the huge range of masses those models cover, as shown in Fig. 1. This figure shows many different broad classes of DM models, and each of which might contain many different specific models. It spans more than 80 orders of magnitude and shows very different hypothesis for DM, from new elementary particles, to composite objects (Jacobs et al. 2015; Khlopov 2019), up to astrophysical size primordial black holes (for a review on recent bounds, see (Carr et al. 2020; Carr and Kuhnel 2020)). This shows us that although we have gathered a lot of knowledge about the gravitational properties of DM, the nature of DM is still elusive, with the current data still allowing a huge amount of highly different models.

Fig. 1
figure 1

Sketch (not to scale) of the huge range of possible DM models that have been conceived. They span many orders of magnitude in mass, with DM represented by very distinct phenomena, ranging from new elementary particles to black holes

The possibility that dark matter could be a long lived particle is very appealing, especially, if these candidates are expected candidates from extensions of the standard model of particle physics. One class of models that became the preferred candidates for the DM particles are WIMP, weakly interacting massive particles, which represent new elementary particles that interact with baryons not only gravitationally but also through the weak force or a new force of comparable strength (Roszkowski et al. 2018; Lin 2019). The strong motivation for this candidate is because if it is thermally produced in the early universe, the relic abundance of particles that have mass of the order of the electroweak scale, and a coupling of order one, corresponds precisely to the abundance of DM in our universe. The possibility that WIMP could also be discovered by direct detection experiments is also an important motivation to search for this candidate. There is a great experimental effort to constrain the properties of WIMP DM with the parameter space being very restricted over the past few years. Given the complex phenomenology from the possible models of WIMP DM and their interaction with the standard model particles, the translation of those bounds to the exclusion of WIMP models is not straightforward. The cosmological and astrophysical behaviour of all the classes of WIMP models is similar to CDM, so the avenue to probe this scenario is through direct, indirect and collider experiments (for a complete review of all the searches, current and projected limits on WIMP detection for both spin-dependent and spin-independent models, together with indirect detection and collider searches, see (Arcadi et al. 2018).)

Another candidate that comes from extensions of the standard model of particle physics is the QCD axion. The axion was introduced to address the strong CP problem of quantum chromodynamics (QCD) (Peccei and Quinn 1977; Weinberg 1978; Wilczek 1978). The axion can be used in many different contexts in cosmology, including as a candidate for DM. The QCD non-perturbative effects induce a potential for the axion. During the radiation dominated period, the QCD axion starts to oscillate at the bottom of its potential and the axions behaves like dust, contributing to the energy density of the universe as non-relativistic matter. The QCD axion couples weakly to the standard model, which motivated an experimental effort for its direct detection (see these references for a review of axion direct detection searches (Di Luzio et al. 2020; Sikivie 2020; Graham et al. 2015a; Battesti et al. 2008)).

Although we have these very well motivated candidates from particle physics, we still have no conclusive evidence for electroweak or other non-gravitational interactions for dark matter. All the knowledge we have about dark matter is gravitational. We know that CDM describes the behaviour of DM very well on large-scales. However, this beautiful and simple coarse grained description of DM as the CDM is challenged by some curiosities that appear on small scales.

As the observations and simulations of the small non-linear scales and galactic scales improve, a number of challenges have emerged for this coarse grained description from \(\varLambda \)CDM. These discrepancies have been around for decades, such as the cusp–core problem, the missing satellite problem and the too big to fail problem. A particularly curious challenge is the regularity/diversity of rotation curves. One thing that is surprising about galaxies is that they are extremely diverse, but at the same time they are incredibly regular. This fact is manifest in several empirical scaling relations, such as the well-known Baryonic Tully–Fisher relation (BTFR; McGaugh 2005, 2008). The BTFR shows the correlation between the total baryon mass (including stars and gas) of the galaxy with the asymptotic rotation velocity in galaxies. The measured BTFR follows a scaling relation different from the one predicted by \(\varLambda \)CDM, and it holds for a range of 6 orders of magnitude in mass, with very small scatter. The significance of these discrepancies is disputed and addressing these challenges is an active field of research. Those challenges emerge on scales where baryonic physics is relevant and simulations including several baryonic effects have been perfected pointing in the direction that baryons could possibly explain some of these observations within \(\varLambda \)CDM.

As the physics of these baryonic processes is complex and as there is no final consensus about the status of theses discrepancies, an alternative explanation for these discrepancies on small-scales could be that DM is not the usual CDM, but a component that has different phenomenology on small-scales. Even setting aside the small-scale problems, given that the observational constraints on these scales are less strong than on cosmological scales, the dynamics on small-scales can offer a chance to probe the properties of DM in the hope to help find the microphysics of this component. Therefore, the small-scales are a good laboratory to probe the nature of DM models that have impacts on astrophysical scales.

The simplest modifications of the DM paradigm that have a different phenomenology on small-scales, and can potentially address some of the small-scales discrepancies is the warm dark matter (WDM) model (Colin et al. 2000). In this model, DM has a small mass leading to a thermal velocity dispersion, modifying its behaviour on astrophysical scales while maintaining the large-scale predictions of CDM. Even with a small velocity, DM free streams out of potential wells and is enough to suppress the formation of small scale structures addressing some of the small-scale problems. Another popular model inspired by those discrepancies is the self interacting DM (SIDM) (Spergel and Steinhardt 2000), where the DM particles have a self-interaction in a way to also suppress the formation of structures on small-scales.

In the past few years, another class of alternative models has (re)emerged as an appealing class of DM models given their rich phenomenology on small-scales. These are the ultra-light dark matter models (ULDM), where DM is composed by ultra-light bosons with masses in the range \(10^{-24}~\mathrm {eV}< m < \mathrm {eV}\). Given the small masses of these bosons, DM forms a condensate or a superfluid on galactic scales. The idea is that the wave nature of DM on galactic scales provides a non-CDM behavior which leads to a different and rich phenomenology for DM on those scales. On large scales, DM behaves as CDM, although with different initial conditions for the ULDM in comparison to CDM, maintaining the observable successes of CDM on those scales.

There are many different realizations of this interesting non-CDM phenomenology on small-scales. Depending on the modelling of the ULDM, these produce distinct condensate structures and lead to a different phenomenology. There are many different models in the literature describing these possibilities. We classify them in this review into three categories, according to the different condensate structure they form. These three classes are the fuzzy dark matter, when the ultra-light scalar field system is only subjected to gravity; the self-interacting Fuzzy DM when the system also presents (weakly) self-interaction, and superfluid DM, where DM forms a superfluid on galactic scales.

This classification is general and based only on the non-linear structure it forms in the halo of galaxies, which is a consequence of the non-relativistic theory they describe. These can be purely phenomenological models of ULDM on small-scales, in the absence of a microscopic description. These classes also contain microscopic scalar field theories like QCD axion, axions (coming from other origins) or axion like particles (ALP), which are part of the FDM class. Each of these categories have different properties which lead to different astrophysical consequences, that can be probed by current and future astrophysical observations. For this reason, ULDM models have regained interest in the community in the past few years, with new and exciting experimental effort to probe many aspects of the small astrophysical scales, opening the avenue to test these models and answer some questions about the nature of DM.

Motivation for this review and detailed plan

This review has the goal of giving an overview of the ultra-light dark matter (ULDM) candidates, focusing in their gravitational effects and mostly in their different phenomenology on small-scales.

There are many very good and complete reviews in the literature describing specific models of ULDM or the microscopic models that can be part of the ULDM class like. There are many reviews of axions in cosmology (Sikivie 2008; Arvanitaki et al. 2010; Wantz and Shellard 2010; Kim and Carosi 2010; Kawasaki and Nakayama 2013; Marsh 2016a) and ALPs (Ringwald 2014; Arias et al. 2012; Graham et al. 2015a; Marsh 2018; Niemeyer 2019; Powell 2016). Axions and ALPs have a whole rich phenomenology of its interactions to the standard model that will not be explored here, but that can be seen in the following reviews (Marsh 2016a; Graham et al. 2015a). The FDM model also has a huge body of literature with many excellent reviews like (Hui et al. 2017; Suárez et al. 2014; Ureña-López 2019).

We propose to do something different in this review. Instead of studying one single model or a specific microscopic theory, we study many ULDM models interested in the gravitational phenomenology that these models present. We study the ULDM models by dividing them into classes according to their dynamics on small-scales. The three classes proposed in this review encompass many of the models cited above, with the inclusion of (weakly) self-interacting models and the DM superfluid model. We believe this classification is instrumental and shows the general behaviour and phenomenology that each of these model have inside each class. Therefore, we hope to bring not only a new classification that encompasses many of the models present in the literature, but also to include new models, trying to make a big overview of the entire class of ULDM models.

Another new feature this review brings is a brief review of BEC and superfluidity, and the different descriptions of these phenomena. Condensation in each of these classes might arise in a different way, given their different descriptions. Bose–Einstein condensation and superfluidity are very well understood and well studied macroscopic quantum phenomena in condensed matter physics, being largely studied theoretically and experimentally. However, these phenomena are not so well understood in gravity. Therefore, understanding their definition, description and differences is particularly important to understand if condensation arises in theses models, and the difference in the condensate structure that is expected to form in each of the classes of ULDM models.

I take this opportunity to also discuss briefly the different views in the literature about the formation of a condensate and the scales where this effect takes place.

With that, we aim to give a general picture of the state of the field to date, trying to describe all the classes of ULDM present in the literature. We hope this review can be a resource to researcher entering this exciting field.

The review is organized as follows. First, in Sect. 2, we start by describing the small scale challenges of \(\varLambda \)CDM, as a motivation to show the discrepancies these alternative models of DM aim to address. The goal of this section is not only to show the problems that some of the ULDM models might solve, but also to introduce the reader into some of the concepts of galactic astrophysics. In this way, the reader can understand some of the interesting phenomenology that the ULDM models have on small-scales that differ from the ones predicted by the CDM paradigm. Next, in Sect. 3, we introduce the basic concepts of the quantum phenomena of BEC and superfluids. In this section, we describe these phenomena, describe approximations and and the structures formed in those system with and without rotation, all in the context of condensed matter physics where they are well defined, understood and tested. The goal is to give a sound basis to the reader so they can understand with a critical view how these concepts can be applied to the case of DM in the next section, given the analogies, approximations and generalizations done in the literature of ULDM. Following this we are ready to describe the main topic of the review, the ULDM models in Sect. 4. We start by describing the three classes that we propose to classify the models of ULDM based on the type of non-linear they describe. We then talk about the fuzzy DM and the self-interacting BEC DM models, showing the conditions for them to condense on galactic scales. We then focus on the fuzzy DM model, showing how and in which conditions the model attempts to solve the small-scale challenges, and the interesting astrophysical consequences this class presents. We then talk about the superfluid DM model describing its condensation on galactic scales, the formation of the superfluid core and its observational consequences. We also discuss the stability of this construction, and its possible extension to cosmology. The constraints in these models and new windows of observations of the effects of these models are discussed in Sect. 5. We will constraints from different observations. In the case of the FDM, the current bounds show that the mass range where an interesting phenomenology is expected on small-scales is strongly constrained. We conclude the review summarizing our discussion.

Since there is no unique literature this review is based on, but a series of reviews and articles referring to specific topics, the main references used are cited in the corresponding sections. The only exception is Sect. 2 that is based mainly in the following reviews (Bullock and Boylan-Kolchin 2017; Del Popolo and Le Delliou 2017; Famaey and McGaugh 2012).

Conventions In the entire review, natural units are used, where \(c=\hbar =1\), unless stated otherwise. The exception is Sect. 3 where all the \(\hbar \) factors are present. With that, the reduced Planck mass is given by \(M^2_{\mathrm {pl}}=1/8 \pi G\), where G is the Newtonian gravitational constant. Unless stated otherwise, the metric signature used is \(\left( +, -, -, - \right) \), and Greek letters are indices going from \(\mu , \nu = 0, \dots , 3\). Sometimes for simplicity we describe partial derivatives as \(\partial _{\mu }=\partial / \partial x^{\mu }\). In the text gray boxes bring definitions necessary for the understanding of the topics in the section or following sections. Frames text and equations refer to important results or discussions that we would like to highlight.

2 Small-scale challenges of cold dark matter

In the concordance model of cosmology, DM is described by the CDM paradigm. This hydrodynamical description for DM is in very good agreement with observations from large-scales. This can be seen in the power spectrum (P(k)), which is the Fourier transform of the two-point correlation function of the matter density perturbations,

$$\begin{aligned} {\Delta } ^2 (k) = \frac{1}{2 \pi ^2} k^3 \, P(k)\,, \end{aligned}$$
(1)

represented here by the dimensionless power spectrum where k is the wavenumber of the fluctuation, shown in Fig. 2. The large-scales (around \(k \lesssim 0.1 \, \mathrm {Mpc}^{-1}\)), measured by the CMB and LSS galaxy surveys, show a good compatibility with the CDM model. This agreement is also robust for the non-linear intermediary scales (\(k \sim [10^{-1} - 10] \, \mathrm {Mpc}^{-1}\)) with constrains from clusters, weak lensing and Ly-\(\alpha \) forest. As we go to smaller and highly non-linear scales (\(k \gtrsim 10 \, \mathrm {Mpc}^{-1}\) equivalent to \(M \lesssim 10^{10}\, M_{\odot }\)), these constraints are less strong, and might retain important information about possible deviations from the CDM paradigm. We can see on the right side of Fig. 2 on galactic and subgalactic scales, different models of DM would behave very differently from the expected linear behaviour of CDM and this could be probed by the observations on those scales (Zavala and Frenk 2019; Kuhlen et al. 2012).

Fig. 2
figure 2

In this figure, inspired by Kuhlen et al. (2012), we show how the dimensionless power spectrum can be probed by many large-scale and small-scale observables, which can be seen as a function of the wavenumber k. The solid line shows the linear dimensionless power spectrum coming from a \(\varLambda \)CDM universe. To show how the small-scales might reveal different behaviour for different DM components, we show the linear power spectrum of warm DM (WDM) with mass of \(10 \, \mathrm {keV}\) (red dotted line), WDM with mass of \(4 \, \mathrm {keV}\) (green dashed line), and for fuzzy DM with mass \(10^{-22} \, \mathrm {eV}\) (orange dash-dotted line). The gray dotted horizontal line represents the limit from linear to non-linear regime, where \({\Delta } \sim 1\). The power spectra for \(\varLambda \)CDM and for WDM were generated using the Boltzmann code CLASS (Lesgourgues 2011; Lesgourgues and Tram 2011), and for the fuzzy DM using AxionCAMB (Lewis et al. 2000; Hložek et al. 2015)(The parameters used to generate these power spectra were: \(\varOmega _b h^2 = 0.022\), \(\varOmega _c h^2 = 0.12\), \(h=0.67\), \(n_s=0.96\), \(A_s=2.2 \times 10^{-9}\), and \(\tau =0.09\).)

On small-scales, the formation of structures is highly non-linear and the evolution of structures is studied using large numerical simulations. In the past few years, those simulations improved in size and precision, simulating the cosmological and small-scales. But when compared to the observations of galaxies, a number of discrepancies emerged, revealing some curious behaviour on small scales. Given the enormous success of the concordance model, these discrepancies attract a lot of interest of the community. They might represent that we need to better take into account the astrophysical processes that happen inside those regions, which indeed have a complex dynamics. Or this might indicate that the CDM model is not good to describe the physics on small-scales and the coarse grained CDM paradigm needs to be revised. An even a more radical approach would be to modify gravity on smaller scales.

In this section, we present very concisely the theory of non-linear structure evolution. We show how the numerical predictions assuming the concordance model might be in tension with the current observations of galaxies. These tensions are seen in the counts and density of low-mass objects, and in the scaling relations that show the tight regularity that galaxies present. We highlight in this section some of the concepts that are going to be used in the ULDM section and that might not be too familiar for researchers from fields of dark matter phenomenology and cosmology.

figure a

Footnote 2

2.1 Dark matter halos and substructures

A halo can be described as a virialized spherical mass concentration of dark matter. Halos are formed by gravitational collapse of a non-linear overdense regions that stopped expanding to collapse into a sphere in virial equilibrium.Footnote 3 The virialization of the halo happens through violent relaxation, where the DM particles scatter on small fluctuations of the gravitational field present in this distribution, taking a time \(t_{\mathrm {dyn}}\), the dynamical time, to fully cross the sphere. Once this process is completed, at \(t_{\mathrm {coll}}\), the dark matter halo has a radius approximately 1/6 of the radius of the region it collapsed from, and average density (Schneider 2015)

$$\begin{aligned} \langle \rho \rangle = \left( 1 + \delta _{\mathrm {vir}} \right) {\bar{\rho }}(t_{\mathrm {coll}})\,, \end{aligned}$$
(2)

where \({\bar{\rho }}\) is the mean density, and \( \left( 1 + \delta _{\mathrm {vir}} \right) \approx 178 \, \varOmega _{\mathrm {m}}^{-0.6}\). Given this, the dark matter halo is defined as the spherical region where the density is approximately 200 times the critical density of the universe at a given redshift, with mass given by

$$\begin{aligned} M_{200}=\frac{4\pi }{3} R_{200}^3 \, 200 \rho _{\mathrm {cr}}\,, \end{aligned}$$
(3)

where \(\rho _{\mathrm {cr}}=3 H^2(z)/ (8\pi G)\). The virial velocity is given by the mean circular velocity at the virial radius, \(V^2_{200} \equiv G M_{200} / R_{200}\). With that, one can express the evolution of the mass and virial radius with respect to \(V_{200}\): \(M_{200}={V^3_{200}/10\,G H(z)}\) and \(R_{200}=V_{200}/ 10 H(z) \). We can see from these expressions that halos that form early in the evolution of the universe are less massive, while late-forming halos are more massive and larger.

This definition is not unique and depends on the choice of the virial overdensity parameter,Footnote 4\(\varDelta \), which above was taken to be \(\varDelta =200 \rho _{\mathrm {cr}}/{\bar{\rho }}\). More generally, (3) can be written as \(M_{\mathrm {vir}}=\left( 4\pi /3 \right) R^3_{\mathrm {vir}} \, \varDelta \, {\bar{\rho }}\). The values of \(\varDelta \) can vary in the literature, with some common definitions being \(\varDelta = 333\) at \(z=0\) for a fiducial cosmology given by (Ade et al. 2016), which asymptotes to \(\varDelta =178\) at high-z (Bryan and Norman 1998); or a fixed \(\varDelta =200\) at all redshifts, usually denoted by \(M_{200\mathrm {m}}\).

We identify the DM halos from numerical simulations, the N-body code \(\hbox {P}^3\)M (Efstathiou et al. 1985), and can extract from them the abundance of halos as a function of their mass for a given redshift. The individual halos can also be analyzed in those simulation and the radial mass profile can be determined. A surprising feature encountered in those simulations is that halos appear to have a universal density profile, averaged over spherical shells. Their functional form is characterized by the Navarro, Frenk and White (NFW) profile (Navarro et al. 1997),

$$\begin{aligned} {\rho _{\mathrm {NFW}} (r)= \frac{\rho _{\mathrm {s}}}{\left( r/r_{\mathrm {s}} \right) \, \left( 1+r/r_{\mathrm {s}} \right) ^2} \rightarrow \left\{ \begin{array}{lr} 1/r \,, &{} \, \,\,\mathrm {for }\,\,\, r \ll r_{\mathrm {s}} \\ 1/r^3\,, &{} \, \,\, \mathrm {for }\,\,\, r \gg r_{\mathrm {s}} \end{array} \right. } \end{aligned}$$
(4)

where \(r_{\mathrm {s}}\) is the radius where the slope of the profile changes and \(\rho _{\mathrm {s}}=\rho (r_{\mathrm {s}})\). We can see that this profile diverges towards the center of the halo, presenting a cusp. The amplitude of the density profile can be written in terms of \(R_{200}\), as we can see from (3),

$$\begin{aligned} {\bar{\rho }}=\frac{3}{4\pi R_{200}} \int _0^{R_{200}} 4 \pi r^2 \rho (r) \, {\text {d}}r = 3 \rho _{\mathrm {s}} \int _0^1 \frac{x^2}{cx \left( 1+cx \right) ^2} \, {\text {d}}x\,, \end{aligned}$$
(5)

where \(x=r/R_{200}\), and \(c := R_{200}/r_{\mathrm {s}}\) is the concentration index and describes the shape of the distribution. With that, the NFW profile can be determined completely by \(R_{200}\) (or \(M_{200}\) or any other halo radius definition), and the parameter c. The shape of the concentration can be inferred from the same \(\hbox {P}^3\)M simulation, where \(c \propto \left( M/M^{*} \right) ^{-1/9} \left( 1+z \right) ^{-1}\). We can see that, early-forming halos have a smaller radius, and they are denser than the larger ones, given the higher concentration. The NFW profile can be generalized for a three-parameter profile that better fits the DM profile of halos for all ranges in mass (Einasto 1965; Navarro et al. 2004; Gao et al. 2008).

Above we presented the spherically averaged density profiles of DM halos, described by the NFW profile. Although this presents a good fit to DM N-body simulations (that assume spherical symmetry and use shells that are distributed radially) and some observations, halos are not spherical. From halo and large cosmological simulations (Schneider et al. 2012; Jing and Suto 2002; Kasun and Evrard 2005; Bailin and Steinmetz 2005; Allgood et al. 2006), we can see, however, that the majority of the DM halos are elliptical or triaxial, with their axis aligned with the cosmic web structure. This non-spherical structure and intrinsic alignment might come at formation of the halos from the tidal field. The halo triaxiality plays a crucial hole in the interpretation of lensing data, cluster morphology and Sunyaev–Zeldovich measurements (for reviews on this topic, see (Joachimi et al. 2015; Kiessling et al. 2015), and needs to be taken into consideration.

figure b

Footnote 5Footnote 6

2.2 Discrepancies in comparison with observations

In this section, we will show how some of the theoretical predictions from simulations of the small-scales considering the \(\varLambda \)CDM model compare with respect to astrophysical observations. However, this comparison is not straightforward, since we indirectly probe the dark matter inferring it from the visible matter that traces the gravitational potential of galaxies and clusters. There are a few approaches to connect the information of galaxies and the dark matter halos like forward modelling, abundance matching and kinetic measurements, and each of those methods has its difficulties and limitations. The result of this comparison is a series of discrepancies that challenges the results of the simulations, and in some cases limitations in observations. We will present some of these challenges in this section. Some of those challenges might have complementary origin and solution, and are indeed connected, as we will discuss below.

2.2.1 Cusp–core

As we saw above, the expected density profile from collisionless simulations is the NFW profile which is cuspy towards the central region of the halo. Given the complex dynamics of baryonic matter in some galaxies, good laboratories to probe the halo structure are low surface brightness (LSB) galaxies and late-time dwarfs. Those systems are dominated by DM throughout their halo up until the central regions. Measuring the rotation curves of dwarf galaxies, (Flores and Primack 1994; Moore 1994) found that those measurements preferred cored isothermal profiles. Many other measurements of the rotation curves of those systems (McGaugh and de Blok 1998; Côté et al. 2000; van den Bosch and Swaters 2001; Borriello and Salucci 2001; de Blok et al. 2001a, b; Marchesini et al. 2002; Simon et al. 2005; Gentile et al. 2005, 2007; Kuzio de Naray et al. 2006, 2008) have confirmed this discrepancy, showing a constant density core with a profile with a slope \(\gamma =\) 0–5 (considering the profile at small radius given by \(\rho \sim 1/r^\gamma \)). The smallest values for this slope from dissipationless simulations are too large in comparison to the ones obtained by observations.

The recent measurement of nearby dwarf galaxies from the survey THINGS (HI Near Galaxy Survey) (Oh et al. 2011) and LITTLE THINGS (Oh et al. 2015) confirmed this discrepancy. Measuring the rotation curves from 7 to 26 nearby dwarfs, they found that the inner slope is much smaller than the NFW one (\(\gamma =-1\)), with \(\gamma = 0.29 \pm 0.07\) for the LITTLE THINGS survey, as we can see in Fig. 3.

Fig. 3
figure 3

Figure adapted from Oh et al. (2011), showing the results from the THINGS and LITTLE THINGS surveys. The plot shows a comparison of the velocity versus radius (rotation curve) (left panel) and density versus the radius (right panel), normalized by \(R_{0.3}\), and \(V_{0.3}\) and\(\rho _{0.3}\), from the theoretical parametrizations of the NFW potential (solid line) and the pseudo-isothermal (dashed line), with the simulated galaxies (Governato et al. 2010), represented in the plots by the legend DG1 and DG2. In the plots observational data from 7 dwarf galaxies measured by THINGS are represented by the other points. The crosses represent the median values of the LITTLE THINGS rotation curves and density profile. We can see that the galaxies seem to follow a cored profile, while NFW predicts a cusp

The situation is more complex for high surface brightness (HSB) objects given its complex inner density structure; or for galaxies with large mass, like spiral galaxies, where at small radii is dominated by baryonic matter. Even in the case of dwarf galaxies, it was pointed out that some systems present cuspy profiles, while others cored ones, presenting an unexpected diversity in the rotation curves (Oman et al. 2015). Since different results were obtained by different techniques for the same system, this shows that determining the inner slope of galaxies is a hard task.

The origin for these discrepancies can come from the fact that the simulations take into account only DM, while the properties of galaxies are also influenced by the presence of baryons. The newest hydrodynamical simulations obtained by many independent groups have shown that baryonic feedback can in fact soften the inner cusps in the profile and generate core-like profiles like the ones observed for dwarf galaxies. The main effects are supernova feedback flattening and dynamical friction from baryonic clumps (for a more detailed list of these and other baryonic processes, see (Del Popolo and Le Delliou 2017)). These simulations show a threshold mass of \(M_{\mathrm {vir}}\sim 10^{10}\, M_\odot \) below which the simulation predict profiles that are cusped (Governato et al. 2012; Munshi et al. 2013; Madau et al. 2014; Oñorbe et al. 2015; Tollet et al. 2016; Fitts et al. 2017).

However, not all simulations agree with this result. Additionally, modelling those baryonic feedback effects is challenging, and introduce many new parameters and uncertainties in modelling assumptions. Finally, not all baryonic processes that might influence the formation and dynamics of galaxies were included in the simulations, and that might reveal to be important for the result. It is clear that the inclusion of baryonic effects is hinting in the right direction, but until consensus is achieved, alternatives need to be considered. As mentioned before, a modification of the properties of DM might in a simple way account for that, as we will show for the case of Bose–Einstein condensate DM. An early solution to the cusp–core problem, and that explains the rotation curves with exquisite precision is a modification of the dynamics of gravity on small-scales, the MOdified Newtonian Dynamics (MOND). This is also a solution for the regularity versus diversity challenge, and its main points and shortcomings will be presented at the end of this section.

2.2.2 Missing satellites

Structure formation is hierarchical in nature and it is expected that the DM halos are also populated by small subhalos. This is confirmed in \(\varLambda \)CDM simulations of Milky-Way-sized halos, which show that the subhalo mass function diverges toward low masses, limited only by the numerical limit. Those simulations then predict several hundreds of subhalos with \(v_{\max } \sim 10\)\(30 \, \mathrm {km/s}\), that are large enough to host a galaxy (\(M_{\mathrm {peak}} \gtrsim 10^7 \, M_\odot )\), where \(M_{\mathrm {peak}}\) is the maximum virial mass the halos had when they formed. On the other side, until 2005 only 12 MW classical satellites were known, with 15 more confirmed ultra-faint satellite galaxies until 2014, with the data from Sloan Digital Sky Survey (SDSS) (Drlica-Wagner et al. 2015). To date, with the inclusion of Dark Energy Survey (DES) data, a few more ultra-faint candidates were discovered, with the known count of satellites of more than 50. However, the number of MW galaxy satellites is still much smaller than the number predicted from simulations. This is known as the missing satellite problem, and not only appears in the MW, but also in the Local Group.

DES and future observations are expected to discover more of those ultra-faint galaxies, which can alleviate this discrepancy, but there is still a debate if this will solve the problem. Another possibility is that low-mass subhalos are there, but we just cannot see them since they have very low baryonic content. One can expect that for low mass subhalos, galaxy formation is suppressed since the photoionizing background heats the gas, reducing its cooling rate and inhibiting gas accretion for \(M_{\mathrm {vir}} \sim 10^9\, M_\odot \) (Efstathiou 1992; Bullock et al. 2000; Benson et al. 2002; Bovill and Ricotti 2009; Sawala et al. 2016). Star formation is also suppressed since supernova-driven winds could strip the gas out of these halos (Dekel and Silk 1986). Other mechanisms can also suppress the baryon content in the low-mass galaxies, see (Del Popolo and Le Delliou 2017), like reionization suppression. So, the visible subhalos are only a set of the entire distribution of halos that contains the non-visible faint end. It was shown recently in the hydrodynamical simulations APOSTLE (Sawala et al. 2016; Zolotov et al. 2012; Zhu et al. 2016) that apparently this mechanism can solve the difference in the number of predicted and observed satellite galaxies, thus solving the missing satellite problem. But the question remains if this process needs to be too finely tuned to solve the problem.

Too big to fail

The above mechanism that could solve the missing satellite problem leads to another challenge: the too big to fail problem. When we say that the visible subhalos of the MW are only a set representing the most massive subhalos in the total distribution of subhalos, to have agreement with \(\varLambda \)CDM simulations these visible MW subhalos need to correspond to the most massive subhalos predicted by the simulations. But, the most massive subhalos predicted by those simulations have central massesFootnote 7(\(V_{\max } > 30\, \mathrm {km/s}\)) that are too large to host the observed satellite galaxies (Boylan-Kolchin et al. 2011, 2012), and the ones that have central mass like the expected by the MW (with \(12< V_{\max } < 25\, \mathrm {km/s}\)) are not the most massive ones. So, the puzzle is why should the most massive subhalos, where the gravitational potential is the strongest and the striping gas mechanisms cited above are not important, be too big to fail to form stars and galaxies? This is illustrated in Fig. 4. This problem also appears in the galaxies in the Local Group and Local Volume (Papastergis et al. 2015; Garrison-Kimmel et al. 2014), so it is not a specific property of the MW.

Fig. 4
figure 4

Figure adapted from Bullock and Boylan-Kolchin (2017), showing the circular velocity versus the radius of specific subhalos from the Aquarius simulation that have \(V_{\max } > 30 \, \mathrm {km/s}\) (magenta lines). Those are known to have halos that are very massive and expected to host the formation of starts. However, as we can see from the data points corresponding to classical MW satellites with masses \(M \sim 10^{5}\)\(10^{7} \, M_{\odot }\), in the most massive of those subhalos, with \(M > 10^8 \, M_{\odot }\) shown in the gray region of the plot, we do not observe satellites. This means that the more massive subhalos predicted by the simulations are too big to fail to form stars and galaxies

This higher central mass from the most massive sub-halos predicted by simulations in comparison to the MW dSphs seems to be a more general feature that appears in simulations. This discrepancy might indicate a more delicate issue related to the internal structure of the sub-halos. In this way, the too big to fail problem is more than just the problem related to the missing satellites problem, as stated above.

Like for the other problems, it was proposed that some astrophysical processes driven by baryons could be important on those scales and solve the too big to fail problem. However, these solutions seem to only work for the MW and for very efficient feedback, like the supernova feedback that only solves the too big to fail problem if very efficient. This is an intense topic of debate and no consensus appears to have been reached. As these notes were being written, there has been claims that the too big to fail problem has been solved (Ostriker et al. 2019).

As for the cusp core problem, different DM physics could solve those problems by having a mechanism that suppresses the formation of small-scale subhalos, and that reduces the central densities of massive subhalos (or modifies the dynamics of the central regions). We are going to show how the models with Bose–Einstein condensation address some of those problems.

2.2.3 Diversity vs. regularity: scaling relations

Although our universe came from very smooth initial conditions, nowadays the diversity of galaxies that we find in the universe is extraordinary. This incredible diversity of galaxies, though, presents a surprising regularity. This fact is manifest in several scaling relations that are shown to hold very tightly for a diverse range of galaxies. These relations relate the dynamical and baryonic properties of galaxies, and hold even for DM-dominated systems, and they are one of the most tantalizing aspects of galaxy phenomenology, representing the most pressing challenge for \(\varLambda \)CDM on small-scales.

The most famous of those relations is the Baryonic Tully–Fisher relation (BTFR) (McGaugh 2005, 2008), which relates the total baryon mass (including stars and gas) of the galaxy to the asymptotic circular velocity in galaxies, \(V_{\mathrm {f}}\) (this is the velocity measured at the flat portion of rotation curves):

$$\begin{aligned} V_{\mathrm {f}}^4=a_0 G M_{\mathrm {b}}\,, \end{aligned}$$
(6)

where \(a_0\) is the critical acceleration, a scale that appears in observations. Its value can be obtained from the data and given by \(a_0 \sim 1.2 \times 10^{-8} \, \mathrm {cm/s}\). The BTFR expands the regime of validity of the Tully–Fisher relation which relates the luminosity, instead of the total mass, to the circular velocity. Luminosity is a probe of the stellar mass, and in the BTFR, the observed gas mass is also considered on top of the stellar mass. This extends the validity of the scaling relation by many decades in mass. This empirical scaling relation is shown to hold for large ranges of masses, six generations, with a very small scatter, compatible to the size of the error bars. The left panel of Fig. 5 presents the BTFR. As we can see, the slope of the BTFR is different from the one predicted by \(\varLambda \)CDM, \(V_{\mathrm {f}}^3 \propto M_{\mathrm {b}}\), shown by the dashed line.

Fig. 5
figure 5

Left panel: the Baryonic Tully–Fisher Relation (BTFR), which shows the relation between the baryonic total mass (\(M_{\mathrm {b}}\)) and the asymptotic circular velocity. Dark and light blue points represent star and gas dominated stars, respectively. The dashed line represents the relation expected for \(\varLambda \)CDM, with slope equal to 3; while the dotted line which better fits the data, has slope 4. Image reproduced with permission from from Famaey and McGaugh (2012), copyright by the authors. Right panel: Plot of the Radial Acceleration Relation for 153 SPARC galaxies. The fit to the data is given by the solid line while the dotted line is the unit line. The insert is a histogram of the residuals. The red uncertainty bars represent the uncertainty in each individual point. The lower panel shows the residuals, and the red uncertainty bar shows the mean uncertainty on individual points. The dashed lines represent the rms value in each bin and the solid red lines represent the observational uncertainties and variation between the stellar mass-to-light-ratio from galaxies. Image reproduced with permission from McGaugh et al. (2016), copyright by APS

There is another general scaling relation that also displays the interesting behaviour of galaxies: the mass discrepancy acceleration relation (MDAR). This is more general since the BTFR can be obtained from the MDAR at large distances in the disk. The MDAR is a relation between the gravitational acceleration from baryons alone (\(g_{\mathrm {bar}}\)), from the distribution of gas and stars in galaxies (McGaugh et al. 2016; Lelli et al. 2017), and acceleration inferred from rotation curves (\(g_{\mathrm {obs}}=V^2/r\)). As it can be seen in the right panel of Fig. 5, this scaling relation shows a remarkably tight correlation between these quantities for very diverse and large number of galaxies This can be seen by comparing the interval determined by the solid red lines and the uncertainty in each individual point, represented by the red uncertainty bars on the top figure, with the dashed lines show that the data is compatible with negligible scatter.

This relation shows us that in regions of high acceleration, where \(g_{\mathrm {obs}} > a_0\) and baryons dominate, one has \(g_{\mathrm {obs}} \sim g_{\mathrm {bar}}\). For low accelerations, in the central regions where it is expected to be DM dominated, this relation deviates from the unit line. This suggests a very curious behaviour: the baryon mass distribution dictates the behaviour of the rotation curve at all radii, even for the regions expected to have less baryons. And this behaviour holds even for galaxies that are DM dominated.

These empirical relations, coming directly from observations, show the surprising feature that in galaxies the dynamics is dictated by the baryon content, even when DM dominates. Even more unexpected these relations are very tight, showing very little spread, even if they come from very diverse types of galaxies. As pointed out in Bullock and Boylan-Kolchin (2017), what dictates the dynamics in these correlations is the baryon mass, which is the sum of gas and stars, and not only the stellar mass, which is the one that is expected to correlate more with the total feedback energy.

Recently, it was shown by many groups that these relations can be explained within the \(\varLambda \)CDM paradigm (Navarro et al. 2017; Ferrero et al. 2017; Garaldi et al. 2018; Dutton et al. 2019; Navarro 2019) using the latest hydrodynamical simulations like EAGLE (Schaye et al. 2015; Crain et al. 2015), APOSTLE (Sawala et al. 2016), Illustris (Illustris 2014), ZOMG (Borzyszkowski et al. 2017; Romano-Diaz et al. 2017; Garaldi et al. 2017), and NIHAO (Wang et al. 2015). Those simulations include several baryonic effects (like star formation, stellar evolution, metal enrichment, gas cooling/heating, galactic outflows and BH feedback, among others) to their \(\varLambda \)CDM simulations.Footnote 8 Those new large volume and high-resolution simulations, like Illustris and EAGLE, have also been able to reproduce the features of the rotation curves of galaxies within \(\varLambda \)CDM. This large amount of progress in the simulation side is very encouraging.

However, some questions still remain. While the BTFR and the MDAR trends can indeed be reproduced by those simulations, it is pointed out by most of the authors that the scatter obtained in the scaling relations is larger than the one expected from data (some authors claim that this spread is correlated with the errors in the stellar feedback). The question remains though whether this is a matter of improving the feedback models and/or resolution of the simulation, or if given the stochastic nature of the feedback effects, they will ever be able to give such tight correlations. Another point that is important to be answered is about the importance of these baryonic feedbacks since these groups do not agree on how very sensitive to the feedback model the simulations are, which is intriguing. The way those effects are introduced in the simulations is by parametrizing their effects, instead of introducing all of these feedback mechanisms from first principles, which is understandable given the complexity of each of those phenomena. This includes many new parameters to the simulations. And different simulations might use different parametrizations. Those simulations also still do not go all the way until dwarf galaxies,Footnote 9 which are DM dominated and where most of the tension is. In summary, this is a very challenging and exciting field and a lot of progress has been done on the simulation side with results that are very encouraging to explain the formation and dynamics of galaxies. But there are some uncertainties in those results and the simulations still do not fully reproduce the observations.

2.2.4 What the small-scales tell us

As we saw above, the small-scales hold precious information that can help us understand astrophysical processes, or even the nature of DM. This is revealed by the challenges presented above, which show rich dynamics on galactic and sub-galactic scales. There are a number of ways that these discrepancies can be addressed. Within \(\varLambda \)CDM, this can done by including baryonic effects, which as we saw in the previous sections seem to address partially or completely some of those puzzles. Another proposal for solving some of the puzzles of galactic evolution is more radical and proposes a universe without DM that has a modified force law for small accelerations, the MOdified Newtonian Dynamics (MOND). See box below for a discussion of MOND.

A third avenue is to modify the DM paradigm. Different models of dark matter can affect the formation of structures in distinct ways, both in the linear and in the non-linear regimes. Therefore, the small-scales offer an opportunity to probe the microphysics of DM, beyond the hydrodynamical large-scale CDM paradigm. The non-linear regime can be specially changed by modifications of this paradigm, as we can see in Fig. 2. This regime can be probed using galaxies, and for even smaller scales satellite galaxies and studying substructures. This could help find new properties of DM, that could help elucidate its nature.

figure c
figure d

Footnote 10

The goal of this section was twofold. First, we wanted to give an overview of the so-called small-scale problems of the CDM, which was the motivation for some ULDM models to be proposed. We wanted to introduce the problems in a way that the reader can understand why the mechanisms proposed by the models in Sect. 4 address and solve each of those small-scale controversies. For example, we are going to see that the FDM model has to have a certain range of mass to solve the cusp–core problem and the satellite problem; or that the DM superfluid model has a modified dynamics of small-scales, reproducing the MOND behaviour, which explains the rotation curves of galaxies and the scaling relations.

The second goal, and perhaps the most important was to give an brief overview of the rich astrophysics that takes place in galaxies, introducing important concepts and observations available on these scales. This is important since the main feature of the ULDM models is to present a new phenomenology on small-scales coming from the non-CDM behaviour of ULDM. In this way, the small-scale observables offer an important window to test the nature of DM.

3 Bose–Einstein Condensation and Superfluidity

In this section, we present a short review of Bose–Einstein condensation (BEC) and superfluidity. The goal of this section is to give an introduction to the basic theory, properties and the methods used to describe those systems, so they can be applied to the case of DM in the next section. The different description of those systems and their limit of validity are very important to be able to understand the construction and validity of the DM models presented next and why they present different phenomenologies and astrophysical consequences.

Bose–Einstein condensation is one of the most fascinating phenomenon of quantum mechanics. Since it was theorized in the year of 1920s, by Satyendra Nath Bose and Albert Einstein, its experimental realization opened the door for many advances in the physics of many-body systems, and even to the application of this phenomenon in other fields like cosmology. Its first experimental realization was done in 1995 by two independent groups using laser and magnetic cooling device to cool down rubidium atoms gas (Anderson et al. 1995; Davis et al. 1995). Nowadays, BECs are observed in helium, ultra-cold atomic gases, quasi-particles in solids, multi-component (mixtures) of BECs, among other systems.

Following on the works of Bose (1924), which described the quantum statistical properties of photons, Einstein extended this concept to a gas of non-interacting particles of integer spin, later called bosons as a tribute to Bose, that follows a Bose–Einstein statistics (Schay 1924). This Bose gas has the property that at low temperatures a large number of these bosons, described then as quantum oscillators, condense into the lowest momentum state, exhibiting long range coherence. This physical phenomenon initiated the idea of Bose–Einstein condensation.

A BEC is defined as a system where at very low temperatures a large fraction of the bosons of the system occupy the lowest energy state of a configuration. This macroscopic occupancy of the ground state is an inherently quantum mechanical phenomenon. Physically we can interpret it as a consequence of the wavelike nature of these particles at low temperatures, where the de Broglie wavelength of these bosons is larger than the inter-particle separation, and their wavepackets superpose and form a coherent macroscopic wavefunction describing the entire system. The BEC is then described by a single wavefunction of the system, linking to the long range coherence property of a condensate.

A few years after BEC was theorized by Einstein, another intriguing macroscopic quantum mechanical phenomenon was discovered: superfluidity. In 1937, Kapitza (1938) and independently Allen and Misener (1938), conducting experiments with helium-4 realized that after cooling down this liquid to a certain temperature, the fluid starts flowing without friction, even climbing the walls of the container where it was stored. Fluids that exhibit this behaviour, characterized by a zero viscosity, are called superfluids. Landau provided a phenomenological description of this effect which rendered him the Nobel prize in 1962. It was proposed by London (1938), after the development of laser cooling techniques for atomic gases, that the properties of \(\hbox {He}^4\) superfluid are related to BEC. This was not obvious given that the (textbook) description of BEC as an ideal non-interacting Bose gas, contrary to \(^4\)He that is a strongly interacting fluid. This gave relevance to the, until then, only theoretical ideas of Einstein, and BEC became a rich topic of research. The relation between superfluids and BECs was confirmed years later in ultracold atomic gases where almost the entire fluid at low temperatures is condensed and exhibits superfluidity.

It is very challenging to describe the strongly interacting helium system. A weakly interacting Bose gas was then proposed by Bogoliubov, as a modification of the non-interacting Bose gas model, to study the Bose–Einstein condensation and superfluidity. In this way, superfluids can be modelled by a Bose–Einstein condensate that has self-interaction, and superfluidity is described as being achieved through interactions in a BEC. Notice that BEC can happen even in the absence of self-interaction, as seen above, since it is a statistical property of a gas of bosons in low temperature, but this system does not exhibit superfluidity. The weakly interacting theory is used to describe many superfluid systems at certain limits. This description, tough, evolved in the last few years to extend and generalize this framework to finite temperature systems, mixtures and even stronger interacting system corrections. New frameworks also emerged to describe different systems that cannot be modelled by the weakly-interacting theory. One of those ideas based on the hydrodynamical description is to write these systems as an effective field theory (EFT) order to describe the system macroscopically using symmetry alone without the need of working its microscopic description. This EFT, depending on the symmetries of the system, can recover at some limits the weakly interacting superfluids, but also can be used to describe more general superfluids, superconductors and even systems like the unitary Fermi gas (Kevrekidis et al. 2001; Baker 1999), which is a gas of fermions that interact through a strong two-body coupling that is a superfluid in the ground state.

The theoretical description of those condensed matter systems, together with the experimental efforts, is a field of research that is in fast development. In this review we are going to describe the basic concepts on BEC and superfluid, and detail the different descriptions and properties of these systems. We start by describing the non-interacting ideal gas, where condensation was first conceptualized to present in more detail the definition and the conditions for condensation. We then start to describe superfluidity. We show first the definition of superfluidity as defined in Landau’s theory of superfluidity. We then describe a more concrete model for a BEC where superfluidity is present, the weakly interacting Bose gas. This is the simplest example of superfluidity. We show how this model describes condensation and superfluidity. We then follow to show the field theory description of the superfluid, where the system is described as a system which undergoes spontaneous symmetry breaking caused by the condensate. This description brings advantages and makes clear the study of many features in BECs and superfluids. As a low-energy description of the superfluid, we present the EFT of superfluids as another description of more general superfluid systems. We finalize describing what happens when we rotate a superfluid, showing the nucleation of vortices upon rotation.

Not linked to what we will discuss in the review, but an important fact. Nowadays, it is known that superfluidity is not necessarily linked to condensation. Recent investigations seem to point that there are states where you can have superfluidity for the majority or all the particles in the system, while only a small fraction is condensed. This happens for example for liquid helium below a certain temperature.

3.1 Non-interacting ideal gas

We start our discussion with the non-interacting Bose gas. The properties of this system are a consequence purely from the quantum statistic of indistinguishable bosons. We will see that in the grand canonical ensemble we can write the Bose distribution function in which we can see the conditions for Bose–Einstein condensation.

We want to describe here a theoretical gas of many non-interacting bosons in a box. In a system with a large number of particles (N), it is impractical to try to determine the state of each particle or even the collective many-body wavefunction that describes this system \(\varPsi ({\mathbf {r}}_1, \ldots , \,{\mathbf {r}}_N,t)\). In this sense, to describe a system with many particles that can occupy many different states, we represent the system using a statistical ensemble description. To describe the state of this collective system, one does not need to label the state of each particle, but to determine the number of particles in each state of the system. The ensemble that is specially convenient for this task of deriving the probability of microscopic states is the grand canonical ensemble (GCE). Since our system is composed of bosons, which are indistinguishable particles \(\varPsi ({\mathbf {r}}_1,\, {\mathbf {r}}_2, \ldots , \, {\mathbf {r}}_N,t) = \varPsi ({\mathbf {r}}_2,\,{\mathbf {r}}_1, \ldots , \, {\mathbf {r}}_N,t)\), called Bose symmetry, this ensemble is useful to describe the system where many particles can occupy the same state.

The GCE is a statistical ensemble that describes a system that is in contact in thermal and chemical equilibrium with a large reservoir, in a way that there is exchange of energy and particles with the reservoir. This exchange of particles with the reservoir makes the number of particles in the system to fluctuate, although the number of particle of the system plus reservoir is constant. As the system is in equilibrium, the energy and particle number fluctuate around an average. This ensemble can be described by the following constants: the chemical potential (\(\mu \)) and the temperature (T), which hold for a system (of volume V). If the GCE is applied to small systems, an additional condition is necessary: that the gas is diluted. In principle, the probability of finding the system in a state s with energy \(\epsilon _s\) and \(n_s\) particles, or occupation number \(n_s\), is given by

$$\begin{aligned} P_\mathrm {s} =\frac{1}{Z^{\tiny {\mathrm {GC}}}_s} e^{\beta \left( \mu \, n_s - \epsilon _s \right) }\,, \quad \mathrm {with} \quad Z^{\tiny {\mathrm {GC}}}_s = \sum _{s} e^{\beta \left( \mu \, n_s - \epsilon _s \right) }\,, \end{aligned}$$
(10)

where \(\beta =1/(k_{\mathrm {B}} T)\), with \(k_{\mathrm {B}}\) the Boltzmann constant. The chemical potential \(\mu =\left( \partial E / \partial N \right) _{S,V}\) is the energy required to add one particle to the isolated system, fully determined by N, the total number of particles, and T. The chemical potential is defined to be negative (so no unphysical negative occupation occurs). The total energy of the system is given by \(E=\sum _s n_s \epsilon _s\). The normalization \(Z^{\tiny {\mathrm {GC}}}_s\) is the grand canonical partition function.

With the GC distribution function, we can then evaluate the average occupation number,

$$\begin{aligned} \langle n_s \rangle =\sum _{n_s} n_s P_s = \frac{1}{e^{\beta \left( \epsilon _s -\mu \right) } - 1} \,, \end{aligned}$$
(11)

where the sum converges for \(\mu < \epsilon _s\). This is the Bose–Einstein distribution. This gives us the total number of particles in the system:

$$\begin{aligned} N= \sum _s n_s =\sum _s \frac{1}{e^{\beta (\epsilon _s -\mu )}-1} \,. \end{aligned}$$
(12)

We can separate the total number of particles into two contributions,

$$\begin{aligned} N=N_0 + N_{\mathrm {T}}\,, \end{aligned}$$
(13)

where \(N_0=1/e^{\beta (\epsilon _0 -\mu )}-1\) is the number of particles with \(\mathrm {s}=0\), which is the number of particles in the condensate, with \(\epsilon _0\) indicating the lowest energy of the single particle state. The number of particles that are not in the ground state, not in the condensate, also called the thermal component of the gas, \(N_{\mathrm {T}}={\textstyle \sum }_{s \ne 0} n_s\). We can replace the sum for an integral and, from the partition function, the thermal component is given by

$$\begin{aligned} N_{\mathrm {T}}=\frac{V}{\lambda _{\mathrm {T}}} \int _0^\infty {\text {d}}\epsilon \, \frac{\epsilon ^{1/2}}{e^{ \beta (\epsilon - \mu )}}\,, \end{aligned}$$
(14)

where \(\lambda _{\mathrm {T}} = \sqrt{2\pi \hbar ^2 \beta }\). For a fixed temperature, \(N_{\mathrm {T}}\) reaches a maximum when \(\mu =\epsilon _0\). So, \(N_{\mathrm {T}}\) is limited, meaning that in this limit there is a finite number of particles not in the ground state. At this same point in this limit, \(N_0\) can diverge showing that the number of particles in the ground state grows becoming macroscopically occupied. This macroscopic occupation of the ground state is seen as a condensation and this phenomenon is called Bose–Einstein condensation.

The critical temperature \(T_{\mathrm {c}}\) defines the temperature below which there is the formation of the BEC. We can define it as the temperature above which all the particles of the system are not going to be in the condensate: \(N_{\mathrm {T}} (T_{\mathrm {c}}, \mu =\epsilon _0)=N\). The chemical potential can be zero at \(T_{\mathrm {c}}\), and from (14), for the maximum \(\mu =\epsilon _0=0\), we can get that

$$\begin{aligned} T_{\mathrm {c}} = \left( \frac{2 \pi \hbar ^2}{m k_{\mathrm {B}}} \right) \left( \frac{n}{\zeta (3/2)} \right) ^{2/3}\,, \end{aligned}$$
(15)

where n is the total number density and \(\zeta (3/2)\approx 2.612\) is the Riemann’s zeta function. With that, for \(T<T_{\mathrm {c}}\), we expect that most of the particles are going to be in the condensate, and the number of particles in the condensate is

$$\begin{aligned} N_0/N= 1-\left( \frac{T}{T_{\mathrm {c}}} \right) ^{3/2} \,. \end{aligned}$$
(16)

Complementary, the number of particle in the thermal component is \(N_{\mathrm {T}} = N \left( T/T_{\mathrm {c}} \right) ^{3/2}\). From that expression we can see that the occupation becomes macroscopic towards small temperatures, as we can see in Fig. 6. This indicates the formation of a BEC. This condition that a BEC can form for \(T<T_{\mathrm {c}}\) can be translated into the condition: \(n \lambda _{\mathrm {dB}}^3 \gg 1\), where \(\lambda _{\mathrm {dB}} = \sqrt{2\pi \hbar ^2/(m k_{\mathrm {B}} T)}\) is the thermal de Broglie wavelength that gives the coherence length of the gas. This condition indicates that the gas needs to be dilute for condensation to happen. This condition is also equivalent to having the de Broglie wavelength of the particles overlap and the system being described by a macroscopic wave-function. When \(T=0\), all the particles of the system will be in the ground state and the condensate is described by a single macroscopic wavefunction. As we see from Fig. 6, at high temperatures, condensation is broken and the system behaves as a gas of individual massive particles.

figure e
Fig. 6
figure 6

In this figure, we plot the number of particles in the ground state, normalized by the total number of particles, with respect to the temperature for the non-interacting Bose gas. We show schematically that for higher temperatures \(T\ge T_{\mathrm {c}}\) the system is in the normal state where the particles behave as free particles and occupy all energy levels. As the temperature is lowered, when \(T<T_{\mathrm {c}}\) we have the formation of a condensate described by a macroscopic wavefunction. When \(T=0\), all particles of the system are in the ground state and we have a pure BEC, described by a single wavefunction

In summary, Bose–Einstein condensation can happen for ideal gases. The condition for condensation is that the occupation number of the ground state is so large that becomes macroscopic when \(T<T_{\mathrm {c}}\). This can be translated in a condition for condensation: if \(n \lambda _{\mathrm {dB}}^3 \gg 1\), there is the formation of a BEC. With that we can see that this very simple theoretical model already shows this intriguing macroscopic quantum phenomenon that would be confirmed experimentally many years later.

figure f

3.2 Landau’s superfluid model and criteria for superfluidity

A few years after BEC was theorized, another striking macroscopic quantum phenomenon was observed, superfluidity. Landau constructed a phenomenological theory to explain the results of superfluidity in helium, which was observed to flow in thin capillaries. This phenomenological theory, however, is quite general to describe superfluids and gives general conditions for the appearance of superfluidity.

This theory has the goal of explaining why in superfluids charge is transported without friction. As we described above, according to London’s ideas, to have superfluidity, one needs to have a BEC. The condensate has the role of transporting charge. So we consider a superfluid as the condensate that transports charge without losing energy. Dissipation of the condensate, which is equivalent to friction in the fluid, is caused by exciting particles out of the condensate. We have a superfluid in the limit of no or low dissipation, and the superfluid is lost in the limit of high dissipation. We present now the conditions for that to happen.

Consider a superfluid moving through a capillary with velocity \(v_{\mathrm {s}}\). The energy of elementary excitations is given by Pitaevskii and Stringari (2016),

$$\begin{aligned} E=E_{\mathrm {kin}}+\epsilon _p+{\mathbf {p}} \cdot {\mathbf {v}}_{\mathrm {s}}\,, \end{aligned}$$
(17)

in the rest frame of the capillary. The kinetic energy of the fluid is given by \(E_{\mathrm {kin}}\), and \(\epsilon _p>0\) and \({\mathbf {p}}\) are the energy of the excitation and momentum in the frame of the fluid, and translated to the frame of the capillary. Dissipation happens when \(\epsilon _p+{\mathbf {p}} \cdot {\mathbf {v}}_{\mathrm {s}}<0\). This can only be negative if its minimum, when \(\epsilon _p+p v_{\mathrm {s}} \cos (\theta )\) where \(\theta =n\pi \) for n integers, is smaller than zero: \(\epsilon _p-p \, v_{\mathrm {s}}<0\). With that we can determine the critical velocity:

$$\begin{aligned} v_{\mathrm {c}} = \min _p \frac{\epsilon _p}{p}\,. \end{aligned}$$
(18)

For \(v_{\mathrm {s}}<v_{\mathrm {c}}\), with \(v_{\mathrm {c}} \ne 0\), the system transports charge without dissipation and the coherence of the BEC is maintained. This is the first criteria for superfluidity. The second necessary criteria is that \(v_{\mathrm {c}}\) cannot be zero, so we need to have a condensate that transports the charge. A non-interacting (pure) Bose gas like we saw in the previous section has \(v_{\mathrm {c}}=0\), so it cannot be a superfluid. A weakly interacting Bose gas has \(v_{\mathrm {c}} \ne 0\) and it is a good representation of a superfluid.

As we are going to see in the next section in the case of the weakly interacting BEC because of the spontaneous breaking of U(1) symmetry, a Goldstone mode appears, the phonon. This mode is gapless \(\epsilon _{p=0} = 0\) and it is an elementary excitation of the superfluid. Even for that mode, the critical velocity is not zero, so there is some cost for producing the gapless excitation. For this weakly interacting Bose gas, the critical velocity is the fluid sound speed, and Landau’s criteria for superfluidity becomes

$$\begin{aligned} v_{\mathrm {s}} < v_{\mathrm {c}} = c_{\mathrm {s}}\,. \end{aligned}$$
(19)

In summary, given that \(v_{\mathrm {c}}\) is nonzero, and that we have a condensate (by construction) in this system, if \(v<v_{\mathrm {c}}\), we can say that there is superfluidity. This result, however, is only valid for zero temperature.

Landau also developed the theory for a superfluid at finite temperature, the two-fluid model. At finite temperatures, the fluid has two components: the superfluid component that flows without friction and a normal fluid which describes the excitations. In this theory, then there are two sounds speeds, for each degree of freedom. In the case of weakly interacting Bose gas, the first sound is \(c_{\mathrm {s}}\) associated with the oscillation in density, and the second sound is \(c_{\mathrm {s}}/\sqrt{3}\) that corresponds to the speed of propagation of the temperature oscillations.

This phenomenological theory is still an important topic of research as a condition for superfluidity. From simulations to experiments, it is interesting to ask if the Landau criteria is fulfilled as a criteria for superfluidity. This criterion seems to be valid only in the regime of linear perturbations. This is the case since in Landau’s theory the superfluid dissipates only into elementary excitations. However, we know that it is also possible to exist quantum vortices, topological defects present in rotating superfluids. This phenomenological theory does not take that into account, which can change the critical velocity of the superfluid to smaller values, reaching the dissipative regime of the superfluid before than if using Landau’s critical velocity Ianeselli et al. (2006).

figure g

3.3 Weakly interacting Bose gas superfluid

We now turn to the discussion of interacting systems. Inspired by Landau’s phenomenological theory, the weakly interacting Bose gas was proposed as the simplest system to study superfluidity, and as a realistic model to understand condensation. In this section, we are going to model a superfluid by a Bose–Einstein condensate that has self-interaction, and show that, although a BEC can be formed both in the case of the non-interacting and interacting Bose gas, the presence of interaction is crucial for superfluidity (Pitaevskii and Stringari 2016; Barenghi and Parker 2016; Rogel-Salazar et al. 2004).

We present here a microscopic description of superfluid which arises upon condensation. The microscopic system where this happens is a weakly interacting gas of bosonic particles. To describe this interacting gas, first we need to understand how to describe the excitations in this system.

One of the conditions for condensation is that the gas is dilute. However, even in a dilute gas, the interaction can play an important role. The way we describe interactions or collisions in a Bose gas is somewhat different than in a classical fluid. Since now we describe it using their wavefunction, we need to have a interatomic potential \(V_{\mathrm {int}}\) to enable these collisions. In a dilute system at low temperatures, three-body interactions are suppressed, so we are going to describe this system with binary collisions. In such a system, the two-body collisions depend only on one parameter a, the s-wave (or coherent) scattering length (Harko and Lake 2015), which is the zero energy limit of the scattering amplitude \(a= \lim _{T \rightarrow 0} f_{\mathrm {scat}}\). This is valid only for low energies when the other length scales of the problem \( d \gg a\). In this limit, the elastic scattering cross-section becomes constant \(\sigma =8\pi a^2\), and the two-body interatomic potential can be written as \(V_{\mathrm {int}}({\mathbf {r}}-{\mathbf {r}}')=(4\pi a \hbar ^2 /m) \, \delta (r-r') \equiv g \, \delta (r-r')\), which is short-ranged and present only when the atoms interact. The s-wave scattering length a can be positive or negative depending on the system described, representing a repulsive or an attractive interaction. With this we can model the effective interaction Hamiltonian as \({\hat{H}}_{\mathrm {int}} = \int d {\mathbf {r}} d {\mathbf {r}}' \, {\hat{\varPsi }}^{\dagger }({\mathbf {r}}) {\hat{\varPsi }}^{\dagger }({\mathbf {r}}) \, V_{\mathrm {int}}(\mathbf {r-r'}) \, {\hat{\varPsi }}(\mathbf {r'}) {\hat{\varPsi }}(\mathbf {r'}) \approx \int {\text {d}}^3 r \, {\hat{\varPsi }}^{\dagger }({\mathbf {r}}) {\hat{\varPsi }}^{\dagger }({\mathbf {r}}) {\hat{\varPsi }}({\mathbf {r}}) {\hat{\varPsi }}({\mathbf {r}})\), where \(\varPsi \) is the Bose operator.

The dynamics of this many-body interacting system is given by the second-quantized N-body Hamiltonian,

$$\begin{aligned} {\hat{H}}=\int {\text {d}}^3 r \, {\hat{\varPsi }}^{\dagger }({\mathbf {r}}) \left[ -\frac{\hbar ^2 \nabla ^2}{2m} + V_{\mathrm {trap}}({\mathbf {r}}) \right] {\hat{\varPsi }} + \frac{g}{2} \int {\text {d}}^3 r \, {\hat{\varPsi }}^{\dagger }({\mathbf {r}}) {\hat{\varPsi }}^{\dagger }({\mathbf {r}}) {\hat{\varPsi }}({\mathbf {r}}) {\hat{\varPsi }}({\mathbf {r}}) \,, \nonumber \\ \end{aligned}$$
(20)

where the brackets is the single particle Hamiltonian, and \(V_{\mathrm {trap}}({\mathbf {r}})\) is the trapping potential, an external potential applied to the system (that in the next section could be the gravitational potential). In the Heisenberg description, we can then write the Heisenberg equations of motion,

$$\begin{aligned} i \hbar \frac{\partial {\hat{\varPsi }}({\mathbf {r}},t) }{\partial t}= [{\hat{\varPsi }}({\mathbf {r}},t), \, {\hat{H}}]=\left( -\frac{\hbar ^2 }{2m} \nabla ^2 + V_{\mathrm {trap}}({\mathbf {r}}) \right) {\hat{\varPsi }}({\mathbf {r}},t)+ g \, {\hat{\varPsi }}^{\dagger }({\mathbf {r}},t) {\hat{\varPsi }}({\mathbf {r}},t) {\hat{\varPsi }}({\mathbf {r}},t)\,, \end{aligned}$$
(21)

with the brackets indicating the commutator. This is the Schrödinger equation for the Bose field operator \({\hat{\varPsi }} ({\mathbf {r}},t)\).

The Bose field operator \({\hat{\varPsi }}^{\dagger }({\mathbf {r}})\) and \({\hat{\varPsi }}({\mathbf {r}})\) create and annihilate a particle at position \({\mathbf {r}}\), and obeys the canonical commutation relations with only non-zero commutator given by \([{\hat{\varPsi }}({\mathbf {r}}), \, {\hat{\varPsi }}^{\dagger }({\mathbf {r}})]=\delta ({\mathbf {r}}- {\mathbf {r}}')\). The Bose field operator describes a continuum spectrum of single particle position eigenstates, and can be re-written in the single-particle basis as

$$\begin{aligned} {\hat{\varPsi }}^\dagger = \sum _i {\hat{a}}_i^\dagger \phi _i^{*} ({\mathbf {r}})\,, \quad {\hat{\varPsi }} = \sum _i {\hat{a}}_i \phi _i ({\mathbf {r}})\,, \end{aligned}$$
(22)

where \(\phi _i\) is the states wavefunction, and the creation and annihilation operators, \({\hat{a}}_i^\dagger \) and \({\hat{a}}_i\), create and annihilate a particle from the state \(\phi _i\). They obey the Bose commutation relations,Footnote 11 with only non-zero component given by \([{\hat{a}}_i, {\hat{a}}_j^\dagger ]=\delta _{ij}\).

Many-particle systems described by the Hamiltonian (20) are very difficult to be solved. With the exception of a few simple models, to find solutions to this problem and be able to study its properties we need to make simplifications. For that we use Bogoliubov’s prescription or mean-field approximation. For the general case of a non-uniform gas, the mean-field approximation can be written, in the Heisenberg picture as

$$\begin{aligned} {\hat{\varPsi }} ({\mathbf {r}}, t) = \psi ({\mathbf {r}}, t) +\delta {\hat{\varPsi }} ({\mathbf {r}}, t)\,, \end{aligned}$$
(25)

where \(\psi ({\mathbf {r}}, t) \equiv \langle {\hat{\varPsi }} ({\mathbf {r}}, t) \rangle \) is classical field called the wavefunction of the condensate. The density of the condensate is fixed by: \(n_0 = | \psi ({\mathbf {r}}, t) |^2 =n\). Like we described for Landau’s theory, \(\delta {\hat{\varPsi }} ({\mathbf {r}}, t) \) is a small perturbation of the system with \(\langle \delta {\hat{\varPsi }} ({\mathbf {r}}, t) \rangle =0\) and describes depletion of the condensate.

Effectively this approximation leads the many-body problem to be reduced to a single body problem by describing the averaging the effects of all other particles. Given that the interactions are weak, and that the gas is diluted, quantum fluctuations on the condensate are suppressed. The mean field approximation is valid for dilute system with \(n\, a^3 \ll 1\). When this conditions is not met there are deviations of the mean-field approximation. We can treat these deviations in perturbation theory, where we invoke non-vanishing moments for the fluctuation operator, like the Hartree–Fock–Bogoliubov, which considers a non-zero \(\langle \delta {\hat{\varPsi }} \rangle \), or the Hartree–Fock–Bogoliubov–Popov, for terms up to second order in the perturbation.

With this approximation, we can write the generalized Gross–Pitaevskii (GP) equation:

$$\begin{aligned} { i \hbar \frac{\partial \psi ({\mathbf {r}},t) }{\partial t}= \left( -\frac{\hbar ^2 }{2m} \nabla ^2 + V_{\mathrm {trap}} ({\mathbf {r}}) + g \, |\psi (\mathbf {r,t}) |^2 \right) \psi ({\mathbf {r}},t) \, .} \end{aligned}$$
(23)

The GP equation is a non-linear Schrödinger equation, with non-linearity arising from the self-interaction term. This equation describes the dynamics of the zero-temperature dilute weakly-interacting Bose system by allowing us to determine the shape of the single particle wave function, the condensate.

We can study the case of stationary solutions. The stationary solution can be taken as the solution that provides us with the condensate, the ground state wavefunction. The ground state is the lowest energy state of a quantum mechanical system, with the excited states being the states with higher energy than the ground state. The stationary states \(\varPsi \) are the eigenfunctions of the Hamiltonian operator, with eigenvalues \(\mu \) related to the energy of the system, in a way that for the wavefunction we have

$$\begin{aligned} i \hbar \frac{\partial \varPsi }{\partial t} = {\hat{H}} \varPsi = \mu \varPsi \,. \end{aligned}$$
(26)

With that we can write the stationary solution as

$$\begin{aligned} \psi ({\mathbf {r}}, t) = \psi _{\mathrm {s}} ({\mathbf {r}} ) e^{-\frac{i}{\hbar } \mu t}\,, \end{aligned}$$
(27)

where the eigenvalue of the Hamiltonian \(\mu \) is also called the chemical potential, and \(\phi _{\mathrm {s}}\) is real field with \(\int d{\mathbf {r}} \, \psi ^2 = N_0 = N\). The Gross–Pitaevskii equation becomes

$$\begin{aligned} \left( -\frac{\hbar ^2}{2m} \nabla ^2 + V_{\mathrm {trap}} ({\mathbf {r}}) + g \, | \psi _{\mathrm {s}}({\mathbf {r}}) |^2 \right) \psi _{\mathrm {s}} ({\mathbf {r}}) = \mu \psi _{\mathrm {s}} ({\mathbf {r}})\,. \end{aligned}$$
(28)

In the Thomas–Fermi limit, which is the approximation where the interaction energy is bigger than the kinetic energy for a large number of particles, the kinetic energy can be neglected so we have \(\mu \psi =(g \, n+ V_{\mathrm {trap}}) \psi \). As a solution of this equation, we get that in the Thomas–Fermi limit,

$$\begin{aligned} n({\mathbf {r}}) = |\psi ({\mathbf {r}}) |^2 = \left\{ \begin{array}{ll} ( \mu - V_{\mathrm {trap}} ({\mathbf {r}})) / g \,, &{} \, \,\,\mathrm {for }\,\,\, {\mathbf {r}} \,\,\, \mathrm {where}\,\,\, ( \mu - V_{\mathrm {trap}} ({\mathbf {r}})) / g > 0 \,, \\ 0\,, &{} \, \,\, \mathrm {otherwise} \,. \end{array} \right. \end{aligned}$$
(29)

Fluid description

We can decompose the complex macroscopic condensate wavefunction into

$$\begin{aligned} \psi ({\mathbf {r}},t)=|\psi ({\mathbf {r}},t)| \, e^{i \theta ({\mathbf {r}},t)}\,, \end{aligned}$$
(30)

where, as we saw above , \(\psi \) is normalized to the total number of particles, \(|\psi ({\mathbf {r}},t)| = \sqrt{n({\mathbf {r}},t)}=\sqrt{\rho ({\mathbf {r}},t)/m}\), and \(\theta ({\mathbf {r}},t)\) is the phase distribution. Inserting this into the GP equation, we get two equations. We make the following redefinition

$$\begin{aligned} {\mathbf {v}}({\mathbf {r}},t) \equiv \frac{\hbar }{m} \, \nabla \theta ({\mathbf {r}},t)\,. \end{aligned}$$
(31)

This, together with (30) is called Madelung transformation. With those new variables, the GP equation results in two equations, the Madelung equations (Madelung 1927):

$$\begin{aligned}&\frac{\partial \rho }{\partial t} + \nabla \cdot (\rho \mathbf {v}) = 0\,, \end{aligned}$$
(32)
$$\begin{aligned}&\rho \frac{\partial \mathbf {v}}{\partial t} + \rho \, (\mathbf {v} \cdot \nabla )\mathbf {v} = -\nabla \left( P_{\mathrm {int}} + P_{\mathrm {QP}} \right) - n \, \nabla V_\mathrm{trap} \,. \end{aligned}$$
(33)

They are a representation of the GP equation in “hydrodynamical” equations since they have a similar form as the continuity equation and Euler equation for a perfect fluid. However, the second Madelung equation describes a fluid with a potential flow, given the definition of the velocity, with zero vorticity \(\mathbf {\nabla \times {\mathbf {v}} =0}\). This corresponds to the main characteristic of the superfluid that it flows without friction, has irrotational flow. This equation also differs from the perfect fluid Euler equation by the presence of the quantum pressure term.

The second Madelung equation, the Euler-like equation, reveals more interesting properties of the superfluid. This equation has two pressure terms, \(P_{\mathrm {int}} \) and \(P_{\mathrm {QP}} \) that are respectively, the pressure term, and the quantum pressure:

$$\begin{aligned} P_{\mathrm {int}}&= K \, \rho ^{(j+1)/j} \Bigr |_{j=1} = \frac{g}{2} n^2 = \frac{g}{2m^2} \rho ^2 \,,\nonumber \\ \nabla P_{\mathrm {QP}}&=-n \nabla Q=- n \nabla \left[ \frac{\hbar ^2}{2m^2} \frac{\nabla ^2 \sqrt{n}}{\sqrt{n}} \right] \,. \end{aligned}$$
(34)

The pressure term comes from the self-interaction which gives a polytropic type of pressure. For the two-body interaction, which is the case we show here, \(j=1\). If we have a three-body interaction, for example, the pressure would have polytropic index \(j=1/2\), giving \(P_{\mathrm {int}} \propto \rho ^3\). The constant K depends on the interaction constant. The quantum pressure is defined in terms of the quantum potential \(Q=-(\hbar ^2/2m) \nabla ^2 \sqrt{n}/\sqrt{n}\) (see definition below).

figure h

Footnote 12

Condensate solution

Having established how we describe the weakly interacting system in the mean-field approximation, we can now describe what is the ground state, the condensate. We can solve analytically the GP equation for a few simple cases and obtain the condensate solution as for the cases with no interactions, in harmonic potential, and other simple systems (Barenghi and Parker 2016). We are going to present some interesting cases here.

In the case of a uniform gas, the wavefunction \(\varPhi _0 ({\mathbf {r}})= \sqrt{N} \phi _0/\sqrt{V}\) describes the condensate formed, while the remaining functions form a complete set of functions orthogonal to the condensate.

Solitons Solitons are a localized solution of the one-dimensional GP equation with \(V_{\mathrm {trap}}=0\), which is integrable in this limit. They are also called solitary waves, and are solutions described by a permanent and localized wave, that maintains its shape and velocity upon collisions. They are obtained when the dispersion term, the kinetic spatial term, and the non-linear term from the interaction cancel out. If the interaction is repulsive (\(g>0\)), a dark soliton is formed,which is given by the solution \(\psi _{\mathrm {d}}(x) = \psi _0 \tanh (x/\sqrt{2 \,\xi })\), where \(\psi _0=\lim _{x\rightarrow \infty } \psi \). A bright soliton is the solution for attractive interactions (\(g<0\)), with \(\psi _{\mathrm {b}}(x) = \psi _0 e^{-i\mu t/\hbar } \cosh (\sqrt{2 m |\mu |/\hbar } \, x)^{-1}\). They are called dark and bright since they represent a decrease and a concentration of condensate, respectively.

Collective excitations, dispersion relation and sound speed

The excitations in the superfluid which represent perturbations of the condensate, are an important part of this system. They represent sound waves, called the phonons that propagate through the condensate. We are going to show how they arise.

Here we are going to work in the case \(V_{\mathrm {trap}}=0\), for simplicity. The case where a trapping potential is present, like for example a condensate in the presence of a gravitational potential, is studied in Sect. 4.1. For a homogeneous condensate, to study the perturbations around the condensate, we perturb the classical wavefunction \(\psi (\mathrm {r},t)\),

$$\begin{aligned} \psi (x,t)=\psi _0 + \delta \psi ^{(1)} + \delta \psi ^{(2)}+ \cdots \,, \end{aligned}$$
(37)

where we assumed that the motion is only in the x-directions without lost of generalization. The perturbations are small and \(\delta \psi ^{(i)}\) indicate the \(i^{\mathrm {th}}\) order in perturbation. To linear order, we can re-write the linearized GP equation: \(i \hbar \partial _t \delta \psi ^{(1)}=-(\hbar ^2/2m) \partial ^2_x \delta \psi ^{(1)} + \mu ( \delta \psi ^{(1)}+\delta \psi ^{* \, (1)})\). We make an ansatz for the solution as travelling waves, \(\delta \psi ^{(1)} = A \, e^{i(kx-\omega _k t)}+ B \, e^{-i(kx-\omega _k t)}\). The parameters \(A,\, B\) are determined by the initial conditions. Substituting this ansatz into the linearized GP equation, we can see that the dispersion relation is given by

$$\begin{aligned} \omega _k^2 = c_s^2 k^2 + \frac{\hbar ^2}{(2m)^2} k^4\,, \end{aligned}$$
(38)

where the sound speed is defined as the term coming from the linear part of the dispersion relation

$$\begin{aligned} c_s^2=\frac{g \, n_0}{m}\,. \end{aligned}$$
(39)

The sound speed appears because of the presence of the interaction; therefore, since for a superfluid the presence of an interaction is crucial, we can say that a superfluid is a fluid that has a sound speed. With this we can easily see the definition and properties of a superfluid.

  • Superfluid: In the presence of interactions, a sound speed is present which determines the behaviour of the excitations on large-scales. - For large wavelengths (small k), the higher k terms do not contribute and the dispersion relation is given by

    $$\begin{aligned} \omega _k = c_s k\,, \end{aligned}$$
    (40)

    which is the dispersion relation of a sound wave. The superfluid is characterized by excitations that propagate as waves, the phonons. Because of this property phonons can mediate a long- range forces (\(F \sim 1/r^2\)). This long-range force is the responsible for the effective dynamics of a superfluid: flowing without friction. We are going to see later (in the field theory description) that this gapless mode can also be viewed as the Nambu–Goldstone modes coming from the spontaneous symmetry breaking of U(1) symmetry of the system caused by the formation of the BEC.

    This limit where the quantum pressure term can be ignored, or more specifically when \(P_{\mathrm {int}} \gg P_{\mathrm {QP}}\), is the Thomas–Fermi approximation we saw above, and can alternatively be defined by wavelengths bigger than the healing length.

    This is the case for a repulsive interaction (\(g>0\)). The situation is different for an attractive interaction (\(g<0\)), where \(w_k\) is imaginary and the solutions is unstable given by exponentially growing or decaying functions. This means that it is not possible to form a stable condensate in these cases.

    For small wavelengths (large k), the quantum pressure term dominates, and the dispersion relation is given by \(\omega _k = \hbar k^2 /2m\), which describes a free particle. In this limit, the system stops exhibiting superfluidity.

    In general, for intermediary frequencies, the full dispersion relation (38) does not propagate as a wave and shows two degrees of freedom: a gapless mode (the phonon), that propagates as a wave, and a massive mode, related to particle creation. We are going to discuss these again in the field theory section.

  • No interactions—BEC: In the limit where \(g \rightarrow 0\), the BEC stops exhibiting superfluidity. The phonon becomes gapped, the dispersion relation is given by \(\omega _k = \hbar k^2 /2m\), which is the dispersion relation of a free particle. Since \(\omega _k^2>0\) in this case, the solution of the linearized GP equation (without a external potential) is a stable oscillatory solution.

We showed above that the superfluid can decay into collective elementary excitation, the long-wavelength sound-wave quanta, the phonon. These are excitations with linear dispersion relation that behave as periodic fluctuations in density in the superfluid. When studying linear perturbations around a classical condensate background, only the phonon excitation is expected. However, a superfluid can also deplete into other excitations called rotons and maxons. The linear part of the dispersion relation is only a part of the dispersion relation at small momenta. For higher momenta, the dispersion relation presents a maximum and then a minimum. Near the maximum we have the maxons excitations, and near the minimum, for even higher momenta, we have the roton excitations. So, phonons, rotons and maxons are excitations described by different parts of the same dispersion relation. And lastly, for a rotating superfluid, there is also the vortex nucleation of the condensate, as we will see in Sect. 3.5.

figure i

Footnote 13 Footnote 14

Some comments are in order. When defining \(\psi \) as the condensate wavefunction, this quantity is actually a mean-field value of the wavefunction, the degree of freedom that defines the condensate. This description of averaging selecting the condensate is consistent with the theory of critical phenomena, like phase transitions. This Bose system can be seen as a system with spontaneous breaking of a symmetry of the description. In our case the U(1) symmetry which is the symmetry of the Hamiltonian. This is analogous to the the spontaneous symmetry breaking in a ferromagnet. The difference is that, since we have a Bose system, the idea of spontaneous symmetry breaking to the thermodynamical limit of a finite size Bose gas defines the number of particles of the system, and this is only consistent with the picture of having a condensate that can change the number of particles in the ground state, if the number of particles is conserved.

We can also understand this argument for the symmetry breaking of the system by analysing another approach to the weakly interacting system initially developed by Penrose and Onsager (1956), and Beliaev (1958). In this approach, the condensate wavefunction is identified, using the density matrix \({\bar{\rho }}({\mathbf {r}},\, {\mathbf {r}}')=\langle {\hat{\varPsi }} ({\mathbf {r}}') {\hat{\varPsi }} ({\mathbf {r}}) \rangle \), to the number density of particles \(n({\mathbf {r}})={\bar{\rho }}({\mathbf {r}},\, {\mathbf {r}})=\sum _i n_i |\phi ({\mathbf {r}})|^2\), where we are working in the stationary case. The formation of a BEC, which means that the ground state has a macroscopic occupation, leads to the factorization \(N=n_0+\sum _{i \ne 0} n_i\), which means \({\bar{\rho }}({\mathbf {r}},\, {\mathbf {r}}')= \varPhi _0 ({\mathbf {r}}') \varPhi _0 ({\mathbf {r}}) + \sum _{i \ne 0} n_i \phi _i^{*} ({\mathbf {r}}' )\phi _i ({\mathbf {r}})\). The field operator then can be factorized in the presence of a condensate into \({\hat{\varPsi }} ({\mathbf {r}}) = \varPhi _0 ({\mathbf {r}}) + \delta {\hat{\varPhi }} ({\mathbf {r}})\), with \(\langle \delta {\hat{\varPsi }} ({\mathbf {r}}) \rangle =0\). This means when a condensate form,

$$\begin{aligned} \langle {\hat{\varPsi }} ({\mathbf {r}}) \rangle = \varPhi _0 ({\mathbf {r}}) \ne 0\,. \end{aligned}$$
(43)

A state that has conserved particle number has \(\langle {\hat{\varPsi }} \rangle =0\). So this condition above is seen as describing a symmetry breaking, more specifically Bose symmetry breaking, and the consequence is a system where particle number is not conserved in the condensate. In the absence of a condensate, this goes back to the particle conserving condition:

$$\begin{aligned} \varPhi _0 \left\{ \begin{array}{lr} = 0 \,, &{} \, \,\,\mathrm {for }\,\,\, T >T_{\mathrm {c}}\,, \\ \ne 0 &{} \, \,\, \mathrm {for }\,\,\, T <T_{\mathrm {c}} \,. \end{array} \right. \end{aligned}$$
(44)

This suggests that condensation comes from a spontaneous symmetry breaking theory.

With that, the interacting condensate system above can be understood as a particle conserving system of bosons with U(1) symmetry, described by the classical field \(\psi \), the wavefunction of the condensate, where the formation of a Bose–Einstein condensate is a phase transition, coming from a spontaneous breaking of the symmetry (that can be seen as spontaneous coherence). This description of the condensate makes us see a parallel with the formalism used in field theory.

3.3.1 Field theory description

Given the suggestion that we can describe condensation as spontaneous symmetry breaking process, we turn now to the description of this system using a field theory language. The methods from field theory are very appropriate to describe this type of system where spontaneous symmetry breaking is present. Given the description we had above for the superfluid and the properties of this system, we represent this system as a massive complex scalar fieldFootnote 15   with self-interactions with a global U(1) symmetry that is spontaneously broken by the presence of the ground state, the condensate, with superfluidity arising upon condensation. We are going to work here, to best illustrate this description, in the homogeneous case, where there is no trapping or external potential applied to the system.

Given that, we describe this system by the Lagrangian density for a two-body interaction,

$$\begin{aligned} {\mathcal {L}}&= (\partial _\mu \varPsi )^{*} (\partial _\mu \varPsi ) - m^2 \varPsi ^{*} \varPsi - \frac{g}{2} (\varPsi ^{*} \varPsi )^2 \end{aligned}$$
(45)

where we consider \(g > 0\) to get a stable condensate with long-range coherence, as discussed above. As we saw above, this system has a U(1) global symmetry, which means that it is invariant under continuous rotations of the phase of the complex scalar field,

$$\begin{aligned} \varPsi \rightarrow \varPsi e^{i \alpha }\,, \quad \varPsi ^{*} \rightarrow \varPsi e^{-i \alpha }\,, \end{aligned}$$
(46)

where \(\alpha \) is a constantFootnote 16. The equation of motion is given by,

$$\begin{aligned} \partial ^{\mu } \partial _{\mu } \varPsi + m^2 \, \varPsi + g | \varPsi |^2 \varPsi =0 \,. \end{aligned}$$
(47)

To describe the condensate, like we did previously, we separate the condensate contribution to the perturbations, and assume the mean-field approximation,

$$\begin{aligned} \varPsi ({\mathbf {x}}, t) = \psi ({\mathbf {x}}, t) + \delta \varPsi ({\mathbf {x}}, t) \,, \end{aligned}$$
(48)

where \(\psi ({\mathbf {x}}, t) \) is the background that gives the condensate field, and \(\delta \varPsi ({\mathbf {x}}, t) \) are the excitations of the condensate that are considered small with respect to the background solution.

As we saw in (27), the stationary solution determines the ground state of the system. Using that, the condensate wavefunction can be written as

$$\begin{aligned} \psi = v \, e^{i \mu t}\, , \end{aligned}$$
(49)

where \(\theta _{\mathrm {bg}} =\mu t\) is the phase of the ground state where \(\mu \) is the chemical potential.Footnote 17 This is the background solution for the equation of motion (47) as long as,

$$\begin{aligned} \mu ^2=m^2+g\, v^2\,. \end{aligned}$$
(50)

When we have a spontaneous symmetry breaking, only the ground state, the state that broke the symmetry, is not invariant under U(1) anymore. This means that we still have a conserved current given by \(j^\nu = i (\varPsi ^\dagger \partial ^\nu \varPsi - \varPsi \partial ^\nu \varPsi ^\dagger )\), as we saw above in the definition of SSB. Using the equations of motion one can see that this current is conserved. For the ground state (49), this current gives the number density: \(n=2\mu v^2\). In the field theory description, the conservation of the norm (\(\int |\varPsi |^2\)), related to the number density in the condensate and conservation of the number of particles in the condensate seen in the quantum mechanical approach above, is interpreted as a consequence of the global symmetry of the system which leads to a conserved Noether charge, \(Q = \int j^0 \propto \int n \propto \int |\varPsi |^2 \) (conservation of the number density of field quanta).

figure j

Footnote 18

This is the background solution corresponding to the condensate. This ground state is responsible for spontaneously breaking the U(1) global symmetry. And we can see that explicitly. Since this solution represents the ground state, we can compute the energy functional for this system,

$$\begin{aligned} E=\int {\text {d}}^3 x \,\, {\mathcal {U}}= \int {\text {d}}^3 x \, (\partial _0 \varPsi \partial ^0 \varPsi + \partial _i \varPsi \partial ^i \varPsi + V_{eff})\,, \end{aligned}$$
(52)

where \(V_{eff}=m^2 \varPsi ^{*} \varPsi + g \varPsi ^{*} \varPsi \). The ground state is given by the stationary, minimum energy state. This can be found by finding the minimum of the energy (51), which amounts to finding the minimum of the potential energy. The set of solutions for the minimum of this potential energy is

$$\begin{aligned} \left\{ \begin{array}{lcl} v_{0,\, \mathrm {s}}=0\,, &{} \quad \mathrm {for }\,\,\, m^2 > 0 \,, &{} \quad \mathrm {Symmetry \, restaured} \\ v_{0,\, \mathrm {ssb}}=\pm \sqrt{\frac{|m^2|}{g}}\,, &{} \quad \mathrm {for }\,\,\, m^2 < 0\,, &{} \quad \mathrm {Symmetry \, broken - condensate} \,. \end{array} \right. \end{aligned}$$
(53)

The value of the minima \(v_{0,\, \mathrm {ssb}}=\pm m/\sqrt{g}\) are called vacuum expectation value and are the value that the field assumes at the ground state, apart from a phase. As we can see from on the left panel of Fig. 7, the symmetry restored phase when \(m^2 > 0\) has a well-defined minimum at \(v_{0,\, \mathrm {s}}=0\), and this is the normal phase, with no condensation. This ground state \(\psi _{\mathrm {s}}=0\) is preserved under the symmetry of the system (U(1) symmetry), the rotations of the phase of the complex field.

The symmetry breaking phase when \(m^2 < 0\), there is a continuous set of minima with the ground state given by \( \psi _{\mathrm {ssb}} = v_{0,\, \mathrm {ssb}} e^{i \alpha }\), corresponding to all the possible phases in the circle \(\alpha \in [0,\, 2\pi )\), as seen in the right panel of Fig. 7. All of these classical backgrounds are not invariant under U(1) symmetry, the symmetry of the system, which means that \(\phi _{i,\, \mathrm {ssb}} \rightarrow \phi _{j,\, \mathrm {ssb}}= v_{0,\, \mathrm {ssb}} e^{i \alpha } \ne \phi _{i,\, \mathrm {ssb}}\). In this way, we say that the symmetry is spontaneously broken by this condensate ground state. From the SSB, we can see that the condensate has long-range order, with the field having the role of the order parameter of the system.

Fig. 7
figure 7

Potential of the weakly self-interacting system. Left panel: Potential of the symmetry restored phase, \(m^2>0\) which has a minimum at \(\psi _{\mathrm {s}}=0\). This ground state is symmetric, respects the symmetry of the system. Right panel: Symmetry breaking potential, for \(m^2 <0\). This phase has a degenerate minima not invariant under the symmetry of the system. It represents the condensate state

Excitations

We consider now fluctuations around the classical condensate configuration to study the spectrum of this system. Considering small fluctuations means perturbing each degree of freedom of the field around the condensate (48). This is equivalent in the polar notation to:

$$\begin{aligned} \psi ({\mathbf {x}}, t) = (v+\rho ) \, e^{i(\mu t+\pi )}\,, \end{aligned}$$
(54)

where \(\rho ({\mathbf {x}},t)\) can be interpreted as a perturbation in the radial direction, and \(\pi ({\mathbf {x}}, t)\) a perturbation in the angular/phase direction. Plugging this into the Lagrangian, we have

$$\begin{aligned} {\mathcal {L}}=-(\partial _\mu \rho )^2+(v+\rho )^2 \left[ g\,v^2 + 2\mu {\dot{\pi }}+{\dot{\pi }}^2 -(\partial _i \pi )^2 \right] - \frac{g}{2} (v+\rho )^4\,. \end{aligned}$$
(55)

With this expansion is already easy to see that \(\rho \) has a mass term (the term accompanying the \(\rho ^2\) term), so the perturbation in the radial direction is massive. The perturbation in the phase has no mass term and it is going to be massless. This massless excitation was already expected from the Goldstone theorem (see below), where a SSB leads to the appearance of a massless excitation, the Nambu–Goldstone (NG) boson \(\pi \). In the context of a superfluid, this gapless excitation is the phonon.

For low-energy theories, the phonon is the only degree of freedom that is excited in the theory, as the massive mode can be integrated out. Therefore, at low energies a superfluid is completely described by the phonon excitations.

Low energies here mean energies lower than the mass gap of the massive mode \(\rho \). We are going to work on this limit here to obtain the spectrum of this NG boson. In this limit, the kinetic term of \(\rho \) can be neglected and the equation of motion becomes:

$$\begin{aligned} g(v+\rho )^2 = g\, v^2 + 2\mu {\dot{\pi }} + {\dot{\pi }}^2 - (\partial _j \pi )^2 \equiv X \,. \end{aligned}$$
(56)

Using this, we can rewrite the on-shell Lagrangian in terms of X as

$$\begin{aligned} {\mathcal {L}}=\frac{X^2}{2g} = \frac{1}{2g} \left[ g\, v^2 + 2\mu {\dot{\pi }} + {\dot{\pi }}^2 - (\partial _j \pi )^2 \right] ^2 \,. \end{aligned}$$
(57)

This Lagrangian depends only on the phonon \(\pi \), as \(\rho \) was integrated out, and is invariant under shift symmetry, \(\pi \rightarrow \pi + c\), inherited from the U(1) symmetry of the complex scalar field. This is the effective Lagrangian at leading order in derivative expansion for the phonon. To obtain the dispersion relation we can expand this Lagrangian, using \(\pi _{\mathrm {c}}=\mu \pi \),

figure k

where we used that \(n=2\mu v^2\). Taking the Fourier transform of the field, we compute the equation of motion and can see that the dispersion relation of the phonon is given by

figure l

where \(c_{\mathrm {s}}\) is the sound speed of the superfluid. This expression shows us the general behaviour we already seen in the excitations calculated in the QM approach. For long wavelengths (small k), the higher order term in k is suppressed, and the dispersion relation is \(w_{k,\, \mathrm {long}}^2=c_{\mathrm {s}} k^2\), which is the dispersion relation characteristic of a sound wave. Therefore, the phonon propagates as a wave with sound speed \(c_{\mathrm {s}}\) in the long-wavelength regime.

The second term of the dispersion relation is characteristic of a massive particle. This means that for short wavelengths, the system behaves as a system of normal particles propagating, and not a superfluid.

We can re-write these expressions in the non-relativistic (NR) regime, which is the regime we are interested in comparing to the previous approach (and also the limit where we describe, in the next section, the behaviour of DM in galaxies). In this limit, \(g \, v^2 \ll m^2\), which implies \(\mu ^2 \approx m^2\). The dispersion relation in this case is given by

$$\begin{aligned} w_{k,\, \mathrm {NR}}^2=c_{\mathrm {s}}^2 k^2 + \frac{1}{4m^2} k^4\,, \quad \mathrm {with}\,\,\, c^2_{\mathrm {s}}= \frac{g\, n}{m}\,, \end{aligned}$$
(60)

which is equivalent to the one found in (38).

In the absence of interactions, we recover the ideal Bose gas from Sect. 3.1, with background solution is given by \( \varPsi _{\mathrm {ideal}}=ve^{imt}\), and dispersion relation of a massive particle \(w_k^2= (1/4m^2) k^4\), showing again that although this system condenses in to a BEC, in the absence of interactions there is no superfluidity. It is easy to see from (58) that in the relativistic regime, \(g \, v^2 \gg m^2\), \(c^2_{\mathrm {s}}=1/3\).

In this way, we showed that the field theory description of the superfluid is very good to describe the general properties of the superfluid. To properly compare with the results obtained above in the QM approach, we are going to show that in the non-relativistic limit, the field theory description gives us the GP and Madelung equations obtained above.

Before doing that, one comment is in order. In (57), we showed that in the low-energy regime, we can re-write the microscopic theory of a superfluid as an effective theory only of the phonon with a non-canonical kinetic term. We did this here in the case of a weakly self interacting system with two body interaction, but in Sect. 3.4 we are going to extend this idea to general superfluid systems and show the construction of the EFT of superfluids.

figure m

Footnote 19

Recovering the other approaches

We want to show that we can recover the GP theory presented in the previous subsection, and emphasize that the field theory description is compatible to describe the superfluid. The field theory presented above is a fully relativistic theory, which means that the action is invariant under Lorentz transformations on top of the global U(1). However, GP description shown above is non-relativistic. Therefore, to recover the GP and Madelung equations we need to take the non-relativistic limit of the relativistic field theory above.Footnote 20

Starting from our field theory for the weakly interacting bosons (45), we take the non-relativistic limit of the Lagrangian. We do that by talking the limit \(c \rightarrow \infty \) and assume that in this limit the field has a very fast phase rotation in time, which allows us to rewrite the fields as

$$\begin{aligned} \varPsi ({\mathbf {x}}, t) = \frac{1}{2m} \psi ({\mathbf {x}}, t) e^{-imt}\,. \end{aligned}$$
(61)

With that, the Lagrangian can be written as

$$\begin{aligned} {\mathcal {L}} = \frac{i}{2} \left( {\dot{\psi }} \psi ^{*}-\psi \dot{\psi ^{*}} \right) -\frac{1}{2m} |\nabla \psi |^2 - \frac{g}{16m^2} (\psi ^{* } \psi )^2\,. \end{aligned}$$
(62)

From this non-relativistic Lagrangian, we can evaluate the equation of motion for the scalar field \(\psi \),

$$\begin{aligned} i {\dot{\psi }} = \left( - \frac{1}{2m} \nabla ^2 + \frac{g}{8m^2} |\psi |^2 \right) \psi \,, \end{aligned}$$
(63)

which is exactly the Gross–Piatevskii equation like shown above, in the absence of a trapping/external potential.

From that, we can also derive the Madelung hydrodynamical equations, if we make the following substitution:

$$\begin{aligned} \psi \equiv \sqrt{\rho /m} e^{i \theta }\,, \quad {\mathbf {v}} \equiv \frac{1}{m} \nabla \theta \,. \end{aligned}$$
(64)

The vorticity of the superfluid is zero and the momentum density has a non-zero curl. Plugging this in the equations of motion, we recover the Madelung equations (32) and (33) in the absence of an external potential. This shows again that we can recover the equations that describe the interacting BEC using the field theory approach.

In this section, we showed how to describe a weakly interacting BEC. We showed the standard treatment of the theory, where the many-body quantum system is described by the GP equation. We showed that condensation can be thought as a spontaneous symmetry breaking process and showed that the system can be described in a equivalent way using the field theory approach. We specialized in both cases in the mean-field theory, which is valid for dilute systems, and simplifies the significantly the study of the system.

It is important to comment on the validity of this theory and the approximations made. The theory presented above is only valid for zero temperatures and in the mean-field approximation that holds for \(na^3 \ll 1\), where we can describe the condensate as a classical wavefunction and the limit where quantum corrections are not important. As we cited above, there are correction to the mean field and other approximations where one can study this model (see Pitaevskii and Stringari 2016 for some examples). For finite temperature, one has to describe the superfluid as a two-fluid model.

In the cases we are going to study, we will extend a bit the validity of the zero temperature description, as a first approximation, since in galaxies the temperature is obviously not zero. However, since the occupation number will be very high in our problem, the classical description is safe.

Two-fluid model

The description presented above for the superfluid is valid for a zero temperature dilute weakly interacting Bose gas. However, as already described in Landau’s phenomenological theory, for finite temperature system, the superfluid has to be described as a two-fluid model: a mixture of a superfluid and a normal fluid. As we saw previously in this description, for \(T<T_c\), the fluid is a mixture of normal and superfluid with most of the fluid in the superfluid phase, while for temperatures higher than \(T_c\), the coherence of the superfluid is broken and the fluid is mostly in the normal phase.

Connecting Landau’s theory and the microscopic description of a superfluid by a weakly interacting theory, a two-fluid relativistic theory of superfluid was developed. This can be linked to the non-relativistic one described above, and a “hydrodynamical” and field theory approaches are developed. We are not going to describe this in detail in this review since this is not used to describe the current ULDM models in the literature. However, the two-fluid description should be used for finite temperature systems to describe a superfluid and we believe it is important to describe realistic DM superfluid models. For a review of the two-field model, see Schmitt (2015). In Sect. 4, we present one work where the two-fluid formalism is used to describe the self-interacting BEC DM.

The weakly interacting Bose system studied in this section is the prototypical description of a BEC system that presents superfluidity. It is a microscopical description that shows the behaviour of the condensate and its excitations. But this description is limited and cannot be used to describe all the models of possible superfluids and realistic experimental systems. The interesting point is that this theory can be recast as a spontaneous symmetry breaking theory of U(1) symmetry of the many-body system. BEC and superfluidity are a consequence solely of the spontaneous symmetry breaking, independent on the specific model chosen. Therefore, this hints us to describing these symmetry breaking systems using the effective field theory approach, where the Lagrangian of the system is described by symmetry alone. The hope is to be able to describe more complicated superfluid systems. We are going to describe this approach then in the next subsection.

3.4 Effective field theory of a superfluid

We described in the previous section the construction of a microscopic effective theory for the weakly interacting BEC that can be used to describe superfluids. This description is based on London’s idea (London 1938), which has its roots in the superfluid/superconductor hydrodynamics, that the BEC can be described by a theory with spontaneous symmetry breaking caused by the condensate, with superfluidity arising upon condensation and being described by the Goldstone boson, the phonon, at low energies. We saw that we can write an effective Lagrangian for the phonon that describes the behaviour of the superfluid, matching many observations, in the low-energy and low-momentum regimes.

This procedure shows us that we can describe the hydrodynamical degrees of freedom of a theory by the Goldstone modes created by the SSB of a symmetry, the global U(1) symmetry in our case. This is more general than the simple weakly interacting two-body interaction case showed above. This is already the case of hydrodynamics that describes macroscopically the behaviour of low-energy variables and interactions of system given a symmetry, coarse-graining over the smallest scales. This is the perfect playground for the use of effective field theories (EFT), and EFT techniques are very appropriate for this task. An EFT describes, the low energy (long distances) behaviour of a system, without having to refer to its underlying microscopic theory, by parametrizing our ignorance of those short scales.

The idea is to use EFT methods to describe the dynamics of a superfluid. This was developed in Son (2002); Dubovsky et al. (2006, 2012); Son and Wingate (2006), where they develop the general formalism to describe fluid hydrodynamics without dissipation as an EFT. In this approach the Lagrangian that describes the system is constrained by symmetry alone.Footnote 21 This approach is very powerful since it not only can be applied to many different systems, but describes the system without the need of its microscopic understanding. It is also very powerful since it is an expansion over momentum. At leading order in the expansion, we describe the low-energy theory. But this description allows us to go beyond leading order in the long-wavelength expansion.

In our case, we want to construct the EFT that reproduces, in the long wavelength regime (low energies), the superfluid hydrodynamics, as presented above. This theory is a theory of the phonon, which is the only degree of freedom that is excited at low energies. This is the Goldstone mode produced by the SSB of the U(1) global symmetry by the ground state. We are going to work here in the non-relativistic regime, but one can see the references above for the relativistic case. Restricting to the non-relativistic regime does not imply any loss of generality of the argument, with the system only subjected to different symmetries than the relativistic case.

Inheriting the knowledge of a superfluid from previous sections, at low energies, the only dynamical degree of freedom that describes a superfluid is the phase of the condensate, the phonon. Therefore, in the non-relativistic regime, we need to construct the EFT of this phase \(\pi \). The superfluid is described by a theory where the symmetry is spontaneously broken by the ground state,

$$\begin{aligned} \theta = \mu t -\pi \,. \end{aligned}$$
(65)

The theory that is described by this phase \({\mathcal {L}}_{\mathrm {eff}} (\theta )\) is invariant under shift-symmetry \(\theta \rightarrow \theta +c\) which is inherited from the U(1) symmetry of the complex scalar field.

EFT states that to construct the effective Lagrangian for the phonon, we have to write all the terms that are compatible with the symmetries of the problem. This system has shift symmetry and Galilean symmetry. For the shift symmetry, for the Lagrangian to be invariant under this symmetry, only terms that are acted by a derivative can appear in the theory: \({\mathcal {L}} = {\mathcal {L}} \left( {\dot{\theta }}, \partial _i \theta \right) \). This may contain terms with any power of the derivative of the field.

However, this theory has more symmetries. In a generic space-time and adding a gauge field, which is a natural extension of the simple scalar field model, we require that this Lagrangian is invariant with respect to three-dimensional general coordinate transformations and gauge invariance. The most general Lagrangian \({\mathcal {L}}={\mathcal {L}} \left( D_t \theta , g^{ij}D_iD_j \theta \right) \) that is invariant under these symmetries, shift symmetry, gauge invariance and general coordinate invariance, is given by,

$$\begin{aligned} {\mathcal {L}}=P\left( X \right) \,, \quad \mathrm {with} \,\,\,\,\, X=D_t \theta - \frac{g_{ij}}{2m} D_i \theta D_j \theta \,, \end{aligned}$$
(66)

where \(D_t \theta = {\dot{\theta }}+A_0\) and \(D_i \theta = \partial \theta - A_i\). In flat space, \(g_{ij}=\delta _{ij}\), the general coordinate invariance corresponds to a Galilean symmetry.

As it can be seen in Son and Wingate (2006), Galilean symmetry is not enough to constrain the NLO terms and one needs to consider the full general coordinate invariance. This is equivalent to considering an additional constraint for the theory, which is known from fluid hydrodynamics,

$$\begin{aligned} T_{0i}=m\, j_i\,, \end{aligned}$$
(67)

where \(T_{0i}\) is the off-diagonal component of the energy–momentum tensor. As stated in Greiter et al. (1989) and Dubovsky et al. (2012), this additional constraint (with an analogous constraint in the case of relativistic systems, Dubovsky et al. 2012) states that only one degree of freedom carries all the current and momentum. Introducing a new symmetry, full general coordinate invariance is equivalent to assuming this relation.

With the two symmetries of the system, in the absence of gauge fields \(A_0=A_i =0\), the Lagrangian that describes this system is given by

$$\begin{aligned} {\mathcal {L}}=P\left( {\dot{\theta }} - \frac{(\partial _i \theta )^2}{2m} \right) \,. \end{aligned}$$
(68)

This is a Lagrangian that has a non-canonical kinetic term and it obeys (67).

At the background \(\theta = \mu t\), and \(T=0\), this Lagrangian density is equal to the pressure \(P=P(\mu )\). With that we can evaluate the particle number density,

$$\begin{aligned} n=P' (X)\,, \end{aligned}$$
(69)

where \('\) indicates the derivative with respect to \(A_0\). For the condensate, then the equilibrium number density at chemical potential \(\mu \) is \(n(\mu )=P' (\mu )\), where \(P(\mu )\) is the thermodynamical pressure, defined up to a constant.

Given the Lagrangian (68) with \(\theta =\mu t + \phi \), we can write the Lagrangian as:

$$\begin{aligned} L=P(\mu )-n{\dot{\phi }} + \frac{1}{2} \frac{\partial n}{\partial \mu } {\dot{\pi }}^2- \frac{n}{2m} (\partial _i \pi )^2\,. \end{aligned}$$
(70)

We can see from that the phonon speed of sound

$$\begin{aligned} c_{\mathrm {s}} = \sqrt{\frac{n}{m} \frac{\partial \mu }{\partial n}} = \sqrt{\frac{\partial P}{\partial \rho }}\,, \end{aligned}$$
(71)

where \(\rho = mn\) is the mass density.

One limit of this EFT Lagrangian is the quadratic Lagrangian for the weakly interacting BEC, shown in (57). We can see that by considering the special case where P(X) is written as a polynomial, \(P(X) \propto ({\dot{\theta }}/m)^n\). Depending on the power chosen we will have a superfluid with a different equation of state:

$$\begin{aligned} \left\{ \begin{array}{lll} n=2: &{} \quad P \sim \rho ^2 &{} \quad \mathrm {BEC/Sf \, (2-body)} \\ n=3/2: &{} \quad P \sim \rho ^3 &{} \\ n=5/2:, &{} \quad P \sim \rho ^{5/3} &{} \quad \mathrm {Unitary \, Fermi \, gas} \end{array} \right. \end{aligned}$$
(72)

These represent different systems. The case \(n=2\) is equivalent to the previous case, in Sect. 3.3, where we had a superfluid with a two-body interaction described by the microscopic weakly interacting theory, where we obtained \(P(X)=X^2\) for the low-energy (57). The case with \(n=3/2\) can correspond to the same theory as the previous case, a weakly interacting theory, but with a three-body interaction, with this effective Lagrangian obtained by integrating out the massive radial mode, like done in the previous section. These two equivalences shows us an interesting aspect of this EFT and from the hydrodynamics of superfluids: the interaction is linked to the equation of state of the superfluid, and this can be seen by a different choice of P(X) in the EFT. The case \(n=3/2\) can also represent another completely different superfluid system, like we will see in the case of the superfluid DM in the next section, where the this case does not come from a weakly interacting microscopic theory with three-body interaction. This is also the case for the other example we show here, the unitary Fermi gas, which cannot be described by a microscopic theory like we did in the previous section, and being described with this EFT if \(n=5/2\).

One comment is in order here. Usually in quantum field theory having fractional exponents can be problematic, leading to caustics or superluminal propagation. However, in the case of the superfluid, this is not a problem. Before reaching these regimes (like the formation of caustics), the superfluid coherence is broken, and the EFT description of the superfluid is no longer valid.

If one wants to add an external or trapping potential \(V_{\mathrm {ext}}\), like, for example, if the gas is under the influence of a gravitational potential, this corresponds to making \(A_0=V_{\mathrm {ext}}\). With that, the Lagrangian is given by (66), with \(X={\dot{\theta }} - \frac{(\partial _i \theta )^2}{2m}-V_{\mathrm {ext}}\). In the case of the condensate in a gravitational potential, this is given by

$$\begin{aligned} {\mathcal {L}}=P\left( X \right) \,, \quad \mathrm {with} \,\,\,\,\, X={\dot{\theta }} - \frac{(\partial _i \theta )^2}{2m}-m \varPhi \,, \end{aligned}$$
(73)

where \(\varPhi \) is the gravitational potential. This is going to be the case studied in the next section for the DM superfluid.

With that Lagrangian we are able to describe the theory as the other approaches we used to describe the BEC and superfluid theories.

figure n

Superfluid hydrodynamics

From this formalism, we can also describe the superfluid hydrodynamics. From (69), we re-write the field equation with respect to the number density to obtain the continuity equation:

$$\begin{aligned} {\dot{n}}+ \frac{1}{m} \partial _i \left( n \partial _i \pi \right) = 0\,. \end{aligned}$$
(74)

The gradient of the field \(\pi \) can define the velocity of the superfluid, \({\mathbf {v}}_{\mathrm {s}} = -\mathbf {\nabla } \theta /m = \mathbf {\nabla } \pi /m\), we can derive the second equation of superfluid hydrodynamics:

$$\begin{aligned} {\dot{\pi }}= -\mu (n) - \frac{m v_{\mathrm {s}}^2}{2}\,. \end{aligned}$$
(75)

Validity of the EFT

As we saw above, this theory is valid for low energies, or long wavelength, and breaks for high energies. The Lagrangian shown here is valid in leading order in derivative expansion. In this regime, it reproduces the results from hydrodynamics of superfluids. But this framework also allows us to go beyond leading order in this momentum expansion, the next-to-leading order (NLO) Lagrangian. In Son and Wingate (2006), they show a prescription to take into considerations next-to-leading order terms. This can be done in this framework at arbitrary order, only requiring that the NLO Lagrangian respects the symmetry of the system, and at the cost of introducing new free parameters. This might allow the study of those systems in a regime that is challenging for the microscopic perturbative description.

However, the validity of this effective theory constructed here still needs to be checked as higher order terms in the Lagrangian can only be neglected if the sound speed of the theory is not too small. In Sect. 4.2.5, we describe the validity of the EFT for that concrete example of superfluid and show that the theory is valid for the parameters of the model.

This theory is also only valid in the absence of dissipation. A discussion of how to describe dissipative phenomenon in this EFT approach can be seen in Berezhiani (2020).

The formalism of the EFT presented here is used to describe the low-energy dynamics of the superfluid Goldstone mode, the phonon. This formalism is more general, tough, and translates into an EFT language the hydrodynamics of fluids at zero temperature and without dissipation, so it can be generalized to describe superfluids with different equation of state, superconductivity, unitary Fermi gas, among other systems. This low-energy EFT approach is very useful to describe the dynamics of various physical systems and writing the superfluid in this macroscopic effective Lagrangian offers us the chance to study the dynamics of this system without having to work out the details coming from the microscopic short distance physics. With that, this formalism allows us to study the behaviour of more complicated superfluids, with different equations of state that might come from different and more complicated interactions. This will be useful to describe the DM superfluid model, in Sect. 4.

3.5 Rotating superfluid—quantum vortices

When we rotate a normal fluid, the fluid rotates together with the recipient in a homogeneous way, like a rigid body, with vorticity \(\nabla \times \mathbf {v_n} = \nabla \times \mathbf {\mathbf {\varOmega } \times {\mathbf {r}}} \ne 0\), where the normal fluid velocity \(\mathbf {v_n}\) of a rotational fluid is given by the cross product of the angular velocity and the position.

As we saw above, a superfluid, described by a weakly interacting BEC, has irrotational flow: \(\mathbf {v_s}=(\hbar /m) \nabla \theta \) which gives \(\nabla \times {\mathbf {v}} = 0\), where viscosity is absent. This is the defining property of a superfluid. In another language, this means that the circulation around a closed contour \({\mathcal {C}}\) in a superfluid is given by

$$\begin{aligned} \varGamma = \oint _{\mathcal {C}} {\mathbf {v}}\cdot \mathrm{{d}}{\mathbf {l}}=\frac{\hbar }{m} \oint _{\mathcal {C}} \nabla \theta \cdot \mathrm{{d}}{\mathbf {l}} = \frac{\hbar }{m} \int \nabla \times \nabla \theta =0\,, \end{aligned}$$
(76)

where \(\mathrm{{d}}{\mathbf {l}}\) is a length element on the path \({\mathcal {C}}\) and A is the area enclosed by this contour. When a superfluid is rotated, this property says that the superfluid would not rotate, but would remain stationary. So, how can we rotate a superfluid and maintain the irrotational flow? This is possible if the superfluid phase presents a singularity.

A superfluid is a state where the system is described by one macroscopic wavefunction. In the presence of this singularity, this wavefunction is single valued, \(\psi (\theta ) =\sqrt{\rho /m} \, e^{i\theta }= \psi (\theta +2 \pi n)\), which leads to the quantization of the circulation:

$$\begin{aligned} \varGamma = \oint _{{\mathcal {C}}} \nabla \theta \cdot d{\mathbf {l}}= \frac{\hbar }{m} \varDelta \theta = 2\pi n \frac{\hbar }{m}\,. \end{aligned}$$
(77)

This property above describes a vortex. The way a superfluid rotates is inhomogeneous by forming quantized vortices (Tsubota et al. 2013a, b; Barenghi et al. 2001).

From that we can see that the azimuthal velocity of a irrotational fluid is given by \(v_{\phi } = n (\hbar /m) (1/r)\), where r is the distance to the center of the closed loop \({\mathcal {C}}\) and \(\phi \) is the azimuthal angle. At the center, we have the vortex core, as \(\lim _{r \rightarrow 0} \psi \rightarrow 0\), of size equal to the healing length, where the density \(\rho \) vanishes and \(\theta \), the phase, rotates by \(2\pi \) around the core. The flow in the center is given by the vortex line. The vorticity of the rotating superfluid is given by

$$\begin{aligned} \nabla \times {\mathbf {v}}_s = \frac{\hbar }{m} \sum _{i=1}^{N_v} 2\pi \delta ({\mathbf {r}}_i) {{\hat{\mathbf{z}}}}\,. \end{aligned}$$
(78)

where \({\mathbf {r}}_i\) is the location of the \(N_v\) vortices, and we considered that the vortex lines are in the z-direction. The vorticity is non-zero only at each vortex. So the flow is irrotational in most of the superfluid, except in the vortices. Given that the vortices are line singularities, they form a lattice of uniformly distributed vortices in the superfluid and carry the angular momentum of the rotation \(L_z=n \, L_{qm}\), where \(L_{\mathrm {qm}} = \hbar N \) is the minimum angular momentum necessary to have one quantized vortex and \(N=\int |\psi |^2\) is the number of particles in the condensate. This configuration is energetically preferable (instead of for example concentric sheets around the superfluid). With that, the spatially averaged vorticity is given by

$$\begin{aligned} \langle \nabla \times {\mathbf {v}}_s \rangle = \frac{\hbar }{m} n_v \, {{\hat{\mathbf{z}}}}=2 \varOmega \, {{\hat{\mathbf{z}}}} \,, \end{aligned}$$
(79)

where \(n_v=2\varOmega /(\hbar /m)\) is the density of vortices, which is related to the angular velocity (Feynman 1955). Although most of the superfluid has irrotational flow, the rotational flow is given by the vortices in a way that the whole superfluid then effectively flows as a normal fluid \(\langle {\mathbf {v}}_s \rangle ={\mathbf {v}}_n\), allowing the superfluid to rotate.

Fig. 8
figure 8

Left panel: schematic of a superfluid in a rotating box. A lattice of vortices is homogeneously formed in the superfluid. Right panel: the experimental observation of a highly ordered vortex lattice. Image reproduced with permission from Abo-Shaeer et al. (2001), copyright by AAAS

Given that the superfluid has irrotational flow, if it is rotated by a small \(\varOmega \), the superfluid will remain stationary. There is a critical angular velocity, \(\varOmega _{\mathrm {c}}\), above which the nucleation of vortices in the superfluid occurs. The critical velocity is given by Guadagnini (2017):

$$\begin{aligned} \varOmega _c=\frac{1}{mR^2} \ln \left( \frac{R}{\xi } \right) \,, \end{aligned}$$
(80)

where R is the radius of the cylinder where the superfluid is contained. Through the dependence in the healing length we can see that the critical angular velocity depends on the interaction, for the case of a weakly interacting BEC that behaves as a superfluid. The case where the interaction is attractive, \(g < 0\), it is known not to produce vortices. This is also the case where the BEC has a finite size coherence, producing only smaller soliton cores. This condition for the formation of cores can also be re-written as \(L \gg L_{\mathrm {qm}}\), where L is the angular momentum of the applied rotation to the system.

If there is the formation of vortices, the area density of vortices is given by \(n_{\mathrm {v}}=n/A_{\mathrm {BEC}}=m \varOmega /\pi \hbar \). From that, we can calculate the size of the vortices created (\(\pi R_{\mathrm {v}}^2=1/n_{\mathrm {v}}\)) which depends on the mass and the angular velocity of the fluid, \(R_{\mathrm {v}}= \sqrt{\hbar /m \varOmega }\).

But what exactly is the structure of these vortices and how are they described? As we saw before, we defined the vortex as a singularity, where the wavefunction is zero at the vortex line, with a quantized circular irrotational flow around the vortex line. Given this, we represent the vortex as an object in cylindrical coordinates: \(\psi ({\mathbf {r}})=f(r,z) e^{in\phi }\). To describe the density distribution, their structure and size, one has to solve the Gross–Pitaevskii equation for this object, which in the case of the weakly interacting Bose gas is

$$\begin{aligned} -\frac{\hbar ^2}{2m} \left[ \frac{1}{r} \frac{\partial }{\partial r} \left( r \frac{\partial f}{\partial r} \right) + \frac{\partial ^2 f}{\partial r^2} \right] + \frac{\hbar ^2}{2m} \frac{n^2}{r^2} f + V(r,z) f + \frac{g}{8m^2} f^3 = \mu f \,. \end{aligned}$$
(81)

In the limit \(n=0\), we recover the standard Gross–Pitaevskii equation. So the term that contains n is called the centrifugal barrier and it is the kinetic energy term from the azimuthal velocity of the vortex. Solving this equation gives the density profile of the vortex, from which we can determine the size, density and general structure of the vortex.

The presence of a trapping or external potential, changes the density of the condensate which also changes the dynamics of the vortices. For details of the formation of vortices in the presence of a trapping potential, see also Fetter (2009).

Since its discovery in Helium 4 superfluid, vortices have been one of the central topics in the research of superfluidity (Madison et al. 2000). They have been experimentally observed in many systems like Helium 4 and Helium 3, superconductors and atomic BECs, including multi-component BECs. These new observational advances allowed us to visualize and study those vortices, as it can be seen in the left panel of Fig. 8.

3.6 BEC in wave turbulence—kinetic theory

We studied in this section, the BEC and the superfluid, and showed that their dynamics is described by the Gross–Pitaevskii equation, which is a non-linear Schrödinger equation. This equation expresses the evolution of the condensate, which is described by the collective wavefunction formed by the macroscopic occupation of the ground state by the bosons, in the mean field self-potential. This theory is capable of describing many different condensate systems and describe the properties of BECs that can be tested with experiments. There is, however, an alternative way to describe the physics of a BEC, using the theory of wave turbulence.

Wave turbulence is the theory that describes non-equilibrium statistical systems using random non-linear interacting waves. These random interacting waves are the fundamental constituent of the theory. Waves in a non-linear medium interact and behave very differently than waves in vacuum and wave turbulence describes these systems of interacting non-linear waves. This approach can be used to describe many different physical system from quantum fluids to astrophysics, which is our main interest. Wave turbulence arises in classical context in systems like non-linear optics, surface waves in water, magnetic turbulence in interstellar gases, among many others (see Nazarenko 2011a for a list of examples and references). Quantum fluids are also physical systems where wave turbulence occurs, and this approach can be used to explain exotic superfluids, atomic BEC, second sound waves in \(^4\)He superfluid, and other systems as it can be seen in Kolmakov et al. (2014). Wave turbulence is a formalism specially useful to model these systems numerically and for experimental studies.

Wave turbulence can arise either in strong or weak interacting system of random non-linear waves. Weak turbulence is a limit where it is considered that all the waves are weak and have random phases. It provides a theoretical framework for wave turbulence theory to describe many physical system, representing those systems by a wave kinetic equation. This equation describes the evolution of the wave spectrum, two point correlation function or the probability distribution function, via averaging over random phases, of a system. This kinetic equation is different for each system and can have a three-wave non-linear interactions, four-wave interactions or even higher order depending on the system described. Higher moments of this equation allow the study of deviations of Gaussianity or to explore the limits of validity of the wave turbulence.

Wave turbulence can also be used to describe Bose–Einstein condensates and superfluids. It can offer a way of describing different stages of a BEC systems in a nearly equivalent way of its proper description through the non-linear Schrödinger equation presented above. We are going to show now how and for which limits a BEC can be described using wave turbulence approach. We are going to show that wave turbulence with four wave interaction, which reproduces the non-linear Schrödinger equation, can only describe the classical initial evolution of a BEC, breaking as the condensate evolves. We also show that the quantum evolution of a condensate can be described by this theory. Wave turbulence also offers a description for a BEC and superfluids outside the weak interaction limit, but we are not going to discuss this case here.

We want to develop a statistical description of the wavefunction and of the non-linear Schrödinger equation, and determine the kinetic equation which is the main equation of the wave turbulence theory. This is done by describing the evolution of an ensemble of waves (in contrast to the Boltzmann equation that is the equation that describes the evolution of a distribution of particles). For that we introduce the Wigner distribution (Wigner 1932)

$$\begin{aligned} f_{{\mathbf {p}}}({\mathbf {x}},t) \equiv \int d{\mathbf {y}}\, e^{-i\mathbf {p.y}} \langle \psi ({\mathbf {x}}+\frac{{\mathbf {y}}}{2}) \psi ^{*}({\mathbf {x}}-\frac{{\mathbf {y}}}{2}) \rangle \,, \end{aligned}$$
(82)

where the average is the average with respect to the random initial phases of the field. The Wigner distribution is a quasi-probability distribution, it behaves like a probability distribution but it does not obey all the axioms of probability theory by Kolmogorov (Kolmogorov 1933). It can be used to represent this ensemble of waves with random initial phases, or of random classical fields. As a quasi-probability, it can acquire negative values indicating interference of waves in phase-space (Choi 2006) describing the undulatory behaviour that can be captured in this description.

We now want to describe the evolution of this ensemble. We want to have a kinetic theory that is nearly equivalent to the GP equation (25) for a weakly interacting system. For that, as we can see from the interacting Hamiltonian of this theory (20), we need a four wave interaction. With that we can rewrite (25) using (82) as

$$\begin{aligned} \frac{\partial f}{\partial t}+\frac{{\mathbf {p}}}{m} \cdot \nabla _{x} f = 2 \, \mathrm {Im} \int {\text {d}}{\mathbf {y}}\, e^{-i\mathbf {p.y}} \langle \psi ({\mathbf {x}}+\frac{{\mathbf {y}}}{2}) \psi ^{*}({\mathbf {x}}-\frac{{\mathbf {y}}}{2}) U_{\mathrm {tot}}({\mathbf {x}}+\frac{{\mathbf {y}}}{2}) \rangle \,, \end{aligned}$$
(83)

where we wrote \(f_{{\mathbf {p}}}({\mathbf {x}},t)\) as f for simplicity of notation. Here we are considering that \(U_{\mathrm {tot}} = g |\phi (x)|^2\). This is the kinetic equation or the wave kinetic equation.

This equation gives the evolution of the distribution that describes the an ensemble of waves that is the two point correlation function of the field. This distribution can be interpreted as the wave action or the particle number density or particle occupation number, where this second interpretation leads us to the idea we had before that \(\psi \) can be thought as the classical limit for the quantum wavefunction or field of a weakly interaction Bose gas (Dyachenko et al. 1992; Mendonça et al. 2005).

We want to obtain the closed form of the kinetic equation, expressed in terms of f. The above kinetic equation presents, in the case of the four wave interaction, a four point correlation function. For the case of weak coupling that we are studying here, the Wick theorem is approximately valid and we can express this four-point function into the sum of two-point functions, reducing the higher order problem into a lower order one:

$$\begin{aligned} \langle \psi _1 \psi ^{*}_2 \psi _3 \psi ^{*}_4 \rangle = \langle \psi _1 \psi ^{*}_2 \rangle \langle \psi _3 \psi ^{*}_4 \rangle + \langle \psi _1 \psi ^{*}_4 \rangle \langle \psi _3 \psi ^{*}_2 \rangle + \langle \psi _1 \psi ^{*}_2 \psi _3 \psi ^{*}_4 \rangle _{\mathrm {conn}}\,, \end{aligned}$$
(84)

where we simplified the notation writing \(\psi _i = \psi (x_i)\), and the last term represents the connected part of the correlation function, the non-diagonal part of the operator. The connected part is non-zero if the distribution is non-Gaussian, and it is also higher order in the interaction.

In the regime of small non-linearities and taking the random phase approximation (Navez 2005), which allows us to ignore the higher order correlations, we can write the closed form for this equation. We wish to work in Fourier space, so we take the Fourier transform of the field. The above approximations correspond to ignoring higher Fourier moments and writing Eq. (83) in terms of the two-point correlation function \(\langle \psi _{k_i} \psi _{k_j} \rangle = n(k_i) \, \delta (k_i - k_j)\), where \(\delta (k_i - k_j)\) is the Dirac delta (Nazarenko and Onorato 2006; Nazarenko 2011b; Zakharov et al. 1992). Given these assumptions, we can re-write the kinetic equation in closed form:

$$\begin{aligned} \dot{n}_k = 4\pi \int n_1 n_2 n_3 n_4 \left[ \frac{1}{n_k} + \frac{1}{n_3} - \frac{1}{n_1} - \frac{1}{n_2} \right] \delta (\mathbf {k}+ \mathbf {k}_3- \mathbf {k}_1 - \mathbf {k}_2 )\, \delta (k^2 + k^2_3 -k_1^2 -k_2^2)\, d\mathbf {k}_1 d\mathbf {k}_2 d\mathbf {k}_3 \,. \end{aligned}$$
(85)

The above equation is the equation for the evolution of the wave spectrum. One can also write the equation for the higher moments if interested in investigating deviations from Gaussianity or the limits where this description breaks, among other phenomena.

This equation reproduces the non-linear Schrödinger equation, which is the equation that governs the evolution of a wavefunction. Any wavefunction. It can be thought as the quasi-classical limit at high occupation number of the quantum kinetic equation (we discuss this later in this section). This equation describes the evolution of any ensemble of classical waves that is described by the Schrödinger equation. It describes a non-condensed system since in this description condensate density, which is the density of particles in the ground state which in the language of waves translates into waves with \(k=0\), is relatively small (in comparison to a strong BEC deep into the region \(T\ll T_{\mathrm {cr}}\), where the number of particles in the ground state is almost equal to the total number of particles). When the condensate density is large or the population of lowest momentum states is large, the system is not weakly non-linear anymore and the above description breaks. However, this description can be used to the initial stages of a condensate, when the condensate density is still small. We call this limit “weak” condensate. When we have self-interactions the system can form a superfluid, and for this four-wave system this equation describes the vicinity of superfluid transition. Therefore, the kinetic equation with four-wave interaction can describe the classical initial evolution of a BEC, when we have a “weak” condensate.

As temperatures drop and the condensate density becomes larger, we have a “strong” condensate. In this stage of the evolution of the condensate the above description with a four wave interaction is not a description of the system, and the evolution of the occupation number of the condensate needs to be described by the wave kinetic equation with three-wave interaction (Nazarenko 2011b; Zakharov et al. 1992; Dyachenko et al. 1992; Lvov et al. 2003). This three-wave interaction representation describes the later phase in the evolution when the condensate is strong and fluctuations on top of the condensate are only given by phonons. Only at this stage the theory can again be described by weak turbulence, but now involving three waves. The intermediary regime between these two description is more complicated. After the four-wave description breaks down, the system is highly non-linear composed by a gas of hydrodynamical vortices. Only after these vortices annihilate and most of the systems are in the condensed phase, one can use the three-wave description.

One possible solution of (85) is the Kolmogorov–Zakharov (KZ) spectra, which is a non-equilibrium steady-state solution. Within this solution, we can have turbulent cascade processes, with a dual cascade for different direction of the energy flux. The interpretation of these dual cascade processes in BEC is interesting and corresponds to techniques used in experimental realizations of BEC. Inverse cascading, which is the non-equilibrium transfer of particle to the lowest energy momentum can be though as condensation. The initial process of BEC formation can be achieved in this process as a non-equilibrium condensation. The forward cascading is a processes is the energy transfer to higher momentum states, higher energy level. When the condensate is in a trap, these particles are going to leave the trap, and this is called evaporative cooling.

We studied above the case without a trapping potential. When in a trapping potential, the condensate density is now coordinate dependent and the behaviour in this trap will depend on the relation between the characteristic mean free path of the excitation wave packets and the size of the trap or the range of the force that produces this potential.

We are gong to see in Sect. 4.1.4 another solution of the above four-wave kinetic equation in the case of long-range interactions, like the gravitational interaction. The potential is present and it is the gravitational potential. This is described by the Landau kinetic theory, and can describe the initial stages of a condensate in the presence of gravitational interactions. Different from the KZ description of condensation discussed above, in this case, condensation does not arise from a cascading process but from a dissipation process.

We described above the four-wave classical kinetic equation. This is a good classical limit representation of the initial stages of a BEC. However, this is not a description of strong condensates, as a three wave kinetic equation needs to be adopted instead. This is also not a description of the quantum condensate. Wave turbulence theory can also be used to describe the full quantum regime of condensate, and not only the classical limit described until now. To be able to do that we first need to discuss the statistical distributions that the kinetic equations describe.

One of the distributions that is a solution of the classical equation (85) is the Rayleigh–Jeans (RJ) distribution, \(n_k = T/(k^2+mu)\), with \(T>0\) and \(\mu > 0\). This is similar to what we find when we are studying classical limit of the systems described by the Gross–Pitaevskii equation. Thu full quantum system described by (21) has a occupation number that describes a Bose–Einstein statistics. The classical limit of this system, described by the Gross–Pitaevskii equation has a mean occupation number in equilibrium described by the RJ distribution, the classical limit of the Bose–Einstein distribution: \(\langle n_k \rangle = k_B T/(\epsilon _k - \mu )\), where \(\mu <0\). This system describes a condensate only when the occupation of the ground state is macroscopic, \(\langle n_k \rangle \rightarrow \langle N_0 \rangle \).

The temperature T in this RJ distribution is related to the initial energy of the system \(E_0 = \int {\text {d}}{\mathbf {k}} \, \omega _k n_k = T \int {\text {d}}{\mathbf {k}} \), where \(\omega _k = k^2 + \mu \), in connection to thermodynamics. From this, we can see that since each degree of freedom of the theory has the same energy T, for a continuous and infinite system, the this and the energy diverge. This is the classical RJ catastrophe or the UV catastrophe.

Thus, the UV catastrophe is inherent to the ensemble of classical nonlinear waves. It is argued then that for the RJ solution to be a relevant solution of the kinetic equation, a cutoff needs to be introduced to regularize the UV catastrophe. This is as a phenomenological way of making the classical description of the system valid. Therefore, to describe a BEC in such a classical theory one needs to have a momentum cutoff in the theory. This truncated system then can be used to describe the evolution of a classical ensemble of waves via the kinetic theory derived above. In realistic systems, this cutoff sometimes comes naturally from dissipation or limits of the simulation or experiment. If one is working in a description where the UV catastrophe does not take place, where the evolution period of the system is not threatened by this divergence, then the above description for the classical condensate also holds. This will be the case of Sect. 4.1.4.

Another solution to make the description valid (avoiding the UV catastrophe) is to go to a quantum statistics. We can modify the kinetic theory to obey a Bose–Einstein statistics. We generalize the kinetic equation to:

$$\begin{aligned} {\dot{n}}_k&= 4\pi \int \, n_1 n_2 n_3 n_4 \, \left[ \left( \frac{1}{n_k} +1 \right) \left( \frac{1}{n_3} +1 \right) - \left( \frac{1}{n_1} +1 \right) \left( \frac{1}{n_2} +1 \right) \right] \nonumber \\&\delta ({\mathbf {k}}+ {\mathbf {k}}_3- {\mathbf {k}}_1 -{\mathbf {k}}_2 )\, \delta (k^2 + k^2_3 -k_1^2 -k_2^2)\, {\text {d}}{\mathbf {k}}_1 {\text {d}}{\mathbf {k}}_2 {\text {d}}{\mathbf {k}}_3 \,. \end{aligned}$$
(86)

The Bose–Einstein statistics is now a solution of this equation. This is the quantum kinetic equation and can be used to describe the full quantum BEC.

We showed above that we can use wave turbulence as an approximate description of some regimes of a BEC. The classical four wave kinetic equation can be used to describe the initial stages of a BEC in the classical limit. Different description will arise for different stages of the evolution of the condensate. Wave turbulence can also describe both classical condensate evolution and quantum evolution. A BEC is a quantum phenomena, but its classical evolution can approximately be described by wave turbulence which is a convenient description, in particular for simulations.

We are going to use kinetic theory to describe the formation of the BEC in Sect. 4.1.4.

3.7 Summary and discussion: what is a condensate?

We saw in this section, an introduction to two of the most interesting phenomena in quantum mechanics, the BEC and superfluids. We went through all these concepts in detail with the goal to give a proper definition of Bose–Einstein condensation in the context it is well understood and measured.

BEC is the phenomena of macroscopic occupation of the ground state that happens at low temperatures. BEC is a consequence of the quantum statistics of bosons, and it is an inherently a quantum phenomena. Equivalent to having a macroscopic occupation of the ground state, condensation can be though a the regime where the interparticle distance is smaller than the de Broglie wavelength of the bosons, which leads to a superposition of these wavefuctions, creating a macroscopic wavefunction that describes the condensate, which is a macroscopic quantum object. One of the main properties of a BEC is that it presents macroscopic (long range) quantum coherence.

We also saw that we can describe a more realistic condensation processes using a weakly interacting Bose gas, which exhibits superfluidity upon condensation. This theory is described by the fully quantum many-body Hamiltonian. For a large number of particles (\(N \gg 1\)), this Hamiltonian is very complicated to be studied. But, when the interactions in the BEC are weak, the BEC is diluteFootnote 22\(na^3 \ll 1\), and for large N, we can take the mean field approximation. In the mean field approximation, we can make the huge simplification that the many-body wavefunction can be approximated by an effective single-particle wavefunction. This means that the wavefunction of the condensate can be written as

$$\begin{aligned} {\hat{\varPsi }} ({\mathbf {r}}, t) = \psi ({\mathbf {r}}, t) +\delta {\hat{\varPsi }} ({\mathbf {r}}, t)\,, \end{aligned}$$
(87)

where \(\psi ({\mathbf {r}}, t) \equiv \langle {\hat{\varPsi }} ({\mathbf {r}}, t) \rangle \) (this is a classical quantity, because if it was quantum this average would be zero). The field \(\psi ({\mathbf {r}}, t)\) is the classical field or classical wavenumber. Quantum effects are suppressed in this limit (the depend on 1/N), and the BEC is well approximated by a classical theory.Footnote 23 The classical field that represents the condensate satisfies the (classical) non-linear Schrödinger equation or Gross–Pitaevskii equation. Therefore, the classical Gross–Pitaevskii or non-linear Schrödinger description of a condensate is a mean field description of condensate.

An important detail about the mean field approximation. In the classical limit of a scalar field we also have (87), where \(\psi = \langle {\hat{\varPsi }} \rangle \) is the scalar field and \(\delta {\hat{\varPsi }}\) are suppressed quantum corrections. Now, when a condensate can be treated as classical we are also in the limit where we can expand (87), but we have \(|\psi ({\mathbf {r}},t)|^2 = n_0 =n\), which means that there is a macroscopic occupation of the ground state. For the classical limit of the condensate, as the term \(\delta {\hat{\varPsi }} ({\mathbf {r}}, t)\) becomes more and more important and the mean field approximation breaks, this can be seen as depletion of the condensate. This limit can be broken if temperature or interactions are increased, then quantum and thermal fluctuations deplete the condensate.Footnote 24 At this point, the mean field approximation breaks and this classical description of the condensate cannot be used anymore.

Although the condensate to be formed depends on the quantum statistics of bosons and it is a quantum phenomena, it can be treated as classical in the mean field approximation. In an equivalent way, in the field description of BEC, the condensate can be described in the mean field limit by a classical field.Footnote 25

Summarizing A BEC can be described by a coherent classical scalar field that satisfies the Gross–Pitaevskii equation, in the mean field approximation where most of the particles are in the ground state, \(|\psi ({\mathbf {r}},t)|^2 = n_0 =n\).

The term “condensate” is used in the literature loosely meaning different things for different authors. I will use throughout this review the definition presented here. Therefore, every time I am using the term condensate, I am referring to the definition described here.

4 Ultra-light dark matter

After the introduction in the previous sections of the concepts that are going to be applied in this part of the review, we are finally ready to discuss the ULDM models.

Ultra-light DM denotes a class of models where DM is composed by ultra-light bosons. These models were introduced as a new class of DM models that can address the small scale challenges of \(\varLambda \)CDM, but mainly as models that offer a novel and rich phenomenology in galaxies that can be tested with small scale observations. The general idea of those models is that inside a virialized DM halos, ULDM thermalizes and forms gravitationally bounded cores that can be described as a BEC or a superfluid. In this way, these models behave like CDM on large-scales, with modified initial conditions, recovering the incredible observational successes of this description, while inside galaxies they present a wave-like behaviour.

To have this behaviour inside galaxies, the mass of this bosonic DM has to be very small. There are many models in the literature of ULDM that present this wave-like behaviour in galaxies, and the specific range of masses where this wave-like behaviour happens in galaxies depends on the specifics of the models. However, we can estimate in a model-independent way the range of masses of the ULDM particles to present this behaviour in galaxies. The mass of the ULDM has to be:

$$\begin{aligned} 10^{-25}\, \mathrm {eV} \lesssim m \lesssim 2 \, \mathrm {eV}\,. \end{aligned}$$
(88)

The lower bound on the mass is very general and comes from the fact that the size of the condensate core cannot be larger than the halo, since we want the condensate only on galactic scales and normal DM on larger scales. The maximum case we can have for the formation of a condensate is where the de Broglie wavelength of the ULDM particle is of the order of the size of the halo. Taking this bound at virialization, there is a maximum value on how large the de Broglie wavelength can be \(\lambda _{\mathrm {dB}} < R_{200}\). Taking \(z_{\mathrm {vir}} \sim 2\) and for halos with mass of order of \(10^{12} M_\odot \), we can see that for a spherical halo (Rindler-Daller and Shapiro 2012) this imposes a lower bound on the mass

$$\begin{aligned} m > m_{\mathrm {H}} \equiv \frac{2 \hbar }{\sqrt{3G}} (R\, M)^{-1/2} \approx 10^{-25} \left( \frac{M}{10^{12} \, M_\odot } \right) ^{-1/2} \left( \frac{R_{200}}{100\, \mathrm {kpc}} \right) ^{-1/2} \, \mathrm {eV}\,.\nonumber \\ \end{aligned}$$
(89)

We can also impose an upper bound on the mass asking the question: what is the biggest mass I can have that ULDM forms a core inside the galaxy? Again, to answer this question one needs to work with a specific ULDM model to study the Jeans theory of this model and the solutions of the equations in this region, which would give a bound within this model for the creation of these cores. However, one can try to be more general. The non-CDM behaviour happens in the regions where the wave behaviour takes place. So, the interesting non-CDM behaviour occurs on scales of the order or smaller than the scale that characterizes the wave which given by the de Broglie wavelength. This is the maximal case where the de Broglie wavelength of the ULDM particle is of the size of the galaxy. But one can also think that we can obtain a galaxy size wave as the superposition of the de Broglie wavelength of each of the ULDM particles (which are themselves smaller than the galaxy in this hypothesis). Then we can calculate the biggest mass, which means smaller de Broglie wavelength if each particle, for which this superposition yields a galaxy size wave. This translates to the condition that de Broglie wavelength of the boson DM is larger than the inter-particle distance between each boson,

$$\begin{aligned} \lambda _{\mathrm {dB}} \sim \frac{1}{mv} > l= \left( \frac{m}{\rho } \right) ^{1/3}\quad \Longrightarrow \quad m < \left( \frac{\rho }{v^3} \right) ^{1/4}\,, \end{aligned}$$
(90)

where assuming a spherical halo, the interparticle distance is defined as the radius of a sphere with density \(\rho \). This gives a bound on the mass of the DM particle. This condition is the same as the condition that a gas need to have to condensate into a BEC, as we can see in the box in Sect. 3.1 since it is equivalent to the condition of having macroscopic occupation number of the ground state for temperatures below the critical temperature of the system. This condition does not determine condensation of DM in the halo since showing that condensation happens in the halo is much more complicated than this ideal gas condition that is not realistic for the halo, but it is a 0th order condition for this phenomena together with the assumption of thermalization. But here we use it only as a condition to form a galaxy size macroscopic wave from the superposition of the individual particles’ waves.

We use the density and velocity of the dark matter halo like described in Sect. 2 from standard spherical collapse (Berezhiani and Khoury 2016), and take this bound at virialization:

$$\begin{aligned} \rho _{200}&= 200 \rho _{\mathrm {cr}} \sim 1.95 \times 10^{-27} \, \left( 1+z_{\mathrm {vir}} \right) ^3 \, \mathrm {g/cm}^3\,, \nonumber \\ V_{200}&\sim 85 \left( \frac{M}{10^{12} M_\odot }\right) ^{1/3} \sqrt{1+z_{\mathrm {vir}}} \,\, \mathrm {km/s}\,, \end{aligned}$$
(91)

where we derived these expressions assuming \(H_0 \sim 70\, \mathrm {km \, s}^{-1} \, \mathrm {Mpc}^{-1}\) and a halo mass of order of the MW. This gives the bound

$$\begin{aligned} m \lesssim 2.3 \, (1+z_{\mathrm {vir}})^{3/8} \left( \frac{M}{10^{12} M_\odot }\right) ^{-1/4} \, \mathrm {eV}\,. \end{aligned}$$
(92)

Taking \(z_{\mathrm {vir}} \sim 2\) and for halos with mass of order of \(10^{12} M_\odot \), we have an upper bound for the mass of the ultra-light DM particle to have galaxy-sized wave coming from the superposition of the wave of each particle: \(m \sim 2 \, \mathrm {eV}\).Footnote 26

I would like to bring the readers attention to following. The issue of condensation of ULDM in the halo is one of huge debate in the literature and we are going to discuss this in Sect. 4.1.5.

The range of masses showed above is just an estimate of the maximal higher and lower masses that the ULDM can have. Each specific ULDM model has a mass range where this behaviour in galaxies takes place and that it is in agreement with observations which depends on the specifics of the models. However, those bounds have to always fall within this general range (assuming ULDM is all the DM in the universe—for studies where this is not true see Hložek et al. 2015, 2018). We will see in Sect. 5 that for the FDM model, using CMB and LSS observations and assuming again the FDM is all the DM, that the lower bound in the mass is very close and within this theoretical estimation. Now for the DM superfluid model, as we will see in Sect. 4.2, respects the upper bound presented here with the mass of its particle very close to this limit.

The mass range for the ULDM presents masses that are much smaller than the ones usually considered for DM candidates and cannot be produced thermally in the early universe. Therefore, ULDM is a non-thermal relic, having to be created by a non-thermal mechanism to be cold today and behave as DM. There are many non-thermal production mechanism (see ‘DM relics’ panel below) but since in this review we are being agnostic on the type of particle that consists our ULDM, we are not assuming any creation mechanism, unless we are talking about a specific microscopic candidate.

With that, for a ULDM candidate with a mass in the range (88), we are going to have a non-CDM like behaviour coming from the presence of this core with wave-like behaviour inside the galaxy, while having a CDM like behaviour on large-scales, with different initial conditions resulting from the different mass and dynamics of the specific ULDM model. ULDM is the name used to the collection of models that have the characteristic stated above. There are many realizations of this behaviour which are present in each of the specific ULDM model that yield a different description of DM in galaxies, and a distinct and rich phenomenology in galaxies. We are now going to see that those models can be classified into three main classes according to their descriptions in galaxies.

Classification of ULDM models

The idea of having DM condensation on small-scales is not new and has been around for 30 years (Sin 1994; Ji and Sin 1994; Khlopov et al. 1985). For this reason, there are many models in the literature that were developed to describe a DM component with that behaviour on galactic scales. These models receive many names in the literature. They are either models that have a microscopic description or phenomenological models, which allow for the inclusion of different interactions and for a different dynamics to describe the evolution and non-linear structures of DM in the halo, which in turn can lead to distinct and rich astrophysical consequences on small-scales.

One possible model of ULDM is the axion. This model helped to bring a lot of attention in the literature to this class of models. In the case of the axion, we have a model that has a microscopic description and a well-defined cosmology. The QCD axions and general axions can behave like DM in a large range of parameters. This is also the case for axion-like particles (ALPs), which is another microscopic scalar theory that can describe DM. These microscopic theories behave like DM, but only present interesting phenomenology in small-scales for a more limited range of masses. On small-scales, the behaviour of these microscopic scalar theories can coincide with the behaviour of other phenomenological models of ULDM. For all of those, the non-relativistic action that yield a non-linear Schrödinger–Poisson equation.

In this review, we am going to classify the ULDM models according to the description they present on the small-scales, given by their non-relativistic dynamics on those scales. Each of these classes can contain both phenomenological and microscopic models that yield the same non-relativistic model. This classification is instrumental since it elucidates the physics responsible for the non-relativistic evolution and non-linear structures that are formed, and separates the different phenomenology each of those classes present.

These different descriptions also yield different conditions for condensation (or if it condenses of not) and formation of the condensate core. Each of these non-relativistic descriptions is going to describe a different phenomena upon condensation, being possible to have either a BEC or a superfluid, the latter in the presence of interactions.

According to this criteria, we classify the ULDM models into three categories (which somehow agrees with what was suggested in Sharma et al. 2019):

Fuzzy dark matter (FDM): The first category is given by a gravitationally bounded scalar field model. It described by a non-linear Shrödinger equation subjected to a gravitational potential, coupled to the Poisson equation (see ‘ULDM classes’ box). In this model, condensation under the influence of the gravitational potential is achieved in galaxies where the gravitational attraction is counteracted by the quantum pressure. This class of model can be called fuzzy DM since this name is already very well established for these gravitationally bounded BECs. One of its main realizations, which coined the name fuzzy dark matter is presented in Hu et al. (2000a); Hui et al. (2017), where the DM is given by a light particle with \(m \sim 10^{-22} \, \mathrm {eV}\). The FDM model is the ULDM model that was studied the most in the literature both theoretically and numerically. With a particle with this mass, the FDM model is known to be able to solve some of the challenges from small-scales presented above, and to be in agreement with large-scale observations. The mass of this model does not have to necessarily have this value and need to be determined by observations, although this is the value that gives the most appealing modifications on small-scales. Therefore, this model has one free parameter, the mass of the FDM particle m (we are considering in the review the case where all DM is composed of ULDM). Some interesting phenomenology also emerges from this model, as we will discuss in detail in the next subsections, that can be probed by current and future astrophysical observations. This model has also been called in the literature by wave DM, \(\psi \)DM, among other names (Sin 1994; Ji and Sin 1994; Seidel and Suen 1990; Widrow and Kaiser 1993; Chan et al. 2018; Guzmán et al. 1999; Matos and Guzmán 2000; Matos et al. 2009).

Axions and ALPs can also be thought to be in this class, since they yield exactly the same physics on small-scales. However, these models are more general and can describe DM for different values of their mass (like the QCD axion that does not produce such structures).

Self interacting FDM (SIFDM): The second category is called self-interacting FDM (SIFDM), but it also receives the names repulsive DM, scalar field DM, fluid dark matter, among others in the literature (Goodman 2000; Peebles 2000; Silverman and Mallett 2002; Arbey et al. 2003; Boehmer and Harko 2007; Lee 2009; Lee and Lim 2010; Harko 2011; Slepian and Goodman 2012; Dwornik et al. 2014; Guzmán et al. 2014; Harko 2014; Fan 2016; Guth et al. 2015; Chavanis 2018, 2016; Dev et al. 2017; Chavanis 2011; Chavanis and Delfini 2011; Chavanis 2012; Rindler-Daller and Shapiro 2012). In these models, DM is described by a scalar field model, in the presence of gravity, with a (usually) two-body self-interaction. The presence of the interaction makes this model present superfluidity upon condensation. This case is described by the interacting BEC presented in Sect. 3.3, which is the simplest example of a superfluid. The presence of the interaction controls the stability of the core and for this reason this model presents a different phenomenology depending not only on the mass of the particle, as for FDM, but given the strength and sign of the interaction. For a repulsive interaction, the condensate has a long range coherence and presents superfluidity. The 2-body case is characterized by having an equation of state (EoS), \(P \sim n^2\), as we saw in the previous section. Higher order interactions describe SIFDM with different equations of state.

DM superfluid: The third category is called DM Superfluid (Berezhiani and Khoury 2015, 2016; Khoury 2016; Hodson et al. 2017; Berezhiani et al. 2018). This theory was proposed with the goal of reproducing the MOND empirical law on small-scales, presenting a natural framework for the emergence of this theory. Different from the case of SIFDM, to reproduce MOND on small-scales, it requires that the EoS is given by \(P \sim n^3\), like what is expected by MOND, with a more intricate dynamics describing the small-scales. To accomplish that, this model is described using the EFT of superfluids which allows us to describe superfluids with a more general dynamics and EoS.

figure o
Fig. 9
figure 9

Map of the ULDM classes of models

There are many amazing reviews in the literature that focus in different parts of the ULDM class of model, either focusing in microscopic models, like describing axions Sikivie (2008); Arvanitaki et al. (2010); Wantz and Shellard (2010); Kim and Carosi (2010); Kawasaki and Nakayama (2013); Marsh (2016a) or ALPs (Ringwald 2014; Arias et al. 2012; Graham et al. 2015a; Marsh 2018; Niemeyer 2019; Powell 2016), or focusing in one of the classes like the FDM Hui et al. (2017); Suárez et al. (2014); Ureña-López (2019), for which the axions and ALPs describe the same non-linear theory (given by a Schrödinger–Poisson system in the absence of self interactions). This review follows a classification between the different classes according to their non-relativistic description and includes not only the FDM, but also the other classes of ULDM.

Given that, in this review, we are going to be describing the dynamics each of these classes of models present on small-scales, and the different cosmological and astrophysical consequences this new phenomenology brings. The exception is in the case of the FDM where we also are going to talk a little bit about the cosmology of the axions and ALPs. However, in general, we are going to remain agnostic about the origin of this field and we are going to work out only the gravitational consequences they have.Footnote 27 Understanding these consequences will prepare the field for the next section, where we discuss constraints on these models. We briefly discuss in the end of this section the big role that simulations have in studying these models and also the regimes where these ultra-light fields can behave as dark energy.

figure p
figure q

Footnote 28

4.1 FDM and SIFDM

In this section, we are going to describe the FDM and the SIFDM models. Although they both describe different non-relativistic dynamics and structure formation in the halo, and according to our classification are in different classes, we describe them in this section together since they can be described by a relativistic action. Even tough we are interested in the non-CDM phenomenology of these models on small-scales, these models can also modify the initial conditions for the evolution of the matter perturbations, and depending on the mass, modify the evolution of the model. In this way, the relativistic theory allows us to study the cosmology of this model and with that make predictions that can be tested by large-scale observations like CMB and LSS. This helps us to describe the model in different scales and use observations from large and small-scales to constraint the parameters of the model.

The ULDM is described as a very light scalar field minimally coupled to gravity given by the action,

$$\begin{aligned} S= S_{\mathrm {EH}}+S_{\phi } = \int {\text {d}}^4 x \sqrt{-{\bar{g}}} \left[ \frac{R}{16\pi G} + \frac{1}{2} g^{\mu \nu } \partial _\mu \phi \partial _\nu \phi -\frac{1}{2} m^2 \phi ^2 -\frac{g}{4!} \phi ^4 \right] \,, \end{aligned}$$
(93)

where \(S_{\mathrm {EH}}\) is the Einstein–Hilbert metric, R is the Ricci scalar, \(g_{\mu \nu }\) is the metric, \({\bar{g}}\) is the determinant of the metric and g is self-interaction coupling.

The axion or ALPs are described by an action like this. In this case, this action has a microscopic theory behind it. This potential comes from non-perturbative effects in QCD, for the axion, or other concrete string theory models, and gives a small mass for the axion or ALP. Since in these cases this action comes from a well-defined microscopic theory, the parameters relate to scales from this theory and the mechanisms that originated this particle. There is only a range in the parameter space where the ALPs behave like DM, and that also gives an interesting modification of structure formation. This happens for masses around \(m \sim 10^{-22} - 10^{-20}\, \mathrm {eV}\), which is similar to the range of masses for the FDM model.

But this relativistic action could also be phenomenological that allows the coupling to have a different sign and values, different from the axion case. We are going to see soon that the attractive and repulsive interactions yield different phenomenologies, with the repulsive allowing for much bigger collapse cores.

In this section, we are going to explore the cosmological consequences of this relativistic action, and after go to the non-relativistic regime to study the structure formation in the FDM and SIFDM models and describe the condensate formed in the center of the galaxy. To explore cosmological evolution, we are first going to study the concrete case where this action is the action for the ALPs. We also comment a little on the case of axion. This is very useful since in this case when we have a microscopic theory, we can identify the scale of the parameter and their relations, determine the initial conditions and the DM abundance.

4.1.1 Formation: ALPs

We are going to briefly describe here the formation mechanism for ALPs (see Marsh 2016a for a more complete description).

An ALP is a pseudo-Nambu–Goldstone resulting from the spontaneous symmetry breaking of a global U(1) symmetry described by the complex scalar field

$$\begin{aligned} \varPsi = v e^{i\phi /f_a}\,, \end{aligned}$$
(94)

where \(f_a\) is the scale of the spontaneous symmetry breaking and \(\theta _a = \phi /f_a\) is the misalignment angle. In the case of the QCD axion, the symmetry broken is the chiral global \(U(1)_{PQ}\), the Peccei–Quinn symmetry, and the introduction of this Goldstone boson solves the strong CP problem. When the symmetry is broken, the massive radial component v is fixed at the vacuum expectation value (vev), \(v_{0,ssb}=f_a/\sqrt{2}\), making the radial field non-dynamical while there is a continuous set of minima with the ground state given by \(\varPsi = v_{0,ssb} e^{i\theta _a},\) corresponding to all the possible phases in the circle (if the reader wants to refresh the memory on SSB, see Sect. 3.3.1 and the box that refers to SSB in that section). The pseudo-Goldstone boson \(\phi \) is invariant under shift-symmetry, inherited from the U(1) symmetry of the complex scalar field.

However, non-perturbative effects, coming from string theory models or from instantons in the case of the axion, can induce a potential that break the shift symmetry explicitly, although softly, which leads to a residual discrete symmetry. This potential gives a small mass to the ALPs, and has the form

$$\begin{aligned} V(\phi ) = \varLambda _a^4 \left[ 1-\cos \left( \frac{\phi }{f_a} \right) \right] \,, \end{aligned}$$
(95)

where \(\varLambda _a^4\) is the scale of spontaneous symmetry breaking.Footnote 29 This potential is not unique and the overall constant added was chosen arbitrarily. For small field values \(\phi \ll f_a\), this potential can be expanded into

$$\begin{aligned} V(\phi ) = \frac{1}{2} m^2 \phi ^2 + \frac{g}{4!} \phi ^4 + \cdots \,, \end{aligned}$$
(96)

where \(m=\varLambda _a^2/f_a\) and \(g = -\varLambda _a^4/f_a^4 < 0\). Since the spontaneous symmetry breaking scale \(f_a\) is usually much higher than the explicit symmetry breaking scale \(\varLambda _a\), the mass is usually very small, with the self-interaction coupling g even smaller. For the QCD case, \(\varLambda _{\mathrm {QCD}} \sim 200 \, \mathrm {MeV}\) (Marsh 2016a), and \(10^9 \, \mathrm {GeV} \lesssim f_a \lesssim 10^{17} \, \mathrm {GeV}\) coming from astrophysical constraints, then we can see that \(10^{-10} \, \mathrm {eV} \lesssim m_{\mathrm {QCD}} \lesssim 10^{-2}\, \mathrm {eV}\). For string theory models, there is a variety of cases, but typically \(\varLambda _{st} \sim T_{\mathrm {SUSY}} \, e^{-S_{i}}\), where \(S_{i}\) is the instanton action which generally is \(S_{i} \gg 1\).

Although very small, given the approximate shift-symmetry, the ALP mass is protected against radiative corrections, and interactions with the standard model are suppressed by powers of \(f_a\).

For the case of the QCD axion, the mass generated by the non-perturbative QCD effects has a time dependence that scales with the power law of the temperature. For a discussion about that, see, (Marsh 2016a).

Given all that, the action for the ALP is given exactly by the action (93) (usually the self-interaction is omitted since this is suppressed by powers of \(f_a^3\)).

4.1.2 Cosmological evolution

Having established that there is a microscopic theory where the action (93) can come from, we are going to study the cosmology in the general case, which describes any model in the FDM and SIFDM classes. If we ignore the small interaction, this model corresponds to the FDM class. In the presence of interaction, this model corresponds the SIFDM class, where the interaction can be attractive or repulsive, while in the case of ALPs and axions, it can only be attractive (\(g<0\)).

We are going to focus only in the matter sector now and omit the Einstein–Hilbert action (which gives GR, the background theory where our field evolves). The action we are going to work with is

$$\begin{aligned} S_{\phi }= \int {\text {d}}^4 x \sqrt{-g} \left[ \frac{1}{2} g^{\mu \nu } \partial _\mu \phi \partial _\nu \phi -V(\phi ) \right] \,, \end{aligned}$$
(97)

where the potential can be given by only the mass term, describing the FDM, or both the mass and the interaction, describing SIFDM.

We can study the evolution of this field in a flat Friedmann–Robertson–Walker background (FRW) background, given by the metric \({\text {d}}s^2 = {\text {d}}t^2 - a^2(t){\text {d}}x^2\), where a(t) is the scale factor. The equation of motion for the ALP is given by

$$\begin{aligned} \ddot{\phi } + 3H {\dot{\phi }} + \frac{{\text {d}}V}{{\text {d}}\phi } = 0\,, \end{aligned}$$
(98)

where \(H={\dot{a}}/a\) is the Hubble parameter. The cosmological evolution is going to depend on the competition between the potential term and the Hubble friction term. Let uthe ULDM is subdominant s consider the case of the FDM, where \(V(\phi ) = (1/2) \, m^2 \phi ^2 \) to illustrate the cosmological evolution. The background evolution of this field proceeds in the following way. In the early universe, \(H \gg m\), the Hubble friction dominates and the solution is a constant given by the initial conditions \(\phi _{\mathrm {early}} = \phi (t_i) \). In the case of the ALP, this initial condition is known and given by the formation of the ALP with \(\phi _i= f_a \theta _a (t_i) = f_a \theta _i\). At early times, the ULDM is subdominant and has equation of state \(w = -1\) behaving like dark energy. As the universe expands, the Hubble parameter becomes smaller and smaller, until a point where it is smaller than the mass of the field \(H \ll m\). The solution of the equation in this case is oscillatory. As the field oscillates, the equation of state also oscillates around zero, giving an averaged out equation of state of a dust like component \(w =0\). In this limit, the field behaves like DM and the energy density evolves as \(\rho = \rho (a(t_{*})) \, (a(t_{*})/a)^3\), where \(t_{*}\) is the time when \(H(t_{*}) \sim m\).

We can already see that the lower the mass is, more and more will take until the Hubble parameter to become smaller than the mass, prolonging the early period where the field behaves as dark energy. For higher masses, the FDM behaves like DM earlier in the history of the universe. Therefore, the mass completely controls when the period of DM domination starts.

In the case of ALP, for them to behave like all the DM of the universe, it has to start oscillating before matter-radiation equality, which gives \(m > 10^{-28} \, \mathrm {eV} \sim H(a_{\mathrm {eq}})\). However, not all of this regime the ALP as DM presents interesting phenomenological consequences for structures since for heavier masses, ALP behaves closer and closer to CDM. We are going to see this in more detail later, but the sweet spot in mass for the ALP to have this distinct regime on small-scales is \(m \sim 10^{-22} - 10^{-20}\, \mathrm {eV}\).

To obtain the full solution of Eq. (98), one needs to solve this equation coupled to the Friedman equation that describes the evolution of the scalar factor according to the components of the universe: \( H^2 = (1/M_{pl})( \rho _{\phi } + \rho _r + \rho _b + \rho _{\varLambda } )\), where the contributions in the energy density come from the FDM or SIFDM, radiation, baryons and cosmological constant, respectively. The energy density and pressure of this scalar particle are given by

$$\begin{aligned} \rho _{\phi } = \frac{1}{2} {\dot{\phi }}^2 + V(\phi )\,, \quad p_{\phi } =\frac{1}{2} {\dot{\phi }}^2 - V(\phi )\,, \end{aligned}$$
(99)

where in the case that the field is oscillating can be averaged over time. In the case of the FDM, we can take \(a(t) \propto t^p\), which is valid for the period of radiation and matter dominations, and we have:

$$\begin{aligned} \phi (t) = a^{-3/2} \left( \frac{t}{t_i} \right) ^{1/2} \left[ C_1 \, J_n (mt) + C_2 \, Y_n (mt) \right] \,, \end{aligned}$$
(100)

where \(C_{1,2}\) are determined by the initial conditions, \(n=(3p-2)/2\) and \(J_n\) and \(Y_n\) are the Bessel functions of first and second kind.

We can then compute the relic density of FDM, which is the energy density of those particles today. For the case where FDM is the ALP and behaves like DM, and all the DM is made of ALPs, the density fraction is

$$\begin{aligned} \varOmega _{\mathrm {ALP}} \sim \frac{1}{6} \, \left( 9\, \varOmega _r \right) ^{3/4} \left( \frac{m}{H0} \right) ^{1/2} \left( \frac{\phi _i}{m_{pl}} \right) ^2\,. \end{aligned}$$
(101)

The initial value of the field displacement determines the relic density of ALPs, and to have the DM density observed today, the initial value of the field must be \(\phi _i > 10^{14} \, \mathrm {GeV}\) for the masses of the ALP that correspond to DM behaviour. A similar calculation can be made for the axion and this can be found in Marsh (2016a).

With that we saw that we have two scales that are important for the DM ALPs: \(f_a\) the spontaneous symmetry breaking scale that determines the initial conditions of the ALPs, and \(\varLambda _a\) the scale of explicit breaking that determines the mass given the previous scale. The temperature \(T_{*}\) associated with \(t_{*}\), the time when the field starts to oscillate and behave like DM, is set after the mass is determined. This is similar in the case of the FDM, where the only one degree of freedom is m, although the initial condition \(\phi _i\) can be unknown. If one assumes the FDM is described by an ALP, then the initial condition is determined.

In the case of the SIFDM, we have an extra parameter in comparison to the FDM, the interaction strength. The interaction term acts as a pressure term in the equation of motion (98). This pressure can be attractive or repulsive. There is then a competition between the Hubble friction, the mass and the pressure. We are going to study more about these effects in the next section.

Cosmological perturbations:   We have studied the background evolution and now we need to study the cosmological perturbations. This is important in order for us to study the cosmological consequences of this scenario that can be tested with cosmological observations. We are not going to present here an extensive description, but a summary of the most important results. Notice that like for the cosmological evolution, the procedure here is general for any FDM or SIFDM.

First, we perturb the scalar field and the metric into small perturbations on top of the background values:

$$\begin{aligned} g_{\mu \nu } ({\mathbf {x}}, t) = g_{\mu \nu }^{(0)}(t) + \delta g_{\mu \nu } ({\mathbf {x}}, t)\,, \quad \phi ({\mathbf {x}}, t) = \phi _0(t) + \delta \phi ({\mathbf {x}}, t)\,, \end{aligned}$$
(102)

where the 0 indicates the background quantities. We are only going to be interested in the scalar perturbations, in this review. We are going to work on conformal time \(\eta \) defined as \({\text {d}}\eta = {\text {d}}t/a\). The perturbed metric for the scalar metric perturbations is described by four functions (following the convention from Mukhanov 2005)

$$\begin{aligned} {\text {d}}s^2 = a^2(\eta ) \left\{ \left( 1+ 2\varPhi \right) {\text {d}}\eta ^2 +2B_{,i} \, {\text {d}}\eta {\text {d}}x^i + \left[ \left( 1 - 2\varPhi \right) \delta _{ij}-2 E_{,ij} \right] {\text {d}}x^i {\text {d}}x^j \right\} \,. \nonumber \\ \end{aligned}$$
(103)

When doing perturbation theory in general relativity, due to gauge invariance, new degrees of freedom that are fictitious and not physical might be introduced. One of the procedures to deal with this is to fix a gauge, which fixes these spurious variables. There are many ways of doing that which leads to the many possible gauges. The final physics given by all these gauges is the same, but each gauge offers a better description of different phenomena. The Newtonian gauge (\(E=0\) and \(B=0\)) is useful in the Newtonian limit. One can also choose the comoving gauge (\(B=0\) and \(v=0\)), where v is the velocity of the matter fluid. We are going to use both gauges whenever they are useful.

After a gauge is chosen, one substitutes the perturbations in the action (97), ignoring the interaction for simplicity, to obtain the second-order action for the perturbations. And from that one can obtain the equation of motion for the axion perturbation.

We can also re-write the perturbations in terms of the fluid variables: \(\rho _{\phi } = \rho _{\phi ,\, 0} + \delta \rho \) and \(p_{\phi } = p_{\phi ,\, 0} + \delta p\). From Eq. (99), we can identify the perturbations in the fluid variables with the ones of the scalar field and metric (Hwang and Noh 2009)

$$\begin{aligned} \delta \rho= & {} \langle \dot{\phi _0} \dot{\delta \phi } - {\dot{\phi }}_0^2 \varPhi +m^2 \phi _0 \delta \phi \rangle \,, \quad \delta p = \langle \dot{\phi _0} \dot{\delta \phi } - {\dot{\phi }}^2_0 \varPhi -m^2 \phi _0 \delta \phi \rangle \,, \nonumber \\&a (\rho + p) v = k \langle \dot{\phi _0} \dot{\delta \phi } \rangle \,, \end{aligned}$$
(104)

where we have taken the Fourier transform of the fields with k denotes the wavenumber, and the background pressure and energy density are averaged.

We are interested in obtaining the sound speed of the FDM particle. We showed before that the FDM behaves like dark matter at the background level, but we also need to show that the sound speed is small as expected for dust. This different sound speed is going to give a different Jeans scale and give a different structure formation for this model. Since we are interested in calculating the sound speed, we are going to work in the comoving gauge. In that gauge, the equations, assuming the averaged background equation of state (which is zero), for the scalar perturbations of the metric and fluid simplify and can be combined to give

$$\begin{aligned} \ddot{\delta } + 2H {\dot{\delta }} - 4\pi G \rho \, \delta + \frac{k^2}{a^2} \frac{\delta p}{\rho } = \ddot{\delta } + 2H {\dot{\delta }} +\left( \frac{k^2}{a^2} c_s^2 - 4\pi G \rho \right) \delta = 0\,, \end{aligned}$$
(105)

where \(\delta =\delta \rho /\rho \) is the density contrast and \(\omega _k^2 = \frac{k^2}{a^2} c_s^2 - 4\pi G \rho \) is the dispersion relation.

By definition, the sound speed is defined as the term that accompanies the gradient, the term with \(k^2\). To obtain the expression for the sound speed, we need to compute the perturbation of the pressure \(\delta p\). For that, make the simple procedure from Hwang and Noh (2009), where we assume an ansatz for the field perturbation, \(\delta \phi (x,t) = \delta \phi _{+}(x,t)\, \sin (mt)+\delta \phi _{-}(x,t)\, \cos (mt)\) and substitute that in Eq. (104). This gives us \(\delta p\), and the sound speed can be written as:

$$\begin{aligned} c_s^2 = \frac{k^2}{4m^2a^2} \left( \frac{1}{1+ \frac{k^2}{4m^2a^2}} \right) \,. \end{aligned}$$
(106)

This is the (relativistic) sound speed of our FDM fluid. It is valid outside the Hubble horizon and inside. For sub-Hubble horizon modes, when \(k/(ma) \ll 1\) the sound speed becomes

$$\begin{aligned} c_s^2 \xrightarrow [{k/ma \ll 1}]{} c_{s, \, n}^{2} = \frac{k^2}{4m^2 a^2}\,. \end{aligned}$$
(107)

This corresponds to sound speed that can be obtained in the non-relativistic Newtonian theory, as we will see in the next section.

There are two competing terms in Eq. (105), which are the terms inside the dispersion relation. The scale \(k_{J}\) for which \(\omega _k (k_J) = 0\) separates the regimes where each of those terms dominates in the equation. We can also write this in terms of \(\lambda _J=2\pi a /k_J\), the Jeans length. For modes with \(\lambda < \lambda _J\), the dispersion relation is negative, and the solution of (105) is that the perturbations oscillate. While when \(\lambda > \lambda _J\), perturbations grow. So there is only gravitational instability for the modes that are outside the Jeans length. In a theory of DM with a finite Jeans length, the growth of perturbations will be suppressed for scales smaller than \(\lambda _{\mathrm {J}}\). That is exactly the effect the small mass of the ULDM models has. This leads to important cosmological consequences for these models.

4.1.3 Evolution on small-scales

We have finally reached the section where we are going to describe the behaviour of the FDM and of the SIFDM on small-scales. It is this different behaviour that is used to classify the models into different classes.

The action that describes the SIFDM and the FDM is (93). We are interested in studying the behaviour of DM in galaxies, so we are on sub-Hubble scales. In this limit, \(H \ll m\) and the field is oscillating fast and behaving as DM. Inside the Hubble horizon and for the small velocities, we have in galaxies (\(v_{\mathrm {vir}} \ll c\)), we are in the non-relativistic limit of our theory. In the Newtonian gauge, the limit where \(B=0\) and \(E=0\) in (103), and with no anisotropic stress \(\varPhi = \varPsi \), we can write the action for the ULDM field (Niemeyer 2019):

$$\begin{aligned} S_{\phi } = \int {\text {d}}^4 x \, a^3 \left[ \frac{1}{2} (1-4\varPhi ) \, {\dot{\phi }}^2 - \frac{1}{a^2} (\partial _i \phi )^2 - (1-2\varPhi ) V(\phi ) \right] \,. \end{aligned}$$
(108)

Since in the non-relativistic limit the field varies slowly, the fast oscillations that we had for the field can be factored and we can re-write the field as

$$\begin{aligned} \phi = \frac{1}{\sqrt{2ma^3}} \left( \psi \, e^{-imt} + \psi ^{*} \, e^{imt} \right) \,. \end{aligned}$$
(109)

With the field in this form and assuming the \({\dot{\psi }} \ll m\psi \), we have the total non-relativistic action, including the Einstein–Hilbert action, that describes this theory given by (Chavanis 2011, 2018)

$$\begin{aligned} S= & {} \int {\text {d}}^4 x \left[ \frac{i}{2} \left( \psi \partial _{t} \psi ^{*} - \psi ^{*} \partial _t \psi \right) - \frac{|\nabla \psi |^2}{2m}-\frac{g}{16m^2} |\psi |^4 - m (\psi \psi ^{*} - \langle \psi \psi ^{*} \rangle ) \varPhi \right. \nonumber \\&\left. -\, \frac{a}{8\pi G} (\partial _i \varPhi )^2 \right] \,. \end{aligned}$$
(110)

The equations of motion of the action yield the Schödinger–Poisson system of equations:

$$\begin{aligned}&i {\dot{\psi }} =-\frac{3}{2} i H\psi -\frac{1}{2m a^2} \nabla ^2 \psi + \frac{g}{8m^2} |\psi |^2 \psi +m\varPhi \, \psi \,, \end{aligned}$$
(111)
$$\begin{aligned}&\nabla ^2 \varPhi = 4 \pi G \left( \rho - {\bar{\rho }} \right) \,. \end{aligned}$$
(112)

If we consider time scales much smaller than the expansion, we can ignore expansion of the universe and write the equation of our system as

$$\begin{aligned} i {\dot{\psi }} = -\frac{1}{2m} \nabla ^2 \psi + \frac{g}{8m^2} |\psi |^2 \psi +m\varPhi \, \psi \,. \end{aligned}$$
(113)

This is the non-linear Schrödinger equation. The gravitational potential term can be re-written in the form \(-Gm^2 \psi \int {\text {d}}^3 x^{'} |\psi ({\mathbf {x}}^{'})|^2/|{\mathbf {x}}-{\mathbf {x}}^{'}|\). This non-linear equation is the Gross–Pitaesvkii equation described in the previous section and describes the evolution of a wavefunction or a field. We can use this equation to analyze the properties of this system, analytically and numerically.

figure r

We can also rewrite the field theory above as a set of hydrodynamical-like equations, in this long wavelength limit. For that, if we identify (using the theory in the presence of expansion):

$$\begin{aligned} \psi \equiv \sqrt{\frac{\rho }{m}} e^{i\theta }\,, \quad {\mathbf {v}} \equiv \frac{1}{a \, m} \mathbf {\nabla } \theta = \frac{1}{2 i m a} \left( \frac{1}{\psi } \mathbf {\nabla } \psi - \frac{1}{\psi ^{*}} \mathbf {\nabla } \psi ^{*} \right) \,. \end{aligned}$$
(114)

The vorticity of the superfluid is zero and the momentum density has non-zero curl. The comoving equations of motion for \(\psi \) are

$$\begin{aligned}&{\dot{\rho }} + 3H\rho +\frac{1}{a} \mathbf {\nabla } \cdot \left( \rho {\mathbf {v}} \right) = 0\,, \end{aligned}$$
(115)
$$\begin{aligned}&\dot{{\mathbf {v}}}+H{\mathbf {v}}+\frac{1}{a} \left( {\mathbf {v}} \cdot \mathbf {\nabla } \right) {\mathbf {v}} = - \frac{1}{a} \mathbf {\nabla } \varPhi + \frac{\mathbf {\nabla } P_{\mathrm {int}}}{\rho }+ \frac{1}{2a^3 m^2} \mathbf {\nabla } \left( \frac{\nabla ^2 \sqrt{\rho }}{\sqrt{\rho }} \right) \,. \end{aligned}$$
(116)

These set of equations are the Madelung equations, generalized for an expanding universe. The second term in the right-hand side of equation (116) comes from the self-interaction term, where \(P_{\mathrm {int}}\) is the pressure from the interactions. The last term of the second equation is the quantum pressure. This is present even in the absence of interaction and it is going to be important for the effects and formation of the condensate for the FDM model. The quantum pressure has the role of not allowing the FDM to cluster and collapse, which also makes the density of this collapsed region to have a finite value. In this way, this model has naturally a cored profile inside the condensate region, addressing the cusp–core problem. This form of the equations is useful for numerical simulations that can reveal some properties of the DM scalar field. However, as we can see from the quantum pressure term, these equations are not defined for \(\rho = 0\).

The quantum pressure term is present in the ULDM models and it is not present in other candidates of DM. As we discussed above, the behaviour of FDM or SIFDM in each regime depends on a competition between the gravity, the pressure term and the quantum pressure. We can roughly say that then the non-CDM behaviour expected for our models will take place on scales where the quantum pressure term dominates. The sign and size of the interaction might affect this a lot. A naive estimate of this effect is that the scales where this quantum pressure term matters is for scales smaller than the de Broglie wavelength of the particle, \(\lambda < \lambda _{\mathrm {dB}}\). The de Broglie wavelength for a typical MW-like galaxy is given by

$$\begin{aligned} \lambda _{\mathrm {dB}} \simeq 0.2 \, \left( \frac{m}{10^{-22} \, \mathrm {eV}} \right) ^{-1} \, \left( \frac{V_{200}}{v} \right) \, \mathrm {kpc}\,, \end{aligned}$$
(117)

where we used the virial velocity (91). For a particle with \(m=10^{-22}\, \mathrm {eV}\), this would mean that the wave-like behaviour of the ULDM particles would be relevant in a MW-like galaxy on scales smaller than \(0.2 \, \mathrm {kpc}\). If we consider dwarf galaxies, for example, where virial velocities are much smaller \(v_{\mathrm {vir}}^{\mathrm {dw}} \sim 10 \, \mathrm {km/s}\), these non-CDM effects would take place on scales of order of their halo size. For scales \(\lambda > \lambda _{\mathrm {dB}}\), the quantum pressure term is not important anymore and the particles behave like free particles, in a CDM-like way.Footnote 30

This scale where the condensate behaviour becomes important is called the coherent length. We are going to show now a more precise determination of this coherent length in both the FDM and SIFDM. And together with this analysis, it is going to be possible to better understand what is going on in the center of halos that drives this non-CDM behaviour.

4.1.4 Description of the condensate

Now that we have our description of the FDM and SIFDM on small scales, we want to understand what takes place inside the halos, where the Schödinger–Poisson equations describe the evolution of the system. We want to describe here what is the picture we have in mind for what happens on those small-scales.

The special feature of the ULDM models is that they present a non-CDM behaviour in galaxies. As we saw in the previous section, the ULDM have a cosmological evolution very close to CDM for large scales. Different than in CDM, the ULDM have a non-zero Jeans length showing that on small-scales this component is going to behave differently than CDM. Therefore, outside galaxies, ULDM behaves like CDM but with a suppressed power spectrum, and inside galaxies, in those homogeneous sub-Jeans scales ULDM can have a non-CDM behaviour.

Inside these homogenous sub-Jeans regions inside galaxies, the ULDM thermalizes and forms gravitationally bound compact objects, called Bose stars or solitons, where a Bose–Einstein condensation or superfluid is formed. This was described in many references in the literature (Lee and Pang 1992; Jetzer 1992; Kolb and Tkachev 1993; Guth et al. 2015; Semikoz and Tkachev 1997; Khlebnikov 2000; Sikivie and Yang 2009; Erken et al. 2012), both in the presence and in the absence of interactions. The coherence length of this condensate sets the region where the wave behaviour of the condensate is important and changes the dynamics. Outside the condensate, on scales larger than the coherent length, the ULDM behaves as particles following the particle description as CDM (decoupled axion following (Sikivie and Yang 2009)), with different initial conditions than in CDM.

There have been many studies of the properties of these condensed gravitationally bounded objects, in particular in the context of axions (Ruffini and Bonazzola 1969a; Chavanis 2011; Barranco and Bernal 2011; Eby et al. 2015, 2016a, b, 2019, 2018; Braaten et al. 2016; Visinelli et al. 2018). In this section, we are going to study the thermalization and formation of these compact objects not only in the context of FDM, but also in the context of the SIFDM. It is interesting to see that the size and phenomena, BEC or superfluidity, described by each of these models can differ a lot in each case, and in the case of the SIFDM it differs with the sign of the self-interaction.

The formation process of this Bose–Einstein condensate by gravitational interaction in the center of halos or in axion miniclusters is shown to take place in our universe, with relaxation times dominated by faster gravitational relaxation time, which is smaller than the age of the universe (Semikoz and Tkachev 1997; Khlebnikov 2000; Sikivie and Yang 2009; Schive et al. 2014a; Levkov et al. 2018; Kirkpatrick et al. 2020). We are going to show here we can describe this formation of the condensate and obtain the relaxation times in the case of the FDM and the SIFDM.

Summarizing, the picture that we have to have in mind is shown in Fig. 10. Inside the halos of galaxies, a condensate core is formed.Footnote 31 These Bose clumps are called solitons or Bose stars. This is also the picture that we have for the DM superfluid model, presented in Sect. 4.2.

Fig. 10
figure 10

Schematic figure of the behaviour of ULDM in galaxies, where a condensate core is expected to form in the inner parts of the galaxy, while DM behaves like normal DM in the outskirts or outside galaxies

As we saw from the description of the ULDM given above, we are describing the theory purely classically. It is valid then to ask if our ULDM can be treated as a classical theory or not, especially since this model has such small masses.

Classicality is an emergent concept that can be applied for system that are composed by a large number of constituents. In this limit, the quantum effects of the theory are suppressed by this macroscopic number of particles, and the theory can be described by a classical theory. In this way, there is a limit where this classical approximation breaks and the quantum effects become important.

Lets first talk about what happens in the halo. When taking the classical limit of a theory, we can think that we decompose our quantum field \({\hat{\psi }}\) into

$$\begin{aligned} {\hat{\psi }} = \psi + \delta {\hat{\psi }}\,, \end{aligned}$$
(118)

where \(\langle {\hat{\psi }} \rangle = \psi \) is the classical field (since if it was quantum this expectation value would be zero) and \(\delta {\hat{\psi }}\) are the quantum corrections on top of the classical field. These quantum correction are suppressed by large occupancy number of the states, and for a coherent state is given by: \(\delta {\hat{\psi }}/ \psi \propto 1/{\mathcal {N}}\). Therefore, if we are in a system with a large number of quantum constituents, the quantum corrections can be neglected and the system can be treated as classical. And this classical field is going to obey classical equations. This is, very roughly since this can be made much more precise mathematically, the definition of a classical field. On top of that, it is a condensate, there is a large occupation number of the ground state, so \({\mathcal {N}}={\mathcal {N}}_0\).

We can then estimate the occupancy number that we have in a halo. If we consider a MW-like galaxy, and we take as an example the axion, then the number density of axions in the galaxy is

$$\begin{aligned} n_{\mathrm {gal}} = \frac{\rho _{\mathrm {gal}}}{m} \approx \frac{\mathrm {GeV/cm}^3}{10^{-5} \, \mathrm {eV}} \,. \end{aligned}$$
(119)

Since we can write the occupation number roughly as \({\mathcal {N}} \sim n \lambda _{\mathrm {dB}}^3\), given the de Broglie wavelength of an axion with mass \(m \sim 10^{-5} \, \mathrm {eV}\), we have that in a galaxy \({\mathcal {N}} \sim 10^{46}\). This shows that in galaxies today, this ultra-light particle has a huge occupancy number and we can consider a classical evolution. We can extend this estimation for earlier times, if we assume the QCD axion, where the potential arises from non-perturbative effects on scales \(\varLambda _{\mathrm {QCD}}\). For those axions, the energy density (assuming they are most or all the DM in the universe) is given by \(n_a \sim \rho _a/m \sim (T_{\mathrm {eq}}/T_{\mathrm {QCD}}) \, \rho _{\mathrm {tot}} \sim T_{\mathrm {eq}} T_{\mathrm {QCD}}^3 /m\). The de Broglie wavelength can be estimated to be at most of the size of the Hubble horizon at that time \(\lambda \sim H_{\mathrm {QCD}}^{-1} \sim T^2_{\mathrm {QCD}}/M_{pl}\). This yields an even larger occupation number \({\mathcal {N}} \sim 10^{61}\).

If you start your theory with a classical scalar field described by classical equations of motion, then this description is valid for almost all the evolution of the universe (early times might require quantum treatment tough), specially the cosmological times we are considering. So it is a good description for the evolution of the ULDMFootnote 32.

As we saw in Sect. 3, in the classical limit, we can describe the properties of the (classical) condensate using a classical theory. And that is what we are going to do here. In the classical point of view, the condensate formed is going to be described by a slowly varying in space, homogeneous and stable field that presents long-range correlation (Guth et al. 2015). In the case of FDM and SIFDM, condensation takes place in the presence of gravity, and of both gravity and self-interactions in the case of SIFDM. So we need to show how this gravitational thermalization takes place. We are going to describe these condensates, showing their size, given by their correlations length, and the condensate solution using linear theory.

SIFDM

We are first going to treat the SIFDM model, which is described by the presence of a self-interaction (Chavanis 2011; Guth et al. 2015; Chavanis 2016, 2018). We specialize here to the two-body interaction, since higher order interaction are usually suppressed in low-energy systems like the one we are interested. But it is easy to generalize this for higher order interactions. From what we saw in the previous section, the theory of a self-interacting condensate describes a superfluid in certain regimes. We are going to see here for which conditions this occurs in the SIFDM model.

We are going to work in the limit where we ignore gravity to investigate the effect of the self interaction in the model in a very similar way as described in Sect. 3.3. The Schrödinger equation that describes model is given by

$$\begin{aligned} i {\dot{\psi }} = -\frac{1}{2m} \nabla ^2 \psi + \frac{g}{8m^2} |\psi |^2 \psi \,. \end{aligned}$$
(120)

We decompose the field into a homogeneous background solution, which represents the condensate, plus a perturbation part: \(\psi ({\mathbf {x}},t)= \psi _{\mathrm {c}}(t) + \delta {\bar{\varPsi }} ({\mathbf {x}},t)\). The condensate part satisfies the Gross–Pitaevskii equation,

$$\begin{aligned} i {\dot{\psi }}_{\mathrm {c}} = \frac{g}{8m^2} |\psi _0|^2 \psi _{\mathrm {c}}\,, \end{aligned}$$
(121)

that has a simple periodic solution \(\psi _{\mathrm {c}}(t) = \psi _0 \, e^{-i \mu _{\mathrm {c}} t}\), where \(|\psi _0|^2 = n_0\) is the number density of particles that fixes the amplitude of \(\psi _0\), and \(\mu _{\mathrm {c}} = g n_0/8m^2\).

The equation describing the evolution of the perturbation, making the field redefinition \(\delta {\bar{\varPsi }} = \psi _{\mathrm {c}} \delta \varPsi \), is

$$\begin{aligned} i \delta \varPsi = -\frac{1}{2m} \nabla ^2 \delta \varPsi + \frac{g n_0}{8m^2} \left( \delta \varPsi + \delta \varPsi ^{*} \right) \,. \end{aligned}$$
(122)

Since \(\delta \varPsi \) is a complex scalar field, we can decompose the field into a real and a imaginary parts, \(\varPsi = A+iB\). We want to determine the dispersion relation of this system, so we write the equation of motion in Fourier space:

$$\begin{aligned} \frac{{\text {d}}}{{\text {d}}t } \begin{pmatrix} A_k \\ B_k \end{pmatrix}= \begin{pmatrix} 0 &{} \frac{k^2}{2m} \\ -\frac{k^2}{2m} - \frac{g n_0}{4m^2} &{} 0 \end{pmatrix} \begin{pmatrix} A_k \\ B_k \end{pmatrix}\,, \end{aligned}$$
(123)

where we call \(\zeta _k=k^2/2m + gn_0/4m^2\). The dispersion relation is given by

$$\begin{aligned} \omega _k^2= \frac{g n_0}{4m^2} \frac{k^2}{2m} +\frac{k^4}{4m^2}\,. \end{aligned}$$
(124)

We can see that for \(\omega ^2_k >0\) we have an oscillatory solution:

$$\begin{aligned} \delta \varPsi _k = Z \left( \omega _k+\zeta _k \right) e^{ i \omega _k t} + Z^{*} \left( \omega _k - \zeta _k \right) e^{- i \omega _k t}\,, \end{aligned}$$
(125)

where Z is an arbitrary complex parameter. When \(\omega _k^2 < 0\), the solution of the equation for \(\delta \varPsi _k\) is given by exponentials,

$$\begin{aligned} \delta \varPsi _k = c_1 \left( \gamma _k - i \zeta _k \right) e^{\gamma _k t} + c_2 \left( \gamma _k + i \zeta _k \right) e^{- \gamma _k t} \,, \end{aligned}$$
(126)

where \(\gamma _k = (k/ \sqrt{2m}) \sqrt{- \zeta _k}\) are the eigenvalues of the matrix, and \(c_1\) and \(c_2\) are constants given by the initial conditions.

The regimes where the dispersion relation is positive or negative are separated by the modes with wavenumber,

$$\begin{aligned} \omega _k^2 = 0 \quad \Longrightarrow \quad k^2_{*}=- \frac{g \, n_0}{2m}\,, \end{aligned}$$
(127)

from where we can also determine the wavelength \(\lambda _{*}=2\pi /k_{*}\) that divides these regimes. This wavelength is proportional to the healing length calculated in the previous sections. However, can see from this scale that what is actually going to determine if we have a stable oscillatory solution or a exponentially growing instability is the sign of the interaction.

For a repulsive interaction, \(g>0\), the homogeneous configuration is always stable, and it is always going to be described by an oscillatory solution, either if \(\lambda \) is bigger or smaller than \(\lambda _{*}\) . This means that for all wavelength we can have a stable solution that can describe a condensate. From the dispersion relation, we can also see that, in the long wavelength regime

$$\begin{aligned} \omega _k \simeq c_s k\,, \end{aligned}$$
(128)

which is the dispersion relation of the phonon, that propagates as a wave, mediating long-range correlation. This means that the SIFDM with a repulsive interaction is a superfluid, with the theory fully described by this propagating phonon. On the other hand, for very small wavelengths, large k, the term \(\omega _k= k^2/2m\) dominates, which is the dispersion relation of a free massive particle, and the system stops exhibiting superfluidity. In the intermediary regime, where we should consider both terms of the dispersion relation, and the theory described by two degrees of freedom the phonon and the massive particle associated with particle creation away from the condensate. The scales that determine what we mean by long-wavelength regime is where \(\lambda \gg \lambda _{*}\) in a way that the linear term dominates the dispersion relation. Since \(\lambda _{*}\) is proportional to the healing length, this condition for the long wavelengths is equivalent to the condition in Sect. 3.3 that the healing length gives us the scale where quantum pressure (QP) can be neglected, which is what we are describing here.

When we have an attractive interaction, given by \(g<0\), we have two regimes of stability. For \(k < k_{*}\) (\(\lambda > \lambda _{*}\)), we have exponential growing solutions, which means that perturbations grow parametrically. Given this instability, the condensate cannot be formed on these scales. For \(k > k_{*}\) (\(\lambda < \lambda _{*}\)), the solution oscillates and is stable, forming a condensate. This stable configuration, however, is different than in the case for repulsive interaction, forming a localized object, with maximum size given by \(\lambda _{*}\). This localized stable solutions are called soliton. Therefore, it makes sense that this is the healing scale is the scale of the stability, since this is the scale below which the interaction “heals” perturbations of the condensate.

figure s

The case of the attractive interaction is not a superfluid even in the stable localized regions, the solitons. The only stable regions are for \(\lambda < \lambda _{*}\) and this is the regime where the linear term is always subdominant in the dispersion relation. The long-wavelength regions in this case are the regions where instability happens and there is no formation of a condensate.

Condensate solution Having studied the stability of the system and determined the regimes where we have stable and unstable solutions, we can now describe the background solution for each case. As we saw in the “condensate solution” part of Sect. 3.3, one possible solution for the weakly interacting BEC is given by the solitons. For repulsive interactions, the condensate solution is a dark soliton, while for a attractive interaction one has a bright soliton.

What is important is that for the repulsive interaction we can create a condensate of any size desired. So the size of the condensate is not limited and it will depend only on the choice of mass and strength of the interaction. For the case of attractive interaction, we can only have localized stable solutions, giving a finite size bright soliton, with maximum size \(\lambda _{*}= 2\pi \sqrt{2m/|g| \, n_0}\). Therefore, we can see that we could only have a soliton, for an attractive interaction, that is relevant on galactic scales if the mass is very small but with a not very small coupling. An interplay between these two parameters need to occurs then if one wants galactic sized \(\lambda _{*}\).

This condition also shows us that the QCD axion is not a good candidate for a ULDM (at least not to represent most of the ULDM, but it can still represent a fraction of DM). For the QCD axion the interaction, given by \(g_{\mathrm {a}} = -\varLambda ^4 / f_{\mathrm {a}}^4\) is negative, and it is extremely small with \(g_{\mathrm {a}} \sim -10^{-48}\), because \(\varLambda \sim 0.1 \, \mathrm {GeV}\), for the typical QCD scale, and \(f_{\mathrm {a}} \sim 10^{11} \, \mathrm {GeV}\), for typical Peccei–Quinn scale. Given that the mass of the axion is approximately \(m \sim 10^{-5} \, \mathrm {eV}\), the soliton length, for \(n_0 \sim n_{\mathrm {gal}}\), is \(\lambda _{\mathrm {s}} \sim 2.8 \times 10^{11} \, \mathrm {km} \sim 9 \times 10^{-6} \, \mathrm {kpc}\). This is much smaller than any galactic scales, which are of the order of tens to hundreds of kpc. The QCD axion, then, produces these small and localized clumps of axions.

Occupancy number evolution -   With those solutions in hand, we can understand how the evolution of the occupancy number for the condensate will behave for each mode. Determining the evolution of the occupation number is very important since having high occupancy number of the ground state of the system st what is actually the definition of a condensate. And this definition is independent of the system described, representing the best way of showing that condensation took place. This is given by \({\mathcal {N}} = |\psi _k|^2 / V\), where \(\psi _k\) is constructed from the exponential and oscillatory solutions described above, with a random phase. The average occupation number evolves as

$$\begin{aligned} \begin{array}{ll} \langle {\mathcal {N}}_k (t) \rangle = \langle {\mathcal {N}}_k (t_i) \rangle \left\{ 1+ \frac{1}{2 \gamma ^2_k} \left( \frac{g n_0}{4m^2} \right) \sinh ^2 \left[ \gamma _k \left( t-t_i \right) \right] \right\} \,, &{} \quad \mathrm {for} \,\, \omega _k^2 < 0 \\ \langle {\mathcal {N}}_k (t) \rangle = \langle {\mathcal {N}}_k (t_i) \rangle \left\{ 1+ \frac{1}{2 \omega ^2_k} \left( \frac{g n_0}{4m^2} \right) \sin ^2 \left[ \omega _k \left( t-t_i \right) \right] \right\} \,, &{} \quad \mathrm {for} \,\, \omega _k^2 > 0 \end{array} \end{aligned}$$
(129)

For \(g > 0\), an repulsive interaction, \(\gamma _k\) is imaginary, with \(\gamma _k = i \omega _k\), so the occupation number oscillates and the oscillations are stable. The ratio \(\langle {\mathcal {N}}_k (t) \rangle / \langle {\mathcal {N}}_k (t_i) \rangle \) which has the largest value is obtained for modes that minimize \(\omega _k\), which are the modes with \(k \rightarrow 0\). These are the longest wavelengths. Since the long wavelength dominates, this means that long-range correlation is present, and we can have a long range condensate.

For \(g < 0\), the occupation number grows exponentially. The fastest growth is given by the modes \(k=k_{*}\) that maximize \(\gamma _k\). So the modes \(k > k_{*}\) or \(\lambda < \lambda _{*}\), where \(*\) denotes the characteristic scale where instability sets in, will dominate and the stable configuration of the system will be localized clumps. The size of these clumps will be given by the mass and interaction of the model.

FDM: only gravity

We are now going to describe a model without interaction, where the ultra-light particles are under the influence of the gravitational potential. We can think that gravitational potential has the same effect as an attractive interaction, in a way that quantum pressure has to counteract the gravitational collapse. This gives a good picture to what to expect, from the knowledge obtained for the SIFDM. However, one needs to remember that the FDM model is described by a non-interacting theory, which means that it condenses into a BEC, but does not exhibit superfluidity in any regime.

The Schrödinger equation for DM in a gravitational potential, in the absence of interaction is

$$\begin{aligned} i {\dot{\psi }} = -\frac{1}{2m} \nabla ^2 \psi + m\varPhi \, \psi \,, \end{aligned}$$
(130)

which is coupled to the Poisson equation:

$$\begin{aligned} \nabla ^2 \varPhi = 4 \pi G \left( m |\psi |^2 - {\bar{\rho }} \right) \,, \end{aligned}$$
(131)

where the average background density, \({\bar{\rho }}\), was subtracted. Expanding the field as done previously \(\psi ({\mathbf {x}},t)= \psi _{\mathrm {c}}(t) + \delta \varPsi ({\mathbf {x}},t)\), the equation for the condensate is trivial and \(\psi _{\mathrm {c}} = \psi _0 = \mathrm {const.}\). For the fluctuations, we can write the linearized systems of equations that govern the evolution of the perturbation:

$$\begin{aligned}&i \dot{\delta \varPsi } = - \frac{1}{2m } \nabla ^2 \delta \varPsi + m \varPhi \,, \end{aligned}$$
(132)
$$\begin{aligned}&\nabla ^2 \varPhi = 4\pi G \, m \, n_0 \left( \delta \varPsi + \delta \varPsi ^{*} \right) \,. \end{aligned}$$
(133)

These can be combined into the equation:

$$\begin{aligned} i \dot{\delta \varPsi } = - \frac{1}{2m} \nabla ^2 \delta \varPsi + 4\pi G m^2 n_0 \nabla ^{-2} \left( \delta \varPsi + \delta \varPsi ^{*} \right) \,. \end{aligned}$$
(134)

One can notice that the equation above is very similar to the equation we had for the interacting case (122) for an attractive interaction. With that, we expect that there is an instability for long wavelengths, and that the condensate stable solution is only given for a finite region, forming a localized core. To determine, this lets take the Fourier transform of the fields. Like in the interacting case, the instability is divided by the regimes where the dispersion relation is smaller or bigger than zero. For the parameters of the FDM, we can determine the wavenumber that separates the regimes as

$$\begin{aligned} {\tilde{\omega }}_k^2 = 0 \,, \quad \Longrightarrow \quad k_{\mathrm {J}} = \left( 16 \pi G m^3 n_0 \right) ^{1/4}\,. \end{aligned}$$
(135)

This scale is the Jeans scale and it separates the regimes where gravity dominates and collapse happens (\(k < k_{\mathrm {J}}\)), and the regime where the quantum pressure dominates and the solution is stable and oscillates (\(k > k_{\mathrm {J}}\)). In this regime, we can have a condensate. The quantum pressure term counteracts the gravitational attraction and any attempt to localize the particle is accompanied by an increase in energy. So stability below the Jeans scale arises because of the uncertainty principle.

figure t

We can estimate the size of the coherent condensate core. Rewriting the Jeans wavelength as

$$\begin{aligned} \lambda _{\mathrm {J}}= & {} \frac{2\pi }{k_{\mathrm {J}}} = \frac{\pi ^{3/4}}{2} \left( G \rho \right) ^{-1/4} m^{-1/2} \nonumber \\= & {} 94.5 \left( \frac{m}{10^{-22} \mathrm {eV}} \right) ^{-1/2} \left( \frac{\rho }{\rho _{\mathrm {crit}}} \right) ^{-1/4} \left( \frac{\varOmega _{\mathrm {m}} h^2}{0.12} \right) ^{-1/4} \, \mathrm {kpc}\,, \end{aligned}$$
(136)

where \(\rho _{\mathrm {crit}}\) is the critical density. For fuzzy DM in an overdense region \(\rho = 10^6 \, \rho _{\mathrm {crit}}\),

$$\begin{aligned} \text {Fuzzy DM:} \,\, m \sim 10^{-22} \, \mathrm {eV} \quad \longrightarrow \quad \lambda _J^{FDM} \sim 3 \, \mathrm {kpc}\,, \end{aligned}$$
(137)

forming a condensate that is of the order of the scales of the halo of galaxies. As we can see, for a MW-like galaxy, this core formed is smaller than the halo. So we expect that in the outskirts of the halo, the DM is not going to be condensed and is going to behave like normal matter, with the profile following the NFW profile. For the QCD axion, if we assume that we can have an axion without interaction, \(\lambda _{\mathrm {J}} \sim 1.7 \times 10^{-7} \, \mathrm {kpc} \), which is a very small-scale in comparison to galaxies. This stable bound system is called a Bose star.

Adding the expansion of the universe, we can see that the system of equations becomes

$$\begin{aligned}&\frac{i}{a^{3/2}} \partial _t \left( a^{3/2} \psi \right) = - \frac{1}{2m} \frac{\nabla ^2 \psi }{a^2} + m \varPhi \psi \,, \end{aligned}$$
(138)
$$\begin{aligned}&\nabla ^2 \varPhi = 4\pi G a^2 \left( m |\psi |^2 - {\bar{\rho }} \right) \,. \end{aligned}$$
(139)

Like before, we determine the linearized equations for the perturbations \(\delta \varPsi \) around the coherent homogeneous background that evolves as \(\psi _{\mathrm {c}} \propto a^{-3/2}\). The equation that describes the evolution of the perturbation is

$$\begin{aligned} i \dot{\delta \varPsi _k} = - \frac{k^2}{2ma^2} \delta \varPsi _k - \frac{3}{2} m \varOmega _{\mathrm {a}} \frac{H^2 a^2}{k^2} \left( \delta \varPsi _k + \delta \varPsi ^{*}_k \right) \,, \end{aligned}$$
(140)

where \(\varOmega _{\mathrm {a}} = m n_0/ \rho _{\mathrm {tot}}\) is the density parameter of our FDM particles. The real part of the perturbations obeys the following equation:

$$\begin{aligned} {\ddot{A}}_k + 2H {\dot{A}}_k - \frac{3}{2} \varOmega _{\mathrm {a}} H^2 A_k + \left( \frac{k^2}{2ma^2} \right) ^2 A_k = 0\,. \end{aligned}$$
(141)

All the terms in this equation would also be present from a massive DM component like CDM, except for the last term, which is related to the quantum pressure. The Jeans length in this case is given by

$$\begin{aligned} \frac{k_{\mathrm {J}}}{a} = \left( 6 \varOmega _{\mathrm {a}} \right) ^{1/4} \sqrt{Hm}\,. \end{aligned}$$
(142)

This stability analysis shows us that if we want to construct a ULDM model with the desired feature of having a condensate in galactic scales, for the FDM model we need to have a very small mass of the order \(m \sim 10^{-22} \, \mathrm {eV}\). This mass is within the bounds we had obtained earlier for all the ULDM models. For the FDM model, the only parameter that controls the size of the condensate is the mass. We need then to determine the mass of the FDM particle for which this model can address the small-scale problems, and for which the model is still in accordance with the very precise cosmological observations. This is what we show in the next subsection. We are going to put bounds on this parameter according to the observations in galaxies and on large-scales.

Condensate solution The ground state solution for the FDM is called Bose star, and it is a gravitational bound stable state in 3-dimensions. This can be obtained by minimizing the Hamiltonian that is described by system with gravity alone, at fixed particle number (N) (Guth et al. 2015). In the absence of an exact solution, one can have an ansatz for this solution, in analogy to the ground state of the hydrogen atom, we assume spherical symmetry and have

$$\begin{aligned} \psi _{\mathrm {bs}} (r) = \psi _s e^{-i \mu _s t} = \sqrt{\frac{N\, k^3_{\mathrm {bs}}}{\pi }}\, e^{-k_{BS} r} \, e^{-i\, \mu _{s} t}\,, \end{aligned}$$
(143)

which corresponds to the ground state \(E=-25G^2m^5N^3/512\), \(\mu _s < 0\) and where the characteristic wavenumber of the Bose star is given by

$$\begin{aligned} k_{\mathrm {bs}} = 5 G m^3 N/16 \sim k_{\mathrm {J}}\,, \end{aligned}$$
(144)

given that \(N \sim n_{\mathrm {bs}}/k^3_{\mathrm {bs}}\). Therefore, the Bose star wavelength coincides with the Jeans length that determines the core, the stable region with no gravitational collapse.

However, the study of the Bose star solution needs to be done numerically. We are going to see next how the formation of these Bose stars takes place in kinetic theory. We see the formation of the BEC in these gravitationally bound structures, having the Bose star, and see how this Bose star grows. We are also going to discuss the cores formed by other simulations in Sects. 4.1.6 and 4.3. With those we can determine the profile, mass and size of the core.

In the case of the SIFDM, we saw that what determines the size of the condensate is the mass and of the strength of the interactions. Like for the FDM, to construct a SIFDM model that addresses the small-scale problems and that are allowed by current observations, we need to put bounds on those parameters. This model was much less studied than the FDM model. There are a few works that present some bounds on those parameters (see references in the definition of this class), but with much less constraints than in the FDM case. This is also understandable since here we have an extra parameter in the model. A more complete analysis for the bounds of the SIFDM is going to be presented in a future publication and it is being considered by the authors while this review is being written.

Initial evolution of the condensate

The picture that we have for the ULDM is that in the interior of galaxies it forms a core where the ULDM condenses and forms a Bose–Einstein condensate, in the case of FDM, and a superfluid, in the case of the SIFDM (and of the DM superfluid which we will see in the next section). Previously, we have determined the region where condensation can happen in the halo and calculated the coherence length of the condensate in the case of FDM and SIFDM, and calculating what is the ground state solution of the condensate, the soliton or Bose star, for each case.

We want to study if there is the formation of a BEC in the center of galaxies in the presence of gravitational interactions, on top of the self-interaction in the case of SIFDM. We showed before that the ULDM is described by a classical theory. The reader, then, might be wondering that since we are using a classical theory we cannot fully describe thermalization and the formation of the condensate. Indeed, as we showed in Sect. 3.6, and also in Guth et al. (2015), classical fields follow a Rayleigh–Jeans distribution which present the UV catastrophe, and, thus, cannot describe a condensate. However, Guth et al. (2015) showed that a classical field theory is capable to describe the phase transition that represents the formation of a condensate and the theory has a well defined thermal equilibrium if one introduces a ultra-violet cutoff in the theory. We saw the same conclusion in wave turbulence, in Sect. 3.6, where the four-wave classic kinetic equation can only describe the initial stages of a BEC if a ultra-violet cutoff is present in the theory. Therefore, it is possible to describe the formation of a condensate with a classical theory.

To study the formation of a BEC in the center of galaxies here we use the kinetic equation from wave turbulence. We can use the four wave kinetic equation to describe the initial stages of evolution of a BEC. This formalism can be used in the case of ULDM to show that Bose–Einstein condensation caused by gravitational forces indeed happens in the center of halos (and of axion miniclusters). From this theory, we can also obtain a prediction for the condensation time and obtain the properties of the condensate distribution. This was done in Levkov et al. (2018) that made this analysis for the FDM model, and showed numerically the formation of the condensate and growth of the Bose stars, and in Kirkpatrick et al. (2020) where the role of interaction was also studied. This is what we are going to show now.

We are going to proceed in the same fashion as in Sect. 3.6, but with the system subjected to a gravitational potential. The difference from the previous procedure is that the gravitational interaction is long range, while in Sect. 3.6, we studied only the role of short-range interactions.

In galaxies, we are in the regime of high occupation number, and the classical description is valid. For the system we are considering, \(p^2 \ll 2mT\); therefore, for the period where the evolution is going to be described by this kinetic theory, the UV catastrophe is not a problem, and the classical kinetic description of the system is valid.

In this regime, we can see that this system described by the non-linear Schrödinger equation (113) can be described in wave turbulence by the classical kinetic equation (83) where the non-linear process is the four-wave resonant interaction. In the case of the ULDM subjected to gravitational potential, the kinetic equation is given by

$$\begin{aligned} \frac{\partial f}{\partial t}+\frac{{\mathbf {p}}}{m} \cdot \nabla _{x} f = 2 \, \mathrm {Im} \int {\text {d}}{\mathbf {y}}\, e^{-i\mathbf {p.y}} \left\langle \psi \left( {\mathbf {x}}+\frac{{\mathbf {y}}}{2}\right) \psi ^{*}\left( {\mathbf {x}} -\frac{{\mathbf {y}}}{2}\right) U_{\mathrm {tot}}\left( {\mathbf {x}}+\frac{{\mathbf {y}}}{2}\right) \right\rangle \,, \nonumber \\ \end{aligned}$$
(145)

where f is the Wigner distribution described by (82) and the potential \(U_{\mathrm {tot}}\) is given by

$$\begin{aligned} U_{\mathrm {tot}}({\mathbf {x}})&= U_{\mathrm {G}} + g \, |\psi ({\mathbf {x}})|^2 = 4\pi G m \int {\text {d}}{\mathbf {x}}' \, \varDelta ^{-1}_{{\mathbf {x}}-{\mathbf {x}}'} \left( |\psi ({\mathbf {x}})|^2 - n \right) \nonumber \\&\quad + g\, \int {\text {d}}{\mathbf {x}}' \, \delta ({\mathbf {x}}' - {\mathbf {x}}) \, |\psi ({\mathbf {x}})|^2 \,, \end{aligned}$$
(146)

and \(\varDelta ^{-1}_{{\mathbf {x}}-{\mathbf {x}}'} \) is the Green function coming from the Poisson equation

$$\begin{aligned} \varDelta ^{-1}_{{\mathbf {x}}-{\mathbf {x}}'} = \frac{1}{4\pi |{\mathbf {x}} - {\mathbf {x}}'|}\,. \end{aligned}$$
(147)

These two equations are equivalent in this limit to the Schrödinger–Poisson system that describes the evolution of the ULDM field in galaxies. This is the Landau kinetic equation for the gravitating ensemble of random phase classical waves inside a structure of radius R that in our case of interest is the halo of a galaxy, but it can be an axion minicluster or a periodic box, in the case of simulations.

To obtain the closed form of this kinetic equation we need to make some assumptions, as detailed in Sect. 3.6. In our case, we are going to assume as a initial distribution Gaussian distributed ULDM particles, described by a Gaussian random field \(|{\tilde{\psi }}_{{\mathbf {p}}}|^2 = 8\pi ^{3/2} {\tilde{N}} \, e^{-{\tilde{p}}^2} \) with random phases \(\mathrm {arg} \, {\tilde{\psi }}_{{\mathbf {p}}}\), where \(\psi = {\tilde{\psi }} v_0^2 \sqrt{m/G}\), \(\tilde{{\mathbf {p}}} \equiv {\mathbf {p}}/mv_0\) and \(v_0\) is the initial velocity. This initial configuration is the Fourier transform of \({\tilde{\psi }} (t,\tilde{{\mathbf {x}}})={\tilde{\psi }} (0,\tilde{{\mathbf {x}}})\), with \(\tilde{{\mathbf {x}}} = mv_0 \, {\mathbf {x}}\), which is an isotropic and homogeneous field. This is an uncorrelated field, a field that has minimal coherence length. This type of initial configuration reinforces that there is no seed for condensation in the halo or axion minicluster, and that condensation arises simply by the gravitational interaction or gravitational interaction plus self-interaction. This initial condition is also well motivated from axions formed by the Kibble mechanism (Fairbairn et al. 2018).

Using the Wick theorem, we can write our kinetic equation in closed form for the SIFDM as (Kirkpatrick et al. 2020)

$$\begin{aligned} \frac{\partial f}{\partial t}+\frac{{\mathbf {p}}}{m} \cdot \nabla _{x} f = F_1 + F_2 + I(f)\,, \end{aligned}$$
(148)

where the \(F_1\) and \(F_2\) correspond to the two point correlation functions in the Wick theorem, the first two terms in Eq. (84), and I(f) comes from the connected partFootnote 33, the last term of Eq. (84). The first two terms are given by

$$\begin{aligned} F_1&= 2\, \mathrm {Im} \, \langle U_{\mathrm {tot}}({\mathbf {x}}-i \nabla _{{\mathbf {p}}}/2) \rangle f({\mathbf {x}},{\mathbf {p}}) \,, \end{aligned}$$
(149)
$$\begin{aligned} F_2&= \frac{2}{(2\pi )^6} \, \mathrm {Im} \, \int {\text {d}}{\mathbf {q}} {\text {d}}{\mathbf {q}}' {\text {d}}{\mathbf {y}} {\text {d}}{\mathbf {y}}' \, \left[ 4\pi G m^2 \, \varDelta ^{-1}_{{\mathbf {y}}} + g\, \delta ({\mathbf {y}}) \right] \, e^{i \left( {\mathbf {y}}'\cdot {\mathbf {q}} - {\mathbf {y}} \cdot {\mathbf {q}}' \right) }\nonumber \\&\quad f({\mathbf {x}}+\frac{{\mathbf {y}}}{2},{\mathbf {p}}+{\mathbf {q}} ) \, f({\mathbf {x}}+\frac{{\mathbf {y}}'}{2},{\mathbf {p}}+{\mathbf {q}}' )\,. \end{aligned}$$
(150)

They are T-odd terms. They are order one in the interactions \({\mathcal {O}}(g)\) and \({\mathcal {O}}(G)\). The potential obeys: \(\varDelta \langle U_{\mathbf {tot}} \rangle ({\mathbf {x}})=(4 \pi G m)/(2 \pi )^3 \int {\text {d}}{\mathbf {p}} \, (f({\mathbf {x}},{\mathbf {p}}) - n )\).

We can separate the contribution in I(f) by coming from the the self-interaction and from the gravitational interaction \(I(f)=I_g (f) + I_{\mathrm {G}}(f)\), respectively, where

$$\begin{aligned} I_g = 2 \, \mathrm {Im}\, \int {\text {d}}{\mathbf {y}} \, e^{-i{\mathbf {p}} \cdot {\mathbf {y}}} \, g \langle \psi _{+} \psi ^{*}_{-} \psi _{+} \psi ^{*}_{+} \rangle _{\mathrm {conn}}\,, \quad I_{\mathrm {G}}= 2 \, \mathrm {Im}\, \int {\text {d}}{\mathbf {y}} \, e^{-i{\mathbf {p}} \cdot {\mathbf {y}}} \, \langle \psi _{+} \psi ^{*}_{-} U_{\mathrm {G}} \rangle \,. \end{aligned}$$
(151)

Here we simplified the notation by writing \(\psi _{\pm } = \psi ({\mathbf {x}} \pm {\mathbf {y}}/2)\). These terms are second order in the interactions. Both of those interactions act in very different scales. The self-interaction is short range and it was the case we studied in Sect. 3.6. The case of the gravitational interaction needs to be treated carefully since the gravitational interaction is long range. Here we follow Levkov et al. (2018) where the treatment from Landau for long-range Coulomb interactions was used for gravity.

For the initial Gaussian distribution, \(F_1\), \(F_2\) and I(f) vanish, and the ULDM distribution is initially static.

We are interested in describing kinetic relaxation of ULDM in the halo of a galaxy (or in an axion minicluster), which is the condensation, and obtain the relaxation time of this process. As we saw before, inside the halo \(y = \varDelta x \ll R\), the field and the potential are homogeneous. In the center of a homogeneous spherically symmetric halo, where the condensate is formed (or a homogeneous box in the case of simulations), the terms \(F_1\) and \(F_2\) vanish and we do not need to take them into account when studying these initial stages of the condensate, where the four-wave kinetic equation is valid. Therefore, in our case

$$\begin{aligned} \frac{\partial f}{\partial t}+\frac{{\mathbf {p}}}{m} \cdot \nabla _{x} f \approx I(f)\,. \end{aligned}$$
(152)

Lets first treat the gravitational part. To estimate \(I_{\mathrm {G}}(f)\), we can follow Landau’s treatment. In the regime where the above equation is valid \(\varDelta x \ll R\), and in the kinetic regime \((mv)^{-1} \ll R\), we can expand the potential as \(U_{\mathrm {G}}({\mathbf {x}}+{\mathbf {y}}/2)=U_{\mathrm {G}}({\mathbf {x}})+ ({\mathbf {y}}/2) \cdot \nabla U_{\mathrm {G}}({\mathbf {x}})\). With that the gravitational scattering integral we were calculating can be described by a diffusion process in phase space: \(I_{\mathrm {G}} = - \nabla _{{\mathbf {p}}} \cdot {\mathbf {s}}\), where the Landau flux \({\mathbf {s}}\) is

$$\begin{aligned} {\mathbf {s}}= & {} \frac{1}{(2\pi )^3} \int {\text {d}}{\mathbf {x}}' {\text {d}}{\mathbf {p}}' \, {\mathcal {F}}^{{\mathbf {x}}' {\mathbf {p}}'}_{{\mathbf {x}} {\mathbf {p}}} \, \nabla _{{\mathbf {x}}}(4\pi G m^2 \varDelta ^{-1}_{{\mathbf {x}}-{\mathbf {x}}'})\,, \nonumber \\ {\mathcal {F}}^{{\mathbf {x}}' {\mathbf {p}}'}_{{\mathbf {x}} {\mathbf {p}}}= & {} \int {\text {d}}{\mathbf {y}} {\text {d}}{\mathbf {y}}' \, e^{-i({\mathbf {p}}\cdot {\mathbf {y}}+{\mathbf {p}}'{\mathbf {y}}')} \langle \psi _{+} \psi ^{*}_{-} \psi _{+}' \psi _{+}^{*\prime } \rangle _{\mathrm {conn}}\,. \end{aligned}$$
(153)

Plugging this new \(I_{\mathrm {G}}\) in the kinetic equation, we can rewrite it as a the equation for the evolution of the four-point function \({\mathcal {F}}\), which involves a six-point correlation function. To be able to write this equation in closed form and solve it, one uses Wick theorem. Ignoring the connected part in the limit of small separations \(y \ll R\), the Landau flux is then \(s_i=\int {\text {d}}{\mathbf {p}}' \, \varPi _{ij}({\mathbf {u}}) (f'^2 \partial _{p_j} f - f^2 \partial _{p_j'} f' )\), where \({\mathbf {u}}= ({\mathbf {p}}'- {\mathbf {p}})/m\) and \(f'=f({\mathbf {x}}', {\mathbf {p}}')\). The term \(\varPi _{ij}({\mathbf {u}})\) is given by the integral

$$\begin{aligned} \varPi _{ij}({\mathbf {u}}) = \int {\text {d}}t'{\text {d}}{\mathbf {y}} \, \partial _i {\bar{\varDelta }}^{-1}_{{\mathbf {y}}} \, \partial _j \left[ {\bar{\varDelta }}^{-1}_{{\mathbf {y}}+{\mathbf {u}}t'} - g \, \delta ({\mathbf {y}}+{\mathbf {u}}t')\right] \,, \end{aligned}$$
(154)

where \(\partial _i {\bar{\varDelta }}^{-1}_{{\mathbf {y}}} = 4\pi G m^2 \varDelta ^{-1}_{{\mathbf {y}}}\). Then this Landau flux defines \(I_{\mathrm {G}}\) through \(I_{\mathrm {G}} = - \nabla _{{\mathbf {p}}} \cdot {\mathbf {s}}\). However, to take the above integral one needs to introduce a long-time and a short-time cutoffs, given the logarithmic divergence of the Poisson Green’s function. Therefore, the integral is taken in the interval \((mv^2)^-1 \ll t' \ll R/v\), which is the time that correspond to the regime where relaxation can happen since it can only happen inside the halo \(y \ll R\) where the field is homogeneous, giving the upper limit, and for distances bigger than the de Broglie wavelength \(y \gg (mv)^{-1}\), since diffusion is only sensitive to fluctuations at long distance (Kirkpatrick et al. 2020). The fact that diffusion is not sensitive to short distance scales already tells us that the self-interaction contribution be sub-dominant in the relaxation process. So the condensation in the halo happens much faster because of the gravitational interactions. In this time range, the dominant contribution of this integral yields \(\varPi _{ij} \approx \varLambda (u^2 \delta _{ij} - u_i u_j)/u^3\), where \(\varLambda \equiv \log (mvR) \) is the Coulomb logarithm. We can compute the relaxation time due to gravity with this, which yields \(I_{\mathrm {G}} \sim f/\tau _{\mathrm {G}}\).

The \(I_g\) contribution is going to be sub-dominant, but we can still evaluate their contribution to the condensate. Since the self-interaction is short range, we do not need to resort to Landau’s treatment. We can just set \(U_{\mathrm {tot}} = g |\psi |^2\), and solve Eq. (113). Considering again that we have a homogeneous distribution inside the halo and ignoring the connected part of the Wick theorem since we want the result in leading order of g, then the kinetic equation can be simplified to \({\text {d}}^2f/{\text {d}}t^2 \sim 8g^2 \langle n \rangle f\), which tells us that the relaxation rate from the self-interactions is \({\text {d}}I_g/{\text {d}}t \sim 8g^2 n^2 f = \gamma f\), where \({\text {d}}/{\text {d}}t = \partial _t + {\mathbf {p}} \cdot \nabla _{{\mathbf {x}}} /m\). This relaxation rate does not grow with time in this limit, but when the connected correlations start to become relevant the relaxation rate starts varying with time. The relaxation time from self-interactions is then given by \(\tau _g =1/\sqrt{\gamma }\).

With that the relaxation time for this weakly interacting Bose gas subjected to gravity in a halo is given by

$$\begin{aligned} \tau _{\mathrm {G}} = \frac{\sqrt{2} m v^6}{12 \pi ^3 G^2 n^2 \varLambda }\,, \quad \tau _g =\frac{1}{ \sqrt{8} \, |g| n } \,. \end{aligned}$$
(155)

This gives the condensation time. The relaxation time for the gravitational interaction is inversely proportional to \(G^2\) (and the cutoff scale), while the self-interaction one is inversely proportional to g. The gravitational relaxation time is much faster than the self-interaction one. The total condensation time is \(\tau _{\mathrm {tot}} \sim 2 \tau _{\mathrm {G}} \tau _g/(\tau _g + \sqrt{\tau _g^2 + 4 \tau ^2_{\mathrm {G}}}) \rightarrow \tau _{\mathrm {G}} \tau _g/(\tau _{\mathrm {G}} + \tau _g) \rightarrow \tau _{\mathrm {G}}\), is dominated by the gravitational one. Kirkpatrick et al. (2020) estimated this to be of the order \(\tau _g/\tau _{\mathrm {G}} \sim 10^5\) for the QCD axion. Therefore, the formation of Bose stars mainly happens due to gravity, even in the presence of the self-interactions. However, self-interaction is important as we saw above that the presence of the self-interactions lead to a different phenomenology and size of the soliton core. Notice that thermalization from gravitational interactions showed here is different then the one arising from short-range interactions through power-law turbulent cascades (Semikoz and Tkachev 1997), arising in this system through a diffusion process.

In the virialized halo, we can see that the formation of the Bose star, for the FDM and SIFDM models, can happen in the universe with formation time smaller than the age of the universe. For example, in dwarf galaxies, the condensation times for a FDM particle of mass \(m_{22}\) would take \(\sim 10^6\, \mathrm {yr}\).

This formalism shows us that we can describe the first stages of the formation of a BEC in the halo of galaxies using kinetic theory. This approximate description of the BEC can help us estimate the condensation time and study the properties of the Bose star. This formalism is particularly useful for numerical simulations which were performed in Levkov et al. (2018) for the FDM model. The formation of the condensate can be seen numerically with no seed condensate in the simulation. It is seen that the initial evolution of the condensate also follows Eq. (148) for \(t<\tau _{\mathrm {G}}\), as expected for the wave turbulence with four-wave interaction. The Bose stars grow after formation and the condensate becomes stronger (more particles condensate). The first decade if this growth shows a growth in the mass of the Bose star \(M_s \simeq c v_0 (t/\tau _{\mathrm {G}}-1)^{1/2}\), with \(c=3\pm 0.7\). Only the first decade of the growth was seen both from computational limitation, but also this description does not hold for when the condensate starts to get stronger, and the description with four-wave interaction is expected to break.

That study shows us that we can indeed have the formation of a Bose–Einstein condensate of the ULDM particles inside the galaxies, forming Bose stars and solitons. This shows that the picture of having ULDM forming condensed cores inside galaxies is indeed valid.

4.1.5 Discussion

Before going further and describing the observational consequences of the FDM and SIFDM, we are going to briefly discuss the picture presented here and some different interpretations regarding condensation that are present in the literature.

The picture that we have showed until now for the behaviour of the ULDM is the following. ULDM is described by a classical theory that gives its cosmological evolution and its non-relativistic evolution in galaxies. Inside the halos of galaxies, the ULDM thermalizes and forms gravitationally bound systems, the Bose stars or soliton, where a BEC or a superfluid is formed. This condensed core is smaller than the size of the galaxy and the coherent length gives the region where the wave-like behaviour of the condensate is manifested, being of the order of the de Broglie wavelength in the case of the FDM. The formation of these objects through gravitational interactions can occur inside the halo of galaxies, without any condensate seed, in a time smaller than the age of the universe, as shown from kinetic theory.

Therefore, in this picture, after the halo of the galaxy forms and virializes, thermalization in the inner regions of the halo happens and a condensate core is formed. This coherent length of the condensate, which changes if we are in the FDM or SIFDM cases, sets the size of the core that is smaller than the radius of the galaxy. This condensed core in the inner region of the galaxy is surrounded by a shell where DM behaves like a free particle (since \(\lambda _{\mathrm {dB}} \ll d\)) instead of a wave like it is inside the core. Outside the cores, in the outskirts of the halo, DM follows the profile predicted by CDM. A condensate core can also be formed in the center of a more massive system like a cluster. However, this condensate is very small, smaller than the ones in galaxies, in comparison with the size of the cluster. Therefore, in the picture presented here, a BEC is formed in the interior of galaxies due to the gravitational interactions.

However, many authors in the literature there is a different view. For some authors, coherence of the ULDM is established initially, in the initial stages of formation of the ULDM. This is the case for axions, as pointed out in Sect. 4.7 of Marsh (2016a) (where a very good discussion of this topic is presented). The cosmic axion field is described as a classical coherently oscillating scalar field. This classical field \(\phi \) comes from the expectation value of the quantum axion field, which can be represented in the Heisenberg picture as \(\langle \phi | {\hat{\phi }}(x) | \phi \rangle = \phi (x)\). The state of the axion field \({\phi }\) is going to be coherent state (Zee 2003; Itzykson and Zuber 1980), with this coherence established initially. For the population of axions that is produced through vacuum misalignment, in the case where the Peccei–Quinn symmetry is broken during inflation, the inflationary evolution is responsible to making the axion coherent within the Hubble radius. For the axion population that forms via the decay of topological defects, coherence of the field is obtained via thermalization after formation with rate of the order of Hubble rate \(H_0\), as presented in Marsh (2016a).

Therefore, as it is claimed in Marsh (2016a), the axions coming from any of the populations can be described cosmologically as a classical coherently oscillating scalar field. This classical description of the axion is said to hold throughout the evolution of the universe. This is discussed in Dvali and Zell (2018) where the validity of this mean field approximation is studied. They wanted to determine what is the quantum break time, which is defined as when quantum effects become important again for the description of the axion and the classical description of the axions is not valid anymore. The quantum breaking time is determined by the rate of axion scattering, and it is found that it exceeds the age of the universe for the QCD axion, but this would also be true for an axion with a smaller mass. Therefore, they conclude that it is safe to treat the axion as a classical coherently oscillating scalar field if coherence was established initially.Footnote 34

This classical description also holds inside the halos of galaxies. This has been shown in Allali and Hertzberg (2020) for a ultra-light fields that behave like DM. The transition from a quantum to a classical system is called decoherenceFootnote 35. This is an important question since decoherence is known to occur very fast in macroscopic systems due to interactions of the system with the environment. However, if we have DM, where interactions with the environment are known to be very weak interacting mainly gravitationally, decoherence might proceed less efficiently. In this reference, they study decoherence of a ULDM, including axions, in the halo (ULDM overdensity) with its environment (the diffuse hydrogen in galaxies in this study). They find that ULDM in the halo can be treated as classical inside galaxies since decoherence would take place very fast, with a decoherence rate of the order of the Hubble rate \(H_0\) for ULDM with \(m \sim 10^{-7} \, \mathrm {eV}\). The same result holds for a BEC in the halo of galaxies. If we have a Bose star that formed in the halo, the decoherence time is very fast for any ULDM that has \(m < \, \mathrm {eV}\), so the condensate, after formed, can be treated as classical and described by classical equations like the Schrödinger–Poisson system of equations.

With that, the picture that some authors have is that since coherence is established from initial conditions, and this classical picture can be maintained throughout the evolution of the universe, then when the Jeans length stable regions are formed, the axion in this region is already a coherent field. Thus, inside these regions in the interior of galaxies, coherence was established by initial conditions, so no thermalization is necessary to happen inside the halo.

However, the coherence of this classical field could be broken during the evolution of the universe by local processes. This can come from many out of equilibrium processes that occur in the universe. One example of this is the formation of halos since virialization happens through violent relaxation. In this case, the DM particles scatter on small fluctuations of the gravitational field, and coherence can be lost. The system can then be describe by an ensemble of classical incoherent waves, with very small coherence length. Also, as pointed out in Dvali and Zell (2018), even the classicality of the axion could be challenged in small overdense regions and quantum breaking could occur in these regions.

With all that, we can see that either having coherence from initial conditions or not, the ULDM field has to either thermalize or re-thermalize on galactic scales. We showed above that thermalization of ULDM can take place in the halos of galaxies in the presence of gravity or gravity and self-interaction, without any need of previous coherence of the field. And in these regions, it was shown that a BEC is going to be formed. And after its formation from the results from Allali and Hertzberg (2020) we can treat the evolution of the Bose star as classical. This picture is valid for the FDM, SIFDM and for the DM superfluid. So, for the phenomenology of ULDM in galaxies, if the field was already a coherent field or not, does not alter the conclusions we present in this review since this coherence can be reached at late times in the halo.

We just want to emphasize one last thing. Bose–Einstein condensation is a quantum mechanical phenomena. It can only arise because of quantum mechanics. The definition of a condensate is the one given in Sect. 3.7. A BEC can be described classically in the classical limit of the many-body theory or the field theory description of a condensate, or by an approximate classical theory as an ensemble of waves like wave turbulence (where we can make a parallel with the definition of a condensate as particles in the ground state which in the language of waves translates is waves with \(k=0\)).

This discussion about thermalization or re-thermalization, classical or not, in the context of ULDM has been presented in many places in the literature of ULDM with many different point of views and interpretations. For the interested reader, some of these discussions are present in the following references (Sikivie and Yang 2009; Erken et al. 2012; Marsh 2016a; Guth et al. 2015; Castellanos et al. 2014; Davidson and Elmer 2013; Davidson 2015). We hope we have presented here a clear view of what happens in the interior of halos and how this can be interpreted, unifying some aspects of these interpretations.

4.1.6 Cosmological and astrophysical consequences of the FDM

Now that we have a description of what happens cosmologically and inside the halos of galaxies for these models, we are going to discuss the rich phenomenology that these models exhibit. We are going to study their predictions and discuss their cosmological and astrophysical consequences. These models behave differently than CDM in two ways. They present a CDM like behaviour on large-scales, with modified initial conditions, and inside the halos of galaxies, they form a core in the inner part of the halo where a non-CDM behaviour is described, while a NFW behaviour is present in the outskirts of the halo, outside the core (see Fig. 10).

We need to study the predictions coming from these modified initial conditions, and from the presence of these cores and their non-CDM behaviour. We can think about three main groups of consequences of these models. First is the suppression of the small-scale structure, that is going to affect many observables both cosmological and astrophysical; the second is related to the presence of the cores in the center of galaxies; and the third is related to dynamical effects that arise from the BEC or superfluid formed in the central regions of galaxies. We are going to describe these predictions in this section. We are going to focus mainly in the FDM in this section. Each of those effects can be probed with different observables, which can lead to bounds in the parameters of the models. We are going to talk about how these effects are measured and the constraints obtained in Sect. 5. For other observational consequences of FDM, see Hui et al. (2017).

A small comment. Although we showed above, we call solitons the ground state of the SIFDM, and Bose stars the ground state of the FDM, in the literature the term solitons is used for both. Therefore, from this point, we will use the term soliton to also describe the ground state of the FDM.

Suppression of structure formation

One of the effects coming from the FDM and SIFDM classes of models is the suppression of small-scale structure. This is a consequence of the fact that these models present sizable Jeans scale which cuts off the structure formation for wavelengths smaller than \(\lambda _{\mathrm {J}}\). The Jeans length for the FDM model is given by Eq. (136). Therefore, as we saw in the previous section, for modes that are larger than the Jeans length \(\lambda >\lambda _{\mathrm {J}}\), gravitational instability takes place and structure formation can happen, while modes smaller than the Jeans length \(\lambda <\lambda _{\mathrm {J}}\) have oscillatory solution and no structure formation takes place. So structure formation suffers a cutoff on scales of the order of the Jeans length. The same happens for the SIFDM model, but the scale of this cutoff is different, analogous to the Jeans length, or the healing length, given by Eq. (127).

We can quantify this suppression by computing the power spectrum for the ULDM models. We are working in linear perturbation theory, as shown above, and we can evaluate the linear suppression of the power spectrum. In analogy to that done for the WDM model, the modifications of the power spectrum with respect to the \(\varLambda \)CDM power spectrum are encoded in a transfer function \(T_{\mathrm {FDM}}(k,z)\). We can relate the power spectra as (Hu et al. 2000b; Marsh 2016a)

$$\begin{aligned} P_{\mathrm {FDM}} (k,z) \!=\! T^2_{\mathrm {FDM}}(k,z) \, P_{\varLambda \mathrm {CDM}} (k,z) = T^2_{\mathrm {FDM}}(k,z) \, \left( \frac{D(z)}{D(0)} \right) ^2 \, P_{\varLambda \mathrm {CDM}} (k) \,,\qquad \end{aligned}$$
(156)

where \( P_{\varLambda \mathrm {CDM}} (k)\) is the power spectrum of \(\varLambda \)CDM at \(z=0\) which in turn is the primordial power spectrum transformed by an appropriate transfer functions as defined in Bardeen et al. (1986); Eisenstein and Hu (1998); and D(z) is the growth factor given by (Peebles 1993)

$$\begin{aligned} D(z) = \frac{5 \varOmega _m}{2H(z)} \int _{0}^{a(z)} \frac{{\text {d}}a'}{(a'H(a')/H_0)^3}\,. \end{aligned}$$
(157)

The FDM transfer function is given by

$$\begin{aligned} T_{\mathrm {FDM}} = \frac{\cos (x^3_{\mathrm {J} (k)})}{1+x^8_{\mathrm {J}}(k)}\,, \end{aligned}$$
(158)

where

$$\begin{aligned} x_{\mathrm {J}} (k) = 1.61 \, \left( \frac{m}{10^{-22}\, \mathrm {eV}} \right) ^{1/18} \left( \frac{k}{k_{\mathrm {J,eq}}} \right) \,, \quad k_{\mathrm {J,eq}} = 9 \, \left( \frac{m}{10^{-22}\, \mathrm {eV}} \right) ^{1/2} \mathrm {Mpc}^{-1}\,.\nonumber \\ \end{aligned}$$
(159)

The scale \(k_{\mathrm {J,eq}}\) is the Jeans length at matter-radiation equality, the time when perturbations start to grow.

This transfer function presents a fast decay at \(k=k_{\mathrm {J,eq}}\), which leads to a suppression of the power spectrum on those scales. The mode where the power spectrum decays to half of its value \(T(k_{1/2})=1/2\) is the half mode given by Li et al. (2019) \(k_{1/2} = 5.1 \, (m/m_{22})^{4/9} \, \mathrm {Mpc}^{-1}\), where \(m_{22} = 10^{-22}\, \mathrm {eV}\). If \(k_{1/2} < k_{nl} \sim 0.1\) where \(k_{nl}\) is the non-linear scale which is around \(1\, \mathrm {Mpc}^{-1}\), the suppression of the power spectrum can be probed by linear CMB and LSS observables. Otherwise, we need non-linear observables. Therefore, roughly speaking, we can probe the suppression on the power spectrum on linear scales for FDM with masses \(m \lesssim 10^{-23}\, \mathrm {eV}\). One interesting fact is that the \(k_{1/2}\) of the FDM is the same as the one for WDM, although they are different functions of the wavenumber.

Beyond linear observables, this suppression of the power spectrum on small-scales induces a suppression in the formation of FDM halos. This can be estimated by calculating the linear half-radius (\(R_{1/2,\mathrm {lin}}\)) which is the radius where half of the mass of the spherically symmetric system is contained. Then for \(R_{1/2,\mathrm {lin}} \sim \lambda _{1/2}/2\), where \(\lambda _{1/2}=2\pi /k_{1/2}\), we obtain the mass of the smallest halos that can be formed in this theory (Bullock and Boylan-Kolchin 2017; Niemeyer 2019)

$$\begin{aligned} M_{\mathrm {lin}} = \frac{4\pi }{3} \, R_{1/2,\mathrm {lin}} \langle \rho _{\mathrm {FDM}} \rangle = 4\times 10^{10} \, M_{\odot } \, \left( \frac{m}{10^{-22}\, \mathrm {eV}} \right) ^3 \left( \frac{\varOmega _m}{0.3} \right) \left( \frac{h}{0.7} \right) ^2 \,,\nonumber \\ \end{aligned}$$
(160)

where \( \langle \rho _{\mathrm {FDM}} \rangle = 3/(8\pi G) H_0^2 \, \varOmega _m\). Therefore, FDM predicts a large suppression of halos for \(M < 10^{10}\, M_{\odot }\) if the mass is \(m_{22}\). Below, we are going to see how this can be calculated more specifically given the cores in the halos and see how these predictions help address the small-scale problems.

This suppression of the power spectrum on small-scales also suppresses the formation of galaxies. It is found in simulations that the number of sub-halos in FDM in comparison to CDM is reduced by a factor of \(\sim (3M/M_{1/2})^{2.4}\). This suppression of formation of small galaxies is larger in FDM at higher redshifts, in comparison to CDM. This opens up an important question about FDM being able to produce small-scale structures at early times to be probed by Lyman-\(\alpha \) forest.

Therefore, the linear suppression of the power spectrum can affect both in the linear and non-linear part of the theory. This can lead to the following effects. This suppression can be probed by probing the linear power spectrum by the CMB and the matter power spectrum through LSS surveys. Or even better by observables that probe even lower scales, like Lyman-\(\alpha \) forest and 21 cm from neutral hydrogen. The suppression of small halos also affects the non-linear scales. The substructure of this model is going to be different than in CDM, with a suppression of substructures on small-scales. The linear theory can predict a minimal mass for the structures formed, which can be probed by the population of satellites observed (and can be related to the missing satellites problem, as we will discuss in the next subsection). The substructures can also be probed directly by gravitational sensitive probes like gravitational lensing and streams, which are affected by the substructure present in the halo, and will be affected differently if the DM presents this suppression. We are going to see these observables and the bounds they can put in the models in Sect. 5. The substructures in the FDM model are also going to be changed by details of the presence of the core, which lead to different predictions that can be tested, as we will see below.

The entire numerical calculation of the power spectrum for the FDM model can be done using the software AxionCAMB (Lewis et al. 2000; Hložek et al. 2015) or a modification of software CLASS made in Ureña López and Gonzalez-Morales (2016). For our discussion in the next section, we used power spectra generated by AxionCAMB.

We discuss now the effect of the FDM in two observables that can probe this suppression, together with other phenomenology of the FDM like the change in the rate of expansion: the CMB and the matter power spectrum.

CMB

We want to review the modifications that FDM can cause in the observables of the CMB so we can understand how we can use this observation to probe the mass of the FDM. It is also possible to probe the fraction of FDM in the universe, if this is not assumed to be all the DM in the universe. We are interested in the case where FDM is all the DM in the universe, but we briefly comment here on the case where it is not.

The low mass of the FDM can alter the CMB in many different ways. We are mostly interested here in the effects in the primary CMB and CMB lensing, which are the ones that probe the DM in the ranges of mass we are interested for the ULDM models. There are other CMB observables that can probe other aspects of the microphysics of these models, and of their formation, as we can see in Fig. 2 from Abazajian et al. (2019).

ULDM affects the CMB in two main ways.Footnote 36 The different expansion rate caused by the ULDM models affects the primary CMB, which is the adiabatic, unlensed without secondary effects CMB spectra. The suppression of the power spectrum that leads to a different clustering present in the ULDM models can be seen in the secondary lensing anisotropies.

We assume now that all the DM is given by ULDM. The primary CMB is affected by the expansion rate. Depending on the mass of the axion, their oscillations take place at different redshifts. If \(z_{\mathrm {osc}} \lesssim 1100\), then after recombination, the ultra-light field is behaving like dark energy and has a very different expansion rate than in the case of \(\varLambda \)CDM. In this case, the amount of dark energy in the universe will be much bigger since this will be composed of \(\varLambda \) plus the one from ULDM, affecting the first peak and the Sacks–Wolfe plateau (the plateau around \(\ell \sim 100\) that can be seen in Fig. 11). The amount of the component that behaves as DM will be much smaller in this case, which in turn affects the other peaks. This can be seen in the left panel of Fig. 11. If the oscillations take place at \(z_{\mathrm {osc}} \gtrsim z_{\mathrm {rec}} \approx 1100\), the ultra-light field behaves as DM before recombination, so they can alter the expansion rate during the radiation time. This affects the Silk damping tale, enhancing the higher acoustic peaks because of the reduction in the angular scale of the diffusion distance. This effect can be degenerate with changes in \(N_\mathrm{eff}\). As the mass gets heavier, which indicates that the ULDM behaves like DM since very early in the universe, the ULDM behaves more and more like CDM, and the angular power spectrum is very close to the \(\varLambda \)CDM one.Footnote 37

In this figure, we plot the angular temperature power spectrum of coming from theoretical predictions of the FDM model for different masses, obtained using AxionCAMB. We also plot for comparison the data from the Planck 2018 Cl TT power spectrum (Ade et al. 2016) and the \(\varLambda \)CDM model best fit to this data. We can see that for masses smaller than \(10^{-25}\, \mathrm {eV}\) we can even visually see the deviation of the power spectrum from the data and from the \(\varLambda \)CDM one, with an enhancement of the size of all the peaks, and changing the relation between the second and third peaks, which indicates less DM with respect to baryons. For higher FDM masses, we cannot visually distinguish it from the \(\varLambda \)CDM one. Therefore, the primary CMB can put bounds on the mass of the ULDM according to their modified expansion rate at recombination.

Other effects might arise when the density of the axions is not equal to the total DM energy density. There is a degeneracy between the amount of ULDM, \(\varOmega _{\mathrm {ULDM}}\), and the amount of curvature (or dark energy) and matter. This can be seen in more detail in Marsh (2016a).

The ULDM can also affect the secondary lensing anisotropies. The small-scale suppression of the power spectrum can be seen as a lensing deflection power on scales \(\ell > 1000\). This can be seen in the lensing convergence power spectrum. This effect from the suppression of clustering can also be seen in the matter power spectrum, as we show below. This effect is degenerate to the one coming from massive neutrinos. For smaller mass ULDM, one can use the effects of expansion to break this degeneracy, but this is not the case in the range of masses we are interested for ULDM that has an important effect on small-scales.

This shows that CMB is a powerful observable to probe many aspects of ultra-light particles, specially in the low-mass range. We are going to see in Sect. 5 the constraints obtained by CMB observations in the FDM model and discuss some forecasts.

Fig. 11
figure 11

Left panel: angular power spectrum of the temperature CMB anisotropy for the FDM model for different masses where FDM is considered all the DM in the universe. We compare this with the \(\varLambda \)CDM model best fit to the Planck data (Ade et al. 2016) showed by the gray data points. Right panel: matter power spectrum for the same FDM model as used in the left panel

Matter power spectrum

The matter power spectrum brings information about the matter density contrast in the universe with respect to the scale. The matter power spectrum contains a huge amount of information. Considering the full shape of the power spectrum, we can measured the equality scale (\(k_{\mathrm {eq}}^{-1}\)) which sets and can be inferred by the peak of the matter power spectrum (a bit hard to see in right panel of Fig. 11) and its overall shape. The other feature that is present in the power spectrum are the BAO features, from where we can obtain the BAO frequency and infer the sound horizon at baryon drag (at \(z_{\mathrm {rec}}\)).

The ULDM affect the matter power spectrum both by having a different expansion rate and from the suppression of clustering on small scales. A different expansion rate would alter the BAO, presenting a different sound horizon in the power spectrum. This is more relevant for smaller masses of the FDM particle, when the FDM behaves like dark energy for a longer time in the evolution of the universe.

The other effect which is more dramatic on small-scales is coming from the different clustering that the FDM presents. As we can see in right panel of Fig. 11, for a FDM that represents all the DM in the universe, the matter power spectrum for FDM presents a suppression of the small-scale power spectrum. As we saw, this suppression comes from the Jeans length of the FDM that suppresses the power spectrum via the transfer function. We can see that the smaller the mass is, the effect is more dramatic. For masses that affect the linear scales, this can be easily tested using observations. However, we can see that for the heavier FDM particles, this suppression occurs on the small-scales, where there is not a lot of observational data.

We can use galaxy surveys to probe the biased matter power spectrum. This can give us bounds in the mass of the FDM, as we will see in Sect. 5. However, for heavier masses we can see that the suppression in the power spectrum occur on smaller scales, not probed by galaxy surveys. For that we need new observables, like Lyman-\(\alpha \) which allows us to probe scales of order \(0.5 \, \mathrm {Mpc}/h \lesssim \lambda \lesssim 100 \, \mathrm {Mpc}/h\). This permits to constrain higher masses for the FDM. Another window of observation that allows us to probe the smaller scales is 21 cm from neutral hydrogen, which can gives the matter power spectrum on scale \(k > 10\, \mathrm {Mpc}^{-1}\). We will discuss these observables in Sect. 5.

One important point about using the power spectrum is the issue of the bias. To infer the matter power spectrum one needs to observe biased tracers of the DM distribution. The bias relates each of those tracers with the underlying DM distribution. This bias for each tracer is unknown, and each probe presents a different bias. Therefore, when obtaining constraints on the mass of the FDM, there is a degeneracy with bias. There is also the possibility of a scale dependence in the bias of ULDM. This is still a not so well-studied problem for the ULDM, with a few studies from numerical simulations for the FDM case (Cooray and Sheth 2002; Hložek et al. 2015). One can also obtain the matter power spectrum from shear measurements, coming from gravitational lensing. This are unbiased tracers and can provide measurements of the FDM mass with a complementary approach.

Halo density profile

We saw above that the suppression of the power spectrum in the FDM model leads to a suppression in formation of small mass halos. Only halos with \(M > M_{\mathrm {lin}}\) where \(M_{\mathrm {lin}}\) is given by (160) are formed in the FDM. This suppression can lead to a different number of low mass halos, modifying the halo mass function.

The halo mass function (HMF) describes the density of halos per unit of mass. To determine the HMF of the FDM one needs to resort to either simulations or semi-analytic methods. Usually simulations are performed and the HMF can be fitted, as done in Schive et al. (2016). In this reference they have a simulation of collisionless particles with initial conditions coming from the FDM. The fitted HMF obtained is given by:

$$\begin{aligned} \left( \frac{{\text {d}}n}{{\text {d}}M} \right) _{\mathrm {FDM}} = \left[ 1+\left( \frac{M}{M_0} \right) ^{-1.1} \right] ^{-2.2}\, \left( \frac{{\text {d}}n}{{\text {d}}M} \right) _{\mathrm {CDM}}\,. \end{aligned}$$
(161)

This HMF presents a suppression for low mass halos, characterized by the scale \(M_0 = 1.6 \times 10^{10} \, m_{22}^{-4/3}\). The HMF of CDM depends on the redshift and mass of the halo, while the suppression term in brackets is redshift independent, which is a consequence of the FDM modification only coming from the initial conditions. This means that this also does not take into account the effective sound speed of the FDM, only through the suppression of the initial power spectrum. This HMF is very accurate for higher masses, in agreement with the CDM HMF, but it presents an uncertainty in the low-mass end, showing that this HMF is reliable to show the suppression on those scales, but not so reliable to obtain the slope of the HMF for FDM for low halo masses.

There is also another HMF obtained using different methods. in Marsh and Silk (2014) with the aim of taking into account the scale dependent linear growth from the FDM, they obtain the HMF from a modified Press–Schechter approach. This yields

$$\begin{aligned} \left( \frac{{\text {d}}n}{{\text {d}}M} \right) _{\mathrm {FDM}} = -\frac{\rho _m}{M} \, f(\nu ) \frac{{\text {d}} \ln \sigma ^2}{{\text {d}} \ln M}\,, \end{aligned}$$
(162)

where \(\nu \equiv \delta _c / \sigma \), \(\delta _c\) is the critical collapse overdensity and \(\sigma (M)\) is the variance of the power spectrum. The variance is calculated by smoothing the power spectrum with a spherical top-hat window function. The function \(f(\nu )\) comes from the Sheth–Thormen model (Sheth and Tormen 1999) and it is given by \(f(\nu ) = A (q/ 2\pi )^{1/2} \nu [1+(q\nu )^(-2p)]\, \exp (-q\nu ^2/2)\), with parameters \(A=0.3222\), \(p=0.3\) and \(q=0.707\). The critical overdensity \(\delta _c\) brings more details about the evolution of the FDM. At \(z=0\), it can be described by comparing \(\delta _c^{\mathrm {FDM}} (k) = {\mathcal {G}}(k)\, \delta _c\), where \(\delta _c = \delta _c (z=0,k=k_0)\) and \(k_0=0.002\) is the pivot scale. The scale-dependent function\( {\mathcal {G}}(k)\) is the ratio of the scale-dependent growth factors from \(\varLambda \)CDM and FDM (Marsh and Silk 2014; Bozek et al. 2015) (in Marsh (2016b) a numerical fitting for this function was found). There are many other modifications of this HMF and alternative formulations that can be seen in Sheth and Tormen (1999), Bozek et al. (2015), Du et al. (2017b) and Schneider (2015).

We are now going to see how this prediction can be probed by the luminosity function and reionization.

Luminosity function and reionization

Observations that probe the low-mass end of the halo mass function can be used to test the FDM and put constraints in the mass. A sensitive probes is the luminosity function of galaxies, which can inherit the suppression coming from the halo mass function. Observations at high redshifts of galaxy counts, reionization history can be used to test the suppression of the HMF caused by the FDM.

The luminosity function \(\phi (L)\) is a map between the galaxy luminosity and the dark matter halo. If we want to obtain cumulative galaxy number density so that we can compare with observations, we need to relate the UV magnitude of a galaxy with the mass of the halo. If there are less halos formed at early times, this would lead to less galaxy formation. This can be one characteristic of the FDM model to be tested, but this suppression cannot be too severe otherwise this model could lead to less than expected high-z galaxies than seen in observations, like the ones from the Hubble Ultra Deep Field (Bouwens et al. 2015). This lack of galaxies could also impact the efficiency of the reionization of the intergalactic medium (IGM), which takes place through star formation. This would impact the optical depth of the CMB, which is constrained by CMB observations (Ade et al. 2016).

The relation between the HMF and the luminosity function is usually done by abundance matching (Kravtsov et al. 2004; Vale and Ostriker 2004; Conroy et al. 2006). For that, one needs to assume a mapping between the halo mass and the galaxy luminosity or the UV magnitude of the galaxy. This is done by matching the cumulative UV luminosity function \(\varPhi (L,z)\) for magnitude smaller than \(M_{\mathrm{UV}}\) (which is the same as luminosities higher than \(L_{\mathrm{UV}}\)) given by the integral of \(\phi (L)\) in this interval, with the cumulative HMF for halos masses bigger than a given \(M_h\): \(\varPhi (<M_{\mathrm{UV}},z) = n(>M_h,z)\). That, together with fixing \(\phi (L)\) at low-z with observations, fixes the mass to luminosity (or UV magnitude) of galaxies.

Given the modified HMF presented by the FDM model, we can then predict the luminosity function of this model. In Schive et al. (2016) and Bozek et al. (2015), we can see some examples of the luminosity function for FDM. The low-mass suppression in the HMF leads to a luminosity function that ends at smaller magnitudes. Therefore, depending on the mass of the FDM particle, we can predict a cumulative luminosity function that ends at different magnitudes. This can be used to put constraints in the FDM using, for example, observations of dwarf galaxies or measurements of high-z galaxies like the one from the HUDF. We discuss this in Sect. 5.

Depending on the mass of the FDM particle we can predict less galaxies at high-z, which would alter the reionization history. This can be seen given that the UV luminosity function is related to the flux of ionizing photons (Niemeyer 2019),

$$\begin{aligned} {\mathcal {F}}_{\mathrm {ion}}= f_{\mathrm {esc}} \int \phi _{UV} (L) \, \gamma (L) \, {\text {d}}L\,, \end{aligned}$$
(163)

where \(f_{\mathrm {esc}}\) is the scape fraction, related to the fraction of ionizing photons that escapes the galaxy without being absorbed, and \(\gamma (L)\) the conversion rate, which describes the conversion between the UV luminosity of galaxies to the luminosity of ionizing photon (for more details see Schive et al. 2016 and references therein). With that we can write the Thomson optical depth to CMB:

$$\begin{aligned} \tau = \int _0^z {\text {d}}z'\, \frac{(1+z')^2}{H(z')} Q(z')\, \sigma _T {\bar{n}}_H \, \left( 1+\eta _{\mathrm {He}} \, Y/4X \right) \,, \end{aligned}$$
(164)

where \(\sigma _T\) is the Thomson cross section, \({\bar{n}}\) is the mean comoving hydrogen number density, X is the hydrogen fraction, \(Y=X-1\) is the helium fraction, and \(\eta _{\mathrm {He}}\) is the ionization state of helium.

We can then use this modified reionization history, coming from the modified luminosity function, to constrain the mass of the FDM. One can use, for example, the CMB where \(\tau (z_{\mathrm {rec}})\) is measured.

Sub-halo mass function

As discussed above, the FDM model is characterized by a suppression in the formation of the small structures. This minimal mass of structures formed is going to impact the substructures in the halo. Therefore, a smaller number of sub-halos is present in the FDM in comparison to CDM. This suppression can be seen in the sub-halo mass function, which is given by \({\text {d}}n_{\mathrm {sub}} (m)/{\text {d}}\, \ln M\), where \(n_{\mathrm {sub}}\) is the number of sub-halos and M is the halo mass. This can also be obtained from simulations and semi-analytic calculations (Schive et al. 2016; Corasaniti et al. 2017; Marsh and Silk 2014; Du et al. 2017a), where a fitting form for the FDM sub-halo mass function is obtained (Du 2018; Schutz 2020):

$$\begin{aligned} \left( \frac{{\text {d}}n_{\mathrm {sub}}}{{\text {d}}\ln M}\right) _{\mathrm {FDM}} = f_1(M) + f_2 (M) \left( \frac{{\text {d}}n_{\mathrm {sub}}}{{\text {d}}\ln M}\right) _{\mathrm {CDM}} \,, \end{aligned}$$
(165)

where the functions present in this fitting formula are given by

$$\begin{aligned} f_1(M)= & {} \beta \exp \left[ - \frac{1}{\sigma } \left( \ln \frac{M}{M_1 \times 10^8 \, M_{\odot }} \right) ^2 \right] \,, \nonumber \\ f_2 (M)= & {} \left[ 1+\left( \frac{M}{M_2 \times 10^8 \, M_{\odot }} \right) ^{-\alpha _1} \right] ^{-10/\alpha _1} \,. \end{aligned}$$
(166)

The influence of tidal stripping of cores in the sub-halo mass function is studied in Du et al. (2018), showing that core stripping influences this function for sub-halo masses smaller than \(10^7 \, M_{\odot }\). Including this effect the parameters in (165) are \(\alpha _1=0.72\), \(\sigma =1.4\), \(\beta = 0.014 \, m_{22}^{3/2}\), \(M_1 = 4.7 \,m_{22}^{3/2} \), and \(M_2 = 2.0 \,m_{22}^{1.6} \). For these parameters, this fitting formula agrees with simulations for masses smaller than \(5 \times 10^{-21}\, \mathrm {eV}\).

Fig. 12
figure 12

Comparison between the sub-halo mass function from CDM and FDM for different masses of the FDM particle. In the case of the FDM, we consider that all DM is given by the FDM model

In Fig. 12, we compare the sub-halo mass function for FDM with different masses with the CDM one. Here we used the following CDM sub-halo mass function:

$$\begin{aligned} \left( \frac{{\text {d}}n_{\mathrm {sub}}}{{\text {d}}\ln M}\right) _{\mathrm {CDM}} = a_{\mathrm {CDM}} \, \left( \frac{M}{10^8 \, M_{\odot } } \right) ^{-\alpha _0} \,. \end{aligned}$$
(167)

where \(a_{\mathrm {CDM}} = 113.094\) and \(\alpha _0 =0.86\), which was obtained as a fit to the sub-halos of the Aquarium simulation (Springel et al. 2008). We can see in the figure the suppression of the small-scale structures by the redshift dependent cut in the sub-halos mass function for smaller sub-halo masses, characteristic of the FDM model.

With the sub-halo mass function expected for the FDM model, one can test if the predictions from this model are in accordance with observations.

The central soliton

We have shown in Sect. 4.1.4 that there is the formation of a soliton (or Bose star) in the central parts of the galaxy, which is the ground state solution of the Schrödinger–Poisson system. However, there is no analytical solution for the soliton and we have to obtain this solution numerically. This was done by many authors (Hui et al. 2017; Chavanis 2011; Schive et al. 2014a, b; Kaup 1968; Harrison et al. 2002; Ruffini and Bonazzola 1969a; Guzmán and Ureña-López 2004; Bar-Or et al. 2019), and here we are going to quote the numerical solutions obtained by Schive et al. (2014a, 2014b). Here we will restore the \(\hbar \) factors for clarity.

An interesting characteristic of the Schrödinger–Poisson is that tit has scaling symmetry, which allows to re-scale the quantities of this problem by an arbitrary variable \(\beta \) as \(\left\{ t, x, \varPhi , \phi \right\} \rightarrow \left\{ \beta ^{-2}\,t, \beta ^{-1}\, x, \beta ^{2} \, \varPhi , \beta ^{2}\, \phi \right\} \) (Ji and Sin 1994; Guzmán and Ureña-López 2006) . This is also valid for the SIFDM, although more subtle (Guzmán and Ureña-López 2006), and the interaction term scales as \(g \rightarrow \beta ^{-2}\, g\). The solutions also obey this scaling transformation, and the physical quantities transform as: the energy density of the soliton transforms as \(\rho _s \rightarrow \beta ^4 \rho _s\), the radius as \(r \rightarrow \beta ^{-1} r\), and the mass of the soliton as \(M_s \rightarrow \beta \, M_s\). This means that one can simple re-scale the solution to the equilibrium scale of interested, like the virial scale, if interested in the cores in galaxies, or study the axion star in QCD axion miniclusters (Niemeyer 2019).

By assuming spherical symmetry, we need only to determine one of those physical quantities, like the soliton density, and we can derive the other parameters from it.

The density of the soliton core can be approximated by (Hui et al. 2017): \( \rho _c \simeq 4 \times 10^{-3} (G m^2 /\hbar ^2)^3 M^4 \simeq (3M/4) R^{-3}_{1/2,c}\) where the half-mass radius is \( R_{1/2,c} {\bar{\rho }}_{1/2} \simeq 4 \hbar ^2 / GMm^2\), the radius where the density drops to one half of its valueFootnote 38. This can be re-written as (Schive et al. 2014a, b)

$$\begin{aligned} \rho _c \simeq \frac{ 1.9 \times 10^{-2}}{[1+0.091\, (r/R_{1/2,c})^2]^8} \, \left( \frac{m}{10^{-22} \, \mathrm {eV}} \right) ^{-2} \left( \frac{r_c}{\mathrm {kpc}} \right) ^{-4}\, M_{\odot } \, \mathrm {pc}^{-3}\,, \end{aligned}$$
(168)

With the density, we can compute the soliton mass

$$\begin{aligned} M_c = 4\pi \int _0^{\infty } r^2 \, \rho _c(r)\, {\text {d}}r = 2.2 \times 10^8 \, \left( \frac{m}{10^{-22} \, \mathrm {eV}} \right) ^{-2} \left( \frac{R_{1/2,c}}{\mathrm {kpc}} \right) ^{-1}\, M_{\odot }\,.\qquad \end{aligned}$$
(169)

Here we are approximating these expression by considering that \(R_{1/2,c}=r_c\) is the radius of the condensate, and the mass of the soliton core is the mass enclosed in the sphere with this radius. We can see that \(95\%\) of the soliton mass is within \(r \le 3r_c\), and this means that (168) is accurate to \(2\%\) for \(0 \le r \le r_c\).

We are interested in studying the soliton cores formed inside the galaxies, so we use the scaling to obtain the solution soliton radius in a virial halo of MW-like mass

$$\begin{aligned} r_c \simeq 0.16 \, \left( \frac{m}{10^{-22} \, \mathrm {eV}} \right) ^{-1} \left( \frac{M}{10^{12}\, M_{\odot }} \right) ^{-1/3} \, \mathrm {kpc}\,. \end{aligned}$$
(170)

With that we can see that the FDM soliton core for a FDM with mass \(m = 10^{-22} \, \mathrm {eV}\) is going to be smaller than the halo of the MW, with size \(r_c \sim 0.16\, \mathrm {kpc}\) in the center region of the halo. In the outskirts of the halo, FDM is expected to behave as free particles, like CDM, so the core is enveloped by shell with in NFW-profile:

$$\begin{aligned} \rho _{\mathrm {halo}} \simeq \left\{ \begin{array}{lcl} \rho _c &{} \text{ for } &{} r \le r_c \\ \rho _{\mathrm {NFW}} &{} \text{ for } &{} r > r_c \end{array}\right. \,, \end{aligned}$$
(171)

which is the picture we showed in Fig. 10. We use this simplified picture to describe what happens in the halo. We can see that the soliton core is inside the Jeans length, as expected, if one compare with Eq. (136). But it is of the order of the de Broglie wavelength if we use the velocity in the soliton \(v^2_c = r/(GM_c)\),:

$$\begin{aligned} \lambda _{dB} (v=v_c) = 3.91\, r_c\, \left( \frac{r}{r_c} \right) ^{1/2}\,. \end{aligned}$$
(172)

In Schive et al. (2014b) based on the scaling of the Schorinder–Poisson system and using the result of simulations it was also derived the core–halo mass relation for the FDM, showing that \(M_c \propto (1+z)^{1/2}\, M^{1/3}\), where M is the halo mass. This shows that MW like halos with \(M \sim 10^{12} \, M_{\odot }\) at a redshift around \(z\sim 8\) will have soliton cores forming after the halo collapse with mass \(M_c \sim 10^{9}\, M_{\odot }\). These soliton cores can be seen in FDM simulations, as we will see in Sect. 4.3.

The presence of these soliton cores in the interior of the galaxies lead to a rich phenomenology. Since no gravitational clustering happens in these regions, the soliton core has a cored profile \(\rho _c(r)\), which is different than the one expected from CDM. Therefore, if we can probe the central density of galaxies and obtain a non-cored profile, we can use this to put a bound on the FDM mass. Dwarf galaxies can also be used to do that. Since those galaxies are DM dominated, one can measure their central density and compare with the bound obtained by the expected core to constrain the mass of the FDM. We are going to discuss this in detail in Sect. 5, and also in a extreme case in Sect. 4.1.7, where we discuss how the core can solve the cusp–core problem.

Dynamical effects—relaxation

In this section, we are going to discuss the third class of observational effects of the ULDM, the dynamical effects. These are effects that arise because of the wave-like behaviour that ULDM have inside the soliton cores in galaxies. This can modify the dynamics of objects that are present inside this region where condensation takes place. We are going to see here two of those effects that arise from the relaxation between the FDM and macroscopic objects that move in or through the condensed core. These effects can lead either to heating, which is seen as an energy injection in the orbit of the macroscopic object, or to cooling or dynamical friction, where orbit of the object loses energy to the FDM field.

These two effects are present in CDM and other DM models, but are modified here by the wave-like behaviour of the FDM. Effects like an enhanced heating can lead to observational effects such as the increase in the velocity dispersion of systems and their expansion. On the other hand, this modified dynamical friction can be used to explain effects like system that were expected to have merged but did not, which represents a challenge for the CDM paradigm.

In the same way that we studied relaxation between the FDM particles leading to the formation of a BEC in Sect. 4.1.4, we can use the same techniques of kinetic theory to study the relaxation of the FDM and the macroscopic objects. We describe below briefly this analysis. This was introduced in Bar-Or et al. (2019) and the reader should consult this reference for more details. There are also different ways to model dynamical friction, which we will comment below.

We want to describe the relaxation between the FDM and macroscopic objects inside the condensate core. Therefore, we can think about this as a test particle, or a contaminant, that is present in the condensateFootnote 39. We want to see how stochastic density fluctuations in the FDM core lead to diffusion of the velocity of the macroscopic objectFootnote 40.

Using kinetic theory one can treat the problem as a diffusion of a test particle in a fluctuating density field. In the case of the FDM, the FDM is represented by a wavefunction which is a collection of plane waves, \(\psi ({\mathbf {r}},t) = \int {\text {d}}{\mathbf {k}}\, \varphi ({\mathbf {k}}) \exp (i\mathbf {k.r} - i \omega (k) t)\), with \(\omega =\hbar ^2 k^2/2m\), interacting with a zero-mass particle. In this case, we can ignore gravitational interactions between the FDM particles and only focus in the interaction of the FDM and the classical test particle. The FDM can be treat in this way inside the core where it is condensed.

Computing the diffusion coefficients for this system of a test particle in the FDM condensate, it was found that it yields the same diffusion coefficients as the ones for a classical two-body relaxation where the test particle interacts with a homogeneous classical density field composed by particles with mass \(m_p\), but with a different effective mass for the FDM “particles” \(m_{\mathrm {eff}}\), a different distribution function \(F_{\mathrm {eff}}\) and a different Coulomb logarithm \(\log \varLambda _{\mathrm {FDM}}\):

$$\begin{aligned} m_{\mathrm {eff}} = \frac{(2\pi \hbar )^3}{m^3} \frac{\int {\text {d}}{\mathbf {v}}\, f^2({\mathbf {v}})}{\int {\text {d}}{\mathbf {v}}\, f({\mathbf {v}})} \,, \quad f_{\mathrm {eff}} = f^2({\mathbf {v}})\frac{\int {\text {d}}{\mathbf {v}}\, f({\mathbf {v}})}{\int {\text {d}}{\mathbf {v}}\, f^2({\mathbf {v}})}\,, \end{aligned}$$
(173)

where \(f(\mathrm {v})\) is the FDM distribution function and the effective distribution function is normalized like \(\rho _{\mathrm {FDM}}=\int {\text {d}}{\mathbf {v}}\, f_{\mathrm {eff}}({\mathbf {v}})\). The Coulomb logarithm is \(\log \varLambda _{\mathrm {FDM}} \approx \log (2b_{\mathrm {max}}/\lambda _{\mathrm {dB}}(\sigma ))\), where \(b_{\mathrm {max}}\) is the maximum scale of encounter and \(\sigma \) is the dispersion velocity.

So the picture is that for understanding the effects of a macroscopic object in the FDM core, one can think as the FDM was composed by effective particles with mass \(m_{\mathrm {eff}}\), called FDM “quasiparticles” in Bar-Or et al. (2019)Footnote 41. If this quasiparticle has a mass that is larger than the mass of these macroscopic objects \(m_t\), this leads to the object to either lose energy or otherwise to gain energy from the encounters with these quasiparticles. These two processes are the cooling (or dynamical friction) and heating.

Thus, if \(m_{\mathrm {eff}} \gg m_t\), the fluctuations in the FDM field inject energy into the orbit of the macroscopic object. This injection increases the velocity dispersion of the macroscopic object, which in turn increases its size and we have heating. The heating time scale is defined as

$$\begin{aligned} \tau _{\mathrm {heat}} = \frac{3 \sigma ^3}{16 \sqrt{\pi } G^2 \rho _{\mathrm {b}} m_{\mathrm {eff}}\, \log \varLambda _{\mathrm {FDM}}} = \frac{3 m^3 \sigma ^6}{16 \pi ^2 G^2 \rho _{\mathrm {b}}^2 \hbar ^3 \, \log \varLambda _{\mathrm {FDM}}} \end{aligned}$$
(174)

For times smaller than \(\tau _{\mathrm {heat}}\), the system can gain energy from these FDM fluctuations and heating takes place. For values of the effective mass in halos, we can see that the quasiparticle is more massive than stars. This heating of stars either in a cluster or in the disk of galaxies can lead to observational effects.

If \(m_{\mathrm {eff}} \ll m_t\), then the mass of the macroscopic object is bigger than the FDM quasiparticle, and the macroscopic object loses energy from its orbit to the FDM field. This process can be interpreted as dynamical friction. Since the macroscopic object is losing energy, this process is also known as cooling, with a cooling time given by

$$\begin{aligned} \tau _{\mathrm {cool}} = \frac{3 \sigma ^3}{8 \sqrt{2\pi } G^2 \rho _{\mathrm {b}} m_{\mathrm {t}}\, \log \varLambda _{\mathrm {FDM}}} \end{aligned}$$
(175)

This process usually happen for more massive objects like globular clusters merging with the center of the galaxy or black holes.

A similar analysis is done in El-Zant et al. (2016, 2020b), where they confirm the results from Bar-Or et al. (2019) and extend the model to include the effect density perturbations coming from stellar winds, supernova explosions or active galactic nuclei. The consequence for super massive black holes in a FDM halo was studied in El-Zant et al. (2020a) which can lead to its ejection from the core. This effect can be used to put bounds in the FDM mass.

We are going to discuss below some specific examples where these dynamical effects take place.

Gravitational heating

We can now see here how the heating affects a stellar population. We show this for a prototype case of a spherical stellar system of radius \(r_{\star }\) in a FDM gravitational potential. The initial distribution function of the stellar system is Maxwellian with a velocity dispersion \(\sigma _t\). Since star have \(m_t = m_{\mathrm {star}} \ll m_{\mathrm {eff}}\), only heating takes place.

The heating is important when \(\tau _{\mathrm {heat},\star }\) is of order of 1/3 of the age of the galaxy \(\tau _{\mathrm {age}}\), where

$$\begin{aligned} \tau _{\mathrm {heat},\star } \approx \frac{2.08 }{\log \varLambda _{\mathrm {FDM}}} \, \left( \frac{r_{\star }}{1\, \mathrm {kpc}} \right) ^4 \left( \frac{m}{10^{-22}\, \mathrm {eV}} \right) ^3 \left( \frac{m}{200\, \mathrm {km/s}} \right) ^2 \, \mathrm {Gyr}\,. \end{aligned}$$
(176)

In this case, we have a increase in the velocity dispersion and the system is going to expandFootnote 42. This happens in scales when \(r_{\star } < r_{\mathrm {heat},\star }\) with

$$\begin{aligned} r_{\mathrm {heat},\star } = 1.13\, \log \varLambda _{\mathrm {FDM}} \left( \frac{\tau _{\mathrm {age}}}{10\, \mathrm {Gyr}} \right) ^{1/4} \left( \frac{m}{10^{-22}\, \mathrm {eV}} \right) ^{-3/4} \left( \frac{m}{200\, \mathrm {km/s}} \right) ^{-1/2} \, \mathrm {kpc}\,.\nonumber \\ \end{aligned}$$
(177)

For the comparison with observation that we are going to show in Sect. 5, we are interested in the cases where this effect leads to a increase in the dispersion relation of the star, expanding the stellar system. This is going to be studied in the disks of galaxies and also in a star cluster, like Eridanus II.

Dynamical friction

The change in dynamical friction is one of the most interesting consequence in BECs and superfluids. This emergent phenomenology can lead to consequent observations that might reveal the characteristics of those systems. It is interesting to see how dynamical friction behaves in the presence of a BEC core, as in the case of the FDM. The dynamical friction in a BEC is not expected to change as dramatically as in a superfluid, which has no friction, but some change is expected nevertheless.

It is expected that the FDM changes this prediction because of three phenomena: (1) change in the rate of orbital decay because of the presence of the condensed core; (2) since the FDM produces a homogeneous core, a mechanism similar to the “core stalling” observed in N-body simulations can take place and reduce or eliminate drag from dynamical friction; and (3) the way dynamical friction is calculated must be modified by the presence of an object with large de Broglie wavelength, a quantum mechanical extension to the calculation of dynamical friction must be done.

An interesting puzzle that can potentially be explained by a modified dynamical friction is the puzzle of the Fornax globular clusters. From the standard dynamical friction expected for CDM and baryons, it is expected that globular clusters orbiting Fornax should have rapidly fallen towards its center to form a stellar nucleus. However, there is no signal of mergers and we detect 5 globular clusters orbiting Fornax.

In Hui et al. (2017), only the last effect is described and simulated for different parameters the orbital decay times for Fornax in CDM and in the FDM cases. They found that in FDM the orbital decay time is longer, and four of the five decay times simulated are bigger than \(10\, \mathrm {Gyr}\) or more, thus explaining the puzzle for why the globular cluster in Fornax survived. More simulations and observations are needed to confirm this claim, but the FDM model seems to address the dynamical friction puzzle. The ideal is to have the microscopic theory describing dissipation in the FDM model.

These qualitative results are confirmed in Bar-Or et al. (2019) using the classical two-body relaxation, shown above. See also results for how dynamical friction alters inspiral systems (Bar-Or et al. 2019).

Finally, in Lancaster et al. (2020), a detailed analytical and numerical study of dynamical friction in the FDM model was performed. To describe the dynamical friction in this model, they describe the dissipation that a perturber moving in a condensate causes. They work this dissipation theory for point-sources (satellites), extended satellites and point-like satellite in a FDM background with finite velocity dispersion. This analytical theory is then verified by their numerical-simulation that solves the Schrödinger–Poisson system in the presence of such perturber satellite, showing good agreement with the analytic methods. This framework is applied to the cases of the Fornax globular cluster, but also to the Sagittarius (Sgr) stream and the Large Magellanic Clouds (LMC) (we will see about those systems in more detail in Sect. 5). For the Fornax, they find that if the mass of the FDM model is \(m \gtrsim 10^{-21}\,\mathrm {eV}\) this model stops explaining the Fornax globular cluster merging times, which is in agreement with the mass bounds necessary for the FDM to solve the small-scale problems. For Sgr and LMC it is found that the dynamical friction on those are described by the classical limit, described by the Chandrasekhar formula. More simulations need to be done to confirm this, and it is very important to understand the dynamical friction in this regime since these bodies have a strong influence in the MW, and are the target of many studies and observations. These results already show that this modified dynamical friction in the FDM model can maybe explain some interesting astrophysical observations which might be a good opportunity to measure the ULDM on galactic scales.

Clusters: most massive halos

The FDM can also affect clusters. For distances larger than the de Broglie wavelength of the FDM, it is expected that DM behaves as standard CDM and that the halo enveloping the soliton has a NFW profile. This can be seen numerically for the mass range \(10^9 \,M_\odot \lesssim M_{\mathrm {vir}} \lesssim 10^{11} \, M_\odot \) (Schive et al. 2014a, b; Schwabe et al. 2016; Veltmaat and Niemeyer 2016), which gives an estimate for the mass of the central soliton. Extending this relation to larger halos, the FDM predicts that in the inner regions of clusters there will be a condensed core, a soliton, with mass

$$\begin{aligned} M \simeq 1.3 \times 10^{10} M_\odot \left( \frac{10^{-22}}{m} \right) \left( \frac{M_{\mathrm {vir}}}{10^{15} M_\odot } \right) ^{1/3} \,. \end{aligned}$$
(178)

which is still below the maximum mass for the soliton calculated above for a galaxy. The corresponding half-mass radius is

$$\begin{aligned} R_{1/2} \simeq 25 \, \mathrm {pc} \left( \frac{10^{-22}}{m} \right) \left( \frac{M_{\mathrm {vir}}}{10^{15} M_\odot } \right) ^{-1/3}\,. \end{aligned}$$
(179)

So, the presence of this soliton with this mass and size would be a prediction of the FDM model. But a question that remains to be answered is the following: is the presence of such solitons in the interior of clusters halos compatible with merging cluster like the Bullet cluster or the anti-Bullet cluster?

In Hui et al. (2017), they ask the question if solitons in the center of the galaxies have not been misinterpreted as super massive black holes. They compare the mass of the central dark region measured from Virgo and show that this is similar to the mass of the soliton core in a galaxy like Virgo for a mass of the FDM particle of \( m \sim 10^{-22} \, \mathrm {eV}\). However, since the observations from the Event Horizon Telescope of the black hole in the center of M87 were released, this hypothesis seem to be almost excluded, and it is indeed a super massive black hole that inhabits the center of this galaxy. We have to wait for more data to confirm this, but this is an exciting measurement that can also be used to test the FDM hypothesis.

Another interesting fact is that we know that the galaxies host a super massive black hole in their center. For this reason in Hui et al. (2017) they investigate the possibility of a super massive black hole to be created in the center of a soliton. Apparently, the black holes do not grow for the fiducial mass \(m_{22}\), in a condensate core. Their creation only starts being significant for \(m \gtrsim 5 \times 10^{-22} \, \mathrm {eV}\), which is in tension with other bounds in the mass, like the one to solve the missing satellites problems.

4.1.7 Addressing the small-scale challenges

Perhaps even more interesting than solving the small-scale problems of the CDM paradigm, it is the new and rich phenomenology that the ULDM present. But the ULDM models can also address these small-scale challenges and reconcile the small-scale observations with the CDM paradigm. Usually the scales where the non-CDM and interesting phenomenology happens coincides with the scales where these models present modifications in the small-scales that are necessary to explain the small-scale observations.

We are going to discuss now how this new phenomenology on small scales presented by the FDM can address the small-scale challenges and what are the conditions in the parameter of this class of models for that to happen. We are only going too discuss the FDM case now, and the conditions in the mass, but one can think that a similar analysis can be done for the case of SIFDM.

FDM

We want to determine the mass of the FDM candidate. From the discussion above, we saw the there is a bound for the mass to condensate in galaxies \( 10^{-25} \lesssim m \lesssim \mathrm {eV}\) for a typical MW-like galaxy. Later we saw that, for masses of order of the usual QCD axion mass, around \(m \sim 10^{-5} \mathrm {eV}\), the stable configurations are very localized and small, far from galactic scales. With that we can already see that \(m \ll 10^{-5} \mathrm {eV}\) for the FDM model. Now, we are going to see other conditions that can bound the mass and show the mass range for the FDM particle that can address the small-scale challenges.

  • Halos: minimum size, maximum density and the cusp–core problems:

    The general idea why FDM (and all ULDM models) can solve the cusp–core problem is because these systems naturally predict a core in the center of galaxies. In this core there is no structure formation (Jeans instability), and for that reason they might prevent the formation of a cusp in the center of the galaxy. We are going to investigate here how and for which mass the FDM core size and density can address the cusp–core problem.

    Since we want the non-CDM behaviour to happen inside galaxies, the de Broglie wavelength of the FDM inside galaxies cannot exceed the size of the galaxy, given by the virial radius: \(\lambda _{\mathrm {d}}B < R = GM/v^2\). Therefore, \(R \gtrsim \frac{1}{G M m^2}\), where M is the mass of the galaxy. We can write that in terms of the half-radius, radius where half of the mass of spherically symmetric system is contained (Hui et al. 2017)

    $$\begin{aligned} R_{1/2} \gtrsim 0.335 \left( \frac{M}{10^9 \, M_{\odot }} \right) ^{-1} \left( \frac{m}{10^{-22} \, \mathrm {eV}} \right) ^{-2} \, \mathrm {kpc}\,. \end{aligned}$$
    (180)

    This bound on the radius is compatible with the half-light radii inferred from the densities of 36 Local group dwarf spheroidals (McConnachie 2012) if the mass of the FDM particle is \(m \sim 10^{-22} \mathrm {eV}\).

    With the above condition, we can also compute the upper bound in the central density:

    $$\begin{aligned} \rho _{\mathrm {c}} \le 7.05 \, \left( \frac{M}{10^9 \, M_{\odot }} \right) ^{4} \left( \frac{m}{10^{-22} \, \mathrm {eV}} \right) ^{6} \, M_\odot \, \mathrm {pc}^{-3}\,. \end{aligned}$$
    (181)

    If we compare this bound to the observations from 8 dwarf spheroidals, and we can see for the density to be comparable to the one measured for these dSph, the FDM mass needs to be \(m= 8^{+5}_{-3} \times 10^{-23} \, \mathrm {eV}\) for Draco and \(m= 6^{+7}_{-2} \times 10^{-22} \, \mathrm {eV}\) for Sextans (Chen et al. 2017). For those masses, the distribution at the center of the galaxies seem to be cored, alleviating the cusp–core problem.

    Therefore, it is necessary a mass of order \(m \sim 10^{-22} \, \mathrm {eV}\) in order for FDM to solve the cusp core problem. And we showed that this is compatible with the measurements from dwarf galaxies. However, we are going to discuss in Sect. 5.2.1 that some studies dispute this conclusion.

  • Lower bound on the FDM halo masses, and the missing satellites and too big to fail problem:

    As we saw before, for the self-gravitating FDM systems, since gravity is attractive, we have coherence on a finite scale. The size of this core depends on the mass, being larger as the mass gets smaller. So the smallest radius to be produced in the FDM model are determined by the smallest mass allowed for the particle.

    Having a limit for the smallest cores that can be created has important consequences in the abundance of low mass halos, and it is going to be different in this model than what is given by \(\varLambda \)CDM. We can see that by calculating the smallest structures formed in the FDM model. This is given when \(\lambda = \lambda _{\mathrm {J}}\), where \(\lambda _{\mathrm {J}}\) represents the last scales that can suffer gravitational instability. With the Jeans length, we can calculate the Jeans mass:

    $$\begin{aligned} M_{\mathrm {J}}&= \frac{4\pi }{3} \rho \left( \frac{1}{2} \lambda _{\mathrm {J}} \right) ^3 = 1.5 \times 10^7 M_\odot \, (1+z)^{3/4} \, \left( \frac{\varOmega _{\mathrm {FDM}}}{0.27} \right) \left( \frac{H}{70 \,\mathrm {km/s/Mpc}}\right) ^{1/2} \nonumber \\&\left( \frac{10^{-22} \, \mathrm {eV}}{m} \right) ^{3/2}\,. \end{aligned}$$
    (182)

    This is the minimum mass of substructure created in the FDM model. This is in contrast with CDM, where halos with mass below \(\sim 10^{8} M_\odot \) are highly created, with abundance \({\text {d}}n(M_{\mathrm {h}}) \propto M_{\mathrm {h}}^{-2} {\text {d}}M_{\mathrm {h}}\). In that sense, the missing satellites problem is addressed in the FDM model, since halos of smaller masses than \(M_{\mathrm {J}}\) are not created, if \(m = 10^{-22} \, \mathrm {eV}\). If the mass of the FDM is smaller than that, sub-halos with smaller mass will form, and the FDM model is not going to address the missing satellite problem anymore. Therefore this shows that to solve this problem, the FDM has a preferred mass of around \(10^{-22}\, \mathrm {eV}\). We are going to see below that tidal disruption can also act suppressing small mass halos, aiding FDM in solving the missing satellites problem. The too big to fail problem is also addressed by the FDM, since we have a mechanism to explain the fact that low-mass sub-halos are not formed, making it not necessary to invoke mechanisms that creates the too-big-to-fail problem as a by product.

Summarizing, to address the small-scale challenges, which means presenting a cored density profile inside galaxies and suppressing the formation of small mass sub-halos which would explain the smaller amount of satellites, the mass of the FDM needs to be of the order of \(\sim 10^{-22} \, \mathrm {eV}\). A component with that mass and presenting the dynamics of the FDM class of models would behave like CDM on large-scales and present the observed structures on small-scales.

However, we are going to see in Sect. 5 that, although for many years this mass range was available as a possibility for the mass of the FDM, the latest observations have been challenging this mass, and showing that the FDM has to probably have a heavier mass than the one necessary to address the small-scale problems.

4.2 DM superfluid

In this section, we are going to describe the third category of models of ULDM, the DM superfluid. In previous sections we saw the small-scale problems of \(\varLambda \)CDM and how MOND empirical law offered a very good fit to the rotation curves of galaxies and the scaling relations that emerge from the dynamics of galaxies, which might be challenging in the context of \(\varLambda \)CDM. However, as we saw, there is no present framework that can explain MOND, given that the initial proposed theory, the full MOND, and its extensions present serious problems. We present here an alternative model to DM that has the goal of reconciling CDM and MOND: the DM superfluid. This model intends to not only solve the problems that the previous models attempt to address but also offer a mechanism to describe MOND on small-scales. In this framework, DM behaves as standard CDM on large-scales, while the MOND dynamics emerges on galactic scales. And this is possible through the physics of superfluidity.

The idea of the DM superfluid model is that DM forms a superfluid on galactic scales, where superfluidity arises upon condensation. This superfluid core present in the inner regions of galaxies not only addresses the small-scale challenges of \(\varLambda \)CDM in a similar way as the previous models, but the superfluid described in these regions behaves following a different dynamics which reproduces the MOND behaviour. This is possible given that this superfluid is described by a Lagrangian similar to thee one from the MOND theory which is allowed given the EFT description of the superfluid. In this way, the long-range correlation present in MOND is going to be given by the behaviour of the phonons which mediate long-range forces. Outside the superfluid core, DM behaves as normal matter as in the previous models. This is the general vision how the DM superfluid model attempts to describe the behaviour of DM in galaxies.

In the following subsection, we will construct the DM superfluid theory showing first in which conditions DM condensates on galactic scales, following that we will present the theory that describes this superfluid phase. With that in hand we can calculate the halo profile and rotation curves to compare with data and check the fit of the theory. We present how this model explain many astrophysical systems and possible predictions. After that we show the limits of validity of this description and its relativistic completion. We briefly describe how the cosmology works in this model.

This model is constructed using the fact that in a generic superfluid we can reproduce the exact action as in MOND. This is a very specific model and serves as a toy model for the understanding of theories of DM that present an emergent dynamics on small-scales. It is important to point out that what is important is for the model to be able to explain the rotation curves and scaling relations described by galaxies. This translates into a theory that exhibits long-range correlation on Galactic scales. We present here a very specific example of a model when this occurs, where the way the long-range correlation is obtained is by restricting the Lagrangian to behave like MOND on small-scales, but this does not have to be the case. The search for a more general theory where this emerges, with a known microphysics is the final goal. Such a construction is currently being searched.

4.2.1 Conditions for DM condensation

Before describing how the DM superfluid behaves inside galaxies, we need to determine in which conditions DM condensates into a Bose–Einstein condensate in galaxies. As we saw in the previous section, two conditions need to be met for condensation: first, we need that all the particles are in a single coherent quantum state, described by a single wavefunction of the condensate; a second condition is that the DM particles are in thermal equilibrium, to be described by a Bose distribution.

In this section, we want to obtain a rough estimate of the bounds in the parameters of the model to obtain this condensed core in the inner parts of the galaxy. For that, for simplicity, we use the criteria for weakly interacting gases.

Condensate wavefunction and thermalization

We showed in the beginning of this section, the condition on the mass for the ULDM particles to condense in galaxies, showing that they should be in the range \(10^{-25}\, \mathrm {eV} \lesssim m \lesssim 2 \, \mathrm {eV}\). This is an approximate condition for the case of DM in galaxies, but it gives us an order of magnitude estimation for the mass of the DM superfluid particles. However, if we remember from Sect. 3, there is a second condition for condensation.

The second condition to form a condensate is that the particles are in thermal equilibrium. The condition to achieve thermal equilibrium is that the time scale of thermalization must be smaller than the time scale where dynamical processes happen in the halo, the dynamical time. If this condition is satisfied and thermal equilibrium is achieved, the condensate is coherent in the entire halo. The time scale of thermalization if given by the inverse of the self-interaction rate, and the condition for thermalization is given by

$$\begin{aligned} \varGamma \sim \varUpsilon v_{\mathrm {vir}} \rho _{\mathrm {vir}} \frac{\sigma }{m} \, \lesssim \, t^{-1}_{\mathrm {dyn}} = \left( 3\pi / 32 G \rho \right) ^{-1/2} \,, \end{aligned}$$
(183)

where \(\varUpsilon \sim \frac{\rho _{\mathrm {vir}}}{m} \frac{(2\pi )^3}{(4\pi /3)(mv)^3}\) is the Bose enhancement factor, which tells you if a boson is already in the state, the probability to another boson to be in that state will be enhanced by a factor of \(\varUpsilon \). The dynamical time is taken here as the time it takes to a sphere of density \(\rho \) to collapse due to gravity. This condition gives a bound in the self-interaction cross section:

$$\begin{aligned} \frac{\sigma }{m } \gtrsim 0.3 \left( \frac{m}{\mathrm {eV}} \right) ^4 \frac{\mathrm {cm}^2}{g}\,, \end{aligned}$$
(184)

where we assumed \(z_{\mathrm {vir}}=2\) and \(M = 10^{12} M_\odot \). If we want that our self-interaction satisfies the merging-cluster bound (Randall et al. 2008; Massey et al. 2011; Harvey et al. 2015), which is \(\sim 1\hbox {cm}^2\)/g, this gives another bound in mass of the superfluid: \(m \lesssim \mathrm {eV}\).

Fig. 13
figure 13

Left panel: mass of the superfluid particle with respect to the mass of the halo. The filled are represents the regime where DM is condensed and behaves as a superfluid, for \(z_{\mathrm {vir}}=2\). Right panel: approximate calculation form (Berezhiani and Khoury 2015) for the fraction of the particles in the condensate versus in the normal state, for a series of sub eV masses in accordance with our bounds. We assumed \(z_{\mathrm {vir}}=0 \)

From these conditions, we can obtain a few properties of the our DM superfluid condensate:

  • Critical temperature: With DM in thermal equilibrium, the temperature can be obtained by the equipartition theorem: \(k_{\mathrm {B}} T = m \langle v \rangle ^2 /2\), which is valid for temperatures smaller than the critical temperature. Above that temperature, the condensate is broken. The critical temperature \(T_{\mathrm {c}}\) is associated with the “critical” velocity \(v_{\mathrm {c}}\), than can be read when we saturate the bound (90):

    $$\begin{aligned} T_{\mathrm {c}} \sim 6.5 \left( \frac{ \mathrm {eV}}{m}\right) ^{5/3} (1+z_{\mathrm {vir}})^2 \, \mathrm {mK}\,. \end{aligned}$$
    (185)

    With that the temperature in a halo of mass M is given by:

    $$\begin{aligned} \frac{T}{T_{\mathrm {c}}} \sim 0.1 (1+z_{\mathrm {vir}})^{-1} \left( \frac{m}{\mathrm {eV}} \right) ^{8/3} \left( \frac{M}{10^{12} M_\odot }\right) ^{2/3} \end{aligned}$$
    (186)
  • Condensate fraction: As we saw on Sect. 3 at \(T=0\) it is expected that almost all the particles are in the condensate. However, at finite but subcritical temperature, as seen in Landau’s theory (Landau and Lifshitz 1980), it is expected that the fluid is going to be a mixture of superfluid and normal fluid, with the majority in the superfluid. Borrowing from the non-interacting BEC description, this can be estimated as

    $$\begin{aligned} \frac{N_{\mathrm {cond}}}{N}= 1-\left( \frac{T}{T_{\mathrm {c}}} \right) ^{3/2} \sim 1- 0.03 (1+z_{\mathrm {vir}})^{-3/2} \left( \frac{m}{\mathrm {eV}} \right) ^4 \left( \frac{M}{10^{12} M_\odot }\right) \,.\qquad \end{aligned}$$
    (187)

    This formula is only valid for free-particles, and a particle with interaction and trapped in the gravitational potential has a different power than 3/2, but it serves as an estimate. We can see in Fig. 13 the condensate fraction with respect to the mass of the halo for different masses of the DM particle. From this, we an see that for a MW-like galaxy for any mass smaller than \(\mathrm {eV}\), the particles are in the condensate, while for higher masses of clusters, for example, this is not true. Therefore, the mass range from the bound (88) describes a condensate that condenses on galactic scales.

The above conditions were obtained assuming that the condensate will take the entire halo. However, as mentioned in Sect. 2.1, virialization occurs through violent relaxation, which is an out-of-equilibrium process. In this way, the DM superfluid cannot thermalize. What should happen is that first, the halo virialized and the profile is the expected NFW. After this process, DM particles start to enter thermal equilibrium in the inner, most central regions of the condensate, where the interaction is more pronounced. In this way, the halo would have an inner region (\(r<R_T\)) where DM is in a condensed state surrounded by the outer part of the halo (\(r>R_T\)) that follows the NFW profile (Berezhiani et al. 2018). Since in this model the goal is to be able to describe the rotation curves of galaxies, \(R_T\) needs to be larger than the radius where the circular motion of stars and gas is observed. For \(r > R_{\mathrm {T}}\), the density profile of the halo follows the NFW profile, \(\rho \propto r^{-3}\). So we can rewrite the density and velocity with respect to the virial quantities used above: \(\rho (r)=\rho (R_{200}) \left( R_{200}/r \right) ^3\), where for a NFW we can estimate \(\rho _{200}/\rho (R_{200}) \sim 5\). With that, the thermalization bound becomes:

$$\begin{aligned} \frac{\sigma }{m } \gtrsim 0.2 \left( \frac{m}{\mathrm {eV}} \right) ^4 \left( \frac{M}{10^{12} M_\odot }\right) ^{2/3} \left( \frac{r}{R_{200}} \right) ^{7/2} \frac{\mathrm {cm}^2}{g}\,, \end{aligned}$$
(188)

which tells us that it is easier to reach thermal equilibrium in the center of the galaxies where the density is higher. This translates into a bound to the thermalization radius:

$$\begin{aligned} R_T \lesssim 310 \left( \frac{m}{\mathrm {eV}} \right) ^{-8/7} \left( \frac{M}{10^{12} M_\odot }\right) ^{1/7} \left( \frac{\sigma /m}{\mathrm {cm}^2/g} \right) ^{2/7} \mathrm {kpc}\,. \end{aligned}$$
(189)

For a MW-like galaxy with \(M=10^{12} M_\odot \), if we can measure the the circular velocity up to approximately \(60\,\mathrm {kpc}\), this will translate into a bound for the mass: \(m \lesssim 4.2 \left( \frac{\sigma /m}{\mathrm {cm}^2/g} \right) ^{1/4}\, \mathrm {eV}\).

4.2.2 Superfluid dynamics

Since we have determined that DM condenses and forms a superfluidity in the central regions of the halo, we now need to describe the evolution of this superfluid inside this region. We need to determine the dynamics of the superfluid to be able to calculate the profile of the region of the halo comprising the superfluid and with the calculate the rotation curves of galaxies. In this section, we will describe the effective field theory of superfluids and show how this is theory reproduces MOND at small scales.

As we saw in the previous section, a superfluid at low-energies is described by the effective Lagrangian that is invariant under shift and Galilean symmetries:

$$\begin{aligned} {\mathcal {L}}_{T=0}=P(X)\,, \quad X={\dot{\theta }} +\mu -m \varPhi - \frac{(\nabla \theta )^2}{2m}\,, \end{aligned}$$
(190)

where \(\varPhi = -G M(r)/r\) is the external gravitational potential for a spherical symmetric static source. The thermodynamic pressure is given by P.

We want our theory to describe the MOND dynamics at the regions where it is superfluid. Given this general Lagrangian for the phonons (190), we want it to describe the MOND Lagrangian (8). For this, we conjecture that our phonon action is given by

$$\begin{aligned} {\mathcal {L}}_{\mathrm {DM},T=0}=\frac{2 \varLambda (2m)^{3/2}}{3} X \sqrt{|X|}\,. \end{aligned}$$
(191)

This fractional power might seem strange from the point of view of a quantum field theory of fundamental fields, leading to superluminal behaviour and caustics. However, as a theory for the phonons, this is not problematic and it determines uniquely the equation of state of the superfluid. As we can see for the condensate, the background, where \(\theta =\mu t\), the pressure is given by the Lagrangian density:

$$\begin{aligned} P(\mu ) = \frac{2\varLambda }{3} (2m\mu )^{3/2}\,, \quad \Rightarrow \quad P=\frac{\rho ^3}{12 \varLambda ^2 m^6}\,, \end{aligned}$$
(192)

where, in the non-relativistic regime, \(\rho =mn\) and \(n=\partial P / \partial \mu \) is the number density of condensed particles. As expected from the result from MOND, this Lagrangian gives us an EoS for the superfluid \(P \propto \rho ^3\), which is what we wanted to reproduce MOND.

To evaluate the excitation spectrum, we write the action for the phonon excitations \(\phi \), that can be obtained by expanding (190) to quadratic order. Neglecting the gravitational potential:

$$\begin{aligned} {\mathcal {L}}^{(2)}= \frac{(2m)^{3/2}}{4\mu ^{1/2}} \left( {\dot{\phi }}^2 - \frac{2\mu }{m} (\nabla \phi )^2 \right) \,, \end{aligned}$$
(193)

from where we can infer the sound speed of the phonon excitations:

$$\begin{aligned} c_{\mathrm {s}} = \sqrt{\frac{2 \mu }{m}}\,. \end{aligned}$$
(194)

However, only those ingredients are not enough to reproduce a MOND-like force and a coupling between the phonons and the baryon density needs to be introduced:

$$\begin{aligned} {\mathcal {L}}_{\mathrm {int}} = -\alpha \frac{\varLambda }{M_{\mathrm {pl}}} \theta \rho _{\mathrm {b}}\,, \end{aligned}$$
(195)

where \(\alpha \) is a dimensionless coupling constant. Although necessary to obtain the MOND regime, this interaction Lagrangian breaks shift symmetry softly, only at the \(1/M_{\mathrm {pl}}\) level. This term is here considered as a phenomenological term to reproduce MOND. In this way, this superfluid theory has three parameters: the mass m, the scale \(\varLambda \) and the coupling \(\alpha \).

The present form of the Lagrangian to obtain MOND is not the only way of obtaining the MOND behavior in the context of the DM superfluid model. In Khoury (2016) it was used higher order corrections to generate the non-relativistic MOND action, which is inspired in the symmetron mechanism. Using the same Lagrangian (191) as the leading order Lagrangian, higher order corrections involving gradients of the gravitational potential are added to effectively modify the gravitational force. This results in the spontaneous breaking of a discrete symmetry. The symmetry is broken for small accelerations leading to MONDian gravity, and is restored in the limit of large acceleration leading to Newtonian gravity. In this theory the shift symmetry of the entire system is maintained. A difference from the present mechanism, as we are going to see later, is that cosmologically all the DM is in the normal phase, behaving like CDM, and reproducing all the results from \(\varLambda \)CDM. Here we will describe the method of adding a photon–baryon coupling since this was studied in more detail in the literature.

Finite temperature

The theory developed above is valid for a \(T=0\) superfluid. However, in reality, the DM in galaxies has a non-zero temperature. As we mentioned in Sect. 3, for finite temperatures, this Lagrangian needs to receive finite temperature corrections. In Landau’s model the finite temperature superfluid consists is described as a two-fluid theory where a superfluid component and a normal component are present. Those components must interact with each other. Following this idea from Landau’s theory, at lowest order in derivatives, we can write the general form of the EFT at finite temperatures and finite chemical potential as a function of three scalars (Nicolis 2011):

$$\begin{aligned} {\mathcal {L}}_{T\ne 0} = F\left( X, B, Y \right) \,, \end{aligned}$$
(196)

where \(X=X(\theta )\) was defined before with respect to the superfluid variables. The other new components are: B is defined with respect to the normal fluid three Lagrangian coordinates \(\varphi ^I (\mathbf {x},t)\); and Y represents the scalar product of the normal and superfluid velocities:

$$\begin{aligned} B \equiv \sqrt{\det \partial _\mu \varphi ^I \partial ^\mu \varphi ^I} \,, \quad Y \equiv u^\mu \left( \partial _\mu \theta +m \delta _\mu ^0 \right) \simeq \mu -m\varPhi +{\dot{\phi }}+ \mathbf {v} \cdot \nabla \phi \,,\qquad \end{aligned}$$
(197)

where \(u^\mu \) is the unit 4-vector from \(\varphi ^I (\mathbf {x},t)\), and in the last equality of Y we have taken the non-relativistic limit, so \(\mathbf {v}\) is the velocity vector of the normal fluid component.

There are many ways to construct the finite temperature operators. Our restriction is that we want our finite-temperature theory to generate the expected MOND profile. To construct such a Lagrangian requires first-principle knowledge of the microphysics of the superfluid. Since we still do not have a fundamental description of the DM superfluid model, we proceed empirically. We suggest the following finite-temperature Lagrangian for the model:

$$\begin{aligned} {\mathcal {L}} = \frac{2 \varLambda (2m)^{3/2}}{3} \, X \sqrt{|X-\beta Y|} - \alpha \frac{\varLambda }{M_{\mathrm {pl}}} \phi \, \rho _{\mathrm {b}}\,, \end{aligned}$$
(198)

where the finite temperature effects are parametrized by a dimensionless constant \(\beta \). When \(\beta \rightarrow 0\), we recover the \(T=0\) result; we are using the fiducial value \(\beta =2\). We included the interaction term so we could represent the entire action of the model that we are going to use next.

4.2.3 Halo profile

With the Lagrangian of the theory, we can evaluate the halo profile in the superfluid region and, after matching with an outer NFW profile, calculate the rotation curves of galaxies. And this is what we are going to do in this section: estimate the halo profile. This will be done in steps. First, we estimate the DM halo profile taking into account only the density coming from (191). Next, we include the baryons, by calculating the profile for the full action including interaction. We are going to use here the finite-temperature effective action (198) since in the case of the \(T=0\) the perturbations around a static background configuration suffer from a ghost-like instability. Although phenomenological, it retains the features of the initial superfluid Lagrangian and can give a more realistic description of the system.

DM halo profile

We can now calculate the density profile of the condensate, in the superfluid region, assuming that we have only dark matter and no baryons for simplicity. This is the halo profile given by the different equation of state that the superfluid has: \(P \propto n^3\), given by equation (192). This analysis is almost the same for the zero-temperature and finite temperature cases, with accounts for the replacement:\(\varLambda \rightarrow {\tilde{\varLambda }}=\varLambda \sqrt{\beta - 1}\). Assuming hydrostatic equilibrium, for a static and spherically symmetric halo, the pressure and acceleration are related by:

$$\begin{aligned} \frac{1}{\rho (r)} \frac{{\text {d}}P(r)}{{\text {d}}r}=-\frac{{\text {d}}\varPhi (r)}{{\text {d}}r} = -\frac{4\pi G}{r^2} \int _0^r {\text {d}}r^{\prime } \, r^{\prime 2} \rho (r^{\prime })\,. \end{aligned}$$
(199)

By making a change of variables \(\rho (r)=\rho _0 \varXi \) and \(r=\xi \left[ \rho _0 / (32 \pi G {\tilde{\varLambda }}^2 m^6) \right] ^{1/2}\), where \(\rho (r=0)=\rho _0\), this equation reduces to the Lane–Emden equation (with \(n=1/2\)),

$$\begin{aligned} \frac{1}{\xi ^2} \frac{{\text {d}}}{{\text {d}}\xi } \left( \xi ^2 \frac{{\text {d}} \varXi }{{\text {d}}\xi } \right) = - \varXi ^{1/2}\,. \end{aligned}$$
(200)

Choosing boundary conditions \(\varXi (0) = 1\) and \(\varXi ^{'}(0)=0\), we can numerically solve this equation. We can see from the change of variables that the size of the condensate and the central density are given by Chandrasekhar (1967),

$$\begin{aligned} R= \xi _1 \sqrt{\frac{\rho _0}{32\pi G {\tilde{\varLambda }}^2 m^6}}\,, \quad \rho _0 = \frac{M}{4\pi R^3} \frac{\xi _1}{|\varXi ^{'} (\xi _1)|}\,, \end{aligned}$$
(201)

where at \(\xi _1\) the numerical solution vanishes. From the numerics \(\xi _1 \sim 2.75\) and \(\varXi ^{'} (\xi _1) \sim -0.5\) gives the following halo radius and central density:

$$\begin{aligned} \rho _0&\sim \left( \frac{M_{\mathrm {DM}}}{10^{12} M_\odot } \right) ^{2/5} \left( \frac{m}{\mathrm {eV}} \right) ^{18/5} \left( \frac{\varLambda }{\mathrm {meV}} \right) ^{6/5} \left( \beta -1 \right) ^{3/5} \, 10^{-24} \, \mathrm {g/cm}^3\,, \end{aligned}$$
(202)
$$\begin{aligned} R&\sim \left( \frac{M_{\mathrm {DM}}}{10^{12} M_\odot } \right) ^{1/5} \left( \frac{m}{\mathrm {eV}} \right) ^{-6/5} \left( \frac{\varLambda }{\mathrm {meV}} \right) ^{-2/5} \left( \beta -1 \right) ^{-1/5} \, 45 \, \mathrm {kpc}\,, \end{aligned}$$
(203)

With that, we can determine the chemical potential \(\mu = \rho ^2 / \left( 8 \varLambda ^2 m^5 \right) \). For \(m \sim \mathrm {eV}\) and \(\varLambda \sim \mathrm {meV}\), we obtain realistic core sizes, which are of sizes that cover a big part of the halo, as we wanted. For this reason, we choose the fiducial values:

$$\begin{aligned} m=0.6 \, \mathrm {eV}\,, \quad \varLambda =0.2 \, \mathrm {meV}\,. \end{aligned}$$
(204)

For these values, we have a cored density profile with a condensate core of radius \(158 \, \mathrm {kpc}\) for \(M_{\mathrm {DM}}=10^{12} M_\odot \). The condensate does not make the entire halo, but we expect that this condensed core is surrounded by a NFW profile. The central density obtained is smaller than the expected from CDM simulations, which is preferred by observations. In this way the DM superfluid offers a simple resolution to the cusp–core and the “too big to fail” problems. We will see these results in more details in Sect. 4.2.4.

Including baryons

Now, we derive the condensate profile in the presence of baryons. We expect that there is this extra acceleration due to the interaction to baryons. This comes from the dynamics of the phonon excitation \(\phi \) given the Lagrangian (198). We are going to assume a static, spherically symmetric approximation: \(\theta =\mu t + \phi (r)\). The equation of motion for the phonon is given by,

$$\begin{aligned} \nabla \cdot \left( \frac{(\nabla \phi )^2 -2m{\hat{\mu }}}{\sqrt{(\nabla \phi )^2 -2m{\hat{\mu }}}} \, \nabla \phi \right) = \alpha \frac{\rho _{\mathrm {b}}}{2M_{\mathrm {pl}}}\,, \end{aligned}$$
(205)

where \({\hat{\mu }}\equiv \mu -m\varPhi \). If we ignore the homogeneous curls term, in the limit where \((\nabla \phi )^2 \gg 2m{\hat{\mu }}\) the solution is

$$\begin{aligned} |(\nabla \phi )|\, (\nabla \phi ) \simeq \alpha M_{\mathrm {pl}} \mathbf {a}_{\mathrm {b}}\,, \end{aligned}$$
(206)

where \(\mathbf {a}_{\mathrm {b}}\) is the Newtonian acceleration due to baryons only. Then acceleration mediated by \(\phi \) is,

$$\begin{aligned} \mathbf {a}_\phi = \alpha \frac{\varLambda }{M_{\mathrm {pl}}} \quad \Longrightarrow \quad a_\phi = \sqrt{\frac{\alpha ^3 \varLambda ^2}{M_{\mathrm {pl}}} \, a_{\mathrm {b}}} = \sqrt{a_0 a_{\mathrm {b}}}\,, \end{aligned}$$
(207)

for \(a_0=\alpha ^3 \varLambda ^2/M_{\mathrm {pl}}\), which is exactly the acceleration expected in the deep MOND regime, as shown in Sect. 2.2.4. In the regime \((\nabla \phi )^2 \ll 2m{\hat{\mu }}\), we recover the Newtonian acceleration given by the baryons. So, in this model, the total acceleration is given by \(\mathbf {a}_{\mathrm {b}}\), \(\mathbf {a}_\phi \), and also \(\mathbf {a}_{\mathrm {DM}}\) the Newtonian acceleration from the DM halo itself (obtained in the previous section), since we have DM in this model (different than MOND).

Halo profile algorithm

Having developed the theory of the superfluid DM above, now we want to evaluate the density profile of the DM halo and the rotation curves, and compare it with the data to make a first proof of concept of the model. To evaluate the rotation curve, we need to determine the circular velocity with respect to the radius.

As discussed in our model the galaxy contains a superfluid core in the central region of the galaxy surrounded by a NFW profile envelope. So to calculate these quantities for the galaxy we first need to evaluate them inside the superfluid core, and then at \(R=R_{\mathrm {NFW}}\) match the density and the pressure obtained for the superfluid \(\rho _{\mathrm {SF}}\) and \(P_{\mathrm {SF}}\), to the ones given by the full NFW profile.

For that, we need to evaluate these quantities in the superfluid phase. To obtain the halo density profile, we need to determine the total mass of the halo M(r). The rotation curve is the circular velocity with respect to the radius, given by \(a=v^2_{\mathrm {circ}} (r) /r\) where \(a=\partial \varPhi / \partial r\). So we need to determine the gravitational potential \(\varPhi \) to calculate the rotation curve and, also to determine M(r). The Poisson equation in the superfluid region is given by

$$\begin{aligned} \nabla ^{2} \varPhi = 4\pi G \left( \rho _{\mathrm {SF}} + \rho _{\mathrm {b}} \right) \,. \end{aligned}$$
(208)

The baryon density is given by the observations, while the superfluid density we can obtain from our theory by differentiating our Lagrangian (198) with respect to \(\varPhi \),:

$$\begin{aligned} \rho _{\mathrm {SF}} = \frac{2\sqrt{2} m^{5/2} \varLambda \left[ 3 (\beta -1) {\hat{\mu }}+(3-\beta ) \frac{(\mathbf {\nabla } \phi )^2}{2m} \right] }{3\sqrt{(\beta -1) {\hat{\mu }}+\frac{(\mathbf {\nabla } \phi )^2}{2m}}}\,, \end{aligned}$$
(209)

where we can see that \(\rho _{\mathrm {SF}} = \rho _{\mathrm {SF}} (\varPhi , \phi )\). So, to solve the Poisson equation, we need the equation for \(\phi \), which is given by its equation of motion (205). The system of equations we need to solve is given by (208) and (205), which can be very intricate to solve. One approximation that can be done to simplify this is to assume that baryon distribution is spherically symmetric (which we know it is not true, but used as a simplification). With that, the system can be solved numerically. This is done in Berezhiani et al. (2018). After having this, this solution needs to be matched to the NFW profile that describes the outskirts of the halo. With that, it is possible to evaluate the density profile and the rotation curves of galaxies.

4.2.4 Observational consequences

Table 1 Summary of observational consequences of superfluid DM, showing the behaviour of this model in each of the systems listed. Table reproduced with permission from Berezhiani et al. (2018), copyright by IOP/SISSA

In this section, we will describe the main observational consequences of the superfluid DM. A summary of all the effects already worked out can be seen in Table 1. This table compiles a list of the behaviour that this model has in different systems. We describe in this section, some of those results, but point to Berezhiani et al. (2018) for a detailed explanation of each of those cases.

In Table 1, EFE stands for external field effect, and it reveals an interesting characteristic of the superfluid DM model. This effect is an example from kinetic screening in scalar field theories, where in the presence of gradient interactions the scalar acceleration, given by the non-linearities in the scalar field gradient, can suppress scalar field effects in gravity, making the system behave as standard gravity theory. This effect was an essential aspect of MOND, but in the DM superfluid model this effect is present in the superfluid cores, coming from the phonon non-linearities. With this, we can see that for satellite galaxies that are inside the superfluid core should follow a MOND-like dynamics, while more distant satellites are outside the core, and therefore should follow Newtonian dynamics. For more details, see Berezhiani et al. (2018).

Since it is going to be used a lot in this section, we remind Landau’s conditions for superfluidity is that the fluid velocity (\(v_{\mathrm {s}}\)) is smaller than the superfluid sound speed \(c_{\mathrm {s}}\), \(v_{\mathrm {s}} < c_{\mathrm {s}}\).

  • Galaxy rotation curves:

    Berezhiani et al. (2018) numerically calculated the rotation curves of IC 2574, a low surface brightness galaxy, and UGC 2953, a high surface brightness galaxy, using the method developed above, as a proof of concept of the galactic dynamics that the DM superfluid is able to reproduce. Since for the theoretical predictions, a spherical baryonic distribution was assumed to simplify the calculations, and since this is far from the actual distribution of baryons, in these calculations a hybrid method, mixing the results calculated with the spherical distribution, was implemented. In this, the acceleration is corrected for the actual distribution leading to

    $$\begin{aligned} \mathbf {a}_{\mathrm {hybrid}} = \mathbf {a}_{\mathrm {b,real}}+\mathbf {a}_{\mathrm {DM}}+\mathbf {a}_{\mathrm {phonon}}\,, \end{aligned}$$
    (210)

    where \(\mathbf {a}_{\mathrm {b, real}}\) is the acceleration computed from Poisson’s equation for a non-spherical baryon distributions; \(\mathbf {a}_{\mathrm {DM}}\) the Newtonian acceleration from the DM halo using spherical baryon distribution; and \(\mathbf {a}_{\mathrm {phonon}}\) from (207) sourced by \(\mathbf {a}_{\mathrm {b, real}}\), but with Newtonian potential from the spherical case. Although calculated in the hybrid method, \(\boxed {\mathbf {a}_{\mathrm {phonon}} \sim \sqrt{a_0 a_{\mathrm {N,b}}}}\) as expected in MOND regime.

    The fiducial parameters used for this numerical calculation were \(m = 1\mathrm {eV}\), \(\left( \sigma /m \right) = 0.01 \, \mathrm {cm}^2/\mathrm {g}\), which are optimal for having a superfluid core that encompasses the baryonic disk of the galaxy, while still within the bounds to agree with cluster observations; \(\varLambda m^3 = 0.05 \, \mathrm {meV} \times \mathrm {eV}^3\); and \(\alpha = 5.7\). The rotation curves can be seen in Fig. 14.