1 Introduction

Effective field theories have been serving as powerful tools for physicists, allowing for focusing solely on the interactions occurring at the energy scales of interest, while neglecting other phenomena that take place in different regimes. As such, the definition of an effective theory is intimately related to the energy cutoff, being the energy scale at which the effective description is rendered invalid due to the emergence of new interactions, possibly mediated by new states. In turn, the knowledge of the effective theory cutoff – namely at which energy scale it is identified and by which new interactions it is determined – carries pivotal information about how the effective theory may be completed in the ultraviolet regime.

A prime example is Fermi’s four-fermion theory, which describes weak interactions among fermions for sufficiently small energy scales. If the energy scales that are probed increase, reaching energies as high as \(\Lambda \simeq 80\, \text {GeV}\), the theory is not able to capture the several interactions occurring at those scales, with the reason being the emergence of the non-negligible W and Z-bosons interactions, which the theory ought to account for. Indeed, we do know that Fermi’s theory can be completed within the Standard Model, for which the former serves as an effective theory valid below its cutoff, that is indeed set by the mass of the W-bosons \(\Lambda _{\tiny {\text {EFT}}} = m_W \simeq 80\, \text {GeV}\).

In effective theories that are endowed with a moduli space, the identification of a cutoff is more subtle and less clear. Indeed, the vacuum expectation values of the moduli determine the strength of the couplings that govern the interactions and, along with those, the masses of the states that, descending from the ultraviolet completion, could break down the description. Consequently, in general, effective theories with a moduli space are not characterized by a single, univocally identified cutoff: rather, in different regions of the moduli space the effective description might be broken down by different kinds of states, and at different energy scales.

Effective field theories originating from string theory are naturally endowed with moduli spaces with rich geometrical structures, and the definition of their cutoff is tied to the geometry of the moduli space. In this regards, towards the boundaries of the moduli space, some ‘hard’ effective-theory breakdowns may occur, such as the one predicted by the Distance Conjecture [1]. The conjecture predicts that in any effective theory that admits a completion within string theory an infinite tower of states becomes light as boundaries of the moduli space located at infinite distance are approached. Thus, towards such infinite-distance boundaries, the effective-theory cutoff should be consistently reduced, in order to keep the infinite tower integrated out, thus getting lower and lower as said boundary is reached.

The Distance Conjecture, as a source of effective description breakdown, is thus invariably affected by the aforementioned ambiguities regarding the definition of an effective-theory cutoff. Indeed, in different regions of the moduli space different infinite towers may first break down the effective theory, for they are the lightest ones in those regions, and the mass of their lightest state may serve as cutoff. This phenomenon is common to several effective theories stemming from string theory – see [2,3,4,5,6,7,8,9,10,11,12,13,14,15,16] for a sample of works. In general, however, it is not clear how many towers are needed in order to realize the Distance Conjecture throughout the moduli space, and how to systematically partition the moduli space according to the regions where each of these towers is relevant.

These ambiguities in the definition of an effective-theory cutoff can be encoded within the following key, general questions:

  • given a moduli space, how many different states, stemming from the ultraviolet completion, could lead to the breakdown of the effective theory? Are they infinite in number, or do they belong to a finite set?

  • how can we concretely determine how the effective theory is broken down throughout the moduli space, and what is the minimal information required to achieve this?

Addressing these questions is the core of this work, and the strategy that we will employ to tackle them is two-fold, relying on both a top-down and a bottom-up viewpoint.

The top-down approach that we will pursue is motivated by the recently formulated Tameness Conjecture [17]. The conjecture states that the couplings that appear in any consistent effective field theory have to be tame, and belong to specific families of functions – these concepts will be reviewed in Sect. 2.2. Exploiting the conjecture, in [15] it was shown that the realization of the Distance Conjecture requires only a finite number of different infinite towers of states in any tame effective theory. In this work, we will generalize these results, and illustrate that in effective theories governed by tame couplings, only a finite number of different types of states is sufficient to define the cutoff throughout the moduli space.

The bottom-up viewpoint that we will deliver offers a concrete method to compute the slicing of the moduli space according to how the effective theory cutoff is defined. Achieving this requires assuming some additional, though minimal information about the effective theory: we need to know how the effective theory is broken down at some points of the moduli space. Then, we feed this information, serving as dataset, to some supervised machine learning algorithms - specifically, we will employ the k-nearest neighbor algorithm, and a linear support vector machine algorithm. The algorithms, utilizing the above dataset, are able to reconstruct the regions of the moduli space where a given state is the one that determines the effective-theory cutoff.

The two viewpoints are not independent of one another, rather they are complementary. In fact, the application of the machine learning algorithms to the problems at hand is rooted in the fact that the couplings of the effective theory are tame, and only a finite number of states contribute to the definition of the effective-theory cutoff. If the number of different kinds of states defining the cutoff were infinite, the application of any machine learning algorithms would be pointless and unsuccessful, for one would need to set an infinite number of inputs to begin with. The complementarity of the two pictures will be additionally illustrated in Sect. 4 with a concrete example, namely the computation of the cutoff-induced slicing of the moduli space of a four-dimensional effective theory obtained after compactifying Type IIB string theory over a toroidal orbifold.

This work is articulated as follows. In Sect. 2 we overview the statements of the Tameness Conjecture, and its implications for effective field theory in general; we also recall how the conjecture may be helpful in investigating how effective theories are broken down due to the emergence of infinite towers of states, reviewing the main findings of [15]. Section 3 collects the foundational ideas of this work, which are presented from two complementary perspectives: we illustrate how the tameness of the effective theory implies that the moduli space is partitioned in a finite number of subsets, according to the kind of states that determines the cutoff there; then, we show how such a partition can be obtained purely from a bottom-up perspective via two supervised machine learning techniques, the k-nearest neighbor algorithm and the linear support vector machine algorithm. In Sect. 4 we show how the ideas presented in Sect. 3 concretely apply to a four-dimensional effective theory obtained after compactifying Type IIB string theory over a toroidal orbifold. Finally, in Appendices A and B we briefly overview the k-nearest neighbor algorithm and the linear support vector machine algorithm employed in Sect. 3.

2 The distance conjecture in tame effective field theories

The Distance Conjecture [1] is one of the founding statements from which the Swampland Program sparked. The conjecture aims at identifying generic features that any consistent effective field theory should be equipped with towards specific field space boundaries. Although such genericity makes the conjecture appealing, it also renders the conjecture hard to prove, even in simple, concrete frameworks.

The claims of the Distance Conjecture can however be better addressed if one invokes another recently formulated conjecture, the Tameness Conjecture [17]. The latter, constraining the functional form of the couplings that appear in any consistent effective field theory, not only can help uncover novel features that effective field theories exhibit near the field space boundaries; it also delivers recipes that simplify tests of the Distance Conjecture [15].

In this section, in order to set the ground for the forthcoming discussion, we briefly recall the core statements of the Distance Conjecture, and the limitations that emerge when trying to test it. Then, we will recall how the Tameness Conjecture can help better address the Distance Conjecture by overviewing the main results of [15].

2.1 The distance conjecture, and its testing limitations

For the sake of generality, we consider an effective theory formulated in D spacetime dimensions. The assumptions on the effective field theory are minimal: beside assuming that the theory is coupled to gravity, we solely assume the theory to be endowed with a non-trivial N-dimensional, classical moduli space \(\mathcal {M}\). A local patch within \(\mathcal {M}\) can be parametrized by the coordinates \(\varphi ^A\), with \(A = 1, \ldots , N\), the moduli of the theory. Since we shall focus solely on single patches of the moduli space, in which the effective theory is well-defined, with an abuse of nomenclature, we will oftentimes denote a point in \(\mathcal {M}\) with its coordinate representation as \(\varphi ^A\). Additionally, within the given patch, the moduli space is endowed with the local metric \(G_{AB}(\varphi )\).

The effective action describing the dynamics of the moduli \(\varphi ^A\), defined for a finite cutoff \(\Lambda _{\tiny {\text {EFT}}}\) include the following contributions:

$$\begin{aligned} S^{(D)}_{\tiny {\text {EFT}}}= & {} M_{\text {P}}^{D-2} \int \textrm{d}^Dx\, \sqrt{-g} \nonumber \\{} & {} \times \left( \frac{1}{2} R - \frac{1}{2} G_{AB}(\varphi ) \partial _\mu \varphi ^A \partial ^\mu \varphi ^B + \ldots \right) \,. \end{aligned}$$
(2.1)

Here, \(M_{\text {P}}\) is the D-dimensional Planck mass, \(g = \det g_{\mu \nu }\), with \(g_{\mu \nu }\) being the spacetime metric, and R is the spacetime Ricci scalar. In (2.1), the dots denote some additional contributions that may include other fields, and are not relevant for the discussion that unfolds below; however, importantly, in this work we assume that the fields \(\varphi ^A\) are not constrained by any scalar potential.

The Distance Conjecture, first formulated in [1], delivers key statements about the geometry of the moduli space \(\mathcal {M}\) for any effective field theory of the form (2.1) that stems from a proper quantum gravity theory. The content of the conjecture can be divided into two parts:

Non-compactness of \(\mathcal {M}\): For any \(C > 0\) and for any point in \(\mathcal {M}\) parametrized by the coordinates \(\varphi ^A_0\), there exists a point, parametrized by the coordinates \(\varphi ^A\), such that

$$\begin{aligned} d(\varphi _0, \varphi ) > C, \end{aligned}$$
(2.2)

where \(d(\varphi _0, \varphi )\) is the length of a geodesic path stretching between the point \(\varphi ^A_0\) and the point \(\varphi ^A\). Namely, parametrizing such a geodesic path as \(\varphi ^A(\sigma )\), where \(\sigma \in [0,1]\), and with the endpoints \(\varphi ^A(0) = \varphi ^A_0\) and \(\varphi ^A(1) = \varphi ^A\), we define

$$\begin{aligned} d(\varphi _0, \varphi ) = \int _0^1 \sqrt{G_{AB} (\varphi ) \frac{\textrm{d} \varphi ^A}{\textrm{d} \sigma } \frac{\textrm{d} \varphi ^B}{\textrm{d} \sigma }}\, { \mathrm d} \sigma . \end{aligned}$$
(2.3)

Said differently, there exist some boundary points in \(\mathcal {M}\), that we label as \(\varphi ^A_\textrm{b}\), that are located at infinite geodesic distance from any other point in \(\mathcal {M}\).

Emergence of an infinite tower of states: There exists an infinite tower of states that become exponentially light as a boundary located at infinite field distance is reached. Specifically, let us call \(M_n(\varphi )\) the moduli-dependent masses of the states constituting the tower, with \(n \in \mathbb {N}\) the index labeling the states in the tower. Then, the Distance Conjecture asserts that the masses of the tower located at two points \(\varphi ^A\) and \(\varphi _0^A\) are related as

$$\begin{aligned} M_n(\varphi ) \sim M_n(\varphi _0)\, e^{ -\lambda d(\varphi _0, \varphi )}, \end{aligned}$$
(2.4)

with \(\lambda \) a real parameter.

The emergence of such an infinite tower of states badly breaks down the effective description that we started with: indeed, assuming the effective theory to be characterized by a finite cutoff \(\Lambda _{\tiny {\text {EFT}}}\) for any point \(\varphi ^A\), an infinite number of states should ideally be included close enough to the infinite-distance boundary. Moreover, it is worth stressing that, allegedly, the infinite tower with states behaving as in (2.4) is the relevant one that breaks down the effective field theory: there may be other infinite tower of states that also break down the effective field theory, but they are more massive than said relevant one.

In order to corroborate the conjecture, some tests have been performed in the literature during the last years. However, as remarked earlier, excluding some trivial, one-dimensional cases, tests have been revealed to be hard, even in simple setups with just a couple of moduli reaching the boundary. In fact, while the non-compactness of the moduli space seems a property that is commonly shared by all the moduli spaces that are obtained by compactifying string theory over some internal manifold, proving the emergence of an infinite tower of states towards an infinite-distance boundary, exhibiting the behavior (2.4), is far from trivial.

In principle, tests of the latter property should proceed in two steps: first, one should catalog all the the possible ‘candidate’ microscopic states that could deliver an infinite tower emerging in the effective field theory; then, among these candidates, one should find the relevant one that breaks down the effective theory before the others. Secondly, one should show that the behavior of the masses of the states within the relevant candidate tower behave as in (2.4) for some parameter \(\lambda \). It is then clear that, in order to perform such checks, not only a detailed knowledge of the microscopic theory is required; additionally, explicit checks of the behavior (2.4) require an explicit computation of the geodesic distance (2.3) which might not be analytically doable.

Furthermore, it is worth remarking that, in its original formulation, the Distance Conjecture did not cover several issues that may be pivotal in addressing concrete checks thereof. For instance, the emergence of an infinite tower of states should occur towards an infinite distance boundary regardless of how the boundary is approached. Namely, one can approach the boundary by following any arbitrary path, and may find a tower of states becoming massless with a fall-off of the masses still regulated by (2.4); however, choosing another path, another, different tower of states may be the relevant one for realizing the Distance Conjecture, being lighter than the former. Indeed, in this regard, the Distance Conjecture does not specify that the tower that is relevant along any path leading to a given boundary has to be unique. Actually, since one can approach a boundary along infinitely many inequivalent paths, and each may exhibit a different, relevant tower, the towers needed for realizing the conjecture may be infinite in number!

2.2 The tameness conjecture

The general, aforementioned issues can be addressed employing the Tameness Conjecture [17], whose statements we briefly overview. The Tameness Conjecture states the following:

Tameness conjecture: All the effective theories valid below a fixed finite energy cut-off scale can be labeled by a definable parameter space and are endowed with scalar field spaces and coupling functions that are definable in an o-minimal structure.

In this work we will not attempt to deliver a mathematical review of o-minimal structures, for which we refer to the original Tameness Conjecture paper [17] or the follow-up work [15]. Rather, we outline the consequences for the effective field theories that come by assuming the Tameness Conjecture.

For the sake of concreteness, consider a moduli-dependent coupling y appearing in the effective field theory, and assume that the o-minimal structure of interest is \(\mathbb {R}_{\text {an,exp}}\).Footnote 1 For instance, in the effective theories of the form (2.1) the coupling y may be one of the elements of the field space metric \(G_{AB}(\varphi )\), or one of the masses of the states within the tower (2.4) realizing the Distance Conjecture. Then, if the coupling y resides in a tame effective theory, supported by an \(\mathbb {R}_{\text {an,exp}}\) o-minimal structure, the functional form of y in terms of the moduli can be obtained by solving the following set of equations and inequalities:

$$\begin{aligned}{} & {} \exists \; x_1, \ldots , x_l: \quad P_I(\varphi ,\lambda ,x,y, f_1, \ldots , f_m, e^\varphi , e^\lambda , e^x, e^y)=0, \nonumber \\{} & {} \quad Q_J(\varphi ,\lambda ,x,y, f_1, \ldots , f_m, e^\varphi , e^\lambda , e^x, e^y)>0, \end{aligned}$$
(2.5)

In the locus above, \(P_I\) and \(Q_J\) are generic polynomials of their arguments. The latter include: the moduli \(\varphi ^A\) appearing in the effective field theory; some external parameters \(\lambda ^\kappa \) (which may be, for instance, internal geometrical parameters related to the specific compactification); the coupling y; some auxiliary variables \(x_1, \ldots , x_l\), which might be needed solely to recast the coupling y as the locus (2.5); some restricted analytic functionsFootnote 2\(f_1, \ldots , f_m\), with all the aforementioned variables as arguments.

As remarked in [15], although the loci (2.5) deliver general couplings of tame effective theories, in concrete string theory settings it is enough to focus on specific, though large families of tame couplings: namely, the ‘polynomially tamed couplings’ or its subset of ‘monomially tamed couplings’.

In order to define these families of couplings, it is convenient to be more specific about the patch of the moduli space where our effective field theory is defined. We are interested in near-boundary patches of the moduli space, that cover the neighborhood of a boundary of the moduli space. Within a patch so defined, which we call \(\mathcal {E}\), we separate the moduli fields \(\varphi ^A\) into two distinct sets:

Saxions \(s^i\):

(\(i = 1, \ldots , n\)): these fields span non-compact domains; we shall assume that \(s^i > 1\), and that the field space boundary around which our effective theory is defined is reached as \(s^i \rightarrow \infty \) \(\forall \, i\);

Axions \(a^\alpha \):

(\(\alpha = 1, \ldots , N - n\)): these are fields that span compact domains; after properly normalizing them, we will assume that \(a^\alpha \in [0,1[\), with the identification \(a^\alpha \simeq a^\alpha + 1\).

In sum, the patch \(\mathcal {E}\) acquires the following form:

$$\begin{aligned} \mathcal {E} = \{ |a^\alpha | < 1,\ \ s^1, s^2, \ldots , s^n > 1 \}\,. \end{aligned}$$
(2.6)

We consider a positive definite coupling y defined over the patch \(\mathcal {E}\); we then identify:

Monomially tamed couplings: These are couplings that admit the following, general expansion in terms of the moduli

$$\begin{aligned} y(s,a) = \sum _{ \textbf{m}}\rho _{\textbf{m}}(e^{-s^i},a^\alpha ) (s^1)^{m_1} \cdots (s^n)^{m_n}\,, \end{aligned}$$
(2.7)

where \(\rho _{\textbf{m}}(e^{-s^i},a^\alpha )\) are restricted analytic functions of their arguments, and \(\textbf{m} = (m_1, \ldots , m_n) \in \mathbb {Z}^n\), and they are such that they are well approximated by a monomial in \(\mathcal {E}\). The latter property is concretely expressed by requiring that

$$\begin{aligned} y(s,a) \sim (s^1)^{k_1} \cdots (s^n)^{k_n} \qquad \text {on} \quad \mathcal {E}\, \end{aligned}$$
(2.8)

for some \(k_1, \ldots , k_n \in \mathbb {Z}\). The symbol ‘\(\sim \)’ has to be understood as a double-boundness statement as follows: there exist two real, positive parameters \(C_1\), \(C_2\) such that \( C_1 (s^1)^{k_1} \cdots (s^n)^{k_n}< y < C_2 (s^1)^{k_1} \cdots (s^n)^{k_n}\) in \(\mathcal {E}\).

Polynomially tamed couplings: Analogously to monomially tamed couplings, polynomially tamed couplings exhibit the following, general expansion:

$$\begin{aligned} y(s,a) = \sum _{ \textbf{m}}\rho _{\textbf{m}}(e^{-s^i},a^\alpha ) (s^1)^{m_1} \cdots (s^n)^{m_n}\,. \end{aligned}$$
(2.9)

However, they cannot be well-approximated by a monomial as in (2.8); at most, over \(\mathcal {E}\), one can upper-bound a polynomially tamed coupling by a monomial as:

$$\begin{aligned} y(s, a) \prec (s^1)^{N_1} \cdots (s^n)^{N_n} \qquad \text {on} \quad \mathcal {E}\, \end{aligned}$$
(2.10)

for sufficiently large \(N_1, \ldots , N_n\). Here the symbol ‘\(\prec \)’ is to be understood as follows: there exists a parameter \(C > 0\) such that \(y(s, a) < C (s^1)^{N_1} \cdots (s^n)^{N_n}\) holds throughout \(\mathcal {E}\).

Clearly, monomially tamed couplings constitute a subset of the more general polynomially tamed couplings. Furthermore, such definitions can be particularized to a subset \(\mathcal {U} \subset \mathcal {E}\), and whether a coupling is monomially or polynomially tamed depends on the specific choice of set \(\mathcal {U}\): a coupling can be polynomially tamed \(\mathcal {U}\), but monomially tamed on a smaller set within \(\mathcal {U}\) or on a different subset within \(\mathcal {E}\).

As illustrated in [15], polynomially tamed couplings are enough to describe all the relevant couplings involving the vector multiplet sector of four-dimensional \(\mathcal {N}=2\) Type IIB effective theories, or the respective chiral multiplet sector of the orientifolded \(\mathcal {N}=1\) Type IIB effective theories. The deep reason resides in the fact that the couplings of these sectors descend from the periods of the internal Calabi–Yau manifold. However, polynomially tamed couplings are expected to be enough to characterize larger families of effective theories.

2.3 The distance conjecture in tame effective theories

In [15], the tameness of the couplings of the effective theory was employed to show how the predictions of the Distance Conjecture can be realized path independently on some regions of the moduli space, and that the Distance Conjecture may require only a finite number of different towers of states. Since the reasoning that leads to these findings in [15] is similar to the one we will employ in the next section, we will now briefly overview how these features are implied by the tameness of the effective field theory.

To begin with, recall that the main couplings that enter the Distance Conjecture are the factor \(e^{-\lambda d(s,a)}\) and the masses of the states within the infinite tower that realize the conjecture. However, since on different regions of the moduli space different towers may be relevant, for the sake of generality, we consider several infinite towers of states. We denote the masses of the states within these towers as \(M_n^{(t)}(s,a)\), with the index t labeling the tower. Notice that, in principle, the index t might be unbounded, for an effective theory can exhibit infinite candidate towers. In other words, the set T, to which the index t belongs, may be of infinite cardinality. Moreover, the towers may well be constituted by states with different density: namely, between some energy scales \(\Lambda \) and \(\Lambda + \delta \Lambda \), a tower may be constituted by a number \(n_1\) of states, and another by a different number \(n_2\) of states.

Given the tameness of the underlying effective theory, we may generically infer that \(e^{-\lambda d(s,a)}\) and the masses \(M_n^{(t)}(s,a)\) need to be tame functions of the axions and saxions. However, in [15] the following, stronger assumptions for these couplings were adopted:

  • the masses of all the states constituting the candidate towers \(M_n^{(t)}(s,a)\) are polynomially tamed;

  • \(e^{-\lambda d(s,a)}\) is polynomially tamed.

These assumptions were motivated in [15] by concrete, consistent effective field theory examples. It should be noted, however, that the polynomially tame behavior of the masses of candidate towers can be proven in some instances. For example, this is the case when the masses of the towers can be computed from the periods of the internal manifold, such as the masses of BPS D3-particles or BPS NS5/D5-membrane states of Type IIB four-dimensional effective theories [15].

Then, in [15] it was proposed to partition the patch \(\mathcal {E}\) of the moduli space employing the following method. First, we introduce a partition of the patch \(\mathcal {E}\) constituted by some sets \(\mathcal {U}_{\textsf{A}}\). Recall that the sets \(\mathcal {U}_{\textsf{A}}\) deliver a partition of \(\mathcal {E}\) provided that

$$\begin{aligned} \bigcup \limits _{\textsf{A}}\, \mathcal {U}_{\textsf{A}} = \mathcal {E}, \quad \text {and}\quad \mathcal {U}_{\textsf{A}} \cap \mathcal {U}_{\textsf{B}} = \varnothing \,, \text { if } {\textsf{A}}\ne {\textsf{B}}. \end{aligned}$$
(2.11)

Each subset \(\mathcal {U}_{\textsf{A}}\) of the partition is defined in such a way that, on every subset \(\mathcal {U}_{\textsf{A}}\), \(e^{-\lambda d(s,a)}\) displays a definite growth. For instance, on a single subset \(\mathcal {U}_{\textsf{A}}\), \(e^{-\lambda d(s,a)}\) may behave as a monomial – namely, it is monomially tamed, in the language of the previous section; or it may be strictly polynomially tamed (for instance, it may behave as a decreasing exponential in the saxions \(s^i\)).Footnote 3

The partition constituted by the subsets \(\mathcal {U}_{\textsf{A}}\) covers the full patch \(\mathcal {E}\), and can be always introduced, provided the tameness of the factor \(e^{-\lambda d(s,a)}\). However, the Distance Conjecture predicts the behavior (2.4) towards boundaries that are located at infinite field distance. Therefore, among the subsets \(\mathcal {U}_{\textsf{A}}\), we may restrict our attention to the sets that, in [15], were called ‘near-boundary subsets’: these are the subsets \(\mathcal {U}_{\textsf{A}}\) whose closures include the boundary, with the latter assumed to be located at infinite field distance from any other point of the moduli space.

Fig. 1
figure 1

Example of a near-boundary partition of a patch \(\mathcal {E}\) of two-dimensional moduli space parametrized by the saxions \(s^1 > 1\) and \(s^2 > 1\). The ‘near-boundary’ subsets \(\widehat{\mathcal {U}}_{\textsf{A}}\), here highlighted with darker colors, are those that extend towards regions where \(s^1 \rightarrow \infty \) and \(s^2 \rightarrow \infty \). The partition of the patch \(\mathcal {E}\) may however be constitute by other sets, here in lighter colors, that do not include the boundary point

For the sake of clarity, we will denote with \(\widehat{\mathcal {U}}_{\textsf{A}}\) these near-boundary subsets.Footnote 4 A pictorial representation of how such a near-boundary subsets may look like can be found in Fig. 1. On each of these near-boundary subsets, since \(e^{-\lambda d(s,a)}\) is polynomially tamed, it holds that

$$\begin{aligned}{} & {} e^{-\lambda d(s,a)}\ \prec (s^1)^{N^{\textsf{A}}_1} \cdots (s^n)^{N^{\textsf{A}}_n} \qquad \nonumber \\{} & {} \quad \text {on any near-boundary}\quad \widehat{\mathcal {U}}_{\textsf{A}}, \end{aligned}$$
(2.12)

with \(N^{\textsf{A}}_1, \ldots , N^{\textsf{A}}_n \in \mathbb {Z}_{\le 0}\), with at least one \(N^{\textsf{A}}_i \in \mathbb {Z}_{< 0}\); clearly, the specific values that bound \(e^{-\lambda d(s, a)}\) as in (2.12) depend on the near-boundary subset \(\widehat{\mathcal {U}}_{\textsf{A}}\).

Now, if the Distance Conjecture holds true towards any infinite field distance boundary point \(\varphi _\text {b}\), it has to hold true, in particular, on each specific near-boundary \(\widehat{\mathcal {U}}_{\textsf{A}}\) separately. Therefore, among the candidate infinite towers of states, with masses \(M_n^{(t)}(s,a)\), there must exist (at least) one tower, the relevant tower that realizes the Distance Conjecture labeled by the index \(t^r \in T\), such that

$$\begin{aligned} M_n^{(t^r)} (s,a) \sim e^{-\lambda d(s,a)} \qquad \text {on } \widehat{\mathcal {U}}_{\textsf{A}}. \end{aligned}$$
(2.13)

It is worth stressing that this relation holds on the full subset \(\widehat{\mathcal {U}}_{\textsf{A}}\), and thus it is true for any path within \(\widehat{\mathcal {U}}_{\textsf{A}}\).

Although this will not be discussed further in this work, we recall that in [15] it was further proved that, in order for (2.13) to hold, it is just enough that it holds on specific family of paths that were therein identified as cosmic string solutions. Therefore, reversing the reasoning, by employing cosmic string solutions one can test whether the relation (2.13) holds, and on which near-boundary subset \(\widehat{\mathcal {U}}_{\textsf{A}}\).

In sum, from (2.13), one can rewrite the near-boundary region spanned by the various subsets \(\widehat{\mathcal {U}}_{\textsf{A}}\) as follows:

$$\begin{aligned} \bigcup _{\textsf{A}} \widehat{\mathcal {U}}_{\textsf{A}}= & {} \left( \bigcup _{k_1} \widehat{\mathcal {U}}^{(1,k_1)} \right) \cup \left( \bigcup _{k_2} \widehat{\mathcal {U}}^{(2,k_2)} \right) \cup \ldots \nonumber \\{} & {} \cup \left( \bigcup _{k_M} \widehat{\mathcal {U}}^{(M,k_M)} \right) . \end{aligned}$$
(2.14)

Here the set \(\widehat{\mathcal {U}}^{(t^r,k_{t^r})}\) denotes the subset where the tower labeled by the index t is the relevant one that realizes the Distance Conjecture, with \(k_{t^r}\) labeling the subsets (among the subsets \(\widehat{\mathcal {U}}_{\textsf{A}}\) that we started with) for which the tower \(t^r\) is the relevant one.

Importantly, since the tameness of the effective theory couplings implies that the sum on the left-hand side of (2.14) is composed by a finite number of subsets \(\widehat{\mathcal {U}}_{\textsf{A}}\), then also the sum on the right-hand side – that is just a rearrangement of the former – is composed by a finite number of sets. As such, only a finite number of different towers of states is needed to realize the Distance Conjecture.

3 Identifying effective-theory breakdown mechanisms via machine learning techniques

In this section we present the founding idea of this work. Namely, we concretely deliver a method to compute a moduli space partition similar to the one introduced in Sect. 2.3, based on the cutoff of the effective theory, by employing machine learning techniques. For the sake of conciseness and clarity, we will divide the discussion in two parts: first, we will illustrate the properties that the moduli space is expected to exhibit from a top-down viewpoint, building on the discussion of Sect. 2.3; then, we show how we can infer these properties from a bottom-up perspective, exploiting the k-nearest neighbor and support vector machines algorithms.

3.1 Top-down viewpoint: a UV-cutoff slicing of the moduli space

Across the moduli space, the couplings that characterize the effective field theory change. Consequently, also the effective field theory cutoff \(\Lambda _{\tiny {\text {EFT}}}\) is expected to be a moduli-dependent function that may drastically change throughout the moduli space. In this section, the first, generic question that we address is: which general functional properties of the effective theory cutoff \(\Lambda _{\tiny {\text {EFT}}}\) can we infer from a top-down viewpoint?

First, we assume that the effective theory cutoff \(\Lambda _{\tiny {\text {EFT}}}\) can be related to couplings that appear in the effective field theory, and some examples supporting this assumption will be provided shortly. As such, the cutoff \(\Lambda _{\tiny {\text {EFT}}}\) is expected to inherit all the general features that any other coupling of the effective theory shares. In particular, if the effective field theory is tame, in compliance with the Tameness Conjecture reviewed in Sect. 2.2, then \(\Lambda _{\tiny {\text {EFT}}}\) is a tame function of the moduli. However, as outlined in Sect. 2.3, concrete effective theory exhibit couplings that are not generically tame, but they rather belong to a smaller family of tame functions, namely the one of polynomially tamed functions. Therefore, we will assume that the effective theory cutoff \(\Lambda _{\tiny {\text {EFT}}}\) is a polynomially tamed function of the moduli as well.

Then, we focus on a near-boundary patch \(\mathcal {E}\) of the moduli as defined in (2.6). The patch \(\mathcal {E}\) can then be partitioned following a reasoning similar to the one performed in Sect. 2.3, based on the behavior of the cutoff \(\Lambda _{\tiny {\text {EFT}}}\) understood as function of the axions \(a^\alpha \) and the saxions \(s^i\). Specifically, being the cutoff polynomially tamed and bounded as in (2.10), we can partition \(\mathcal {E}\) into a finite number of smaller subsets \(\mathcal {U}_{\textsf{A}}\) such that, on each \(\mathcal {U}_{\textsf{A}}\), the cutoff \(\Lambda _{\tiny {\text {EFT}}}\) is either monomially tamed, behaving as (2.8), or strictly polynomially tamed, obeying (2.10) but not (2.8). Notice that, with respect to what we did in Sect. 2.3, in this section we will be also interested in the subsets \(\mathcal {U}_{\textsf{A}}\) whose closures do not include the boundary point, for we wish to characterize the behavior of the cutoff throughout the full patch \(\mathcal {E}\).

Therefore, analogously to what we did in (2.14), we may rearrange the subsets \(\mathcal {U}_{\textsf{A}}\) of the partition as follows: on each subset \(\mathcal {U}_{\textsf{A}}\), the cutoff \(\Lambda _{\tiny {\text {EFT}}}\) exhibits a definite growth behavior, and is expected to be related to the behavior of a single effective field theory coupling - say \(y^{(r)}\), with r the index labeling the specific coupling. As an example, \(y^{(r)}\) can be identified with the lightest mass \(M_{\text {lightest}}^{(r)}\) among the states that could break down the effective description. Thus, as in Sect. 2.3, we denote as \({\mathcal {U}}^{(r,k_{r})}\) the subset where the coupling labeled by the index r is the one that gives the cutoff, namely

$$\begin{aligned} \Lambda _{\tiny {\text {EFT}}} (s,a) \equiv y^{(r)} (s,a) \qquad \text {on } {\mathcal {U}}^{(y^r,k_{r})}, \end{aligned}$$
(3.1)

and \(k_{r}\) labels the different subsets - among the \(\mathcal {U}_{\textsf{A}}\) - where the identification (3.1) holds. We can thus rearrange the subsets \(\mathcal {U}_{\textsf{A}}\) partitioning the patch \(\mathcal {E}\) as

$$\begin{aligned} \bigcup _{\textsf{A}} {\mathcal {U}}_{\textsf{A}}= & {} \left( \bigcup _{k_1} {\mathcal {U}}^{(1,k_1)} \right) \cup \left( \bigcup _{k_2} {\mathcal {U}}^{(2,k_2)} \right) \cup \ldots \nonumber \\{} & {} \cup \left( \bigcup _{k_M} {\mathcal {U}}^{(M,k_M)} \right) , \end{aligned}$$
(3.2)

composed just by a finite number of subsets \(\mathcal {U}_{\textsf{A}}\).

Fig. 2
figure 2

Here is an example of a partition (figure in the center) of a two-dimensional moduli space parametrized by the saxions \(s^1 > 1\) and \(s^2 > 1\) (depicted on the left). The four subsets of the patch \(\mathcal {E}\), here labeled by different colors, denote the unions of subsets \(\bigcup _{k_{r}} {\mathcal {U}}^{(y^r,k_{r})} \) where the effective cutoff \(\Lambda _{\tiny {\text {EFT}}}\) is identified with the \(y^r\)-th coupling. On the right is a schematic depiction of how the effective field theory cutoff \(\Lambda _{\tiny {\text {EFT}}}\) may appear over the partition of the two-dimensional saxionic space. On each subset, denoted with different color, the cutoff is related to a different coupling \(\Lambda _{\tiny {\text {EFT}}}\), and it may thus exhibit a different functional behavior

We stress that the fact that the partition (3.2) is composed by a finite number of subsets is nontrivial: in fact, towards boundaries located at infinite field distance, an infinite number of exponentially light states ought to appear, and there is, in principle, no guarantee that only a finite number of them are relevant in defining the cutoff at which the effective theory breaks down. Indeed, it is the tameness of the effective theory cutoff (3.1) that critically implies that the number of states breaking down the effective theory is finite, due to the finiteness of the partition (3.2).

In order to give an idea of how this partition may look like, a pictorial representation of such a partition of a two-dimensional moduli space, spanned by the saxions \(s^1\) and \(s^2\), is depicted in Fig. 2. The cutoff \(\Lambda _{\tiny {\text {EFT}}}\), which is here a function of the saxions \(s^1\) and \(s^2\), exhibits a definite growth, or fall-off on each of the subsets plotted in Fig. 2. The functional form of the cutoff \(\Lambda _{\tiny {\text {EFT}}}(s^1,s^2)\) is then the union of the various cutoff functions \(\Lambda _{\tiny {\text {EFT}}}(s^1,s^2)\) over the subsets.

The partition induced by the cutoff \(\Lambda _{\tiny {\text {EFT}}}\) may be related to the one introduced in Sect. 2.3 in order to realize the Distance Conjecture, at least in the near-boundary subsets \(\widehat{\mathcal {U}}_{\textsf{A}}\). In fact, the Distance Conjecture may give a recipe in order to identify the cutoff of an effective theory that admits a quantum gravity UV completion. Indeed, within each of the near-boundary subsets \(\widehat{\mathcal {U}}_{\textsf{A}}\) an infinite tower of states becoming light, and the maximal cutoff that allows the effective field theory to be valid in order to avoid the inclusion of the entirety of the tower is

$$\begin{aligned} \Lambda _{\tiny {\text {EFT}}} = M_{\text {lightest}}^{(t^r)} \qquad \text {on the near-boundary sets } \widehat{\mathcal {U}}_{\textsf{A}}, \end{aligned}$$
(3.3)

where \(M_{\text {lightest}}^{(t^r)}\) denotes the lightest among the masses \(M_n^{(t^r)} (s,a)\) of the states of the infinite tower (labeled by the index \(t^r\)) that realizes the Distance Conjecture as in (2.13) over \(\widehat{\mathcal {U}}_{\textsf{A}}\).

Therefore, the near-boundary sets \(\widehat{\mathcal {U}}_{\textsf{A}}\) of the partition \(\{ \mathcal {U}_{\textsf{A}} \}\) induced by the cutoff are expected to be similar – if not equal – to those that realize the partition as in (2.13). For example, in these subsets the cutoff \(\Lambda _{\tiny {\text {EFT}}}\) may be identified with the mass of the lightest among the Kaluza–Klein modes or winding modes; or, away from decompactification limits, \(\Lambda _{\tiny {\text {EFT}}}\) may be identified with an emergent string scale [7,8,9,10,11,12].

Alternatively, other brane states may deliver the effective theory cutoff within near-boundary subsets. For instance, in four-dimensional Type IIB effective theories, the Distance Conjecture may be realized by infinite towers of D3-particles [3,4,5,6] or D5/NS5-membrane states [13, 15]; then, the cutoff may be given by the lowest among the masses of these states, and these were proven to be polynomially tamed in [15], thus motivating the hypothesis employed at the beginning of this section.

For the sets of the partition \(\{ \mathcal {U}_{\textsf{A}} \}\) that do not include the boundary point the identification of the cutoff might be more subtle. Therein, the cutoff might be still given by the lightest mass of an infinite tower of states (that, however, does not become massless in said subset). More generally, some other state that is not necessarily part of an infinite tower can be the first one to be encountered when probing higher energies, and thus its mass may be employed as effective field theory cutoff \(\Lambda _{\tiny {\text {EFT}}}\) so as to avoid its inclusion in the effective theory that we started with.

3.2 Bottom-up viewpoint I: partitioning the moduli space via a k-nearest neighbor algorithm

We will now illustrate how the partition of the moduli space that follows the recipe in (3.3) can be uncovered from a bottom-up perspective. Namely, consider an effective field theory with a nontrivial moduli space, defined over a patch \(\mathcal {E}\) that is parametrized by the coordinates \(\varphi ^A\). Here, we shall address the following question: given an arbitrary point \(\varphi ^A_0 \in \mathcal {E}\), what is the type of states that would ‘likely’ break down the effective field theory there?

Knowing the answer to this question for a sufficient number of points would allow one to reconstruct the partition dictated by (3.3), and eventually infer some key properties about the UV-completion of the effective theory under investigation. In this section, we will first answer the question by exploiting the k-nearest neighbor algorithm. This is a supervised machine learning algorithm that is commonly used in Data Science. However, before showing how to use such an algorithm, we first need to specify the data that one should minimally feed to the algorithm to make it work.

The data: Consider a generic effective theory with a nontrivial moduli space, and assume that we are able to measure some physical quantities that the effective theory describes. Specifically, let us assume that, with appropriate measurements carried at some spacetime point \(x^\mu _1\), we are able to infer the vev’s of the moduli fields at that point, \(\varphi ^A(x_1) \equiv \varphi ^A_1\). In other words, the vev’s \(\varphi ^A_1\) are those that reproduce the value of the various physical quantities that we measure.

Assume that we know that our effective theory is valid up to some energy scale \(E \le \Lambda _1\). However, we wish to get some minimal information about the UV-completion of the effective theory. Therefore, let us assume that we can increase the energy scales E that are probed beyond \(\Lambda _1\). At some energy scale \(\Lambda _1' \ge \Lambda _1\) a new state will be detected, and the effective field theory we started with may not be enough to describe its dynamics: we say that this state is the source of the breakdown of the effective theory, and its mass \(m_1 \simeq \Lambda _1'\) may provide the cutoff of the effective theory that we started with, and we set \(\Lambda _{\tiny {\text {EFT}}}(\varphi _1) = m_1\). Let us denote the type of this state as ‘Type I’. For instance, the type may be the collection of the quantum numbers that specify the state, or it can specify the microscopic origin of the state, for instance the type of brane which it originates from.

Now, let us assume that we can move our probing instruments to some other location, at some different coordinates \(x^\mu _2\). The vev’s of the moduli at the spacetime point \(x^\mu _2\) might be different from the one at the point \(x^\mu _1\).Footnote 5 Following the same procedure as the one carried at the spacetime point \(x^\mu _1\), let us increase the energy scales probed until we find a new state that would thus break down the effective theory that we started with. As above, the mass of this state would tell us the cutoff of the effective field theory at the moduli space point \(\varphi ^A_2\), as \(\Lambda _{\tiny {\text {EFT}}}(\varphi _2) = m_2\). This state may be the same that we previously found at \(x^\mu _1\), namely a ‘Type I’ state; or it could be a different kind of state – say, of ‘Type II’.

We assume that this procedure can be replicated at several, different spacetime points, corresponding to different points in the moduli space. The set of the moduli space points \(\{\varphi ^A_k\}\) probed in this way, along with the types \(\text {Type}\, J_{k}\) of the states that first break down the effective theory at the point \(\varphi ^A_k\) constitute the data that we will be focusing on and which we will feed to the machine learning algorithm:

$$\begin{aligned} \text {Data} = \{ (\varphi ^A_{k},\, \text {Type}\, J_{k}) \}. \end{aligned}$$
(3.4)

Clearly, at the moment, we do not have access to such a phenomenological knowledge. In order to compensate to this lack, and in order to show how the algorithm of the following section works, we will rather generate the data (3.4) by exploiting the known microscopic description of the effective theory. In Data Science language, these sorts of data that are not obtained from external sources, but they are rather artificially crafted, are oftentimes called ‘synthetic data’.

Fig. 3
figure 3

An example of data for a two-dimensional field space, spanned by the saxions \(s^1\) and \(s^2\). Each point represents one measurement, and the color labels the kind of state that breaks down the effective theory. The states are here assumed to scale, in terms of the saxions, as \(\sim \frac{1}{s^2}\) (green dots), \(\sim \frac{1}{s^1}\) (blue dots), \(\sim \left( \frac{s^2}{s^1}\right) ^4\) (teal dots)

In Fig. 3 is a toy-example of such a synthetic data set, which has been obtained as follows.Footnote 6 Consider an effective theory, which we assumed to be endowed with a nontrivial moduli space. Within its moduli space, we focus on a two-dimensional subset thereof, that is spanned by two saxions, \(s^1\) and \(s^2\), while fixing the other moduli to some given vev’s. We assume that the states that could break down the effective theory could belong to three species, whose masses are as \(M^{(1)} = \frac{1}{s^1}\), \(M^{(2)} = \left( \frac{s^2}{s^1}\right) ^4\), \(M^{(3)} = \frac{1}{s^2}\). For instance, they might be the lightest states of some towers of infinite states, scaling in this fashion in the saxions \(s^1\) and \(s^2\).

Then, we randomly generate 500 moduli space points \((s^1, s^2)\), in order to simulate some phenomenological measurements probing those moduli space points. For each point so generated, we determine which is the relevant state that leads to the breakdown of the effective field theory: namely, we determine which is the lightest among the states with masses \(M^{(1)}\), \(M^{(2)}\) and \(M^{(3)}\). The different colors of the points in Fig. 3 tell which of the three species deliver the leading state breaking down the effective theory.

The k-nearest neighbor algorithm: Datasets like the one depicted in Fig. 3 constitute the sources upon which we can apply machine learning methods. As anticipated earlier, we will first employ the k-nearest neighbor algorithm. Here we will highlight some of the basic features of the algorithm, and we refer to Appendix A for a more detailed review thereof.

Preliminarily, it is worth stressing that the question at hand can be regarded as a classification problem that requires supervised machine learning techniques. Indeed, in terms of the new terminology introduced, the question, mentioned at the beginning of this section, that we wish to address can be reformulated as follows: given a point \(\varphi ^A_{\tiny {\text {new}}}\) of the moduli space that is not contained in the dataset (3.4) that we started with, what is the Type \(J_{\tiny {\text {new}}}\) of state that will likely first break down the effective theory at \(\varphi ^A_{\tiny {\text {new}}}\)? Therefore, this is a classification problem because we wish to tell to which category (i.e. the Type \(J_{\tiny {\text {new}}}\) of the breakdown state) the new variable (i.e. the point \(\varphi ^A_{\tiny {\text {new}}}\)) is associated with. As explained in Sect. 3.1, the tameness of the effective theory guarantees that the Type \(J_{\tiny {\text {new}}}\) belong to a finite set. Furthermore, the problem requires a supervised approach since we know the kind of ‘target’ that we wish to predict – i.e. the Type \(J_{\tiny {\text {new}}}\) of the breakdown state.

The k-nearest neighbor algorithm is a supervised machine learning algorithm that addresses precisely such classification problems via the following procedure. First, as any other machine algorithm, it requires a training dataset, and a test dataset. In the following, we will devote \(80\%\) of the data in (3.4) to the training set, and the remaining \(20\%\) to the test set.

Now, consider a moduli space point \(\varphi ^A_{\tiny {\text {new}}}\), not included in the training part of the dataset (3.4). The algorithm computes the Euclidean distance from the point \(\varphi ^A_{\tiny {\text {new}}}\) to all the points in the training set, and picks the closest k points. Then, among such k neighbors, the algorithm determines what is the most recurring Type \(J_{k}\) of states that break down the effective description. The most common type will be the Type \(J_{\tiny {\text {new}}}\) assigned to \(\varphi ^A_{\tiny {\text {new}}}\).

We remark that the choice of k is arbitrary, and one could tune k in a such a way that the algorithm delivers the best performance on the test set. Typically, one should expect that, the higher the value of k, the more stable the results of the algorithm are. However, a too great value of k might just trivialize the algorithm and deliver wrong predictions, and a compromise choice has to be taken.

Fig. 4
figure 4

The application of the k-nearest neighbor algorithm to the data plotted in Fig. 3 leads the regions here depicted, with the color scheme following the same as for the towers in Fig. 3. The algorithm has an accuracy of 0.97 on the training test set, and an accuracy of 0.99 on the test set

In the following, we will employ such an algorithm to plot ‘decision regions’. In order to obtain the latter we consider a grid, composed of several hundred points that cover the subset of the moduli space that we will be focusing on. Then, we apply the k-nearest neighbor algorithm for each point of the grid, with the decision based on the training set, and the decision regions are plotted joining the points of the grid belonging to the same type. The decision regions so obtained portray the approximate shape of the subsets that constitute the UV-cutoff-induced partition.

An example of output delivered by the k-nearest neighbor algorithm is contained in Fig. 4, where we applied the algorithm to the synthetic data shown in Fig. 3.

3.3 Bottom-up viewpoint II: obtaining decision boundaries with support vector machines

The k-nearest neighbor algorithm allows for obtaining a partition of the moduli space in general. However, due to the very nature of the algorithm, the decision boundaries that separate the subsets of the partition might be quite irregular – an issue that becomes more and more severe as the value of nearest neighbors k is lowered. As a result, it might be hard to determine the shape of the decision boundaries, as in Fig. 4. In order to sharpen the prediction of the decision boundaries some more sophisticate machine learning algorithms may be employed, such as the ‘support vector machine algorithms’, one of which we will now discuss.

For simplicity, we will focus on the case in which the masses of the states \(M^{(r)}\) that could break down the effective theory as in (3.3) are all monomially tamed, behaving as in (2.8) in the entire patch \(\mathcal {E}\) of the moduli space. Such masses behave as monomials in the saxions as the boundaries of the moduli space patch \(\mathcal {E}\) are approached, and \(\log M^{(r)}\) is well-approximated by a linear combination of the logarithms of the saxions:

$$\begin{aligned} \log M^{(r)} \sim \sum \limits _{i = 1}^n k_i^{(r)} \log s^i \qquad \text {on} \quad \mathcal {E}. \end{aligned}$$
(3.5)

For instance, this is the case of the states that break down the effective description in Fig. 3, where the states are actual monomials of the saxions \(s^1\) and \(s^2\).

With this simplifying hypothesis in force, the characterization of the decision boundary becomes simpler. Indeed, consider two neighboring, near-boundary subsets of \(\mathcal {E}\), \(\widehat{\mathcal {U}}^{(r^1)}\) and \(\widehat{\mathcal {U}}^{(r^2)}\): we assume that in the former the state with mass \(M^{(r^1)}\) is the one that breaks down the effective theory, serving as the effective theory cutoff, while the state with mass \(M^{(r^2)}\) serves as cutoff for the second subset \(\widehat{\mathcal {U}}^{(r^2)}\). Clearly, it needs to hold that \(M^{(r^1)} \le M^{(r^2)}\) in \(\widehat{\mathcal {U}}^{(r^1)}\), whereas \(M^{(r^2)} \le M^{(r^1)}\) in \(\widehat{\mathcal {U}}^{(r^2)}\). The decision boundary that separates the subsets \(\widehat{\mathcal {U}}^{(r^1)}\) and \(\widehat{\mathcal {U}}^{(r^2)}\) is thus the locus on which \(M^{(r^1)} = M^{(r^2)}\). However, since we assumed that the behavior of the masses \(M^{(r^1)}\) and \(M^{(r^2)}\) obeys (3.5), as the boundaries are approached, the locus where \(M^{(r^1)} = M^{(r^2)}\) resembles an hyperplane in the \(\log s^i\) variables.

Clearly, the procedure illustrated can be repeated for any other pair of neighboring near-boundary sets, which will still be approximately separated by boundary decision hyperplanes, due to the monomially tamed behavior (3.5).

Fig. 5
figure 5

Here is plotted the application of the linear support vector machine algorithm to the dataset in Fig. 3. The algorithm produces the two dashed line plotted above, which separate the three subsets of the two-dimensional saxion space where each state is relevant. The slope of the upper, darker line is approximately \(\simeq 0.99\), while the slope of the lower, lighter line is \(\simeq 0.74\), very close to the expected values of 1 and 0.75, respectively

Support vector machines are machine learning algorithms that aim at separating the data according to the category they belong. The setup and the structure of the data on which the algorithm is based on are the same as the ones that we introduced in the previous section for the k-nearest neighbor algorithm: the dataset that we start with is as in (3.4) and is composed by a set of moduli space points \(\varphi ^A_k\), alongside with the type \(\text {Type} J_k\) of state that breaks down the effective theory at \(\varphi ^A_k\). For example, the dataset depicted in Fig. 3 may well serve as the starting point for support vector machine algorithms.

Due to the hypothesis (3.5), for our scopes it is enough to employ linear support vector machine algorithms. After feeding data of the form as in (3.4), the algorithm searches for (portions of) hyperplanes that best separate the dataset according to their categorical value. Specifically, in our case, the algorithm can find hyperplanes in the \(\log s^i\) variables that best separate the moduli patch of interest according to the type of state that breaks down the effective theory. An overview of how the linear support vector machine algorithm is contained in Appendix B. As an example in Fig. 5 is the application of the linear support vector machine algorithm to the dataset depicted of Fig. 3. The algorithm delivers the boundaries that best separate the three distinct subsets where each effective theory-breakdown state is relevant.

Furthermore, it is worth mentioning that, whenever the approximation (3.5) holds, the linear support vector machine algorithm may give critical information about the functional forms of the masses of the states. In fact, consider again the two neighboring sets considered above, \(\widehat{\mathcal {U}}^{(r^1)}\) and \(\widehat{\mathcal {U}}^{(r^2)}\), with the boundary between them dictated by the condition \(M^{(r^1)} = M^{(r^2)}\). The linear support vector machine algorithm provides the slope of the line where \(M^{(r^1)} = M^{(r^2)}\). In the case where we have just two saxion fields, \(s^1\) and \(s^2\), in the approximation (3.5), such a slope is given by \(\frac{k_2^{r_2} - k_2^{r_1}}{k_1^{r_1} - k_1^{r_2}}\). Thus, by knowing such a slope – as well as the slope of other lines separating this region from other neighboring ones – one can infer information about the \(k_i^{(r)}\) factors in (3.5) determining the monomial decay of the leading states that break down the effective theory.

Although the linear support vector machine algorithms allow for having full control of the boundary decision, they clearly have some limitations. Firstly, it is worth stressing that, even with the hypothesis (3.5) in force, the decision boundaries are asymptotically hyperplanes: for small values of the saxions, far away from the boundaries of \(\mathcal {E}\), although (3.5) holds, \(\log M^{(r)}\) might not be approximated by a linear combination of the logarithms of the saxions sufficiently well. Moreover, if the couplings are not monomially tamed but, rather polynomially tamed, then the decision boundaries are not hyperplanes in general, even in the near-boundary regime. Nonlinear version of support vector machine algorithms have also been introduced: they exploit a ‘kernel trick’ consisting of artificially recasting the problem as a higher-dimensional one, where the data can be separated by hyperplanes. The application of nonlinear support vector machines to the UV-cutoff slicing of the moduli space, which are necessary when the couplings breaking down the effective theory are not monomially tamed, is left for future work.

4 A concrete example: a toroidal orbifold compactification

This section is devoted to showing how the ideas explained in Sect. 3 apply to a concrete effective field theory. The effective field theory that we consider is a four-dimensional \(\mathcal {N}=2\) supergravity effective field theory obtained after compactifying the ten-dimensional Type IIB string theory over a Calabi–Yau manifold, and we will choose the latter to be the toroidal orbifold \(T^6/(\mathbb {Z}_2 \times \mathbb {Z}_2')\) considered in [22]. The reasons for focusing on this model rely on the simplicity of the relation of the moduli with respect to the geometric data, and on the clear identification of the candidate infinite towers of states that can break down the effective description.

Indeed, we will use such a simple model in order to generate a synthetic dataset of points in the moduli space, labeled according to the state that first breaks down the effective theory. However, we stress that, in principle, the machine learning examination proposed in Sect. 3 can be applied to – and is more meaningful for – datasets for which we do not know the microscopic origin: in fact, the construction of the partition proposed in Sect. 3 may shed light on the geometrical details of the partitions. Thus, the dataset that we will artificially construct in this section should be understood as a ‘simulation’ for a case in which one has just a dataset so constructed, without ultraviolet information. On the other, clearly, working with an example for which the geometrical details are known is more instructive for understanding the interplay between the internal geometry and the moduli space partition observed at low-energy.

Let us review briefly the relevant features exhibited by the the four-dimensional compactification that we consider. Preliminarily, we recall that the ten-dimensional Type IIB effective theory that we start with, in the conventions of [23], is

$$\begin{aligned} S_{\tiny {\text {EFT}}}^{(10)}= & {} \frac{1}{2 \kappa ^2_{10}}\int d^{10}x\, \sqrt{-g} \Bigg [ e^{-2\phi } \left( R + 4 \partial \phi \cdot \partial \phi - \frac{1}{2} |H_3|^2 \right) \nonumber \\{} & {} - \frac{1}{2} |F_1|^2 - \frac{1}{2} |\tilde{F}_3|^2 - \frac{1}{4} |\tilde{F}_5|^2 \Bigg ] \nonumber \\{} & {} - \frac{1}{4 \kappa ^2_{10}} \int C_4 \wedge H_3 \wedge F_3, \end{aligned}$$
(4.1)

where \(2 \kappa ^2_{10} = 2\pi / \ell _\text {s}^8\), with the string length \(\ell _\text {s}^2 = 2\pi \alpha '\), and string mass \(M_\text {s} = \frac{1}{\ell _\text {s}}\). Here, \(\phi \) denotes the ten-dimensional dilaton, related to the string coupling as \(g_\text {s} = e^\phi \), \(H_3 = \textrm{d} B_2\) and \(F_1\), \(\tilde{F}_3\) and \(\tilde{F}_5\) being the gauge invariant Ramond–Ramond field strengths.

We will consider the following ansatz for the ten-dimensional spacetime metric:

$$\begin{aligned} \textrm{d}s^2_{10D} = e^{2A} \textrm{d}s^2_{4D} + \textrm{d}s^2_{Y}, \end{aligned}$$
(4.2)

with \(Y = T^6/(\mathbb {Z}_2 \times \mathbb {Z}_2')\) being the six-dimensional internal manifold. We shall choose the dimensionless warp factor \(e^{2A}\) in such a way that the effective theory (4.1), once recast to the ten-dimensional Einstein frame, and then reduced over the metric (4.2), delivers a four-dimensional effective theory that is expressed in the four-dimensional Einstein frame:

$$\begin{aligned} e^{2A} = \frac{M_\text {P}^2 \ell _\text {s}^8}{4 \pi V_Y}, \end{aligned}$$
(4.3)

where \(V_Y\) is the volume of the internal manifold Y expressed in the Einstein frame.

On the internal \(T^6/(\mathbb {Z}_2 \times \mathbb {Z}_2')\) orbifold: Let us denote with \(z^i\), with \(i = 1, 2, 3\) the three holomorphic coordinates of the internal torus, and with \(\hat{x}^j\), \(j = 1,\ldots , 6\) the internal, real coordinates. In the following, it will be convenient to take the coordinates \(\hat{x}^j\) dimensionless, and such that they span circles of unitary length. As such, the holomorphic coordinates \(z^i\) and the real coordinates \(\hat{x}^j\) so defined can be related as

$$\begin{aligned} \textrm{d} z^i = 2\pi \left( R_{2i - 1} \textrm{d} \hat{x}^{2i - 1} + \textbf{i}R_{2i} \textrm{d} \hat{x}^{2i} \right) \end{aligned}$$
(4.4)

where \(R_i\) is the i-th internal radius, expressed in Einstein frame.

The orbifold \(\mathbb {Z}_2\) and \(\mathbb {Z}_2'\) operations act on the internal coordinates as follows [22]

$$\begin{aligned} \theta : {\left\{ \begin{array}{ll} z^1 \rightarrow - z^1 \\ z^2 \rightarrow - z^2 \\ z^3 \rightarrow z^3 \end{array}\right. }, \qquad \theta ': {\left\{ \begin{array}{ll} z^1 \rightarrow z^1 \\ z^2 \rightarrow - z^2 \\ z^3 \rightarrow -z^3 \end{array}\right. }. \end{aligned}$$
(4.5)

A choice of symplectic basis of three-forms \(\{\alpha _I, \beta ^J\}\), obeying \(\int _Y \alpha _I \wedge \beta ^J = \delta _I^J\) and that is invariant under the orbifold action (4.5) is

$$\begin{aligned} \alpha _0= & {} \textrm{d} \hat{x}^1 \wedge \textrm{d} \hat{x}^3 \wedge \textrm{d} \hat{x}^5, \qquad \alpha _1 = \textrm{d} \hat{x}^2 \wedge \textrm{d} \hat{x}^3 \wedge \textrm{d} \hat{x}^5, \nonumber \\ \alpha _2= & {} \textrm{d} \hat{x}^1 \wedge \textrm{d} \hat{x}^4 \wedge \textrm{d} \hat{x}^5, \qquad \alpha _3 = \textrm{d} \hat{x}^1 \wedge \textrm{d} \hat{x}^3 \wedge \textrm{d} \hat{x}^6, \nonumber \\ \beta ^0= & {} - \textrm{d} \hat{x}^2 \wedge \textrm{d} \hat{x}^4 \wedge \textrm{d} \hat{x}^6, \qquad \beta ^1 = \textrm{d} \hat{x}^1 \wedge \textrm{d} \hat{x}^4 \wedge \textrm{d} \hat{x}^6, \nonumber \\ \beta ^2= & {} \textrm{d} \hat{x}^1 \wedge \textrm{d} \hat{x}^3 \wedge \textrm{d} \hat{x}^6, \qquad \beta ^3 = \textrm{d} \hat{x}^2 \wedge \textrm{d} \hat{x}^4 \wedge \textrm{d} \hat{x}^5. \end{aligned}$$
(4.6)

The moduli space: The moduli space of the four-dimensional theory is composed by three sectors. Here, we shall focus on the saxionic part of these sectors only. Firstly, the ten-dimensional dilaton \(\phi \) appears in the four-dimensional action as modulus. The saxion, defined out of the ten-dimensional dilaton, \(s_0 = e^{-\phi } = \frac{1}{g_\text {s}}\) is assumed to be large, so as the string coupling is small. A second sector is formed by the Kähler moduli \(v_i\), which are most readily associated with the areas of the internal two-dimensional tori as

$$\begin{aligned} v_i = \left( \frac{2\pi }{\ell _\text {s}}\right) ^2 R_{2i-1} R_{2i}. \end{aligned}$$
(4.7)

The third sector, composed by the tori complex structure moduli, is slightly more involved to be identified. The first step is to recast the Calabi–Yau unique holomorphic three-form \(\Omega = \textrm{d} z^1 \wedge \textrm{d} z^2 \wedge \textrm{d} z^3\) in terms of the symplectic basis (4.6) as follows:

$$\begin{aligned} \Omega = \ell _\text {s}^3 \left( X^I \alpha _I - \mathcal {F}_I(X) \beta ^I\right) . \end{aligned}$$
(4.8)

Here, \(\mathcal {F}_I(X)\) denote the derivatives of the holomorphic prepotential \(\mathcal {F}(X) = - \frac{X^1 X^2 X^3}{X^0}\) with respect to the coordinates \(X^I\). Then, inserting (4.4) and (4.6) in the identification (4.8), one obtains how the coordinates \(X^I\) and \(\mathcal {F}_I(X)\) are expressed in terms of the geometric data. Then, the complex structure saxions \(s^i\) are identified in terms of the internal \(S^1\) radii as

$$\begin{aligned} s^i = \textrm{Im} \frac{X^i}{X^0} = \frac{R_{2i}}{R_{2i-1}}. \end{aligned}$$
(4.9)

For future reference, we report here how the radii of the internal \(S^1\) are expressed in terms of the Kähler and complex structure moduli by inverting (4.7) and (4.9)

$$\begin{aligned} R_{2i - 1}^2 = \left( \frac{\ell _\text {s}}{2\pi }\right) ^2 \frac{v_i}{s^i}, \qquad R_{2i}^2 = \left( \frac{\ell _\text {s}}{2\pi }\right) ^2 v_i s^i, \end{aligned}$$
(4.10)

and we note the Einstein-frame internal volume is \(V_Y = \ell _\text {s}^6 v_1 v_2 v_3\).

4.1 Candidate effective theory-breakdown states

The four-dimensional effective field theories obtained from the compactification of Type IIB string theory over the aforementioned \(T^6/(\mathbb {Z}_2 \times \mathbb {Z}_2')\) orbifold may be broken down by different sources, as the energy scales that are probed are increased. Indeed, the microscopic description predicts the existence of several infinite towers of states that can break down the effective theory, which we now enlist.

Fig. 6
figure 6

A random set of 500 points in the two-dimension moduli subspace spanned by the complex structure saxions \(s^1\) and \(s^2\), in the range [1, 100], for the indicated value of the third complex structure saxion \(s^3\), and Kähler modulus \(v_1\). The different shapes and colors of the points label which type of states, among those listed in Sect. 4.1, first breaks down the effective theory. The other moduli, not shown in the plot, have been assumed to have the following, fixed vev’s: \(v_2 = 20\), \(v_3 = 30\), \(\phi = -10\)

Fig. 7
figure 7

The application of the k-nearest neighbor algorithm to the datasets in Fig. 6

Fig. 8
figure 8

The application of the linear support vector machine algorithm to the datasets in Fig. 6

Fig. 9
figure 9

The application of the k-nearest neighbor algorithm to a sample of 500 data points, in presence of extended objects

Fig. 10
figure 10

The application of the linear support vector machine algorithm to a sample of 500 data points, in presence of extended objects

Tower Ia: Kaluza–Klein modes: The Kaluza–Klein modes are prime candidates that could break down effective field theories stemming from a compactification of a higher dimensional theory. Moreover, as outlined in [1], since they come in infinite number, they could well be the infinite states that realize the Distance Conjecture. Here we will be specifically interested in the scalar Kaluza–Klein modes. They can originate, for instance, from the excitations of the ten-dimensional dilaton \(\phi \) appearing in (4.1) along some of the internal dimension. For the toroidal model at hand, one can identify six different types of Kaluza–Klein mode, according to the \(S^1\) along which the excitation occurs.

In order to compute the masses of the dilation-originated Kaluza–Klein modes one can start with the ten-dimensional action (4.1), switch to the Einstein frame, expand the dilaton in Fourier component along the chosen internal \(S^1\), and then compactify the theory using the metric ansatz (4.2), with the warp factor chosen as in (4.3). Finally, one gets the following, four-dimensional masses for the six different types of Kaluza–Klein modes:

$$\begin{aligned} m^{\text {KK}, i}_{k} = \frac{2 \pi k}{R_i} e^A, \end{aligned}$$
(4.11)

where \(k \in \mathbb {N}\) denotes the excitation level, and \(R_i\) is the Einstein-frame radius of the \(S^1\) delivering the excitation. Exploiting the relations (4.3) and (4.10) we can re-express the Kaluza–Klein masses in terms of the moduli as

$$\begin{aligned} m^{\text {KK}, 2i-1}_{k}= & {} M_\text {P} (2\pi k) \sqrt{\frac{\pi }{v_1 v_2 v_3}} \frac{s^i}{v_i}, \qquad \nonumber \\ m^{\text {KK}, 2i}_{k}= & {} M_\text {P} (2\pi k) \sqrt{\frac{\pi }{v_1 v_2 v_3}} \frac{1}{ s^i v_i}. \end{aligned}$$
(4.12)

Tower Ib: winding modes: A second candidate of tower of states that could break down any effective theory that originates from a compactification is the tower of winding modes. This tower is dual of the Kaluza–Klein modes tower discussed above. The masses for the winding modes can be most readily obtained from the ones of the Kaluza–Klein modes in (4.11) upon exchanging \(R_i \rightarrow \alpha '/R_i\):

$$\begin{aligned} m^{\text {w}, i}_{k} = \frac{2 \pi k}{\alpha '} e^A R_i , \end{aligned}$$
(4.13)

which can be recast, in terms of the moduli, as

$$\begin{aligned} m^{\text {w}, 2i-1}_{k}= & {} M_\text {P} k \sqrt{\frac{\pi }{v_1 v_2 v_3}} \frac{v_i}{s^i}, \qquad \nonumber \\ m^{\text {w}, 2i}_{k}= & {} M_\text {P} k \sqrt{\frac{\pi }{v_1 v_2 v_3}} s^i v_i. \end{aligned}$$
(4.14)

Kaluza–Klein and winding modes may be the source of the effective description breakdown for any theory that stems from dimensional reductions. However, let us now specialize to potential breakdown sources that are peculiar to Type IIB effective theories. These may originate from the brane content that characterizes the ten-dimensional Type IIB string theory.

Tower II: D3-particles: The ten-dimensional Type IIB string theory allows for the presence of \(\text {D}3\)-branes. If these branes are wrapped on some appropriately chosen internal three-cycles, they deliver BPS particles in the external four-dimensional spacetime.

In order to compute the masses of such \(\text {D}3\)-brane originated particles, let us first recall that the Nambu–Goto part of the effective action describing the dynamics of a generic \(\text {D}p\)-brane, neglecting the worldvolume gauge fields, is [23]

$$\begin{aligned} S_{\text {NG},\text {D}p} = \frac{(\alpha ')^{-\frac{p+1}{2}}}{(2\pi )^p g_{\text {s}}} \int _{\mathcal {W}} \textrm{d} \xi ^{p+1} \sqrt{ -h} . \end{aligned}$$
(4.15)

Here \(\xi ^\iota \), with \(\iota = 1, \ldots , p+1\) are the \(\text {D}p\)-brane worldvolume coordinates, and h is the determinant of the metric \(h_{\iota \kappa }\), pull-back of the ten-dimensional spacetime metric (4.2) over the \(\text {D}p\)-brane worldvolume \(\mathcal {W}\). The prefactor in (4.15) specifies the mass of \(\text {D}p\)-brane, which is solely dilaton-dependent at the ten-dimensional level. Particularizing to the case \(p=3\), and upon dimensionally reducing (4.15) over the metric ansatz (4.2), we find that the mass of a \(\text {D}3\)-brane-originated particle wrapping an internal three-cycle \(\Gamma \) is

$$\begin{aligned} M^{\text {D}3}(\Gamma ) = \frac{M_{\text {P}}}{4 \pi ^{\frac{3}{2}}} \frac{\text {vol}_3(\Gamma )}{V_Y}, \end{aligned}$$
(4.16)

where \(\text {vol}_3(\Gamma )\) denotes the Einstein-frame volume of the wrapped three-cycle \(\Gamma \).

Focusing on the regime of small string coupling and large complex structure and Kähler moduli, the lightest among the \(\text {D}3\)-particles that could first break down the effective theory is the one wrapped over the three-cycle Poincaré dual to the \(\alpha _0\) three-form in (4.6). The mass of such lightest \(\text {D}3\)-particle is

$$\begin{aligned} M^{\text {D}3}_\text {lightest} = \frac{M_{\text {P}}}{4 \pi ^{\frac{3}{2}} \sqrt{s^1 s^2 s^3}}. \end{aligned}$$
(4.17)

Tower IIIa: D5-membrane states: The \(\text {D}5\)-branes that populate the ten-dimensional Type IIB string theory may deliver BPS domain walls in the external spacetime, and their oscillatory modes constitute an additional, potential source of effective-theory breakdown. The tension \(\mathcal {T}^{\text {D}5}\) of \(\text {D}5\)-brane originated BPS domain walls can be obtained from (4.15), after dimensional reduction over the metric ansatz (4.2) and particularizing to the case \(p=5\):

$$\begin{aligned} \mathcal {T}^{\text {D}5}(\Gamma ) = \frac{M^3_{\text {P}} \ell _{\text {s}}^6}{32 \pi ^{\frac{7}{2}} } \frac{\sqrt{g_{\text {s}}} \; \text {vol}_3(\Gamma )}{\sqrt{s^1 s^2 s^3} V_Y^{\frac{3}{2}}}. \end{aligned}$$
(4.18)

We shall conventionally take the mass of the oscillatory modes associated to the \(\text {D}5\)-domain walls as given by \(M^{\text {D}5}(\Gamma ) = (\mathcal {T}^{\text {D}5}(\Gamma ))^{\frac{1}{3}}\) [13]. As for \(\text {D}3\)-particles, in the regime of interest the lightest \(\text {D}5\)-domain walls are those wrapped on the three-cycle dual to the \(\alpha _0\) three-form in (4.6), and has tension:

$$\begin{aligned} \mathcal {T}^{\text {D}5}_\text {lightest} = \frac{M^3_{\text {P}}}{32 \pi ^{\frac{7}{2}} } \frac{\sqrt{g_{\text {s}}}}{v_1 v_2 v_3 \sqrt{s^1 s^2 s^3}}, \end{aligned}$$
(4.19)

with the associated mass scale \(M^{\text {D}5}_\text {lightest} = (\mathcal {T}^{\text {D}5}_\text {lightest})^{\frac{1}{3}}\).

Tower IIIb: NS5-membrane states: Finally, the ten-dimensional theory Type IIB string theory may be endowed with NS5 brane that, analogously to D5-branes, may deliver supersymmetric domain walls once wrapping internal three-cycles. The tension of \({\text {NS}5}\)-originated BPS domain walls can be most readily obtained from the tension of \(\text {D}5\)-domain walls, after diving it by the string coupling \(g_{\text {s}}\):

$$\begin{aligned} \mathcal {T}^{\text {NS}5}_\text {lightest} = \frac{M^3_{\text {P}}}{32 \pi ^{\frac{7}{2}} } \frac{1}{v_1 v_2 v_3 \sqrt{g_{\text {s}} s^1 s^2 s^3}}, \end{aligned}$$
(4.20)

and the associated mass scale is \(M^{\text {NS}5}_\text {lightest} = (\mathcal {T}^{\text {NS}5}_\text {lightest})^{\frac{1}{3}}\). As such, for small string coupling, we expect the lightest \({\text {NS}5}\)-brane states to be always heavier that the lightest \({\text {D}5}\)-brane states.

4.2 Partitioning the moduli space

With the possible effective theory breakdown sources identified above, let us now show how the moduli space is sliced according to the UV-cutoff.

4.2.1 Partition with no fundamental objects

To begin with, let us first consider the case where no internal brane is present. The types of state that can break down the effective theory can then solely be the lightest among the Kaluza–Klein or the winding modes listed in Sect. 4.1.

For the ease of exposition, and clarity of data visualization, we focus on a two-dimensional subset of the moduli space at a time. Within each of these subsets we generate a synthetic dataset (3.4) as follows: we first randomly generate several hundred of points in the chosen subset; then, we will calculate which state, among those listed in Sect. 4.1 is the lightest and would first break down the effective theory. This procedure can be carried out by setting up a python code, employing the pandas library for handling large datasets.

An example of such a synthetic dataset is portrayed in Fig. 6. Therein, we considered 500 randomly generated points in the saxionic complex structure subspace spanned by \(s^1\) and \(s^2\). Furthermore, we considered three different values for the complex structure modulus \(s^3\) and Kähler modulus \(v_1\), with all the other moduli fixed at given values. For each of these points we then highlighted, using different colors, which is the type of state that first breaks down the four-dimensional effective field theories among the Kaluza–Klein and winding modes in (4.12) and (4.14). Ostensibly, in the regime we considered, the lightest among the Kaluza–Klein modes \(m^{\text {KK}, 2i}_k\) – namely, \(m^{\text {KK}, 2i}_1\) – are the states that break down the effective description first.

From a bottom-up perspective, we may then ‘forget’ the microscopic origin of the states, and just treat them as different effects breaking down the effective theory, and we can proceed to the study of how the moduli space is partitioned according to this sample of data points. Indeed, we can feed the datasets plotted in Fig. 6 to the k-nearest neighbor algorithm and the linear support vector machine algorithm as illustrated in Sects. 3.2 and 3.3, respectively. In particular, \(80\%\) of the data points of Fig. 6 form the training set for the algorithm, while the remaining ones constitute the test set. After applying the algorithms, then we can determine how the regions for which each type of states that break down the effective theory look like.

The applications of these algorithms are contained in Figs. 7 and 8. As is clear from the plots, the two algorithms act differently on the datasets of Fig. 6: the k-nearest neighbor algorithm, whose delivered results are contained in Fig. 7, delivers decision boundaries that look rather irregular. Instead, the linear support vector machine algorithm, which produces the plots in Fig. 8, delivers decision boundaries that are just lines, and they can be helpful in guessing, from a bottom-up perspective, the functional form of the masses of the states breaking down the effective theory.

4.2.2 Partition with fundamental objects

In the presence of fundamental objects, the effective theory might be broken down earlier, at least in some portions of the moduli space, by the states introduced by such objects. Then, let us assume that the effective four-dimensional theory is populated by particles originated by \(\text {D}3\)-branes, and by domain walls, stemming from either \(\text {D}5\) or \(\text {NS}5\)-branes, and let us repeat the same procedure performed in the previous section to identify the effective theory cutoff.

In Figs. 9 and 10 are illustrated, respectively, the applications of the k-nearest neighbor algorithm and the linear support vector machine algorithm for a data sample of 500 points in the saxionic space \(s^1\), \(s^2\), for the indicated values of the complex structure saxion \(s^3\) and the Kähler modulus \(v_1\). The remaining moduli have been assumed to take the following values: \(v_2 = 20\), \(v_3 = 30\), \(\phi = -1\).

Indeed, at some data points (indicated by crosses, and orange-colored) the state that breaks down first the effective theory is a \(\text {D}3\)-brane state. It is worth recalling that \(\text {D}3\)-brane states may also come in infinite towers (see, for instance, [3, 4]), and such a breakdown cannot be solved by including a finite number of states. For some other regions, instead, the state that breaks down the effective description first is the lightest mode of a Kaluza–Klein tower, and the discussion is no different from the one in the previous Sect. 4.2.1. Interestingly, \(\text {D}5/\text {NS}5\)-states are never the lightest ones, consistently with [24].

5 Conclusions

In this work we have illustrated how some difficulties in the definition of a cutoff of an effective field theory are ameliorated when the effective theory is tame. The tame structure of the effective theory allowed us to conclude that only a finite number of different states is required in order to define the effective-theory cutoff throughout its moduli space. Accordingly, the moduli space is partitioned into a finite number of subsets, in each of which a different state is the relevant one that determines the cutoff. Furthermore, we have shown that this partition can be concretely determined by employing supervised machine learning techniques, only by knowing how the effective theory is broken down at some points in the moduli space.

The ideas presented in this work can be expanded, or tested following several directions. To begin with, the support vector machine algorithm that we have utilized throughout this work is a linear one. However, as explained in Sect. 3.3, such a linear method is, in particular, useful in two circumstances: over the full patch of the moduli space whenever the states determining the cutoff are strictly monomials, or towards the boundaries of the moduli space if the couplings are monomially tamed. If the couplings are more complicated functions of the moduli – for instance, they could be polynomially tamed – different support vector machine algorithms should be employed in order to obtain a meaningful partition of the moduli space.

It would be interesting to additionally test the construction of the moduli space partition proposed here to effective field theories more complicated than the toroidal model considered in Sect. 4. In particular, it would be nice to get a partition when the masses of the states determining the effective-theory cutoff exhibit a more involved dependence on the axion fields populating the theory, and check to what extent they participate in determining the partition. It should be stressed, however, that constructing a synthetic set of data points as performed in Sect. 4 for the toroidal model requires a clean control over the numerical factors that appear in the masses of the effective-theory breakdown states, which for some models may be hard to achieve.