Reconstructing Effective Lagrangians Embedding Residual Family Symmetries

We consider effective Lagrangians which, after electroweak- and family-symmetry breaking, yield fermionic mass matrices and/or other flavoured couplings exhibiting residual family symmetries (RFS). Thinking from the bottom up, these RFS intimately link ultraviolet (UV) Beyond-the-Standard Model (BSM) physics to infrared flavour phenomenology without direct reference to any (potentially unfalsifiable) UV dynamics. While this discussion is typically performed at the level of RFS group generators and the UV flavour groups they can close, we now also focus on the RFS-implied shape of the low-energy mass/coupling matrices. We then show how this information can be used to algorithmically guide the reconstruction of an effective Lagrangian, thereby forming top-down models realizing the typical bottom-up phenomenological conclusions. As a first application we take results from scans of finite groups capable of controlling (through their RFS) CKM or PMNS mixing within the SM alone. We then extend this to recently studied scenarios where RFS also control special patterns of leptoquark couplings, thus providing proof-in-principle completions for such `Simplified Models of Flavourful Leptoquarks.'


Introduction
The unexplained 20-22 free and physical parameters associated to the masses, mixings, and CP-violating phases of the Standard Model's (SM) flavour sector (the so-called Flavour Puzzle) represents an open challenge for theoretical constructions Beyond the SM (BSM). While these parameters are technically natural, their appearance in the quark sector is associated to an explicit breaking of the U(3) 5 global flavour symmetry otherwise present in the SM Lagrangian [1,2], while the observation of neutrino masses is already a definitive new physics phenomenon. Furthermore, the actual values of fermionic masses and mixings exhibit tantalizing hierarchies, including dramatically different patterns between quark and lepton sectors. These observations beg for a dynamical origin for flavour, and countless BSM models based on family symmetries have been devised to that end, with some even attempting explanations for the presence of the otherwise arbitrary flavour index (i = 1, 2, 3) in the first place. However, the model space is underdetermined -multiple models based on different symmetries can predict the same phenomenology, and often models based on the same family symmetry can yield different infrared (IR) predictions when (unfalsifiable) tweaks to ultraviolet (UV) Lagrangian parameters are made. Indeed, it may be impossible to determine a true theory of flavour in the absence of any convincing observation of new physics that distinguishes SM fermion generations, especially since reliable experimental constraints already exist for all but the leptonic (Dirac) CP-violating phase, absolute neutrino masses, and additional parameters depending on whether neutrinos are Majorana particles. Therefore most model predictions should actually be considered 'post-dictions. ' One might then pursue a formalism for describing BSM flavour in more model-independent ways, focusing only on connecting patterns of family-symmetry breaking (which can themselves be generically motivated, perhaps in stringy theories -see [3,4], e.g.) to the relevant IR phenomenology, and not on unfalsifiable Lagrangians based on new heavy states or dynamics that may be associated to that symmetry breaking. Residual Family Symmetries (RFS) provide just such a formalism, as they promote accidental Abelian symmetries of the SM mass sector to the residual subgroups of a UV flavour symmetry G F . In bases where the physical mixing parameters appear in the SM Yukawa Lagrangian (any basis other than the mass-eigenstate basis), one then notes that the Abelian generators associated to RFS are themselves functions of the physical mixing parameters. Closing flavour groups with these generators then provides the desired, model-independent link between UV symmetries and IR mixing phenomenology, and multiple analytic and computational studies have been performed to uncover viable G F .
Of course, if a particular 'simplified model' (or class of simplified models) based on the RFS formalism is singled out due to new measurements in the flavour sector, a more complete description of the physics will be desired. In this paper we provide a method to (re)construct effective Lagrangians that recover the symmetry breaking distilled in RFS scenarios. That is, we show how to construct a top-down model from a bottom-up phenomenological observation/conclusion. We do so by focusing on the intimate link between RFS generators and the implied shape of an RFS-invariant mass/coupling matrix. After all, RFS are symmetries of mass matrices and not the full SM Lagrangian and so, up to possible ambiguities associated to the group-theoretical properties of RFS generators, a specific symmetry-breaking pattern from the UV G F to a given RFS implies a specific IR mass/coupling matrix. This shape then hints at relevant multiplet charge assignments under the parent G F , which when combined with RFS-implied vacuum expectation values (VEV) for family-symmetry breaking scalar flavons, can be used to algorithmically construct an effective Lagrangian. We first apply this method to models addressing SM mixing structures alone, i.e. the U P M N S or U CKM matrices, and then also to a class of 'Simplified Models of Flavourful Leptoquarks' developed in [31,32]. These models include a new Yukawa-like coupling between the leptoquark and SM quark and lepton doublets which is, in addition to the SM mixing, controlled by RFS. They therefore generate rich, flavour-dependent phenomenology at the Large Hadron Collier (LHC) and other precision experiments which can be used to probe their predictions.
The paper develops as follows: in Section 2 we give a pedagogical review of the RFS formalism, making explicit the intimate connection between RFS generators and implied mass shapes, while also describing bottom-up techniques to close UV flavour groups. We then discuss how to take those results and build an effective UV Lagrangian. In Section 3 we apply this recipe to models reproducing U P M N S or U CKM before moving to leptoquark applications in Section 4. We conclude in Section 5, give relevant information for the finite groups we employ in Appendix A, and also give further details on our core RFS-preserving flavon condition in Appendix B.

RFS: Bottom-Up Formalism for Top-Down Models
The core assumption of the RFS paradigm is that a parent flavour symmetry G F is broken in such a way that, after subsequent EWSB, RFS mediated by subgroups of G F are preserved in some or all of the SM mass matrices, or indeed any other term controlled by the original flavour symmetry. For example, a natural symmetry breaking pattern through intermediate groups controlling the lepton and quark sectors of the SM is schematically illustrated by Of course other patterns beyond (1), perhaps without the intermediate G L,Q or which only address either the quark or lepton sector individually, are also conceivable. G F ,L,Q can in principle be Abelian or non-Abelian, continuous or discrete, although for the remainder of this paper we will assume that G F ,L,Q are non-Abelian, such that irreducible multiplets of dimension greater than one can be arranged in flavour space. Furthermore, we will only work with non-Abelian discrete symmetries (NADS) when constructing explicit models in Sections 3-4, although our general approach and analysis in this section is equally applicable to non-Abelian continuous flavour groups as well. Also, the RFS G a with a ∈ {u, d, l, ν} must be Abelian, and when considering they reconstruct NADS we have in particular Abelian cyclic groups of order n, when G F ,L,Q are themselves discrete. Discrete product groups of the form G a ∼ Z na,1 ×Z na,2 ×... are also possible. Finally, we note that the complete flavour symmetry present in the effective Lagrangians to be considered in the upcoming sections is actually G F × G shape , where G shape is an Abelian shaping symmetry that forbids unwanted scalar interactions. However, as will become clear, unlike G a we do not need to explicitly specify G shape ab initio, as our bottom-up RFS approach depends only on the fact that G shape can be found to exist after realizing all of the desired phenomenology.

The Infrared Lagrangian
To review how the RFS chain in (1) is naturally motivated, we follow prior discussions (see e.g. [5,10]) and first write down the SM Yukawa Lagrangian, after EWSB, in the masseigenstate basis: where m a are diagonal matrices of mass eigenvalues and where we have assumed a Majorana neutrino mass term to illustrate our point, although our RFS approach applies straightforwardly to a Dirac mass ∝ν R m ν ν L as well. From (3) we observe that the Lagrangian is invariant under Abelian transformations on the fermion fields: 1 Hence we promote these accidental symmetries to RFS, and note that, for the case of Majorana neutrinos, the (maximal) RFS generated by T ν i is a Klein four-group [5], Similarly, the generators T f simply represent re-phasing freedoms of the three fermion generations in each family's Dirac mass term, and of course a Dirac neutrino mass term would be generically invariant under (6) instead of (5). When (2) is realized the otherwise continuous phases in T f are quantized as Finally, from (3) one also finds that T f L ! = T f R for the terms to be invariant. Of course, leftand right-chiral fermions can be charged differently in the complete flavour theory invariant under G F ,L,Q . However, (3)-(4) tell us nothing about the physical predictions associated to the family symmetry breaking in (1). It is only when we rotate to a basis where the Yukawa terms contain information about fermionic mixing that the RFS is useful as a bottom-up tool. Take the standard 'flavour basis' of the SM, where charged-current (CC) interactions are diagonal, as an example. Here (3) is transformed to where the left-handed unitary matrices U have physical effects in the CC through the presence of the CKM and PMNS overlap matrices: Both U CKM,P M N S are 3 × 3 matrices in flavour space and are parameterized by three mixing angles θ q,l 12,23,13 and one Dirac CP-violating phase δ q,l . If neutrinos are Majorana particles, the PMNS also encodes two additional phases α 1,2 . Hence it is clear that the redefined mass matrices m aU (where a denotes all fermions) in (8) are themselves 3 × 3 matrices in flavour space, and are of course related to the SM Yukawa couplings Y a through the Higgs VEV v, Obviously (10) does not hold for the Majorana neutrino mass term written explicitly above.
Let us now examine the RFS of (8). Here one observes that the Lagrangian is invariant under transformations of the form as opposed to those of (4). Indeed, the RFS generator T aU now knows about the physical mixing matrices U a , which means that any parent group G F ,L,Q with subgroup G a generated by T aU can be connected to a physical mixing prediction embedded in U a . In this way the RFS intimately links the IR phenomenology to the UV symmetry without reference to any of the dynamics associated to realizing (1). RFS therefore provide a powerful, bottom-up means of understanding observed patterns of flavour mixing in a rather model-independent way, as the only assumption made thus far is that the accidental flavour symmetries of the SM mass sector encoded in (4) are in fact the global RFS of a complete flavour theory broken as in (1) or its analogues. 2 In this way the RFS formalism defines a set of 'simplified flavour models,' which can easily be extended to BSM constructions as well -see [35] for a recent application of RFS to the Yukawa sector of multi-Higgs doublet models, where they were shown to be capable of controlling dangerous flavour-changing neutral currents alongside of fermionic mixing, and [31,32] where it is demonstrated that they can also structure the flavour patterns of leptoquark couplings (we also address some of these models in Section 4 below). However, (8) is but one of an infinite number of bases that the Yukawa sector can be written in. In the presence of BSM couplings that introduce new (physical) mixings, one may want to work in a different one in order to preserve diagonal CC (see the 'leptoflavour basis' discussed in Section 4). Or one may be motivated to change basis due to the ease of use of certain (basis-dependent) group product rules. Regardless, the trend as regards the associated RFS symmetry transformation is trivially clear; a rotation on a mass-eigenstate field a with unitary matrix V † a equivalently implies a basis-change on the corresponding RFS mass-basis generator T a through the same matrix: The statement holds vice versa as well, since otherwise T a would no longer be an RFS generator, as it would not leave the associated mass/Yukawa term invariant. In Section 2.3 we will apply the logically equivalent statement to (12) to study the RFS-invariant mass matrices themselves, in an effort to guide the reconstruction of an effective Lagrangian with manifest G F ,L,Q . Before doing so, it is important to address a couple of subtleties in the approach, for clarity. First, the RFS are not symmetries of the full IR Lagrangian. CC interactions do not respect them without additional assumptions relating RFS within the quark and/or lepton sector. This is realized naturally in most flavour models, however, since the breaking of G F ,L,Q is typically only communicated to the Yukawa sector, perhaps through scalar flavons developing VEV. This is the approach we will take in what follows, although it is worth noting that familysymmetry breaking can also occur through other mechanisms, e.g. orbifold compactifications.
And secondly, we recall that a bottom-up RFS analysis alone cannot recover the exact mixing prediction associated to the model sketched by (1) unless G a distinguishes all three fermion generations, i.e. the associated RFS generator(s) T a needs to have three distinct eigenvalues (or multiple T a i need to be present when T a has fewer than three distinct eigenvalues). This point becomes clear in the following flavour-basis equality: That is, the RFS generator cannot distinguish between the mixing matrix U a and U a · R a , with the latter having free parameters in the degenerate (i, j) sector of T a . In complete models these free parameters can either be fit to data or quantized as a result of other mechanisms, like further auxiliary or accidental symmetries of the Lagrangian. We will discuss the top-down implications of (13) in upcoming sections.

Closing Ultraviolet Flavour Groups
Given (12), one must then have a procedure for recovering the associated parent groups G F ,L,Q , as the Abelian G a alone are insufficient to model patterns of physical mixing. Many groups have attempted this by performing either analytic or computational studies of the classes of G F ,L,Q that can break to desired subgroups G a , given that specific (phenomenologically viable) shapes for U a must be achieved in a realistic model. On the other hand, the GAP language for computational finite algebra [36,37] has been indispensable when searching for NADS with automated techniques, as it has a large library of small groups catalogued along with vast amounts of associated group-theoretical information (conjugacy classes, order, irreducible representations, etc.).
In what follows we will use the bottom-up approach to 'reconstructing' NADS first discussed in [20,23], but recently applied to a class of BSM leptoquark models in [31,32]. Here one assumes that the RFS generators form the complete generating set for G F ,L,Q , such that the latter are recovered upon using GAP to close all elements of the former: where (14) reconstructs a parent group generated by all family sectors, (15) forms a direct product parent group of lepton and quark symmetries, and (16)- (17) assume that the NADS only controls either lepton or quark mixing, but not both. Other scenarios could also be envisaged, e.g. one where G L,Q are formed as in (16)- (17), but where G F is not their direct product group as in (15), but instead any larger group containing G L,Q . Regardless, the hatted (T ) notation in (14)- (17) simply indicates any basis where the generators know about physical mixing parameters,T a ≡T a θ a ij , δ a ij , ... .
Of course, when searching for NADS in the bottom-up approach one must also apply a discretization scheme to all of these mixing parameters, and details on this procedure and other cuts made regarding group order, etc., can be found on case-by-case bases in [20,23,32]. However, it does not matter how one finds the parent symmetry with associated RFS for our present purposes. Any procedure is appropriate, as long as all relevant information about the RFS can be extracted, which we now discuss in detail.

Guided Reconstruction for Effective Lagrangians
As demonstrated above, the implied shape of the generatorsT a given an RFS-invariant Lagrangian is the information required for connecting the IR G a to the UV G F ,L,Q . But is there a systematic way of using the recovered non-Abelian parent group to build an effective Lagrangian L Y (a model, that is) that exhibits, upon family-and electroweak-symmetry breaking, the simplified construction of (1)? A straightforward approach 3 is based on the implied RFS invariances of (a) the IR mass matrix and (b) new scalar favons whose VEV implement the breaking patterns of (1).
Concerning (a), we note that a generic rotation V † a on the mass-eigenstate terms in (3) yields new mass matrices of the form where we observed that, by construction, the Hermitian combination of this term is invariant underT a given in (12). Hence, as m a can always be written as a diagonal matrix of mass eigenvalues, the RFS-invariant quantitym † am a can always be written out in model space, once the rotations V a are specified. In the class of simplified models we have reviewed above, V a can always be extracted from the (known) IR phenomenology that is predicted. In this way (19) provides the rubric for completing the simplified model, as a L Y that reproduces it will, by construction, embed the desired RFS.
Then flavons (point (b)) provide a candidate mechanism for breaking G F ,L,Q down to the desired RFS-invariant mass shapes. TheT a invariant mass matrices are then obtained, after the flavon expands around its VEV, from Lagrangian terms of the form where A L denotes the associated SU(2) L doublet for the family sector, A R is the SU(2) L singlet, H is a Higgs doublet, andŷ a is the effective coupling suppressed by the new physics scale Λ integrated out of the effective operator. To enforceT a invariant masses, we can use the following condition regarding the VEV direction of the flavon field [5] T † a φ a = e a φ a ⇐⇒T a φ a = e a φ a with e a e a = 1 , where e a is a (scalar) eigenvalue, and the ⇔ is due to the fact that T a is assumed without loss of generality (since we work with finite groups) to be unitary. From the group theory perspective, it is of course obvious that φ a (or indeed φ a · φ a ) should preserve G a ∼ =T a when (21) holds. However, in Appendix B we provide our own derivation of the known condition in (21), starting from the G F -invariant Lagrangian term in (20). This then provides more clarity as to why (21) provides a sufficient condition on operators of the form (20) which yields mass matrices of the form in (19), and thereby realizes the simplified construction in (1).

Summary of Effective Lagrangian Reconstruction
In summary, once a flavour group G F ,L,Q is determined with RFS generated by {T a }, which are assumed to know about the physical mixings the RFS mechanism (and therefore the reconstructed model) controls, the following procedure can be followed to yield L Y : 1. Write out the explicit representations of {T a } in a basis amenable to manipulating the group product rules of G F ,L,Q . This is typically determined by identifying {T a } with specific group elements in a given irreducible representation. If this basis differs from the one in which G F ,L,Q was originally recovered, take account of the additional transformations in the IR Lagrangian. This determines the model basis.
2. For each family sector a with an active RFS, write down N a ≥ 1 new flavon(s) φ a whose VEV respect (21) in the model basis, hence deriving the model-space orientation of φ a . The flavon(s) φ a are taken to be charged under the irreducible representation thatT a was identified with in Step 1, and the number of flavon(s) N a is determined by Steps 3-4, i.e. one needs as many flavons as can successfully yield the desired mass matrix in family sector a.
3. Derive the expected form of the model-basis mass/coupling matrix in each family sector, which is given by (19) for a generic set of model-basis transformation(s) V a away from the mass-eigenstate basis, and form the Hermitian combination which preserves the information from the physical mixings in the theory.
4. For each family sector a with an active RFS, create an effective Yukawa-like operator with φ a and build invariants of the form in (20), or a similar invariant of the form ∝ [L L L L φ ν φ ν ] 1 for Majorana neutrinos. Multiple such invariants may be required in a given family sector, depending on the kinds of irreducible representations implied in Steps 1-3. The goal is to recover the RFS-invariant mass/coupling shapes from Step 3, with a one-to-one mapping between physical and model parameters, and of course the shapes of {T a } andm † am a already hint at appropriate generations charges under G F ,L,Q .
5. Construct the Hermitian Yukawa couplingŶ † aŶ a from (20), 4 such that a comparison with the quantities in Step 3 can be made. Map the model parameters (e.g. {ŷ ai , v, ...}) to physical parameters (e.g. {m ai , θ a ij , ...}). If this mapping is not one-to-one, the model may appear to require some fine-tuning of parameters, although we will show that this could be a misleading conclusion if the expected RFS-invariant mass matrices have not been generalized with the free parameters permitted through the relationships in (13). Also be sure to check that the implied mass eigenvalues are physical. If not, additional operators may need to be added.
If Steps 1-5 are successful, the resulting model will exhibit the RFS symmetry-breaking patterns and desired phenomenology embedded in the original simplified models, thereby providing an Effective Field Theory (EFT) completion.

Application to Models of SM Flavour
In this section we apply the strategy outlined in Section 2.3 to flavour models reproducing SM mixing matrices, i.e. the PMNS or CKM matrices defined in (9).

A 4 Altarelli-Feruglio Model for U P M N S
As a first application of the algorithm described in Section 2, we now show how the famous Altarelli-Feruglio model of leptonic flavour [38,39] can be reconstructed with only minimal knowledge of its low-energy predictions. In particular, its IR phenomenology is characterized by the breaking of the tetrahedral A 4 group to Z 3 and Z 2 RFS in the charged lepton and neutrino sectors, which are respectively generated by The model assumes Majorana neutrino masses, and its LO mixing prediction is the tribimaximal (TBM) matrix defined as follows 5 4 It is often preferred to instead construct terms in the LR basis, with operators ∝Ā L φ a HA R . In this case, simply identify the predicted Yukawa coupling from this term asŶ † a , and then proceed to buildŶ † aŶa . We will do this in some of the models below. Also, it is obvious that Majorana neutrinos do no require the construction of the Hermitian objectm † νmν . 5 In this particular example, our convention for the PMNS mixing is different than that adopted later in the paper, cf. (76), in order to better reproduce those of [39].
which is realized in a special flavour basis, where U l = 1 and U ν ≡ U P M N S . 6 Given knowledge of (23)-(24), we now have all of the information necessary to apply our reconstruction algorithm.
We first use these equations to infer that, in the model basis we will construct our effective Lagrangian, the relevant generators are given by which we immediately identify as triplet 3 representations from the A 4 review in [38], and which we can use to solve for flavon VEV in each family sector, and to conclude that the model-basis mass matrices invariant under them are characterized by which we can further use to infer charge assignments under A 4 for SM fields and the new flavons in (26). We recall that A 4 is O(12) and has four irreducible representations: a triplet 3 and three singlets 1, 1 and 1 , with 1 denoting the trivial representation. While A 4 is an exceptionally well studied finite group, we repeat the relevant product rules in this basis for completeness in Appendix A.1, where it is clear that in order to build up non-trivial Yukawa matrices, the SM SU(2) doublet L L and corresponding flavons (from (26)) will need to be assigned to the triplet representation: Given this, we then consider the charged lepton mass term and observe from (25) and (28) that SM generations do not 'talk' to one another through the A 4 symmetry, and so we assign a different singlet to each RH SM field: Table 1: Relevant field and A 4 symmetry content from [39].
where l c R are transforming as left-handed fields. Because these fields transforms as A 4 singlets we need combinations of [φ l L L ] as in (20) to themselves transform as one-dimensional representations of A 4 . Noting this, one quickly deduces the LO effective Yukawa Lagrangian for this sector: where we omit the necessary insertions of the Higgs field that make each term invariant and the [...] 1 notation indicates that the bracketed fields contract to the indicated singlet under the A 4 product rules given in (127). Each individual term in (31) is then an A 4 and SM gauge singlet, once contracted with the corresponding RH isospin singlets. The additional terms implied in (31) correspond to higher-order operators in the Effective Field Theory (EFT) allowed by successive flavon and SM field insertions, given their associated symmetry assignments. We will discuss these below, along with additional symmetries irrelevant to the RFS formalism. Regardless, one immediately finds that (31) generates the desired mass matrix from (28), with the relations between masses and Lagrangian parameters easily found to be with v the Higgs VEV realizing EWSB. Moving now to the neutrino masses, the m νU implied in (27) has non-trivial structure in all matrix sectors, a fact concurrent with 1) our observation that φ ν and L L should be charged as A 4 triplets, and 2) the fact the Altarelli-Feruglio Model predicts a Majorana neutrino mass matrix, which is itself implied (or at least consistent with) the Z 2 neutrino RFS. In the lowenergy EFT, a Majorana neutrino mass is necessarily ∝ L L L L . We therefore conclude that an operator of the form should be included in the Lagrangian. This term generates a contribution to m νU , which, while invariant under (25) (as it must be by construction), fails to realize the required neutrino phenomenology, as it has only two distinct eigenvalues: 0 and B. In other words, it cannot map to the generic, RFS-invariant form in (27) that we have deduced, in the absence of (unphysical) assumptions about the mass eigenvalues embedded in it. The obvious solution is to introduce a further flavon ξ whose VEV ξ = u does not break G ν and which can couple to the L L L L bilinear. To that end we introduce ξ as an A 4 singlet, which adds an additional contribution to (33)- (34), where we again omit the necessary insertions of the Higgs field that make each term invariant. This matrix is still invariant under (25), is diagonalized by U T BM , and has mass eigenvalues given by which is fully consistent with the matrix form in (27).
In conclusion, with the knowledge of the parent flavour symmetry A 4 , the neutrino and charged lepton RFS in (23), and the PMNS mixing prediction given in (24), we have easily inferred the field and symmetry content in Table 1 and the following LO effective Yukawa Lagrangian: This is to be compared to eq (12) in [39], where it is found to be equivalent to the non-SUSY version of the Altarelli-Feruglio Lagrangian -we have 'reconstructed' this model from the bottom up.
Here we do not concern ourselves with the UV completion of this model (or indeed other models), which can be achieved by adding appropriate messengers fields to make the underlying model renormalizable, for A 4 models see e.g. [40,41]. UV completions exist in general for models and are typically more predictive than the corresponding non-renormalizable model if the messenger fields included in the complete model enable a subset of the contractions that are allowed by the symmetries at the non-renormalizable level.
However, as is well known, the complete model of [39] is more involved than just its LO Yukawa terms. Furthermore, we made choices in the above discussion that, a priori, may seem ad-hoc. We will now discuss some of these subtleties for this particular model, as well as their broader implications for our generic approach, although in forthcoming models we will typically leave these discussions implicit, unless they become particularly relevant for the physics at hand.

Mass and Mixing Prediction Ambiguities
We have observed in the preceding section that knowledge of the IR RFS and mixing prediction is not guaranteed to tell us everything required to build the LO terms in the EFT. For one, as became clear between (34)- (35), the RFS has no control over the quantization of the mass eigenvalues, but only the mixing associated to them. 8 As we saw, (34) exhibits the required Z 2 invariance, but does not map to the (more) generic RFS-invariant form in (27) which is unphysical (there are two non-zero mass splittings measured for low-energy neutrinos). This motivated the introduction of the singlet ξ whose VEV also breaks G L , but does not break G ν , and so does not upset the TBM prediction for the PMNS. In general, this is a good strategy when reconstructing a given Majorana-neutrino-sector Lagrangian for which one does not have a literature reference, as we did here --operators ∝ [L L L L ] 1 will always preserve a given RFS if augmented only by a scalar singlet. And secondly, in the absence of a reference Lagrangian, one can also reconstruct a mass term associated to a different mixing prediction, when degenerate eigenvalues exist in RFS generators. This was highlighted explicitly for the mixing in (13), but of course this also has implications for the associated RFS-invariant mass matrix. In the Altarelli-Feruglio case, we observe that T l has three distinct eigenvalues, and so U l is uniquely predicted as the identity matrix. However, as only one Z 2 is explicitly preserved in the neutrino sector, the truly generic RFS-invariant mass matrix is given by with θ 13 and δ 13 defined as in (13). This complex symmetric matrix is diagonalized by from which θ 13 and δ 13 can be fit to experimental data, yielding a phenomenologically successful PMNS matrix. The point here is that knowledge of the RFS alone does not nail down the mixing or mass matrix prediction that a top-down EFT can yield, when the generators of said RFS do not distinguish all three generations. That the Altarelli-Feruglio model predicts (24) and not (40) at leading order is due to the accidental invariance of the m νU in (35) under the which (39) does not respect. This invariance is not associated to the RFS, as Z 2 × Z µτ 2 is not a subgroup of A 4 . Rather, the accidental invariance of (35) under (41) is due to the absence of additional operators in (37), which is a consequence of additional symmetries unrelated to the RFS, which we will now discuss.

Further Symmetries and Fields
The astute reader will notice that (37) does not contain the most generic set of operators invariant under the A 4 flavour symmetry and SM gauge symmetries. For example, a term of the form (briefly restoring Higgs fields to maintain clarity) is also allowed, as are the four operators corresponding to (37) but with φ ν ↔ φ l . Indeed, these additional contributions to the LO Lagrangian are forbidden by a Z 3 shaping symmetry, which limits contact interactions between certain fields. As the RFS have nothing to do with these shaping symmetries, in what follows we will simply assume that either they are not needed or, more commonly, that they can always be found such that only desired operators in the 1/Λ EFT expansion are recovered.
We also ignored all of the dynamics required to obtain the VEV derived in (26). As mentioned in the introduction, flavon VEV can be realized via the minimization of an appropriate scalar potential. SUSY is assumed for the Altarelli-Feruglio model, such that (37) is understood as one part of the overall superpotential, whilst yet another flavonξ that breaks A 4 is introduced alongside additional 'driving' superfields φ l 0 , φ ν 0 and ξ 0 . All fields are then further charged under a traditional R symmetry U(1) R that distinguishes matter, symmetry-breaking, and driving/alignment fields. We again ignore all such discussion in upcoming models, as we simply assume that the required VEV alignment can be achieved.
Finally, we mentioned that the RFS do not constrain mass eigenvalues. That means that mass hierarchies must be understood with some other mechanism. In the case of [39], this is achieved with an additional Froggatt-Nielsen [42] U(1) F N , under which the τ R , µ R , and e R generations are assigned 0, q and 2q, and additional flavons θ are introduced whose VEV create hierarchical mass suppressions ∼ λ ≡ θ /Λ: c e ≈ O(1), b e ≈ O(λ q ), and a e ≈ O(λ 2q ). Again, such symmetries can always be imposed in addition to the core flavour symmetries yielding the RFS of interest. We therefore do not mention them further in what follows.

Higher-order Operators
We have only reconstructed the LO Yukawa Lagrangian in the 1/Λ EFT expansion. Higherorder terms associated to more SM or flavon field insertions (but which are still invariant under all assumed symmetries) can of course be found, and these will generate small corrections to the phenomenological conclusions of the LO Lagrangian. In the Altarelli-Feruglio model, the leading such terms are given by for the charged leptons, and for the neutrinos. These operators will add small corrections to the predictions for the associated mixing matrices, which can bring them closer to experiment. However, in general, they may softly break the RFS preserved at LO, 10 and so studying them in generality is again beyond our scope in what follows.
We now consider a simple model based on the finite group (Z 14 × Z 2 ) Z 2 that makes predictions for CKM quark mixing. The RFS symmetry-breaking pattern to the down and up quark sectors is illustrated by with G u,d generated by The model predicts the LO CKM mixing prediction to be of the Cabibbo form, which, while insufficient to fully reproduce the known three-dimensional structure of the CKM, does capture the dominant mixing in the (1, 2) sector, i.e. the Cabibbo angle θ C . Further corrections are highly suppressed, ∝ O (θ 2 C , θ 3 C ), and will be briefly mentioned below. As in Section 3.1, we can immediately construct the flavour-basis generators, under the (common) assumption that the down quarks are already diagonal, such that the entirety of the CKM mixing is encoded in the up sector. We immediately find In principle, we can use (49) to proceed with the algorithm as described in Section 2. However, in what follows we find it convenient to work in a different basis where the non-trivial entries of T uU are in the (2, 3) sector. 11 To that end, we consider the following unitary transformation on the weak-eigenstate generators: Applying P with T u,d = P † T uU,dU P we get the following expressions for the RFS-generators in the model basis: 12 The previous expressions (51) indicate the use of a singlet and doublet representation. Even if the second and third generation are part of the doublet, we remind the reader that these generations are not the actual flavor states. In principle it is possible to build the model in the flavour basis, but the charge assignments would be rather inconvenient. Finally, let us emphasize that the actual flavor state charges do not depend of the group basis choice. We now want to identify these matrices with (Z 14 × Z 2 ) Z 2 generating elements in certain irreducible representations. However, we have not found sources available that catalogue the properties of this group. For this reason we derived the relevant product rules and group information ourselves, and have provided them in Appendix A.2. There we see that T u,d can be easily expressed in terms of the group generators a, b and c: Critically, we observe that (51)- (52) indicate that 2 3+− is the appropriate (Z 14 × Z 2 ) Z 2 charge for the two flavons φ u,d that we introduce according to the algorithm in Section 2, and we can use (51) to work out the expressions for these doublet VEV, finding Finally, one derives that in the absence of mixing ambiguities, the model-basis mass matrices are given by where m A i are the associated mass eigenvalues, and where we have used dagger combinations for the charged fermions to remove the dependence on RH transformations. Of course, in deriving (54), we have been careful to keep track of the additional basis change implied by operating with P in (50).
The results in (51)-(54) strongly indicate that the second and third generations of LH quarks should transform as a (Z 14 × Z 2 ) Z 2 doublet, while the first generation transforms as a non-trivial singlet. Similarly, the second and third generations of RH up and down quarks should transform as a non-trivial singlet, while the first generation of both types of quark transforms trivially. Furthermore, (52) already indicated that the flavons φ d,u associated to these sectors should transform as a 2 3+− , a fact that helped us derive (53). This information is summarized in Table 2.
Assuming a shaping symmetry to prevent φ i from coupling to undesirable sectors, one can straightforwardly build up the Yukawa sector for the quarks in the model basis using Table 2, where we have omitted Higgs fields and scale suppressions. Using the vevs from (53) and product rules from Appendix A.2, we get the following Yukawa matrices: Assembling these into their Hermitian combinations, one immediately finds which directly map to (54) with and analogous relations for the mapping of Y d , up to VEV factors and multiplicative constants.
We have therefore reconstructed a successful top-down model, as it exhibits the required symmetry breaking in (46) and recovers the CKM mixing prediction in (48).

Application to Models of Flavourful Leptoquarks
As a particularly relevant extension of the field content of the SM, we now apply our algorithm to a class of flavoured leptoquark models defined in [31,32], which we will briefly review for completeness. Leptoquarks have been become popular in the recent literature due to their ability to resolve (potential) anomalies in heavy meson decay observables like R K observed by LHCb [45,46], as well as other potentially anomalous measurements sensitive to muon physics (see e.g. [47][48][49][50]). Here we allow the SM to be augmented by one of the following bosons, denoted the 'scalar triplet,' 'vector singlet,' and 'vector triplet,' whose charge assignments under the SM gauge group are respectively given by (in the form These leptoquarks are easily motivated in the UV by Grand Unified constructions, or in models with new gauge interactions (see e.g. [51,52]), and all can successfully account for R K ( * ) < 1 [53]. The SM-gauge invariant operators they source are given by with {i, j} denoting flavour indices, {a, b} denoting SU(2) indices, and k = 1, 2, 3 for the Pauli matrices. 13 Following [54] and redefining the components of the scalar triplet state according to, 14 contracting the SU(2) indices of (59), and ignoring the diquark operator of ∆ 3 (for simplicity, although RFS can control it-see [31], and it can also be controlled with other symmetries [55,56]) , one then finds that the Yukawa/mass sector of the SM is enhanced to The physics of leptoquarks is thoroughly reviewed in [54]. 14 We will write the following equations explicitly for the scalar triplet, although analogous expressions are easily derived for the other two leptoquark states of (58). Superscripts on the LHS denote electric charges.
with the novel leptoquark couplings λ QL normalized to the first term, λ dl , which in the masseigenstate basis of the SM fermions is generically parameterized by The other couplings in (61) are related to λ dl via SU(2) relations, and are given by Given these new flavoured couplings, we defined multiple classes of simplified models based on the RFS formalism in [31,32] . In particular, we assumed that the natural RFS of the SM (cf. (4)) also hold in the new leptoquark terms of (61). This allowed us to constrain the λ QL couplings via RFS invariances of the form with Q, L respectively representing arbitrary quark (d, u) and lepton (l, ν) families, and the transposed 'T ' (daggered ' †') T Q corresponding to scalar (vector) leptoquark(s). Critically, different phase relationships amongst RFS generators T Q,L correspond to different textures in λ QL , and the extent to which free parameters remain in (62)-(63) is a function of the amount of symmetry present in any given term. Precision flavour data from (e.g.) B − B mixing, leptonflavour-violating (LFV) observabes like µ → eγ, and the anomalous R ratios also constrain the viable textures (and hence also the phenomenologically viable RFS relationships) in (62). For example, in [31] we insisted that RFS hold in all lepton and quark sectors of the SM and leptoquark couplings, and this led to only O(10) viable textures for λ dl with only a single real parameter, once all experimental and symmetry constraints were made. 15 Then, in [32], we relaxed the symmetry assumptions and enforced an RFS invariance in some or all of the SM mass terms, but only in the d − l leptoquark coupling, where either or both T d,l were allowed to act; in this symmetry environment, invariance in λ dν,ul,uν is inherited via SU(2) relationships as in (63). These two types of simplified models were labeled 'SE1' and 'SE2' respectively, with the former likely requiring more intricate model building to account for the fact the RFS distinguishes members of SU(2) doublets in individual leptoquark terms after EWSB. The SE2 models, on the other hand, represent highly natural relaxations of the SE1 constructions, and can easily be realized with single flavon EFTs as per our algorithm in Section 2, which we will now show.

The Leptoflavour Basis
Viable NADS that can realize the SE2 symmetry predictions for U P M N S,CKM and λ QL must be uncovered in order to apply our algorithm (cf. Section 2.2), and to that end a GAP scan was performed in [32]. We performed that scan in the 'leptoflavour basis', where information about all relevant physical mixings could be extracted, and which we now review.
Recall that, in the mass-eigenstate basis, the RFS generators (and therefore G F ,L,Q ) do not know about U P M N S,CKM , even in the case of only SM field content. However, we can go to a special form of the 'flavour basis' (cf. (8)) by doing no change of basis (or trivial change of basis with the identity matrix) on the LH d, l, and a change of basis via the CKM for the LH u and the PMNS for the LH ν. With leptoquarks present, we simultaneously want to encode the additional information present in λ QL together with the SM mixing matrices, so we choose the leptoflavour basis to be the one where the SM mixing is all in the mixing matrices, and therefore the charged current should be diagonal -similarly to what we have in the flavour basis -but additionally, we choose to have λ dl diagonal. To diagonalize λ dl , we require a further non-trivial change of basis in the LH d, l, which must be cancelled with rotations in the u and ν respectively, to have the CC diagonal. So, starting from the mass basis (where λ dl is defined as in (62)), d changes by Λ d , l by Λ l and u changes by both CKM and by Λ d (canceling the presence of Λ d in the CC and making it diagonal), while ν changes by both PMNS and by Λ l (canceling the presence of the Λ l in the CC, and making it diagonal). Finally, the case without leptoquarks appears correctly as the limit with Λ d = Λ l = 1.
To see this explicitly, we apply the following 'leptoflavour' basis transformations on the mass eigenstates: This yielded the leptoflavour basis Lagrangian, which is invariant under the following LH RFS generators: and the following RH RFS generators: where T R holds only in the case of Dirac neutrinos. As mentioned above, in the limit where Λ l,d → 1, (67) reduces to the SM-only flavour-basis generating set! We also point out that, in

Basis-Dependent RFS Quantities
Basis the absence of a RH leptoquark coupling as present in (e.g.) the vector singlet case, one has some freedom to choose the RH Λ E,R,D,U transformations since their shapes are not dictated by the requirement of diagonalizing a particular coupling (we can always form totally LH combinations of the SM mass matrices, e.g.). Given the leptoflavour basis, bottom-up scans as described in Section 2.2 were then performed in [32] with (67), so that parent family groups were closed according to (14)- (17). As it turns out, many NADS were discovered, including members of the popular S N , A N , ∆(3N 2 ), ∆(6N 2 ), Σ(3N 2 ), Σ(3N 3 ), and D N finite group series. As a final preparation for the reconstruction of the RFS-invariant Lagrangian in these extended leptoquark scenarios, we allow for the possibility that an additional basis change will be amenable for manipulating the group product rules of the NADS discovered in [32]. Hence we rotate via a generic matrix P (which can be set to the identity matrix in the event it is unnecessary), a → P a , and so the effective mass terms are now given by where we have assumed that A R also transforms with P . The mass matrices in this basis are labeled by m , and are clearly non-diagonal. The remaining leptoquark terms of (61) are similarly given in this basis by These operators already reveal the natural form of their respective RFS generators. A summary of all basis changes on (e.g.) the neutrino field and associated changes in m ν and T ν are given in Table 3, tracking all the way from the mass-eigenstate basis to the model basis of (70)- (71). Hence, given a specific NADS, its RFS, and the associated predictions for U CKM,P M N S and λ dl , one can use (70)- (71) to reconstruct the UV EFT. We will now consider two such models, one based on the ∆(96) group and one based on the D 15 member of the Dihedral series D N . All of the relevant bottom-up information for these groups is given in Table 4, and the parameters x e,µ are defined in the following textures: which are the consequence of special relationships amongst RFS generators -λ dl corresponds to −β l = β d = γ d . 16 By construction these couplings are diagonalized with Λ d,l in the combination Λ d λ dl Λ † l , with The quark matrix Λ d in (73) left-diagonalizes both patterns in (72), whereas Λ e l right-diagonalizes λ dl . On the other hand, the parameters t θµτ ≡ tan θ µτ and θ C are defined in the following LO PMNS and CKM textures: which were specified as the SM mixing to be recovered in [32]. 17 While the exact forms of (74) and (75) [32] in order to reflect the conventions of [72].
'reactor angle,' θ l 13 , and its free parameter θ µτ can be fit to many well-studied textures like the tri-bimaximal [58], bi-maximal [59], golden ratio [60,61], and hexagonal forms [62,63]: (76) Furthermore, corrections to (74) can naturally be realized by higher-order terms in the EFT expansion beyond (20), which can softly break the RFS embedded in the LO contribution, or also through renormalization group flow (RGE) [64][65][66][67][68][69][70][71] between the scale at which G F is broken and the IR, where global fits are performed. Similarly, (75) provides an excellent description of the dominant Cabibbo mixing of the CKM matrix. Unlike the PMNS, the CKM is extremely hierarchical, with mixings in the (2,3) and (1,3) sectors suppressed by one to two orders of magnitude with respect to the Cabibbo sector. This suppression again hints at further contributions to (75) from higher-order terms in (20) and/or RGE corrections.
Of course, precisely calculating the corrections expected to (74)-(75) depends on the complete UV flavour model, including not only the full field and symmetry content, but also the presence or lack thereof of supersymmetry. Specifying this is well beyond the scope of our present paper, and so we consider (74)-(75) sufficiently accurate to develop our approach to reconstructing effective Lagrangians from RFS.

∆(96) Model for U P M N S and Leptoquarks
As a first example incorporating leptoquarks we construct a ∆(96) flavour model from the scan results in [32], which are repeated in the first row of Table 4. This model predicts tri-bimaximal mixing U P M N S = U µτ (arctan 1/ √ 2) and the electron isolation pattern λ dl for the leptoquark coupling. While the muon isolation pattern is phenomenologicaly preferred over the electron one, as it explains further the different muon anomalous observables while the electron isolation pattern could only explain the deviation in R ( * ) K , we focus here on the electron isolation as this rather simple group can predict it unambiguously. We highlight the fact that other symmetries, such A 4 or ∆(75), obtained using the procedure described in [32] are capable of reproducing the muon isolation pattern.
The symmetry breaking to RFS is illustrated in As evident from the fact that T ν generates G ν ∼ = Z 4 , this model features Dirac rather than Majorana neutrino masses. In Table 4 one can read off the specific charges of the masseigenstate RFS generators T l and T ν . We note their basis-independent Traces are respectively 0 and 1, which will soon help us identify the conjugacy class to which they belong within ∆(96). Table 4 gives all the information required to move to the leptoflavour basis, where the RFS generators take the forms where ω 4 = e i2π/4 = i. We want to identify these generators with group elements of ∆(96), and to do so we use the catalogue in [74], repeating the relevant product rules in Appendix A.3. As before, we find it convenient to perform a P transformation on the leptoflavour basis, so that we go to a basis where the combination of ∆(96) generators a 2 3 1 c 3 1 d 3 1 for the 3 1 representation, found in [74], is diagonal. We note that we use the same naming of the generators as in [74], only differentiating them with the boldface to further avoid confusion with our naming for the coefficients (in this and in other sections). The P matrix we use is which leads to With this change of basis, we are able to match T l with the diagonal a 2 cd element of the3 1 representation of ∆(96), the conjugate representation to 3 1 , which as expected has zero trace and lies within conjugacy class C 6 [74]. According to the character of T ν , it could be within C 5 or C 9 in the same3 1 representation, and indeed we found it to match the element a 2 bc 2 d 3 .
With this information we introduce two flavons φ l,ν , for which we use (80) to derive candidate VEV in the triplet directions which are invariant under T l for3 1 (and for3 1 ) and T ν for3 1 , respectively. Finally, the RFS-invariant mass combinations in this basis are given by l 3 0 0 0 m 2 which we will build below. Note that there are no mixing ambiguities associated to these matrices.

The Lepton Sector
We now build the model by assigning L L as a3 1 , and the RH leptons as combinations of a ∆(96) trivial singlet and a doublet, which we designate as 1 + 2 e.g.
R , E 2 R ∼ 2 and similarly for ν 3 R ∼ 1 and ν 12 R ∼ 2. In this case the ∆(96) invariant Yukawa terms for charged leptons and for neutrinos are very similar, of the type where f stands for either charged leptons or neutrinos and the Higgs field is omitted for simplicity. For the neutrino sector we find that the (1, 1, 1) direction gives rise to which combines into A simplified version of this could be obtained through a shaping symmetry removing the coupling either to ν 12 R (b ν = 0) or to ν 3 R (a ν = 0). This matrix embeds the correct PMNS matrix predicted by the RFS framework, but with two massless neutrinos. Given that the charged lepton invariants are very similar, we can quickly construct the respective Yukawa matrix for this sector as well: leading to Hence the invariant operators give rise to a diagonal Yukawa coupling but with two degenerate charged lepton masses, which is clearly unphysical.
In order to make the model realistic, we first note that the directions φ ν ∼ {(1, 1, 1), (−1, 1, 0), (−1, −1, 2)} T are the eigenvectors of T ν with eigenvaluesê ν = {1, −i, i}, respectively. While we initially selected the first eigensystem in (81) withê ν = 1, according to (21) we are free to choose any of them, noting that while this doesn't actually preserve T ν as a residual symmetry, the resulting mass matrices will still lead to a successful Y † ν Y ν in the sense that we obtain U P M N S = U µτ (arctan 1/ √ 2) as intended. More details on this type of situation can be found in Appendix B. Taking either (−1, 1, 0) or (−1, −1, 2) for an additional flavon φ ν2 's orientation allows one to generate further non-zero masses in m ν .
At the same time, for the charged leptons, it is possible to break the mass degeneracy by having an additional triplet flavon φ l2 in the3 1 representation, aligned in the same direction as φ l . In summary, with the invariant terms the degeneracy of the eigenvalues is lifted as . Explicitly, we aim for a normal mass hierachy by picking (−1, 1, 0) as the additional direction, with a shaping symmetry which distinguishes the neutrino flavons such that each only couples to one of the right-handed neutrino fields. Taking v ν = v ν2 , the Yukawa term in the neutrino sector then corresponds to and therefore we have with a ν from the contraction of (1, 1, 1) with the ∆(96) doublet and b ν from the contraction of (−1, 1, 0) with the singlet right-handed neutrino, respectively. These map to (82) with again up to constant prefactors and VEV, thereby realizing the desired shapes.

The Leptoquark Sector
As seen in Table 4, from the bottom-up perspective of the scans in [32], one does not have control over the coupling x e in (72) when only lepton symmetries are active. The T l symmetry controls the overall shape of the term (electron isolation), but not the quantization of the ratio of λ se /λ be . This can be seen practically by observing that ∆(96) is generated by ∆(96) ∼ = {T l , T ν }, and neither of these RFS generators knows about x e . Hence, one derives that in the model basis the generic RFS-invariant leptoquark coupling is given by where we observe that, thanks to the Hermitian combination we have constructed, the appearance of x e in this relationship is not due to the mixing matrix Λ d , but instead the masseigenstate isolation pattern itself (cf. (72)). This term can now be easily built using one of the charged lepton flavons, taking the leptoquark field to transform either as 1 (selects φ l ) or 1 (selects φ l2 ). For simplicity we consider the trivial singlet option: As in the A 4 models described in [73], contracting [L L φ l1 ] 1 gives one of the lepton isolation cases ensuring the leptoquark couples only to one lepton flavour. In this case the VEV in the model building basis is (0, 0, 1), leading to This is written in an unknown quark basis where the third row corresponds to the specific combination of the three components ofQ L and a ∆ which is the appropriate function of the three a i ∆ coefficients. In the mass-eigenstate basis, the model yields the electron isolation pattern as expected from the results in our previous paper [32]. To be more precise, we can sum over the uncertainty of the quark sector that we are not controlling with the symmetry. As λ dl only has entries in the third column, the resulting λ † dl λ dl combination only has a nonzero (3,3) entry proportional to the modulus of the third column vector, therefore the model is indeed predicting the structure in (90).

D 15 Model for U CKM , U P M N S , and Leptoquarks
We now consider a D 15 model 18 that makes predictions for both CKM and PMNS mixing alongside of the ratio of leptoquark couplings denoted by x µ . The scan result from [32] is repeated in Table 4, whose first column reveals that a hexagonal PMNS matrix U HM is predicted alongside of Cabibbo mixing with θ C = π/15 for the CKM matrix, while the second through fourth columns reveal the following symmetry-breaking pattern: From Table 4 we can immediately construct the leptoflavour-basis RFS generators with (67), finding that the neutrino and up-quark matrix have non-trivial structure in all three matrix sectors.
As above, we attempt to find a basis within which its easy to manipulate the relevant D 15 group product rules. To that end, we consider the following unitary transformation that block diagonalizes the leptoflavour-basis generators: Finally, one derives that in the absence of mixing ambiguities, the model-basis mass matrices are given by where m A i are the associated mass eigenvalues, and where we have used dagger combinations for the charged fermions to remove the dependence on RH transformations. However, unlike the ∆(96) model of Section 4.1, we see from Table 4 that all of the RFS generators have degenerate eigenvalues, and hence there are again freedoms in the associated mass and mixing matrices thanks to (13). We will discuss these when they become relevant below.

The Quark Sector
The results in (95)-(98) strongly indicate that the second and third generations of LH quarks should transform as a D 15 doublet, while the first generation of up quarks transforms as a nontrivial singlet. Similarly, the first and third generations of RH up and down quarks should transform as a non-trivial singlet, while the second generation of both families transforms trivially. Furthermore, (96) indicates that the flavons φ d,u associated to these sectors should transform as a 2 1 under D 15 , a fact that helped us derive (97). This information is summarized in Table 6.
With an appropriate shaping symmetry preventing φ d,u from coupling to undesirable sectors as well as distinguishing u 1 R from u 3 R and d 1 R from d 3 R , 19 one can quickly obtain the model- 19 However, we will soon see that off-diagonal entries in Y † d will become desirable once we begin to discuss basis Yukawa sector for the quarks using Table 6, where Higgs fields and scale suppressions are again ommitted. Using the VEV from (97) and product rules from Appendix A.4, we get the following Yukawa matrices Assembling these into their Hermitian combinations, one arrives at which maps, up to prefactors and VEV, to (98) with and analogous relations for the mapping of Y d .

The Lepton Sector
Similarly, the matrices in (95)-(98) suggest that the second and third LH generations of SU(2) doublet leptons transform as a 2 5 D 15 doublet, along with the associated flavons φ ν,l . The L 1 L and first and third generations of E R are to be charged as non-trivial singlets, while E 2 R transforms trivially. Using this information, assembled in Table 7, one reconstructs the LO the leptoquark sector below, and therefore the implied shaping symmetry present in (99) will be modified to allow such additional operators.

Lagrangian as
We quickly derive the following terms for neutrino masses and charged lepton Yukawas which gives While the neutrino mass maps directly to (98), the charged lepton term apparently does not without additional fine tuning of the parameters. For example, setting d e = e and c e = a e , one can recover the corresponding term in (98), and only then will a diagonal matrix of eigenvalues be returned for Y † l Y l upon (un)rotating (104) to the mass-eigenstate basis with Λ l and P . However, we simultaneously observe that the coupling Y † l Y l still respects the required RFS invariance, Are these claims contradictory? After all, we argued that successfully mapping to (98) is a sufficient condition for ensuring that the EFT yields the desired IR phenomenology and RFS symmetry-breaking patterns, and while it appears that (104) cannot do so without unappealing assumptions, the RFS invariance still holds in (105). The solution to this puzzle resides in the fact that, as in the A 4 Altarelli-Feruglio case, (98) does not, in fact, represent the most generic set of RFS-invariant mass matrices, and we will now show that the apparent fine-tuning required in mapping (104) to (98) can be understood as a top-down manifestation of (13).
We begin by recalling that the T l generator cannot distinguish between Λ l and Λ l · R 23 , where our tilde notation indicates that this can be understood a basis change on the charged lepton field, such that (starting from the mass-eigenstate basis), we are now operating with and so the most generic mass matrix invariant under theT l = T l generator that our algorithm to find D 15 knows about is given bỹ and where {δ, θ} 23 denote the free parameters our formalism has no control over. However, (107) also implies that a diagonal charged current in this basis (we have not applied a change on any other field) implies a modified PMNS matrix in the (physical) masseigenstate basis:l This is consistent with the well-known fact that, in a more generic flavour basis, a degeneracy in T l should translate to a free parameter in U l and therefore also U P M N S . Now, one might be tempted to conclude that we should also translate ν L with a compensating factor of R 23 , so that the definition of the PMNS is preserved a la However, this corresponds to a generic neutrino mass matrix given bỹ which is left invariant under That is, the neutrino generator knows about this basis change, which differs from our original observation that D 15 doesn't know the difference between T l andT l . Indeed, we have checked that (at least at certain values of δ 23 and θ 23 ) the group generated by G F ∼ = {T l ,T ν ,T u ,T d } is not D 15 , and may not even be finite! 20 So making this compensating change in (111) is inconsistent with the starting point of our analysis, and cannot be done.
In conclusion, moving to the tilde basis requires no more work from the model building side, but implies a generalized RFS-invariant charged lepton mass matrix, and therefore a generalized prediction for the physical PMNS matrix given by U P M N S = R † 23 · U HM . The parameters of R 23 are therefore functions of the (unspecified) coupling strengths of the EFT operators, since when solving the system of equations implied by mapping (109) to (104), one easily sees that m 2 l 1 ↔ |b e | 2 , m 2 l 2 ↔ |d e | 2 + | e | 2 + |a e | 2 + |c e | 2 + | e | 2 + |c e | 2 − |a e | 2 − |d e | 2 csc 2θ 23 sec δ 23 , m 2 l 3 ↔ |d e | 2 + | e | 2 + |a e | 2 + |c e | 2 + |d e | 2 + |a e | 2 − |c e | 2 − | e | 2 csc 2θ 23 sec δ 23 , up to VEV and prefactors, with Hence, while no fine-tuning is required in achieving this map, the model's prediction for U P M N S is ambiguous up to the quantization of the couplings {a e , ..., e }, which may result from a higher UV symmetry (e.g. a GUT) that relates the otherwise independent operators of the EFT. Such an attempt is obviously well beyond our scope in this paper. However, thinking from a more phenomenological perspective, one can instead fit the parameters {θ 23 , δ 23 } to available experimental data for the PMNS matrix, which then implies relationships amongst the top-down model's parameters, according to (114)-(115). Regardless, we see clearly that the ignorance of R 23 in the bottom-up RFS generation of G F has consistently manifested itself in a certain lack of predictivity in the top-down EFT.

The Leptoquark Sector
The naive RFS-invariant leptoquark coupling for the d − l operator expected in the model basis is given by This can be achieved using the following Lagrangian but again only with an additional tuning of the parameters, However, following the above discussion for charged leptons, (116) is modified when we consider the R 23 free rotation yielding (109), and results in the matrix whose first and third columns, corresponding to lepton generations, are now distinguished. 21 Hence the need to fine-tune parameters between them disappears, although the otherwise independent couplings {a ∆ , b ∆ , c ∆ , d ∆ } and {a e , b e , c e , d e } are linked through (115) -they need to be simultaneously fit to functions of the same physical PMNS parameters, and hence are quite correlated. Continuing, we also now note that the symmetry structure exposed in [32] (cf. Table 4) also permits a rotation in the (2,3) sector of T d , and the conversation above as regards the corresponding basis change on the charged-lepton field can be had equally for the down quark, resulting in a further modified (119) which also distinguishes rows (quark generations), contributions in the (1, 3) sector of (122). However, we recall that the implicit assumption in building (99) (and all other effective L Y in this paper) is that unspecified shaping symmetries forbid undesirable operators from contributing to the Yukawa. So, once (122) is the appropriate term to be recovered in the Lagrangian of (99), one can then assume a different shaping symmetry, such that further operators contribute in a way that allows for a one-to-one mapping. For example, one can easily obtain the modified quark Lagrangian as the one in (99) with additional down quark terms which are in fact invariant under D 15 (given that d 1

R
and d 3 R are not in fact distinguished by D 15 ). The down sector would then be where the a d and c d terms are clearly the terms with the unprimed couplings after undergoing a swap of d 1 R and d 3 R . It is simple to see they create entries in the mass matrix that will allow a successful map to (122): In this scenario one loses some predictivity over the CKM mixing, since (123) leaves {θ q 23 , δ q 23 } unquantized -they will become functions of the free operator couplings in a manner analogous to (115). On the other hand, the fine-tuning issue inλ dl is resolved, and (123) anyway better approximates global fits to the experimental CKM matrix than the original Cabibbo form we predicted. Hence θ q 23 can be fit to the data, which then leads to more precise EFT predictions in the (unmeasured) leptoquark coupling of (120).

Further Comments on the Appearance of Free Parameters in Effective Models
We have seen that the simple equivalence evident in (13) can be important when building L Y realizing family-symmetry breaking of the form G F ,L,Q → G a ∼ = {T a }. While this phenomenological ambiguity was discussed from a bottom-up perspective in [32] and multiple prior references from other authors, its consequences from a top-down model-building perspective have, to our knowledge, not been appreciated. We now see that, in the absence of a proper accounting of (13), the implied RFS-invariant mass/coupling shapes are unnecessarily restrictive, possibly leading to the erroneous conclusion that fine-tunings of model parameters are required. Upon considering the full implications of (13) on these shapes, these fine-tunings are resolved in favor of one-to-one mappings between model and physical parameters, albeit at the expense of the EFT's predictivity. In short, the bottom-up mathematical ambiguity of (13) can consistently manifest itself as a top-down phenomenological ambiguity in a given model's IR mass and mixing spectrum.
However, our D 15 analysis still leaves some questions unanswered. For example, why was no tuning required in the neutrino or up quark mass matrices, where we also only attempted a map to the naive RFS-invariant mass matrices, but where Table 4 clearly indicates that free parameters can be introduced into these sectors as well? While it is beyond our present scope to answer this question conclusively, we suspect that the answer lies in the group product rules at hand, which as a function of the group closure will (at least in the basis we consider) likely be driven by the CKM and PMNS structures entirely embedded in the up and neutrino sectors. After all, D 15 is only armed with a handful of doublets from which we can form invariants according to (20), whereas a larger group that contains, e.g., triplet representations might allow a broader and more diverse set of invariants from which we can form the naive RFS-invariant shapes of (98). This suspicion is at least consistent with the fact that, while studying D 15 , we also attempted to build the final model presented in Table 4, based on the same symmetry (Z 14 × Z 2 ) Z 2 that we used for the CKM prediction in Section 3.2. There we again found that no tuning was required until we tried to model m † l m l and the subsequent leptoquark coupling λ dl , where the need became apparent exactly as in D 15 . As conjectured, (Z 14 × Z 2 ) Z 2 also only has doublet and singlet irreducible representations. Unfortunately though, we did not find a candidate symmetry in [32] that allows us to test to this hypothesis, and so it may be interesting to perform a similar group theory scan while allowing for larger finite groups to pass the self-imposed cuts. On the other hand, the simultaneous introduction of the inherently 2D Cabibbo form of (75) alongside the inherently 3D µ − τ symmetric form of (74) into the scans may inevitably lead to similar results as those in [32]. We will leave the resolution of these questions to future study -our method, as demonstrated in Sections 2-4, works regardless of their conclusions.

Summary and Outlook
We have shown how to use the RFS of the Yukawa sector of an IR Lagrangian, i.e. one where electroweak-and family-symmetry breaking has already occurred, to systematically reconstruct a UV effective Lagrangian that respects SM gauge symmetries and non-Abelian flavour symmetries G F which contain such RFS as subgroups. Our method is thus complimentary to prior scans of family groups performed in order to identify phenomenologically viable G F and symmetry-breaking patterns without specifying concrete UV Lagrangians -that is, we can use bottom-up, model-independent information to algorithmically construct top-down models with an explicit field and symmetry content. We have shown four such examples, two where only SM fermionic mixing (CKM or PMNS matrices) is controlled, and two where SM mixing matrices and flavoured leptoquark couplings are structured with the RFS. We thus provide 'proof-in-principle' routes to EFT descriptions for the simplified models outlined in [31,32]. Our study has also helped to clarify commentary in prior literature as regards the role of eigenvalue degeneracies in RFS generators and associated mixing ambiguities in top-down flavour models.
Furthermore, leptoquark extensions of the SM represent but one of many BSM scenarios with non-trivial flavour structure that can be studied within the RFS paradigm, which bypasses potentially unfalsifiable aspects of model building and offers a mechanism for identifying classes of simplified models and their phenomenological implications. Our results in this paper indicate that analogous, model-independent RFS applications to (e.g.) multi-Higgs-doublet models (cf. [35]) or softly-broken SUSY can also be readily 'completed' if deemed necessary by a particular experimental signature, and can therefore be confidently studied in the meantime without reference to UV dynamics. We leave these possible extensions to future work. where x 1 y 1 x 2 y 2 2 2ρσ for n = 1 x 2 y 2 x 1 y 1 2 3σρ for n = 2 x 2 y 2 x 1 y 1 2 1σρ for n = 3 .
On the other hand, for the singlet representations we have that One can work out the kronecker products for these different representations. In our case, the relevant ones for contracting two doublets will be given by while two non-trivial singlets contract to a trivial singlet

B Details on Flavon VEV Condition
In this Appendix we present a derivation of the core RFS-preserving condition on the flavon VEV we impose in (21). While this condition has been known since as early as [5], we now show our own approach for clarity and completeness. We start with the generic G-invariant term that will lead to a mass matrix, where F ρ F A and f ρ f a are respectively SU (2) L doublets and singlets with flavour representations ρ F and ρ f . φ ρ φ α is the flavon field with representation ρ φ . Finally, C Aaα = C Aaα ρ F ×ρ f ×ρ φ →1 stands for the Clebsch-Gordan matrix that pins down the product representation to a specific flavour singlet.
Let's act on the term with an element g ∈ G, which leaves the singlet invariant: In the broken phase, where G → H (where H ⊂ G) by φ α with the condition the mass matrix m Aa = C Aaα φ α exhibits an invariance under generic elements h ∈ H, Moreover, when we consider the broken element transformations T (g ) where g / ∈ H, the VEV instead transforms as T αβ φ β = φ α ⇒ m → m . This leads to the following equality: which explicitly shows the non-invariance of the mass matrix under g . The invariance relation is also easily extended to the combination mm † , which is the more general framework that we consider in the paper. Starting from the relation we end up with In that case T ρ F AB (h) → T ρ F AB (h)e iθ leaves the condition invariant; therefore the VEV preserving combination mm † condition is simply given by Here one then clearly sees the origin for the conditionê a ·ê a ! = 1 in (21). Finally, the corresponding constraint for Majorana mass terms can be easily derived from the same procedure, and in that case one finds that no additional phases are present.