The \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta $$\end{document}η transition form factor from space- and time-like experimental data

The \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta $$\end{document}η transition form factor is analyzed for the first time in both space- and time-like regions at low and intermediate energies in a model-independent approach through the use of rational approximants. The \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta \rightarrow e^+e^-\gamma $$\end{document}η→e+e-γ experimental data provided by the A2 Collaboration in the very low-energy region of the dielectron invariant mass distribution allows for the extraction of the most precise up-to-date slope and curvature parameters of the form factors as well as their values at zero and infinity. The impact of these new results on the mixing parameters of the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta $$\end{document}η–\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta ^\prime $$\end{document}η′ system, together with the role played by renormalization dependent effects, and on the determination of the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$VP\gamma $$\end{document}VPγ couplings from \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V\rightarrow P\gamma $$\end{document}V→Pγ and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P\rightarrow V\gamma $$\end{document}P→Vγ radiative decays is also discussed.


Introduction
The pseudoscalar transition form factors (TFFs) describe the effect of the strong interaction on the γ * γ * P vertex, where P = π 0 , η, η , η c . . ., and is represented by F Pγ * γ * (q 2 1 , q 2 2 ), a function of the photon virtualities q 2 1 , and q 2 2 . From the experimental point of view, one can study such TFFs from both space-and time-like energy regions. The time-like region of the TFF can be accessed at meson facilities either through the double Dalitz decay processes P → l + l − l + l − , which give access to both photon virtualities (q 2 1 , q 2 2 ) in the range 4m 2 l < (q 2 1 , q 2 2 ) < (m P − 2m l ) 2 , or the single Dalitz decay processes P → l + l − γ , which contains a single virtual photon with transferred momentum in the range 4m 2 l < q 2 1 < m 2 P , thus simplifying the TFF to F Pγ * γ * (q 2 1 , 0) ≡ F Pγ * γ (q 2 ). To complete the time-like region, e + e − colliders access the values q 2 > m 2 P through the e + e − → Pγ annia e-mail: rescriba@ifae.es b e-mail: masjuan@kph.uni-mainz.de c e-mail: sanchezp@kph.uni-mainz.de hilation processes. The space-like region of the TFFs are accessed in e + e − colliders by the two-photon-fusion reaction e + e − → e + e − P, where at the moment the measurement of both virtualities is still an experimental challenge. The common practice is then to extract the TFF when one of the outgoing leptons is tagged and the other is not, that is, the single-tag method. The tagged lepton emits a highly off-shell photon with transferred momentum q 2 1 ≡ −Q 2 and is detected, while the other, untagged, is scattered at a small angle with q 2 2 0. The form factor extracted from the singletag experiment is then F Pγ * γ * (−Q 2 , 0) ≡ F Pγ * γ (Q 2 ).
At low-momentum transfer, the TFF can be described by the expansion where F Pγ γ (0) is the normalization, the low-energy parameters (LEPs) b P , c P , and d P are the slope, the curvature, and the third derivative of the TFF, respectively, and m P is the pseudoscalar meson mass. F Pγ γ (0) can be obtained either from the measured two-photon partial width of the meson P, or, in the case of π 0 , η, and η , from the prediction of the axial anomaly in the chiral limit of QCD.
Several attempts to describe the η TFF are available in the literature at present [3,5,7,[15][16][17][18][19][20][21][22][23][24][25][26][27][28][29] but none of them tries for a unique description of both space-and time-like experimental data, specially at low energies. In Ref. [30], it was suggested for the π 0 case that a model-independent approach to the space-like TFF can be achieved using a sequence of rational functions, the Padé approximants (PAs), to fit the data. Later on, in Ref. [4], the same method was applied to the η and η TFFs. More recently, the A2 Collaboration reported a new measurement of the η → e + e − γ Dalitz decay process with the best statistical accuracy up to date [14]. A comparison with different theoretical approaches was also performed. In particular, the results from Ref. [4], based on space-like data, were extrapolated to the time-like region and agreed perfectly with their measurement. Triggered by these new A2 results, we explore in the present work a combined description of both space-and time-like regions of the η TFF within our method of rational approximants. This will provide, for the first time, a determination of the energy dependence of the η TFF in both regions together with a unified extraction of its LEPs.
Our approach makes use of PAs as fitting functions to all the experimental data. PAs are rational functions P N M (Q 2 ) (ratio of a polynomial T N (Q 2 ) of order N and a polynomial R M (Q 2 ) of order M) constructed in such a way that they have the same Taylor expansion as the function to be approximated up to order O(Q 2 ) N +M+1 [31]. Since PAs are built in our case from the unknown low-energy parameters (LEPs) of the TFF, once the fit to the experimental data is done, the reexpansion of the PAs yields the desired coefficients. Being rational functions the PAs are analytic everywhere except where the poles are located. Branch cuts cannot in principle be described by PAs, however, if the function to be approximated is of a certain kind, for instance a Stieltjes function, it can be proven mathematically that an infinite-order PA is able to reproduce the cut [31]. 1 Another interesting issue is the implementation of chiral logarithms of the kind log(Q 2 /M 2 ), appearing for instance in chiral expansions at next-to-leading order, in the PAs method. These chiral logs admit a Taylor expansion which can be seen as an infinite-order diagonal PA and is convergent for any value of Q 2 > 0. Therefore, in the case the approximated function includes chiral logs their effects are incorporated in the PAs to a good extent (more precise as the order of the PA increases). 2 The advantage of PAs over Taylor expansions is their ability to enlarge the domain of convergence. However, to prove the convergence of a given PA sequence is a difficult task and only for certain classes of functions this can be done rigorously. In practice, the success of PAs in the description of experimental data can only be seen a posteriori in the sense that the pattern of convergence can be shown but unfortunately not proven mathematically. We refer the interested reader to Refs. [32,33] for details on this technique. In this work, we resume our method [4] for fitting the η TFF experimental data after including all the recent available time-like measurements from η → l + l − γ decays (l = e, μ). Besides recapitulating the main features of the method we will address the following issues: • A reevaluation of the systematic errors considered in our previous work is demanded by the inclusion of time-like data at these low energies. This new set of data being more precise than the space-like one, its incorporation will allow for an improved systematic error associated with each element of a given PA sequence and the increase in order of the sequence itself. • The better description of the low-energy region of the TFF allows for an improved determination of its value at zero momentum transfer, which is related to the two-photon decay width of the η. The impact of the recent measurement of this width by the KLOE collaboration [34] and of older measurements based on Primakoff techniques is commented. • The role played by high-energy space-like data in view of the fact that in such region only BABAR data is available. Related to this, the existing puzzle between the precise mixing scenario derived from the TFF in contrast to the measured time-like cross section by BABAR at q 2 = 112 GeV 2 [35] is discussed. The possibility for the Belle Collaboration to measure the time-like η TFF is also mentioned; • The extraction of the η-η mixing parameters from the TFFs and the two-photon decays after discussing the role of the renormalization scale dependence of the singlet decay constant F 0 . The new results are much better constrained with the inclusion of time-like experimental data and turn out to be competitive with standard determinations, such as for instance the analysis of V → Pγ (V = ρ, ω, φ) and P → V γ decays [36]. • The determination of these V Pγ coupling constants from the former mixing parameters and its comparison with current experimental values. The effect of OZI-violating parameters and higher-order effects is also discussed.
The paper is organized as follows. In Sect. 2, a reanalysis of the systematic error related to our method when taking into account both space-and time-like experimental data is performed. In Sect. 3, a brief description of the general method for extracting the low-energy parameters of the η TFF using rational approximants is presented and then the impact on them of both the η → γ γ latest measurement and the highenergy space-like data are discussed. In Sect. 4, the implications of our new results for the determination of the η-η mixing parameters, the understanding of the BABAR puzzle, and the prediction of the V Pγ couplings are examined. Finally, in Sect. 5 the conclusions of the present analysis are given.

A new systematic error
In the context of Padé approximants, by systematic error is meant the difference between the function to be approximated and the highest approximant reached after the fit procedure. If there is seen convergence, the larger the PA order, the smaller the systematic error. Therefore, any finite-order PA should have a definite systematic error. In this section, we discuss how to obtain such an error for a scenario containing both time-and space-like data.
In order to illustrate the utility of the PA as fitting functions, Ref. [30] simulates the real situation of the experimental data on the space-like region by generating with different models a set of pseudodata. Such data were then fitted with a P L 1 (Q 2 ) (single-pole approximants) sequence and the LEPs where extracted. This exercise was twofold: first it was meant to show the ability of the PA sequence to extract the LEPs and, second, also provided a systematic error for the extraction of each LEP at each value of L. In Ref. [4], more examples were worked out and further discussed and we refer the interested reader to such references.
Dealing now with a larger set of data, such systematic errors should be reanalyzed, specially because the amount of time-like data, which covers the lowest-energy regionand is most important for LEPs extraction-is larger than the space-like one.
Following the strategy presented in Refs. [4,30], we simulate with an holographic model (see Appendix B for details on the model together with on the simulation) the situation of the experimental data from both space-and time-like data; see, respectively, [9,10,37] and [11,12,14]. The results obtained with the holographic model described in Appendix B are collected in Table 1 where the relative errors for the first three derivatives for each element on the P L 1 (Q 2 ) sequence are reported. These results are model dependent. Using, instead, the quark model considered in Ref. [30], we find faster convergence and we reach systematic errors one order of magnitude better for the higher PA of the sequence than the holographic model. We chose to use the results with the holographic one to be on the conservative side.
The strategy is then to generate pseudodata for both regions trying to emulate the real experimental situation. In the space-like region, we evaluated the model at 10 points in the region 0.6 ≤ Q 2 ≤ 2.2 GeV 2 , 15 points in the region 2.7 ≤ Q 2 ≤ 7.5 GeV 2 , and 9 more points in the region 9 ≤ Q 2 ≤ 34 GeV 2 . In the time-like region, the model is evaluated at 8 points in the region (0.045) 2 ≤ Q 2 ≤ (0.100) 2 GeV 2 , 15 points in the region (0.115) 2 ≤ Q 2 ≤ (0.220) 2 GeV 2 , and 31 more points in the region (0.230) 2 ≤ Q 2 ≤ (0.470) 2 GeV 2 . On top of these set of data points we add the value of F ηγ γ (0, 0). All these data points have zero error because we want to obtain a pure systematic error on our fitting functions. Notice that the majority of points lie in the low-energy region. This simple exercise also prevents us against over-fitting problems. The very same study can be performed to evaluate the P N N (Q 2 ) sequence. The results are, however, an order of magnitude better than for the P L 1 (Q 2 ) one (see the comparison in Appendix B). From now on, we consider only the systematic errors from the latter to be on the conservative side.
The forthcoming BESIII data on the η TFF at space-like region below 9 GeV 2 might demand a reanalysis of our systematic errors, although we think that including them would not really modify our percentages beyond the precision we are reporting them in Table 1. Their data will be, nevertheless, crucial to reduce our statistical errors which is by now the dominant source.
In passing, we also study what would be the systematic error done by a VMD fit to only the time-like data set. From the three models considered in Refs. [4,30], the most conservative systematic error found is around 5 % (details are presented in Appendix B). Notice that when fitting spacelike data with a VMD such an error is around 40 %. The reason of such a difference is simple because available timelike data is much closer to the origin of energies than the space-like one and less sensible to higher-order effects.

η transition form factor: a space-and time-like description
To extract the η TFF low-energy parameters b η , c η , and d η (slope, curvature, and third derivative respectively) from the available data, we start with a P L 1 (Q 2 ) sequence. However, according to Ref. [38], the pseudoscalar TFFs behave as 1/Q 2 for Q 2 → ∞, which means that, for any value of L, one will obtain in principle a good fit only up to a finite value of Q 2 but not for Q 2 → ∞. Therefore, it would be desirable to incorporate this asymptotic-limit information in the fits to Q 2 F ηγ * γ (Q 2 ) by considering also a P N N (Q 2 ) sequence. This method, which makes use of experimental data and theoretical framework for fitting them, cannot access the second Riemann sheet where the resonance poles are supposed to be located [39]. One cannot extract resonance poles parameters with such methods, and that poses a word of caution on the interpretation of fits such as Eq. (3) to relate its pole parameters with effective masses. Our method does not contain a branch cut and all the analytical structure is built to reproduce only the first Riemann sheet. The effective pole we obtain should lie outside the range where data are. The main advantage of the method of PAs is indeed to provide the Q 2 dependence of the TFF over the whole space-and time-like region up to the first resonance in an easy and systematic way, without the need of a model for the resonance poles appearing in the amplitude [30,32]. For how to extract resonance pole parameters using PA, see Refs. [40,41].

A remark on experimental systematic errors
When comparing time-like data results from different collaborations it is common to report, together with the experimental data, the result of a fit with a single-pole function Eq. (3). Although such data contain only statistical errors, systematic errors are incorporated in the result of the fit. When using these data in our fits one must incorporate the systematic error information into the fitted data. 3 The A2 Collaboration reported in 2011 on the Dalitz decay η → e + e − γ [12]. Their fit yielded −2 = (1.92±0.35 stat ± 0.13 syst ) GeV −2 . Combining both statistical and systematic error one obtains −2 = (1.92 ± 0.39 comb ) GeV −2 . In order to obtain the combined error from a direct fit to the published data one can include a new source of error defined in the following way For the A2 2011 data we find that = 6.8 % will allow us to reproduce, with Eq.
Published space-like data contains both error sources separately. The exception is the CELLO Collaboration which does not report a systematic error for each bin of data. Only a 12 % for the two-photon η-decay channel is reported. Accounting for all the different systematic sources we could find in their publication, we ascribe a 12 % of systematic error for the hadronic η decay which leads to a 6 % error for the global number of events (implying a 12 % of systematic error for each bin). We expect that the forthcoming space-like measurements at BES-III will provide the accurate description of such energy region and the role of the unknown systematic effects in the CELLO data would not be important.

Results
After defining the set of data we will use, we report on our results. We start fitting with a P L 1 (Q 2 ) sequence. We reach L = 7 and we show it in Fig. 1 as a green-dashed line. The smaller plot in Fig. 1 is a zoom into the time-like region. The obtained LEPs are collected in Table 2 and shown in Fig. 2 together with our previous results (empty orange) when only space-like data were included in our fits [4]. The stability observed for the LEPs with the P L 1 (Q 2 ) sequence is remarkable, and the impact of the inclusion of time-like data is clear since not only allows us to reach higher precision on each PA but also to enlarge our PA sequence by 2 elements. The stability of the result is also clearer and reached earlier, reduces our systematic error, and shows the ability of our method to extract, for the first time, the LEPs from a combined fit to all the available data. The coefficients of the best fitted P L 1 (Q 2 ) can be found in Appendix A. Fig. 1 η-TFF best fits. Green-dashed line shows our best P L 1 (Q 2 ) fit and black line our best P N N (Q 2 ) fit. Experimental data points in the space-like region are from CELLO (red circles) [9], CLEO (purple triangles) [10], and BABAR (orange squares) [37] Collaborations. Experimental data points in the time-like region are from NA60 (blue stars) [11], A2 2011 (dark-green squares) [12], and A2 2013 (empty-green circles) [14]. The inner plot shows a zoom into the time-like region Table 2 Low-energy parameters for the η TFF obtained from the PA fits to experimental data The first column indicates the type of sequence used for the fit and N is its highest order. The last row shows the weighted average result for each LEP. We also present the quality of the fits in terms of χ 2 /DOF (degrees of freedom). Errors are only statistical and symmetrical To reproduce the asymptotic behavior of the TFF, we have also considered the P N N (Q 2 ) sequence (second row in Table  2). The results obtained are in very nice agreement with our previous determinations. The best fit is shown as black-solid line in Fig. 1. We reach N = 2. Since these approximants contain the correct high-energy behavior built-in, they can be extrapolated up to infinity (black-dashed line in Fig. 1) and then predict the leading 1/Q 2 coefficient: This prediction, although larger than in our previous work [4], still cannot be satisfactorily compared with the BABAR time-like measurement at q 2 = 112 GeV 2 , F ηγ * γ (112 GeV 2 ) = 0.229 (30)(8) GeV [35]. The impact of such a discrepancy on η-η mixing is discussed in the next section.
Our combined weighted average results from Table 2, taking into account both types of PA sequences, give where the second error is systematic (around 0.7, 1.5, and 9 % for b P , c P , and d P , respectively, from Table 1).
Equation (5) can be compared with b η = 0.60(6) stat (3) sys , c η = 0.37(10) stat (7) sys using space-like data exclusively [4]. As expected, not only statistical results have been improved but also systematics, both by an order of magnitude, yielding the most precise slope determination ever.
One should notice that all the previous collaborations used a VMD model fit to extract the slope. In order to be consistent when comparing with our results, a systematic error of about 40 % should be added to the experimental determinations based on space-like data [4,30], and a systematic error of about 5 % should be added to the experimental determinations based on time-like data (see Appendix B for further details).
When comparing different theoretical extractions of the slope of the η TFF with our result in Fig. 3, we find a pretty good agreement with the exception of the results in Ref. [3] that reported b η = 0.546(9) and b η = 0.521(2) using resonance chiral theory with one-or two-octet ansätze. The disagreement is between 2 and 5 standard deviations. Reference [3] uses resonance chiral theory, which is based on large-N c arguments, to extract LEPs. Going from large-N c to N c = 3 imposes a systematic error [33,[44][45][46]. Since Ref. [3] considered two approximations for fitting the η TFF  (with one and two octets), one could consider the difference between them as a way to estimate such an error [4,40,47]. In such a way, the η TFF slope would read b η = 0.53(1), at 2.5 standard deviation from our result.
Eventually, we want to comment on the effective singlepole mass determination P from Eq. (3). Using b P = m 2 P / 2 P and the values in Eq. (5), we obtain η = 0.722(7) GeV or −2 η = 1.919(39) GeV −2 . The fits shown in Fig. 1 use the experimental value of the two-photon decay width as an experimental datum to be fitted. Such a fit could be repeated without includ-ing that decay. In such a way, we reach again a P 7 1 (Q 2 ) and a P 2 2 (Q 2 ) as our best PA with the advantage now that the value F ηγ γ (0) is a prediction of our fits. We find F ηγ γ (0)| fit = 0.250 (38) GeV −1 for the P 7 1 (Q 2 ) and F ηγ γ (0)| fit = 0.248 (28) GeV −1 for the P 2 2 (Q 2 ), which translates into ηγ γ | fit = 0.4 (13) keV and ηγ γ | fit = 0.42 (10) keV, respectively. Comparing with the experimental value ηγ γ | exp = 0.516 (18) keV such predictions are at 0.66 and 0.94 standard deviation each.

The impact of η → γ γ measurements
Our results in Eq. (5) are, by far, the most precise to date. Particularly, we believe that the precision achieved for b η will be hard to improve even if new data becomes available. Nevertheless, the values obtained mildly depend on ηγ γ . For instance, if we would have used the value measured through the Primakoff mechanism omitted in the PDG average [42] (i.e., Primakoff ηγ γ = 0.476(62) keV [48]), we would find b η = 0.570 (13) for the SL+TL extraction with a 20 % larger error, still in nice agreement with our aforementioned results. Notice that this result is pretty similar to the one obtained by our fits when the decay into two photons is not used in the data set. This fact does not lead to a puzzle, everything seems to agree within uncertainties, but it may suggest to look again for a Primakoff measurement.  Low-energy parameters are defined at zero momentum transfer. When extracting them from our fits, one would expect the low-energy data to dominate. We noticed, however, that in order to reach large PA sequences (leading to more precise extractions), the high-energy data is also important as can be seen in Fig. 1. From 5 to 35 GeV 2 data are basically dominated by the BABAR measurement [37] and that has a clear implication on the extraction of the asymptotic lim Q 2 →∞ Q 2 F ηγ * γ (Q 2 ) value as can be seen in Table 3, where the role of data from each collaboration is reported. Indeed, a fit exclusively to BABAR data yields similar results for both slope and asymptotic value than when considering the full set of space-like data. However, a fit to the data from the CELLO [9] Collaboration which range only up to 2.23 GeV 2 yields much larger asymptotic value (although statistically compatible). Considering only data from the CLEO [10] Collaboration, which ranges up to 12.74 GeV 2 , reduces the asymptotic value by about 20 % compared to CELLO. The role of BABAR data is then twofold, allowing first to reach N = 2 in the P N N (Q 2 ) sequence and determining basically the asymptotic value.
In view of the puzzle of the π 0 TFF between BABAR [49] and Belle [50] results, a second experimental measurement covering the high-energy region would be very welcome here. We find the Belle Collaboration suited for such purpose and we would like to encourage them to go ahead with such measurements.
On the other side, time-like data can also be used to predict the asymptotic value, even though the range of data is much shorter and much closer to Q 2 = 0. From the three sets of time-like data used in our fits, A2-11 [12] and A2-13 [14] are based on the η → e + e − γ and covers larger range of phase space. The NA60 [11] Collaboration, based on the η → μ + μ − γ , covers a shorter range but in the higherenergy region. The asymptotic values extracted from the difference time-like sets of data agree rather well but disagree with the results obtained from the space-like data (although overlapping within errors). Whatever the combination of different data sets selected, BABAR data always decides on the asymptotic value. In passing, we notice that any of the configurations considered so far agrees with the results of the η TFF measurement from BABAR [35].

η transition form factor: applications
As stated in the introduction, TFF are not also interesting by themselves but also for the range of scenarios where they play a crucial role. In this section we consider a few of such applications.

Reanalysis of the η-η mixing parameters
In this subsection we briefly summarize the main elements to extract the mixing parameters exclusively from our fits to the form factor data.
As was done in Ref. [4], we analyze η-η mixing using the quark-flavor basis. In this basis, the η and η decay constants are parametrized as where F q,s are the light-quark and strange pseudoscalar decay constants, respectively, and φ q,s the related mixing angles. Several phenomenological analyses find φ q φ s , which is also supported by large-N c ChPT calculations where the difference between these two angles is seen to be proportional to an OZI-rule violating parameter and hence small [36,51].
Within this approximation, the asymptotic limits of the TFFs take the form and their normalization at zero (from the chiral anomaly and Eq. (2)) withĉ q = 5/3 andĉ s = √ 2/3. Experimental information provides |F ηγ γ (0)| exp = 0.274 (5) GeV −1 and |F η γ γ (0)| exp = 0.344 (6) GeV −1 and for the asymptotic value of the η TFF we take the value shown in Eq. (4) with symmetrical errors, lim Q 2 →∞ Q 2 F ηγ * γ (Q 2 ) = 0.177 (15) GeV. With these values, the mixing parameters are predicted to be with F π = 92.21 (14) MeV [42]. The uncertainties are dominated by the error from the asymptotic value prediction. One can translate the mixing parameters obtained in the flavor bases into the octet-singlet one by the following recipe [52]: where represents the admixture of the η and η decay constants in terms of the octet and singlet one. These relations, Eqs. (10), are very useful since, as observed in Ref. [53] and recently discussed in [54], the singlet decay constant F 0 is renormalization-scale dependent: with β 0 = 11N c 3 − 2 3 N F , N c the number of colors, N F the number of active flavors at each scale, and μ the renormalization scale, with μ 0 = 1 GeV a reference point close to the η mass.
To include this effect in our results it is convenient to work it out in the singlet-octet basis for later on translate it into the flavor one using Eq. (10). As such, the asymptotic behavior equations (7) shift to = 2(ĉ q (1 + 4δ/5)F q sin φ +ĉ s (1 + 2δ)F s cos φ). (13) Assuming asymptotic freedom for α s (μ), the phenomenological input α s (M z ) = 0.1185 [42], and the renormalization group equation for α s (μ), we determine α s (μ 0 = 1 GeV) = 0.48, including up to four loop corrections and threshold effects for its running 5 . With such values and Eq. (12) we determine δ = −0.17. Using (10) to go back to the flavor basis we obtain as our final mixing parameters, representing one of the main results of this work: when taking the η(η ) asymptotic behavior, respectively, as part of the subset of equations to be solved (8,13). We stress that corrections from δ are bigger for the η case, as the singlet admixture is more important there. Comparing to our old results (Sect. III in [4]), we find a better agreement among both solutions in (14). As explained in Ref. [4], the mixing equations are not independent, there is a relation between them: where in the last step we have used the exact expressions (8,13). Numerically, using our δ = −0.17, one would obtain 0.85 3 2π 2 for the r.h.s. of (15). Our numerical predictions for the asymptotic form factors together with their experimental normalization yield 0.89(3) 3 2π 2 for its l.h.s., in nice agreement, but this would not be the case without the δ correction.
This result contrast with BABAR determinations, which, taking the running from μ 0 up to their scale Q 2 = 112 GeV 2 instead of at ∞ (resulting in δ B AB AR = −0.09), yields using again the two subsets of equations in (13). Equation (15), l.h.s., would read 0.98 (7) 3 2π 2 where we are neglecting any 1/Q 2 dependence in it. In other words, assuming that Q 2 = 112 GeV 2 plays the role of ∞. The r.h.s. of (15) using δ BABAR results in 0.92 3 2π 2 , then compatible with its l.h.s. This comparison is somewhat false since we do not assume δ BABAR to be at ∞, otherwise we would have used δ = −0.17, and then we would have found a contradiction between l.h.s. and r.h.s. of (15). This discrepancy is what we call the BABAR puzzle; it is depicted in Fig. 4. This figure includes both our mixing results and the BABAR determination at Q 2 = 112GeV 2 with the corresponding mixing parameters obtained using Eqs. (8) and (13).
We remark that our results are tied to α 2 s corrections in Eq. (12) and non-negligible systematic effects for the η asymptotic behavior from our fits. Since this assertion relies on the high-energy TFF-behavior, where only BABAR data are available, a second measurement by Belle Collaboration would be a very useful crosscheck.
The mixing parameters obtained with our fits are precise enough to be competitive with the standard approaches with the advantage of using much less input information. Figure 5 compares our results from Eq. (14) (blue squares) with wellestablished phenomenological determinations, the one from Feldmann, Kroll, and Stech (FKS) from Ref. [51,52], and the one from Escribano and Frere (EF) from Ref. [36] (updated in Ref. [4]). The agreement among the three approaches for both F q and φ is impressive. Less agreement is found for  Fig. 4 Mixing parameters as a function of the η TFF asymptotic value when the errors coming from the normalization of the TFFs is set to zero. For comparison, we show the mixing parameters extracted from the measurement of the time-like η TFF at q 2 = 112 GeV 2 by BABAR Collaboration [35]. This figure exemplifies the puzzle between the standard mixing parameters and the BABAR measurement F s . This parameter is more sensible to meson decays where the strange quark plays an important role, such as the φmeson decays. In fact, such decays where included in the EF approach but not in the FKS or in the present work. Figure 6 compares different singlet octet determinations. In this basis, our results turn out to be F 8 /F π = 1.22 (11), F 0 /F π = 1.15 (6), and they can be compared with the FKS and EF as before together with the results by Leutwyler (L) from Ref. [53] (no errors were given), and the results from Benayoun, Del-Buono, and O'Connell (BDO) [55]. Again, the agreement between our results and FKS and EF are remarkable, and also in agreement with the results of Leutwyler. Our results slightly disagree for θ 0 and F 0 with BDO. The reason is because in the BDO the OZI-violating piece is not set to zero. Since such piece mixes with the singlet component of the mixing, their θ 0 and F 0 are slightly shifted.

A comment on the BABAR high-energy time-like measurement
Our mixing parameters in Eq. (14) disagree with the ones obtained using BABAR time-like result as shown in Fig. 4 which, including the results in Figs. 5 and 6 suggests a puzzle between the BABAR measurement and the standard phenomenology [4]. Assuming that the measurement is correct, the difference could be explained if the assumption of duality between time-and space-like regions at high energy would not be yet valid at 112 GeV 2 or that the asymptotic limit of Brodsky and Lepage is not yet reached at such energies and a systematic error is done in assuming duality [35]. In Ref. [35], BABAR collaboration studied the process e + e − → γ * → η ( ) γ at the center-of-mass energy √ s = 10.58 GeV. They measured its cross section and using its relation with the TFF, obtained the absolute value of the timelike TFF at Q 2 = −s = −112 GeV 2 , |Q 2 F ηγ * γ (Q 2 )| = (0.229±0.031) GeV and |Q 2 F η γ * γ (Q 2 )| = (0.251±0.021) GeV, where statical and systematic uncertainties are added in quadrature.
A kinematic factor K 3 P with K P = 1 − M 2 P s (see [56]) was missing in BABAR expressions. This correction leaves BABAR published results almost untouched. This small shift together with the duality argument [35] results in a prediction of the TFF at Q 2 = 112 GeV 2 : One is tempted to include these time-like measurements transformed into space-like predictions Eq. (18) into our fits, after assuming that at this high momentum transfer, the duality between time-and space-like holds and no extra error should be included. For the η TFF fits, its inclusion will mainly modify the asymptotic prediction growing up its value up to lim Q 2 →∞ Q 2 F ηγ * γ (Q 2 ) = 0.247 GeV, higher than the BABAR result, with a good reduced χ 2 < 1. This, by itself, already indicates that at Q 2 = 112 GeV 2 the asymptotic regime is not yet reached. Curiously enough, the value of the fit function at Q 2 = −112 GeV 2 is |Q 2 F ηγ * γ (Q 2 )| Q 2 =+112 GeV 2 = 0.219 GeV, below (18). Even worse is the prediction of our fit function for the time-like counterpart, i.e., |Q 2 F ηγ * γ (Q 2 ) | Q 2 =−112 GeV 2 = 0.307 GeV. This exercise shows that the assumption of asymptotic regime at 112 GeV 2 has an error of about 15 % in our fits, a theoretical error that should be added to BABAR results when used in the space-like region. A recent analysis of the pseudoscalar TFF based on perturbative corrections [54] concludes that the difference between the timeand space-like form factors at |Q 2 | = 112 GeV 2 can be of the order of 5-13 % for different pseudoscalar distribution amplitudes, and can be enhanced by Sudakov-type corrections (see [57] for details). The Regge model defined in Ref. [4] also suggests a departure from duality of about 15 to 20 % at |Q 2 | = 112 GeV 2 . It is, however, difficult to calculate that error and hence difficult to ascribe it to the BABAR determination.
Interestingly enough, to check the eventual departure arising from duality violations, one could artificially enhance BABAR error just to cross-check its order of magnitude. Increasing the error in Eq. (18) from 0.031 GeV 2 to 0.051 GeV 2 (adding in quadrature a 1.3σ ) and refitting again, we obtain, with somewhat better χ 2 , the result that the asymptotic predicted value would then be lim Q 2 →∞ Q 2 F ηγ * γ (Q 2 ) = 0.193 GeV, the fit value at Q 2 = −112 GeV 2 would read 0.187 GeV, but also our time-like prediction at Q 2 = 112 GeV 2 would read 0.199 GeV, essentially satisfying the initial assumption that time-and space-like TFF coincide at 112 GeV 2 . The error we had to artificially add to reach at that conclusion is around a 20 %, which agrees with our previous statements and also with [54]. Of course, adding this 20 % error in Eq. (16) solve what we call BABAR puzzle.
A 15 % departure from the asymptotic limit may seem too large for that high momentum transfer. Notice [54,57] that due to its nature, TFF are a convolution of a perturbative hard-scattering amplitude and a gauge-invariant meson distribution amplitude (DA) [58] which incorporates the nonperturbative dynamics of the QCD bound-state [38]. That means that even for large Q 2 well inside the asymptotic region, soft scales coming from the Fock decomposition can enhance the TFF. These soft corrections depend on the broadness of the DA. At low energies, our fits suggest the typical hadronic scale for the η TFF to be lower than the η counterpart. Being the η more contaminated by ss content (and less from other Fock states), one would expect its hadronic scale to be close to the φ meson mass, around 1 GeV. This is in fact what we find, and indicates a narrower DA for the η , dominated by a qq state, explaining at once why the duality arguments hold better than in the η case. This argument complements the one discussed in [54] from the perturbative study of the TFFs.
Even larger error should be added to duality arguments at lower energies, such as the measurement of the CLEO Collaboration of the same cross section but at √ s = 3.773 GeV, and forthcoming measurements by the BES-III Collaboration at √ s = 4.26 GeV. For all these reasons, we chose not to use these BABAR measurements in the time-like region in our fits.

A prediction for the V Pγ couplings
In this subsection, we extend our analysis to the vectorpseudoscalar electromagnetic form factors. In particular, we are interested in the couplings of the radiative decays of lowest-lying vector mesons into η or η , i.e., V → (η, η )γ , and of the radiative decays η → V γ , with V = ρ, ω, φ.
We follow closely the method presented in Refs. [36,59], and we make use of the equations in Appendix A in Ref. [36] to relate the form factors with the mixing angle and the decay constants in the flavor basis. To account for the φ-ω mixing we use φ V = 3.4 • . The form factors, saturated with the lowest-lying resonance and then assuming VMD, can be expressed by where g V Pγ are the couplings we are interested in, and f V are the leptonic decay constants of the vector mesons and are determined from the experimental decay rates via with c V an electric charge factor of the quarks that make up the vector, . Experimentally we find using (ρ → e + e − ) = 7.04(6) keV, (ω → e + e − ) = 0.60(2) keV, and (φ → e + e − ) = 1.27(4) keV from [42].
The couplings in this flavor basis are where we have assumed φ q = φ s = φ. Table 4 collects our predictions in its second column. Corrections due to φ q = φ s to these formulas can be found in Appendix A, Eq. (A.5) of Ref. [36].
The decay widths of P → V γ and V → Pγ are  (55) 4.67 (20) Experimental determinations are from Ref. [42] ( The experimental decay widths from [42] allow us to extract an experimental value for g V Pγ , which are collected in the last column on Table 4.
Our predictions compare well with the experimental determinations, see Table 4, specially considering the simplicity of the approach. The differences are always below 2 standard deviations, excepting the ω couplings. Our prediction for the ratio of J/ decays is in that respect remarkable.
The observed deviations hint toward a somehow oversimplified approach. Even though our goal is just to show the relevance of TFF in other decays, and we do not pretend an exhaustive study of higher-order contributions in our scheme, we still want to remark two possible ways to improve our approach.
On the one hand, the fact that F q departs from F π in Eq. (14) may imply a correction through an OZI-violating parameter 1 that appears at next-to-leading order in the Lagrangian of χ PT Large-N c used to define the mixing equations, F q = F π (1 + 1 /3) [52,60] which in turns imply φ q = φ s , since φ q − φ s ∼ 1 /3. With the result in Eq. (14), we estimate 1 ∼ 0.2, in agreement with the naive 1/N c counting (i.e., 1 ∼ 1/N c ∼ 0.3), and then φ q − φ s ∼ 3.8 • .
The ratio R J/ provides direct information on the angle φ q since with M J/ the J/ meson mass. The experimental R J/ ratio defined in last the row in Table 4, results in φ q = (38.1 ± 0.6) • , which implies φ s = (38.1 + 3.8 ± 1.6) • = (41.9 ± 1.6) • with the error coming from our determination of φ in Eq. (14). Even though both angles are distinguishable, their impact on the g V Pγ is a shift of the form g V Pγ → g V Pγ /(cos φ q cos φ s + sin φ q sin φ s ) [36]. For the φ q = φ s limit, such a shift is exactly 1. Using the 3.8 • differ-ence, such a shift translates into 0.998, a 2 per mil effectnegligible. Our assumption φ q = φ s = φ is supported phenomenologically.
On the other hand, as discussed in detail in Ref. [61], in the flavor singlet channel one has to allow for another OZI-rule violating correction, which essentially corresponds to replacing F 0 → F 0 /(1+ 3 ). This shifts both the P → γ γ decays and the formulas for g V Pγ predictions [60]. The parameter 3 is, however, still unknown, although expected to be ∼ 1/N c ∼ 0.3. We can make use of Eq. (15) to estimate it. The shift on F 0 can be translated into a shift in F q,s recalling that both are related to F π following Eqs. (17) and (9), respectively, and find F q,s → F q,s /(1 ± 3 ) as well. Going then to Eq.
Then Eq. (15) transforms into which, after expanding and reorganizing in such a way that in the l.h.s. remain only experimental quantities, results in We recall that l.h.s., experimentally, reads 0.89 3 2π 2 , and δ = −0.17. With (26) we find 3 = 0.05, smaller than expected and with positive sign.
The VPγ couplings are also shifted by 3 . The expressions can be found in Eq. (42) in Ref. [60] which, after expanding, can be expressed as a shift on the couplings in our Eq. (22): g V ηγ → (g V ηγ + |g V ηγ | 3 /2) and g V η γ → (g V η γ + |g V η γ | 3 ), always increasing the coupling. For some of them, the 3 correction goes on the right direction (the ρ case), but for others it is not conclusive (the φ case where for η goes well and for η wrong). The result of the shift is, then, ambiguous.
Discarding OZI-violating effects, Padé approximants can then be the avenue to follow since the vector mass that should be used in Eq. (19) it should not correspond to a physical observable, but an effective scale provided by the pole of a PA assuming the philosophy of the present work. For the η TFF, the 2 η from Eq. (3) is smaller than the VMD mediator. If the same would happen with the ρ, ω form factors, one would expect, then, different g V Pγ couplings. Since this study is beyond the scope of the present analysis, we postpone it for future work. A naive estimate of these effects could be accounted for within the half-width-rule [46], i.e., instead of using m V in Eq. (19), we use m V ± V /2, with the full width of the vector. This provides a way to assess the error of neglecting the width of the resonance in using m V . For example, for the ρ case, within the half-width-rule, the errors of the g ρ Pγ would be enlarged by a factor 3, well compatible with the experimental determinations.
Further studies along these lines are postponed for future work.

Conclusions
In the present work, the η transition form factor has been analyzed for the first time in both space-and time-like regions at low and intermediate energies making use of a modelindependent approach based on the use of rational approximants of Padé type. The model independence of our approach is achieved trough a detailed and conservative evaluation of the systematic error associated to it. The new set of experimental data on the η → e + e − γ reaction provided by the A2 Collaboration in the very low-energy part of the time-like region allows for a much better determination of the slope and curvature parameters of the form factor, as compared to the predictions obtained in our previous work only using spacelike data, which constitute the most precise values up-to-date of these low-energy parameters. Our method is also able to predict for the first time the third derivative of the form factor. In addition, the new analysis has served to further constrain its values at zero momentum transfer and infinity. We have seen that our results, in particular for the case of the slope parameter, are quite insensitive to the values used in the fits for the two-photon decay width of the η, thus showing that the collection of space-and time-like experimental data is more than enough to fix a value for the normalization of the form factor compatible with current measurements. We have also seen that the role played by the high-energy space-like data is crucial to get accurate predictions for the low-energy parameters of the form factor and its asymptotic value. As a consequence of these new results, we have fully reanalyzed the η-η mixing parameters this time also considering renormalization-scale dependent effects of the singlet decay constant F 0 . The new values obtained are already competitive with standard results having the advantage of requiring much less input information. Related to this, we have also obtained predictions for the V Pγ couplings which are in the ballpark of present-day determinations.
In summary, the method of Padé approximants has been shown to be very powerful for fixing the low-energy properties of the η transition form factor making their predictions more accurate and well established. This fact opens the door to a more exhaustive analysis of the single Dalitz decay processes P → l + l − γ , with P = π 0 , η, η and l = e, μ, the double Dalitz ones P → l + l − l + l − (in all possible kinematically allowed configurations) [62], and the rare leptonpair decays P → l + l − -see the π 0 → e + e − application in Ref. [63], which are usually discussed only in terms of monopole approximations. Indeed, when this work was being concluded the BESIII Collaboration reported a first observation of the η → e + e − γ process measuring the branching ratio and extracting the η transition form factor [64]. This new measurement may put our approach with its back to the wall. However, a very preliminary analysis of this recent data in comparison with our prediction for this form factor in the time-like region exhibits a nice agreement but reveals the necessity of going beyond the VMD model used in the experimental analysis [65].  Table 5.
With the coefficients in Table 5, one can extract the slope of the TFF by expanding (A.1) and normalizing the result as in Eq. with m η = 0.547853 GeV, to be compared with the third column in Table 2.

Appendix B: Convergence of the Padé approximant sequence
To test how fast the convergence of our PA sequence is we analyze here a simple holographic confining model presented in [18] (and also explored in Ref. [17]), based on lightfront holographic QCD where the correct small Q 2 behavior (in order to simulate confinement) is introduced using the dressed current (see [18] for details). 7 In this context, the TFF is defined as (assuming, for simplicity, η ∼ η 8 ) where P qq is the probability of finding the qq component in the η light-front wave function, and we impose P qq = 0.5 for numerics. Once the model is defined, by generating a set of pseudodata as is done is Sect. 2 we can test how fast the PA sequence converge to Q 2 F ηγ * γ (Q 2 ). We fit these pseudodata with both P L 1 (Q 2 ) and P N N (Q 2 ) sequences going up to L = 8 and N = 4 and we show the convergence pattern for the value of the η TFF at the origin, F ηγ γ (0), and its three first derivatives (b η , c η , and d η ) in Fig. 7 (the black square shows the corresponding value from Eq. (A.3)). This exercises complements the one studied in the appendix of Ref. [4]. The first element of the sequence, the P 1 1 (Q 2 ) contains only two parameters and then only F ηγ γ (0) and b η can be directly extracted from the fit. By reexpanding it one could predict all the other derivatives but we only show fit outputs. The same applies for the second element of the sequence, the P 1 1 (Q 2 ) and its third derivative. The reader should notice the hierarchy pattern of convergence. While F ηγ γ (0) is approach very fast and with the P 2 1 (Q 2 ) the error is about a 2 %, the derivatives are worse predict, being the third one the worst. Comparing each prediction with its counterpart from the model gives and idea of the systematic error done by such procedure (see Ref. [4] for details about the systematic error from our method). The P N N (Q 2 ) sequence converges much faster, reaching the 10 −3 with its second element. The inconvenience is its growing two-by-two inputs for each new element, making the sequence more difficult to be performed in fits to real data. These results are model dependent and other models may yield different systematic errors. The trend shown in Fig. 7 is, however, general although models such us the logarithmic one studied in Ref. [30] converge faster (reaching smaller errors) even though the systematic error for the first elements on the P L 1 (Q 2 ) are larger.