How much do charm sea quarks affect the charmonium spectrum?

The properties of charmonium states are or will be intensively studied by the B-factories Belle II and BESIII, the LHCb and PANDA experiments and at a future Super-c-τ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau $$\end{document} Factory. Precise lattice calculations provide valuable input and several results have been obtained by simulating up, down and strange quarks in the sea. We investigate the impact of a charm quark in the sea on the charmonium spectrum, the renormalization group invariant charm–quark mass Mc\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$M_{\mathrm{c}}$$\end{document} and the scalar charm–quark content of charmonium. The latter is obtained by the direct computation of the mass-derivatives of the charmonium masses. We do this investigation in a model, QCD with two degenerate charm quarks. The absence of light quarks allows us to reach very small lattice spacings down to 0.023fm\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.023~\hbox {fm}$$\end{document}. By comparing to pure gauge theory we find that charm quarks in the sea affect the hyperfine splitting at a level around 2%. The most significant effects are 5% in Mc\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$M_c$$\end{document} and 3% in the value of the charm quark content of the ηc\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta _c$$\end{document} meson. Given that we simulate two charm quarks these estimates are upper bounds for the contribution of a single charm quark. We show that lattice spacings <0.06fm\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<0.06~\hbox {fm}$$\end{document} are needed for safe continuum extrapolations of the charmonium spectrum with O(a) improved Wilson quarks. A useful relation for the projection to the desired parity of operators in two-point functions computed with twisted mass fermions is proven.


Introduction
The charmonium system is frequently characterized as the "hydrogen atom" of meson spectroscopy owing to the fact that it is non-relativistic enough to be reasonably well described by certain potential models [1]. It is a perfect testing ground for a comparison of theory with experiment. Over the last years, there has been a renewed interest in spectral calculations with charmonia because of the experimental discovery of many states which are not predicted by potential models [2], e.g. the so-called X, Y, Z states, like the X (3872) state [3] or the P c pentaquark candidates [4]. More exciting experimental data are expected from the B-factories Belle II [5] and BESIII [6], the LHCb experiment [7], the PANDA experiment at FAIR [8] and at a future Super-c-τ Factory [9].
States above the open charm thresholds decay strongly and multi-hadron channels need to be included for a full treatment. The masses of these resonances can be computed in the approximation that they are treated as stable and are accurate up to the hadronic width [10,11].
For the computation of the charmonium spectrum the relevant quarks to include in the lattice simulations are u, d, s, and c. The question which we address in this work is the necessity to include the charm quark c in the sea, i.e. as a dynamical quark which contributes through loops and not only as a valence quark. QCD with N f = 2 + 1 (u, d, s) dynamical quarks is cheaper to simulate than QCD with N f = 2 + 1 + 1 (u, d, s, c) dynamical quarks. Adding a dynamical charm quark requires finer lattices than they are needed for the lighter quarks and complicates the tuning of the parameters.
For processes at energies E which are much smaller than the charm-quark mass M c the charm quark decouples [17,18]. It can be integrated out and its effects are absorbed in the renormalization of the gauge coupling and light quark masses, and in small corrections proportional to inverse powers of M c . In [19][20][21] the decoupling of the charm quark at low energies was studied in a model, namely QCD with N f = 2 degenerate heavy quarks of mass 1.2 M c M M c /8. Simulations at very small lattice spacings down to a = 0.023 fm in physical volumes comparable to those used in the Yang-Mills theory allow to control the continuum limit. Concerning the renormalization the model study confirmed that treating decoupling in perturbation theory only introduces small non-perturbative corrections which can be estimated through the model calculation. Denoting a low energy scale of mass dimension one by S, the massscaling function defined by where M is the renormalization group invariant mass of the heavy quark, is universal (i.e. it does not depend on the specific scale chosen) up to non-perturbative 1/M 2 corrections η M NP . In [21] the conclusion was that η M NP < 0.014 for the charm quark in QCD. We emphasize that Eq. (1.1) corresponds to the charm quark content of the nucleon and is needed to compute the cross-section of the scalar interaction of dark matter with nucleons.
In this work we extend the model study of charm loop effects to observables which explicitly depend on a valence charm quark. The paper is organized as follows. In Sect. 2 we introduce the model. Section 3 explains our lattice setup based on twisted mass fermions at maximal twist, the observables and the computation of their derivative with respect to the quark mass. In Sect. 4 we present our results for the charm loop effects, specifically in the charmonium spectrum and the renormalized charm-quark mass. We also compute the generalization of the mass-scaling function in Eq. (1.1) to describe the charm-quark mass-dependence of the charmonium states. All our results are evaluated after continuum extrapolation and we discuss the size of lattice artifacts. Section 5 contains the summary of this work. Appendix A shows how to construct two-point functions which project to definite parity states with twisted mass fermions. In Appendix B the charmonium masses obtained on our ensembles are listed.

Model
Consider QCD with quarks q i , i = {u, d, s, c}. We denote their Dirac operators by D i . Our goal is to estimate the contribution of charm-quark loops in physical observables A[q i , U ], where U represents the gauge field. The expectation value of the observable is The charm-quark loop effects 1 stem from the determinant det D c . Quenching the charm, i.e. setting det D c = 1 means neglecting the charm loops. This approximation is made in the computations of the charmonium spectrum of Refs. [10][11][12][13][14]. In order to assess how good this approximation is, one would need a comparison in the continuum limit with simulations where a dynamical charm quark is added. Assuming this was possible, the comparison would be superfluous since one would stick with the more complete theory anyhow. But adding a dynamical charm quark means a significant increase in the complexity and costs of the simulations. This is so because of the additional tuning of the charm quark mass and the combination of small lattice spacings, which are required by the large charm-quark mass and the large physical volumes, which are needed to accommodate the light mesons. So the really interesting question is if it is possible to decide whether a dynamical charm quark is necessary before doing the simulations. This is why the study of a model, QCD with just N f = 2 degenerate charm quarks, is appealing. Observables in this model are defined in terms of a doublet of charm quarks q c = (c 1 , c 2 ) and their expectation value is After matching this theory with a Yang-Mills (or pure gauge) theory, the difference in physical observable will be a direct measure of the effects of charm-quark loops. There are two differences with respect to a comparison between QCD with four (u, d, s, and c) and three (u, d, s) quarks: in the model we miss the effects of the light quarks and we double the number of sea charm quarks. Since what we are interested in is a comparison of a theory with and without charm quarks in the sea we do not expect the light quarks to affect the difference of the same quantity computed in the two theories much. The extra charm quark in the sea will make the effects larger. For a low energy quantity, where the theory of decoupling applies, the effects scale proportionally to the number of quarks [21], so they are overestimated by a factor of two in the model. For a quantity with a valence charm quark decoupling does not apply in the obvious way 2 and we consider the effects computed with two charm quarks in the sea as an upper bound for those with only one charm quark.

Simulations
In this section we introduce the lattice setup used for this work and all the observables under investigation. We mainly focus on quantities with an explicit charm-quark dependence, like charmonium masses, the hyperfine splitting and the renormalization group invariant quark mass.

Actions and algorithms
We use relatively simple and theoretically well understood lattice actions for our simulations. For the N f = 0 ensembles the standard Wilson plaquette action [22] is employed. In the N f = 2 case, a doublet of twisted mass Wilson fermions is added [23,24]. In massless schemes, theories with standard and with twisted-mass fermions share the same renormalization factors, as long as other details of the action are the same. We therefore also include a clover term [25] in our action.
Although not necessary for O(a) improvement of physical quantities [24] (at maximal twist), it has been shown to reduce O(a 2 ) artifacts in some cases [26], and more importantly gives us access to the wide range of renormalization factors that have been determined non-perturbatively in the past. In particular we benefit from the knowledge of the critical mass m cr [27,28] and the axial current and pseudoscalar density renormalization factors Z A [29][30][31] and Z P [27,32]. Since one of our goals is a detailed understanding of charm related lattice artifacts, we simulate also at very fine lattice spacings, much finer than what is currently feasible in simulations that include light quarks. Problems related to deficient sampling of topological sectors are avoided by the implementation of open boundary conditions in the time directions [33]. The spatial dimensions are kept periodic.
To summarize, our action is S = S g + S f , with gauge action where the summation is over all oriented plaquettes p on the lattice, weighted by w( p) which is one everywhere except for spatial plaquettes on the temporal boundary time-slices, where it is 1/2. U ( p) is the product of four SU(3) gauge fields U μ (x) around the elementary plaquette p. Gauge fields are periodic in spatial directions and absent on temporal links sticking out of the lattice (i.e. open boundaries). The free parameter of the gauge action is the bare coupling g 2 0 ≡ 6/β. In case of the N f = 2 simulations, a fermionic action is added where χ = (c 1 , c 2 ) is a flavor doublet of quarks and the Dirac operator is with bare mass m 0 and twisted bare mass μ. The third Pauli matrix τ 3 in the twisted mass term acts in flavor-space, all other terms of the operator are flavor diagonal.
is the massless Wilson operator, containing the usual covariant forward and backward finite difference operators ∇ μ χ(x) . Finally, the operator in the Sheikholeslami-Wohlert term acts as A symmetric discretization of the field strength tensorF μν , as e.g. in [34], is used. The fermionic fields are periodic in spatial directions and satisfy Dχ = 0 on the first and last time-slice of the lattice. The fermionic part of the action has dimensionless simulation parameters κ ≡ 1 2am 0 +8 , aμ and c sw . The above choice for the actions corresponds to setting the gluonic and fermionic boundary improvement terms to their tree level values.
Both N f = 0 and N f = 2 theories are simulated with a Hybrid Monte Carlo (HMC) [35] algorithm. The molecular dynamics equations are integrated using a fourth order Omelyan-Mryglod-Folk integrator. In the case with fermions a multi-level variant is employed, with fermionic Table 1 Simulation parameters of our ensembles. The columns show the lattice sizes, the gauge coupling β = 6/g 2 0 , the critical hopping parameter, the twisted mass parameter μ, the pseudoscalar mass in t 0 units, the hadronic scale t 0 /a 2 defined in [40] and the total statistics in molecular dynamics units. Notice that even though the number of sites in the temporal direction is even, the temporal extent T is an odd multiple of a due to the open boundaries. The links pointing out of the lattice volume are absent forces being integrated with a coarser step size than the forces deriving from the gauge action. In addition the quark determinant is factorized into two factors which are then represented by two separate path integrals over pseudo fermion fields [36]. The costs are dominated by solutions of the Dirac equation. The relatively high quark masses in our simulations, mean that a standard conjugate gradient algorithm is often more efficient than more complicated preconditioned variants. On the finer lattices however, SAP preconditioning [37] of the equations involving the light Hasenbusch mass is beneficial. Our simulations are carried out using a variant of openQCD [38]. A minor change allows us to choose a different twisted mass parameter in the SAP preconditioner than in the simulation [21,39]. Table 1 summarizes our simulation parameters.
The simulation algorithm performs very well. In particular no increased critical slowing down due to deficient sampling of topological sectors can be observed. The scaling of the exponential auto-correlation time with the lattice spacing is compatible with the expected τ exp ∝ a −2 behavior [33]. The expected scaling of autocorrelation times with open boundary conditions has been shown in Fig. 8 of Ref. [21].

t 0
The Wilson-flow equation [40,41] is a gauge action of the smeared fields, in our case the Wilson plaquette action, and the link differential operator ∂ x,μ is defined in the usual way [40,42]. It has been shown that correlators constructed from gauge fields at t > 0 are automatically renormalized [43]. Among other things, this allows to define the low-energy length scale t 0 [40] as the flow time t at which In this equation E(t) denotes the Yang-Mills action density at flow-time t, away from the temporal boundaries. A different discretization than the one used in the simulations may be used. We follow [34] and use a symmetrized clover definition where G a μν (x, t) are the Lie algebra components of the lattice field strength tensor.

Isovector meson masses
We study mesons that are ground states in the channels that are excited by operatorsψ τ a ψ. Twisted mass fermions at maximal twist,χ and χ , are related to the fields in the physical basis by (3.10) Table 2 Typical interpolators for meson states and relations between physical and twisted basis. The particle name is the closest relative in nature State J PC Particle Physical basis Twisted basis 2 χ a The notation refers to the γ -structure of the operator This means that some operators take an unusual form. For flavor components τ 1 and τ 2 , the relations are summarized in Table 2.
Meson masses can be extracted from zero momentum correlation functions of the form with various choices of the operators O i . We work with definite flavor assignments, e.g.
Then, integrating over the fermions leaves us with a sin- where i are 4 × 4 matrices related to the operators in the correlation function, and S 1 (S 2 ) is the inverse Dirac operator with positive (negative) twisted mass term. Spatial translation invariance could be exploited to eliminate one of the sums, which would allow to compute a correlator at the cost of 12 solutions of the Dirac equation per choice of y 0 . The signal however is highly improved, by keeping the two sums. The trace can then be efficiently estimated stochastically. We use time-dilution with 16 U (1) noise sources per time-slice, which amounts to 16 inversions per y 0 value and Dirac structure. An improved signal and exact symmetries are achieved by defining the averages 14) (3.16) Enforcing the continuum time reflection symmetries prevents opposite parity operators from mixing, as explained in Appendix A. From the exponential decay of these correlators at x 0 a, meson masses are extracted. First, effective masses are computed , (3.17) and the meson mass is then given as a weighted plateau average The start of the plateau, t low , is chosen such that excited state contributions are completely negligible, and the weights w are given by the inverse squared errors of the corresponding effective masses. All masses that we extract are those of iso-vector mesons. In the light sector these would be called . However, since both our quarks have the mass of a charm-quark, the meson masses that we obtain are more comparable to the charmonia masses η c , J/ψ,. . . respectively. The difference being, that these are iso-scalars and the determination of their masses would require the computation of disconnected (charm annihilation) diagrams.

PCAC mass
Partial conservation of the axial current is an operator relation On the lattice it holds up to lattice artifacts, when inserted into any correlation function, as long as A and P are at a different positions than all other operators in the correlator. These lattice artifacts depend on the exact choice of correlation function, and can be quite large. We extract the bare PCAC quark mass from where∂ μ denotes the symmetric finite difference operator. The lattice artifacts in this quantity increase, when the correlators are evaluated close to the boundary (small x 0 ). We form an average value from the time-slices in the plateau region away from boundaries. For us the main use of the PCAC mass is to find the critical value of the bare mass m 0 , i.e. the maximal twist condition. It is given by the value at which m PCAC = 0. Instead of determining it ourselves, we use very precise critical masses obtained in [27,28]. These were computed from slightly different correlation functions in a finite volume, and differ from ours by an O(a) lattice artifact. We thus do not expect the PCAC masses that we determine to be zero, but to be small and to vanish when the continuum limit is approached. By computing them, we put this expectation to a test. Figure 1 demonstrates that indeed, up to lattice artifacts, we are at maximal twist. If we consider the usual definition of the twist angle the largest deviation from maximal twist (ω = π/2) that we encounter in our simulations is around 6% in the ensemble E, whilst the smallest deviation is around 2% in the ensemble W.

RGI quark mass
At maximal twist a renormalized quark mass is given by m = Z −1 P μ, and depends on the scale and scheme in which Z P was computed. Away from maximal the more general relation holds. We neglect the (very small) contribution due to nonvanishing m PCAC in our determination, after verifying that it is compatible with being of O(a). The axial current renormalization factor Z A is scale independent. It has been determined non-perturbatively in the N f = 0 theory with our action, by exploiting current algebra relations in a massless Schrödinger functional [29]. The same technique has also been applied to the N f = 2 theory [30]. In this case also a more precise determination based on universal relations between correlators in a chirally rotated Schrödinger functional exists [31], and these are the values that we use here. The pseudoscalar renormalization factor Z P depends on the renormalization scheme and scale. It is known non-perturbatively in the SF scheme in both N f = 0 [32] and N f = 2 [27] theories for a wide range of bare couplings, albeit at slightly different scales. The renormalized charm quark mass can thus be computed in the continuum limit, in this particular scheme. To be able to compare the two theories, also the scales should match. We go one step further and compute directly the RGI masses, which are scale and scheme independent. The necessary relations between renormalized and RGI masses are well known for the scales and schemes used above, namely M/m = 1.157 (12) in the N f = 0 theory [32], and M/m = 1.308 (16) in the theory with two dynamical quarks [27].

Twisted mass derivatives
We also computed the derivatives of all the observables above, with respect to the twisted mass parameter μ. The twisted mass derivative of a primary observable A is given by Most quantities we are interested in, are non-linear functions of various primary observables (e.g. m P , which depends on the correlatorf P P at various distances in the plateau region). For these the chain rule dictates None of the observables that we consider have an explicit μ dependence, so the last term in Eq. (3.23) is absent. The derivative of the action is dS/dμ = xχ iγ 5 τ 3 χ , and this is all that is needed to compute the twisted-mass derivatives of purely gluonic observables. More precisely, The last line is a consequence of the twisted-mass relation D 1 − D 2 = 2iγ 5 μ and allows for a more precise stochastic determination of the trace [44]. We found that 64 U (1) noise vectors are enough for the errors in the determination of the derivative to be dominated by gauge-noise, rather than the noise from the stochastic trace evaluation. If the observables depend on fermionic fields too, the first term of Eq. (3.23) gives rise to new contractions that have to be computed. These are different for every fermionic observable. In the case of our two-point functions Eq. (3.11) we find contractions of the form −a 10 x,y,z tr[ 1 S 2 (x, y) 2 S 1 (y, x)]tr[γ 5 (S 1 (z, z) − S 2 (z, z))] gauge , that can be immediately computed because both traces have already been Fig. 1 The standard mass contribution to the renormalized quark mass, which vanishes in the continuum limit estimated for the evaluation of the correlator and of dS/dμ respectively, and new terms ia 10 x,y,z that require some attention. When evaluated stochastically together with the correlator itself, the number of necessary inversions is increased by a factor of 3. While the dS/dμ terms quantify the dependence on the sea-quarks, this last term gives the valance quark mass dependence of the correlator, which is generally much stronger -especially with heavy quarks.

Mass scaling functions
At last, we also investigate the mass scaling functions where m x denotes the mass of a meson in a generic x channel (scalar, pseudoscalar, vector, axial vector, tensor) and M is the renormalization group invariant quark mass. Note that η x is a renormalized quantity and its continuum limit can be easily extracted from the measurements performed at different lattice spacings, without the need of any renormalization factor. Notice that by the Hellman-Feynman-Theorem [45], η x is proportional to the scalar charm quark density between meson states x, i.e. the σ -term Once the twisted mass derivatives of the meson correlators are known, the determination of η x amounts to the evaluation of Eq. (3.24) with a particular function f . Since the action of N f = 0 QCD does not depend on μ, the calculation is greatly simplified in this case. Eq. (3.23) receives a single contribution of the form dÃ[D −1 , U ]/dμ . In the N f = 2 theory on the other hand, also the μ-derivative of the action must be taken into account.

Parameters, tuning and mis-tuning corrections
Apart from the lattice size, the bare parameters of the N f = 2 simulations are the inverse bare coupling β, the bare mass am 0 and the bare twisted mass aμ. The choice of β corresponds to a choice of the lattice spacing. We choose to simulate at β ∈ {5.3, 5.5, 5.6, 5.7, 5.88, 6.0} which spans a wide range of lattice spacings, see Table 5 and allows for very controlled continuum extrapolations.
The bare mass is set to its critical value m 0 = m cr . To achieve this, the values in [27] are fitted to a Padé function, as described in [21]. This puts us to maximal twist, up to O(a 2 ). In this situation the physical quark mass is given by the twisted mass parameter m = Z −1 P μ. On our finest lattice at β = 6.0 we choose where the ratio of the RGI charm quark mass and the two flavor parameter is set to 4.87, the pseudoscalar renormalization factor at scale L −1 1 and β = 6.0 in the SF scheme is Z P = 0.5184 (33) [27], the relation between a renormalized quark mass in the SF-scheme at scale L −1 1 and the RGI quark mass M is known in the continuum M/m(L −1 1 ) = 1.308 (16) [27], and MS L 1 = 0.649 (45) [46]. Finally L 1 in lattice units is obtained by an interpolation of L 1 /a vs β data from [27] to β = 6.0. A quadratic fit of log(L 1 /a) as a function of β, describes the data very well and yields L 1 /a β=6.0 = 17.27 (70). The quite substantial errors mean, that our simulated mass corresponds to the charm quark mass only up to about 10%. This is however fully sufficient for us, as long as the relative mass differences between the different ensembles are under better control. To achieve this, we do not use Eq. (3.31) at the other lattice spacings. Instead we proceed as follows: the dimensionless, renormalized quantity √ t 0 m P = 1.807463 (3.32) is determined on the ensemble with the finest lattice spacing. On the coarser lattices aμ is tuned such that the same value of √ t 0 m P is obtained. This condition determines the bare twisted mass parameter very precisely and ensures that all ensembles have the same renormalized quark mass up to O(a 2 ). Finally, the clover coefficient c sw is set to its nonperturbatively determined value [47].
The tuning of the twisted mass parameter can be only carried out to a limited precision -at most to within the statistical errors. To account for the mis-tuning, a correction is applied to all observables, based on the computed twisted mass derivatives. First a target tuning point μ is determined and afterwards all quantities, denoted by below, are corrected The error of the tuning point μ * is propagated to the value of (μ * ) taking all correlations into account. It is assumed that the initial tuning was precise enough for the omitted quadratic terms to be negligible, compared to the statistical precision. Figure 2 demonstrates the procedure. A comparison with direct simulations indicates that even for large shifts of ≈ 15% in aμ the linear approximation works well. The true shifts, that are needed are all much smaller, at most 5.40%. Note that the μ-shifts could also be computed using the mass reweighting, as explained in [48,49].
The N f = 0 simulations are carried out at β ∈ {6.1, 6.34, 6.672, 6.9}. The valence quarks have m 0 = m cr [50], non-perturbative c sw from [50] and three values of the twisted mass parameter, chosen such that a short interpolation to the value of √ t 0 m P given in Eq. (3.32) can be performed. An example of this procedure is shown in Fig. 3. Since decoupling applies to t 0 , the condition Eq. (3.32) means that the quark mass in the N f = 0 theory is the same as in the theory with two flavors, up to O(a 2 ) and tiny O( 2 /M 2 c ) power corrections [20,21].   Table 1). The horizontal line depicts the tuning point Eq. (3.32). The vertical lines are the resulting interpolated twisted mass parameter aμ and its statistical error. The measured vector, scalar and tensor mesons masses (diamonds, triangles and squares respectively) can then be interpolated to the tuning point, resulting in the corresponding solid markers. In their error bars all the correlations among the data have been taken into account

Data analysis
We use the -method [51] for the determination of statistical uncertainties. Observables like the effective mass Eq. (3.17) are non-linear functions of "primary observables", and their errors are determined as described in [52]. When incorporating the mis-tuning corrections of Sect. 3.3 the necessary nonlinear functions can become quite unwieldy. For instance, the vector meson mass at μ depends on the vector correlator in the plateau region, but also on the pseudoscalar correlator in its plateau region, to determine how big a shift in μ is required. Furthermore, the vector mass depends on the μderivatives of these correlators, on the μ-derivative of the action and on the μ-derivative of the action times the correlators. Combinations like √ t 0 m V depend on even more primary data.

Raw results
We measured all observables described in the previous section on all ensembles, except of m T and m S which were measured only on a subset and the mass derivatives, which were not measured on the W ensemble. A somewhat delicate issue is the proper choice of the plateau regions over which the effective masses are averaged. The leading correction to a constant effective mass is given by where 1 ( 2 ) is the distance between m and the first (second) excited state. In a first preliminary fit we determine 1 and c. We are then in the position to choose the plateau region such, that the influence of the excited states on the plateau average Eq. (3.18) is negligible compared to its statistical uncertainty. The thus determined plateau regions are collected in Table 3. Figure 4 demonstrates the procedure for the case of ensemble W . The effective masses in the axialvector channel become too noisy, before a clean plateau is reached and are hence excluded from the tables.
The results for the plateau averages are summarized in Table 6 in Appendix B, which shows the results at the simulated parameters, as well as the values corrected for small mis-tunings in the twisted mass parameter.

Continuum extrapolations
We perform continuum extrapolations of dimensionless quantities. These are either ratios of meson masses, namely m V /m P , m S /m P and m T /m P , or the mass-scaling functions η P and η V . One last quantity is the renormalized quark mass. We take the continuum limit of the dimensionless ratio of m and m P . All fits are restricted to a region where the data can be well described by the expected leading scaling violations Table 3 The meson masses are determined from effective masses in the region t low < x 0 + a/2 < t high . The  Fig. 4 The effective masses on the W ensemble for the pseudoscalar (circles), vector (diamonds), scalar (triangles) and tensor (squares) channels are displayed, together with the plateau average and its error band. The fit to the leading correction Eq. (4.1) is also shown of order a 2 . This means, neglecting data with lattice spacings coarser than a 2 /t 0 > 0.25. Figures 5, 6 and 7 and Table 4 summarize our findings. The data entering the fits are collected in Table 7.
The results in the continuum limit are collected in Table 4.

Dynamical charm effects
The comparison of continuum results in the N f = 2 theory with those in the N f = 0 theory directly quantifies the typical size of the effects, that the inclusion of dynamical charm quarks have on observables with valence charm quarks. Although they were determined very precisely, no significant effect can be seen in the meson mass spectrum. The most significant deviations of around 1.6σ are found in the ratios m V /m P and m S /m P . The relative differences between the central values of the first ratio are  A clearer difference between the N f = 0 and N f = 2 theories can be observed in the mass-scaling functions and in the RGI quark mass. The values of η P and the quark mass differ by almost 3σ . The relative differences are 1.8)%. An even larger (but less significant) difference is found in η V .

Lattice artifacts
Having access to very fine lattice spacings is crucial for reliable continuum extrapolations. Although our fermionic  action, i.e. twisted mass fermions with an additional clover term, is known to have relatively mild lattice artifacts, the continuum value of e.g. m V /m P would be significantly underestimated if we had access only to our two coarsest lattices (E and N). The finer of the two has a lattice spacing of a ≈ 0.049 fm, which is comparable to the finest lattice spacings typically achievable in large-volume simulations with light quarks. The situation is depicted in Fig. 8. The presence of large lattice artifacts of O((aμ) 2 ) not only affects observables like m V /m P , but also the value of the lattice spacing a itself. Since it is obtained by determining some hadronic length scale L had /a in lattice units at finite lattice spacing and dividing it by the continuum value in fm, i.e. a = a/L had × L had,cont , its value depends on the lattice artifacts present in L had . In our case one possibility to compute the lattice spacings is through the scale L had,cont,1 ≡ L 1 = 0.40 (1) fm. Its values in lattice units are known for our bare couplings and the resulting lattice spacings are between a L 1 = 0.023 fm on ensemble W and a L 1 = 0.066 fm on ensemble E. Alternatively, one could determine the lattice spacing through L had,cont,2 ≡ √ t 0 (M) = 0.1131 (38) fm [21]. While the two lattice spacing determinations agree well on the fine ensembles, the dif-  ference is quite substantial on the coarsest one, where we find a t 0 ≈ 0.1 fm, i.e. we observe a 37% lattice artifact in a! Since t 0 /a 2 is also determined on the quenched ensembles, we can determine their lattice spacings using the decoupling relation . Note that lattice spacings determined by using the N f = 0 theory as an effective theory for our massive two flavor theory differ from those determined by using it as an (uncontrolled) approximation to full QCD. In particular these lattice spacings depend on the value of M in the fundamental theory. This is also the reason why our value of M c does not agree with other quenched determinations, like e.g. [53]. Table 5 summarizes our scale setting.

Conclusions
In this work we presented a determination of the effects of charm quarks in the sea based on a simulation of a model, QCD with N f = 2 charm quarks. By comparing to the N f = 0 pure gauge theory at the matching point defined in Eq. (3.32) we can compute the size of these effects. We find that they are below 2% for the hyperfine splitting of charmonium. These are good news for lattice QCD computations of charmonium based on simulations of N f = 2 + 1 light quarks in the sea. We also demonstrate in Fig. 8 that lattice spacings a < 0.06 fm are needed for safe continuum extrapolations of the charmonium spectrum when using O(a) improved Wilson quarks. We also computed the effects of sea charm quarks in the mass-scaling function η of the charmonium masses Eq. (3.29) and in the renormalization group invariant charm-quark mass M c . Table 4 lists the comparison in the continuum limit of these quantities in the N f = 2 and N f = 0 theory. The effects of the charm sea quarks are clearly resolved and their size is 3% for η P and 5% for M c . We notice that our results are upper bounds for the effects of a charm sea quark in QCD since in our model we have doubled their number.
Further analysis to compute charm loop effects in decay constants and finestructure of B c mesons is in progress. So far the disconnected contributions due to charm annihilation [54] have been neglected since we computed isovector charmonium masses in our model. Work on these contributions is under way.

Appendix A: Parity and time-reflection symmetries
Following transformations can be considered as a change of variables in the lattice path integral: Parity P : They are symmetries of the twisted mass action only if μ = 0. In general these transformations lead to relations between expectation values in theories with positive and with negative twisted masses. E.g. for the two-point functions like Eq. (3.11) one finds

Appendix B: Tables
See Tables 6 and 7.