Functional equations and the Cauchy mean value theorem

The aim of this note is to characterize all pairs of sufficiently smooth functions for which the mean value in the Cauchy mean value theorem is taken at a point which has a well-determined position in the interval. As an application of this result, a partial answer is given to a question posed by Sahoo and Riedel.


Introduction
Given two differentiable functions F, G : R → R, the Cauchy mean value theorem (MVT) states that for any interval [ (1) Here, and in the rest of the paper we will use the "lower case" notations for the derivates f = F and g = G . A particular situation is the Lagrange MVT when G(x) = x is the identity function, in which case (1) reads as The problem to be investigated in this note can be formulated as follows.
for all a, b ∈ R, where f = F , g = G , α, β ∈ (0, 1) are fixed and α + β = 1. For the case of the Lagrange MVT with c = a+b 2 , this problem was considered first by Haruki [5] and independently by Aczél [1], proving that the quadratic functions are the only solutions to (2). This problem can serve as a starting point for various functional equations [9]. More general functional equations have been considered even in the abstract setting of groups by several authors including Kannappan [6], Ebanks [3], Fechner-Gselmann [4]. On the other hand, the result of Aczél and Haruki has been generalized for higher order Taylor expansion by Sablik [8].
For the more general case of the Cauchy MVT much less is known. We mention Aumann [2] illustrating the geometrical significance of this equation and the recent contribution of Páles [7] providing the solution of a related equation under additional assumptions. In this note we provide a different approach to the Cauchy MVT. As it will turn out, the most challenging situation corresponds to c = a+b 2 in which case our main result is the following: Theorem 2. Assume that F, G : R → R are three times differentiable functions with derivatives F = f , G = g such that for all a, b ∈ R. Then one of the following possibilities holds: (d) there exists a non-zero real number μ such that The paper is organized as follows. In Sect. 2 we consider the problem first for the known case of the Lagrange MVT as an illustration of our method. In Sect. 3 we provide a preliminary result that will allow passing local information to a global one about the pairs of differentiable functions (F, G) satisfying (3). In Sects. 4, 5 we consider the asymmetric (α = β) and symmetric (α = β = 1/2) cases, respectively. Section 6 is for final remarks. Here we also provide a partial result to an open problem by Sahoo and Riedel which corresponds to a more general version of (3).

The Lagrange MVT with fixed mean value
Note that every c ∈ (a, b) can be written uniquely as c = αa + βb for some α, β ∈ (0, 1) with α + β = 1. It is easy to check that (2) holds for all a, b ∈ R with fixed α = 1/2 if F is a linear function, and with α = 1/2 if F is a Vol. 90 (2016) Functional equations and the Cauchy mean value theorem 685 quadratic function. We claim that the converse of this statement is also true. As mentioned earlier, there are various proofs of the latter in the literature, see for example [1,5,9]. Nevertheless, we give here a short and self-contained argument mainly to illustrate our approach to the more general case of the Cauchy MVT.
Then the following statements hold: Proof. Let us put αb + βa = x and b − a = h. Then (5) reads as From this equation it is apparent that f = F is differentiable as a linear combination of two differentiable functions and thus F is twice differentiable. By induction, it follows that F is infinitely differentiable. Differentiating (6) with respect to h, we obtain the relation Again, we differentiate (7) with respect to h and find that Since f is continuous, letting h 0, we obtain If α = 1/2, this implies that f = 0 identically. Therefore f is constant and thus F is a linear function, proving the first statement. If α = 1/2, then (7) reads as and twice differentiation with respect to h leads to Now letting h 0, we get f (x) = 0 for all x ∈ R, so f is linear and F is a quadratic function, proving the second statement.

The Cauchy MVT with fixed mean value
Let us introduce the sets and also their complements Z f := R\U g and Z g : Proof. By assumption, there is a non-empty interval (p, q) ⊂ U g such that Denoting x + αh by y for x ∈ [p, q] and h > 0, we get F (y) − F (y − h) = 0 if (h, y) lies within the semi-strip (cf. Fig. 1) Then, for y > p choosing h > 0 such that (h, y) ∈ L, we have ∂ ∂y for y > p. However, by (10), we have F (q + αh) = F (q − βh) and thus F (y) is the same constant for all y < q. Therefore, f (y) = F (y) = 0 for all y ∈ R.
Proposition 4 shows that the condition U f ∩ U g = ∅ holds only if at least one of the sets U f and U g is empty. Then we have the simple cases described in the beginning of the section.
and consider the representation (9). If {F, G, 1} are linearly dependent as functions on I σ for every σ ∈ Σ, then {F, G, 1} are linearly dependent on R.
Proof. For σ 1 , σ 2 ∈ Σ with σ 1 = σ 2 , consider the intervals I σ1 := (p 1 , q 1 ), and assume that {F, G, 1} are linearly dependent on I σ1 and I σ2 . Then it follows that there are constants With the changing of variables Inserting this value into (15), we obtain Put y = x + αh, then x − βh = y − h, and (17) means that (h, y) lies within the parallelogram (cf. Fig. 2) Since β ∈ (0, 1), (12) guarantees that Π = ∅, and (16) implies Therefore, at any point of Π, we have (17), so g(y − h) = 0 and thus So far our analysis says nothing about B 1 , B 2 in (13), (14) but since σ 1 , σ 2 ∈ Σ were arbitrary, (18) together with (13) and (14) imply On the other hand, by changing the roles of F and G in the above analysis, we come to the conclusion that By (11) there is a point x 0 ∈ U g ∩ U f so AK = 1 and these coefficients are by trivial reasons (all these values are zeros) so with (19) and (20) these identities are valid on the entire

The Cauchy MVT with fixed asymmetric mean value
In this section we consider the asymmetric case, i.e. in (3) we take The following proposition describes all pairs (F, G) of two times continuously differentiable functions satisfying (3) under the assumption (21) on α, β in the intervals where g = G does not vanish.
if x ∈ I and h > 0 are such that x + αh, x − βh ∈ I. The latter condition yields that (22) holds if (h, x) lies within the open triangle (cf. Fig. 3) By differentiating both sides of (22) with respect to h twice, we obtain the following relation in T All the functions are continuous so the latter holds on the closure T as well, in particular, on the interval {h = 0, p < x < q}. Therefore, with β 2 − α 2 = 1 − 2α = 0 by (21), we get f (x)g(x) = g (x)f (x) for all x ∈ I = (p, q). We can divide both sides by g 2 (x) and conclude that (f/g) = 0 on I. This implies that f/g = A for some constant A ∈ R, and F (x) = f (x) = Ag(x) = AG (x), x ∈ I. After integration we get F (x) = AG(x) + B(x), x ∈ I. The following theorem is the main result of this section.
Proof. Consider the following cases: In this case G is a constant on R and (3) holds for any differentiable function F . Hence (24) holds, for example, with A = 0, B = 1, C = −G and thus {F, G, 1} are linearly dependent on R.
In this case Proposition 4 yields that F is a constant on R and (3) holds for any differentiable function G. Hence (24) holds, for example, with A = 1, B = 0, C = −F and thus {F, G, 1} are again linearly dependent on R.
In this case Propositions 5 and 6 immediately imply that {F, G, 1} are linearly dependent on R.

The Cauchy MVT with symmetric mean value
In this section we consider the problem of describing all pairs (F, G) of smooth functions for which the mean value in (3) is taken at the midpoint of the interval. Our first result gives a necessary (and also sufficient in case {1, F, G} are not linearly dependent) condition on such pairs in the intervals where g = G does not vanish. Proposition 8. Assume that F, G : R → R are three times differentiable functions with derivatives F = f , G = g. Let I ⊂ R be such an interval that g = 0 for all x ∈ I and (4) holds for all a, b ∈ I. Then there exist constants A, K ∈ R and x 0 ∈ I such that

Moreover, if (25) holds with K = 0, then (4) holds if and only if
for all x, h ∈ R such that x, x + h, x − h ∈ I. Proof. With the changing of variables x = a+b 2 , h = b−a 2 , we can rewrite (4) as for all x, h ∈ R with the property that x, x + h, x − h ∈ I. By differentiating this equality three times with respect to h, we get Setting h = 0, we obtain for all x ∈ I, x ∈ I, and integration over (x 0 , x) with any x 0 ∈ I yields (25). Now assume (25) holds with a nonzero constant K. Then we have By comparing the last two relations, it is easy to see that (26) is equivalent to (4).
The following example illustrates that there are non-trivial functions satisfying (26) (and hence (4)) on R. , and consequently, We invite the interested reader to verify directly that the pair (F, G) in (28) satisfies the relation (4), giving a non-trivial example of such pairs. Now we assume that K = 0 and analyze the property (26) for all x, h ∈ R such that x, x + h, x − h ∈ I. Differentiating it with respect to h, we obtain Differentiation two more times with respect to h gives for all x ∈ I and h ∈ R such that x, x + h, x − h ∈ I. Setting h = x − x 0 in these two equations, we obtain and for all x ∈ I with 2x − x 0 ∈ I. Since 2x − x 0 ∈ I and g has no zeros in I, both sides of (29) do not vanish. By comparing (30) and (29), we get for all x ∈ I such that 2x−x 0 ∈ I. Putting y(x) := g(2x−x 0 ) and λ := 4g (x0) g(x0) , (31) yields the second order differential equation y − λy = 0, whose general real-valued solution (depending on the sign of λ), has the following form where P , Q are real constants. Hence G has one of the following forms where A, B, C are real constants.
Remark 10. Altogether, we come to the following conclusion: on every interval I ⊂ R on which G = 0, either {F, G, 1} are linearly dependent, or G and thus also F , cf. (25), has one of the forms described in (32)-(34).

Vol. 90 (2016)
Functional equations and the Cauchy mean value theorem 693 In the sequel, we call a function G (resp. the pair (F, G)) to be of quadratic, exponential or trigonometric type on I if G has (resp. both of F and G have) the form (32), (33) or (34), respectively. Consider the set U g and its representation, cf. (8), (9). The following lemma plays a crucial role in the analysis of the equation (4).
Proof. g(p) = 0 by (9) so by Remark 10, it is sufficient to consider the following cases. Case 1: G is of quadratic type on (p, q).
Then F is also of quadratic type on (p, q), and since f (p) = g(p) = 0, we have F, G ∈ span 1, (x − p) 2 . Thus {F, G, 1} are linearly dependent on (p, q). Case 2: G is of either exponential or trigonometric type on (p, q).
First suppose that G is of exponential type on (p, q). Then so is F and since the set of functions satisfying (4) is invariant with respect to the addition of constant functions, we can assume, without loss of generality, that F, G ∈ span e μ(x−p) , e −μ(x−p) for some μ = 0. Hence there are real constants u, v such that − p)). The same argument for G explains that G(x) = 2w cosh(μ(x − p)) for some real w, and consequently F and G are multiples of the same function cosh(μ (x − p)).
If G is of trigonometric type, then in the same way as above, we can conclude that F and G are multiples of the same function cos(μ(x − p)), implying that {F, G, 1} are linearly dependent on [p, q).
Proof of Theorem 2. Consider the set U g defined in (8). If U g = ∅, then g ≡ 0 on R, and thus G is identically constant on R. In this case F can be an arbitrary differentiable function on R and thus {1, F, G} are linearly dependent on R. If U g = R, then it follows (cf. Remark 10) that either {1, F, G} are linearly dependent or G has one of the forms (32)-(34) on the whole of R. Moreover, we get the same conclusion if U g ∩ U f = ∅ (cf. Proposition 4).
Next, let us assume that U g ∩ U f = ∅ and U g is a proper subset of R. Consider the representation (9). It is clear (cf. Remark 10) that the index set Σ can be split into disjoint subsets as Σ = Σ lr ∪ Σ q ∪ Σ t ∪ Σ e , where Proof. Assume Σ lr is a proper subset of Σ. Then there exists σ 2 ∈ Σ such that σ 2 / ∈ Σ lr . Since Σ lr = ∅, there is σ 1 ∈ Σ lr and A 1 ∈ R such that f (x) = A 1 g(x) on x ∈ I σ1 . Consider all x, h ∈ R such that x + h ∈ I σ2 and x ∈ I σ1 . Using (4) for a = x − h and b = x + h, and recalling that g = 0 on I σ1 , we get Therefore, x ∈ I σ1 . From this it follows that F and G are in linear relationship on I σ2 , that is, σ 2 ∈ Σ lr , which leads to a contradiction. Claim 2. If Σ lr = ∅, then only one of the index sets Σ q , Σ t , Σ e is non-empty. Proof. Let σ ∈ Σ and I σ = (p, q). Since U g is a proper subset of R, one of p, q is finite. We can assume p > −∞. Then g(p) = 0, and Lemma 11 yields f (p) = 0. Hence using (4) for a = p − h and b = p + h we get so the graph of G is symmetric with respect to the vertical line y = p.
If σ ∈ Σ q or σ ∈ Σ e , then q = +∞ since the functions of quadratic type have exactly one and the functions of exponential type have at most one critical point. Therefore, if σ ∈ Σ q , then G ∈ span{1, (x − p) 2 }, x ∈ R and Σ = Σ q . Similarly, it follows from (36) that if σ ∈ Σ e , then Σ = Σ e .
Next, assume Σ lr = Σ q = Σ e = ∅. Then Σ = Σ t and let σ ∈ Σ t . Since G is of trigonometric type on I σ = (p, q), we must have q < +∞. So g(p) = g(q) = 0 and it follows as in the proof of Lemma 11 that there are real constants u, v such that Using (36) we obtain that (37) holds on the whole of R.
Since U g = ∅, at least one of Σ lr , Σ q , Σ t , Σ e is non-empty. If Σ lr = ∅, then Claim 1 and Proposition 5 imply that {F, G, 1} are linearly dependent on R. If Σ lr = ∅, then Claim 2 yields that one of the possibilities (b)-(d) holds.

Final remarks
As a consequence of our main result we can give a partial answer to the following still open question of Sahoo and Riedel (cf. [9, Section 2.7] for an equivalent formulation). for all x, y ∈ R.
We provide a partial solution to this problem under certain assumptions on the unknown functions. Proof. Let f, g be the derivatives of F, G, respectively and the sets U g , U f (resp. Z g , Z f ) be defined as in Sect. 3. Without loss of generality, assume that φ does not vanish on R. By differentiating (39) with respect to t and setting t = 0 in the resulting equation, we get f (s)φ(s) = g(s)ψ(s), s ∈ R.
For any s ∈ U g and t ∈ R, by (39)