Abstract
There are a lot of papers dealing with applications of the so-called cyclic refinement of the discrete Jensen’s inequality. A significant generalization of the cyclic refinement, based on combinatorial considerations, has recently been discovered by the author. In the present paper we give the integral versions of these results. On the one hand, a new method to refine the integral Jensen’s inequality is developed. On the other hand, the result contains some recent refinements of the integral Jensen’s inequality as elementary cases. Finally some applications to the Fejér inequality (especially the Hermite–Hadamard inequality), quasi-arithmetic means, and f-divergences are presented.
Similar content being viewed by others
1 Introduction
The significance of convex functions is rightly due to Jensen’s inequality. A real function f defined on an interval \(C\subset \mathbb{R}\) is called convex if it satisfies
for all \(t_{1},t_{2}\in C\) and all \(\alpha \in [ 0,1 ] \).
Let the set I denote either \(\{ 1,\ldots,n \} \) for some \(n\geq 1\) or \(\mathbb{N}_{+}\). We say that the numbers \(( p_{i} ) _{i\in I}\) represent a discrete probability distribution if \(p_{i}\geq 0\) (\(i\in I\)) and \(\sum_{i\in I}p_{i}=1\). It is called positive if \(p_{i}>0\) (\(i\in I\)). A permutation π of I refers to a bijection from I onto itself.
The following discrete and integral versions of Jensen’s inequality are well known.
Theorem 1
(discrete Jensen’s inequalities, see [16] and [17])
(a) Let C be a convex subset of a real vector space V, and let \(f:C\rightarrow \mathbb{R}\) be a convex function. If \(p_{1},\ldots,p_{n}\) represent a discrete probability distribution and \(v_{1},\ldots,v_{n}\in C\), then
(b) Let C be a closed convex subset of a real Banach space V, and let \(f:C\rightarrow \mathbb{R}\) be a convex function. If \(p_{1},p_{2},\ldots \) represent a discrete probability distribution and \(v_{1},v_{2},\ldots \in C\) such that the series \(\sum_{i=1}^{\infty }p_{i}v_{i}\) and \(\sum_{i=1}^{\infty }p_{i}f ( v_{i} ) \) are absolutely convergent, then
Theorem 2
(integral Jensen’s inequality, see [16])
Let φ be an integrable function on a probability space \(( X,\mathcal{A},\mu ) \) taking values in an interval \(C\subset \mathbb{R}\). Then \({\int _{X}} \varphi \,d\mu \) lies in C. If f is a convex function on C such that \(f \circ \varphi \) is μ-integrable, then
Remark 3
It follows that Theorem 1 (b) can be generalized in case of \(V=\mathbb{R}\): if \(C\subset \mathbb{R}\) is an interval (not necessarily closed) and the other conditions of the statement are satisfied, then \(\sum_{i=1}^{\infty }p_{i}v_{i}\) lies in C and (2) holds.
There are many papers dealing with refinements of discrete and integral Jensen’s inequalities (see the book [12] and the references therein).
In papers [2] and [13] there are special refinements of the discrete Jensen’s inequality of the form in Theorem 1 (a) (so-called cyclic refinements). These led the author to the following refinement of the discrete Jensen’s inequality which is a significant generalization of the previously mentioned results.
Theorem 4
(see [11])
(a) Let \(k,n\geq 2\) be integers, and let \(p_{1},\ldots,p_{n}\) and \(\lambda _{1},\ldots,\lambda _{k}\) represent positive probability distributions. For each \(j=1,\ldots,k\), let \(\pi _{j}\) be a permutation of the set \(\{ 1,\ldots,n \} \). If C is a convex subset of a real vector space V, \(f:C\rightarrow \mathbb{R}\) is a convex function, and \(v_{1},\ldots,v_{n}\in C\), then
(b) Let the set J denote either \(\{ 1,\ldots,k \} \) for some \(k\geq 2\) or \(\mathbb{N}_{+}\). Let \(p_{1},p_{2},\ldots \) and \(( \lambda _{j} ) _{j\in J}\) represent positive probability distributions. For each \(j\in J\), let \(\pi _{j}\) be a permutation of the set \(\mathbb{N}_{+}\). If C is a closed convex subset of a real Banach space \(( V, \Vert \cdot \Vert ) \), \(f:C\rightarrow \mathbb{R}\) is a convex function, and \(v_{1},v_{2},\ldots \in C\) such that the series \(\sum_{i=1}^{\infty }p_{i}v_{i}\) and \(\sum_{i=1}^{\infty }p_{i}f ( v_{i} ) \) are absolutely convergent, then
In the paper [13] we obtain refinements of the integral Jensen’s inequality by using cyclic refinements of the discrete Jensen’s inequality, but these results are not natural obverses of the discrete one’s.
In this paper we give the integral version of Theorem 4 when \(V=\mathbb{R}\). On the one hand, a new method to refine the integral Jensen’s inequality is developed (totally different from earlier techniques, see e.g. [10] and [19]) and our result contains Theorem 4 when \(V=\mathbb{R}\). On the other hand, we can have from it some recent refinements of the integral Jensen’s inequality (see [7], [5], [6]) as elementary cases. Finally, some applications to the Fejér inequality (especially the Hermite–Hadamard inequality), quasi-arithmetic means, and f-divergences are presented.
2 Preliminary result
We give an extension of Theorem 4 if \(V=\mathbb{R}\).
Proposition 5
Let the index set I denote either \(\{ 1,\ldots,n \} \) for some \(n\geq 1\) or \(\mathbb{N}_{+}\). Let the index set J denote either \(\{ 1,\ldots,k \} \) for some \(k\geq 1\) or \(\mathbb{N}_{+}\). For each \(j\in J\), let \(\pi _{j}\) be a permutation of the set I. Let \(( p_{i} ) _{i\in I}\) and \(( \lambda _{j} ) _{j\in J}\) represent positive probability distributions. If C is an interval in \(\mathbb{R}\), \(f:C\rightarrow \mathbb{R}\) is a convex function, and \(( v_{i} ) _{i\in I}\) is a sequence from C such that the series \(\sum_{i\in I}p_{i}v_{i}\) and \(\sum_{i\in I}p_{i}f ( v_{i} ) \) are absolutely convergent, then
Proof
By using Remark 3, we can copy the proof of Theorem 4 in [11]. □
The positive part \(f^{+}\) and the negative part \(f^{-}\) of a real-valued function f are defined in the usual way.
We need another result about integrability.
Lemma 6
Let φ be an integrable function on a probability space \(( X,\mathcal{A},\mu ) \) taking values in an interval \(C\subset \mathbb{R}\). If f is a convex function on C such that \(f\circ \varphi \) is μ-integrable, then there exists a convex function g on C such that \(\vert f \vert \leq g\) and \(g\circ \varphi \) is μ-integrable too.
Proof
Along with the function f, the function \(f^{+}\) is also convex. Since \(f\circ \varphi \) is μ-integrable, \(f^{+}\circ \varphi \) is also μ-integrable.
The convexity of f on C shows that there is an affine function \(l:C\rightarrow \mathbb{R}\), \(l ( t ) =at+b\) for which
Then
Since the function φ is a μ-integrable function, \(\vert l \vert \circ \varphi \) is also μ-integrable.
Using that the function \(\vert l \vert \) is convex, \(\vert f \vert =f^{+}+f^{-}\), and the sum of two convex functions is also convex, it follows from the above that \(g:=f^{+}+ \vert l \vert \) can be chosen.
The proof is complete. □
We shall use the following Fubini theorem for double series.
Theorem 7
(see [20])
Let \(a ( i,j ) \in \mathbb{R}\) (\(( i,j ) \in \mathbb{N}_{+}\times \mathbb{N}_{+} \)). If any of the two sums
is finite, then both of the series
are absolutely convergent and both have the same sum.
3 Main results
We need the following hypotheses.
- (\(\mathrm{H}_{1}\)):
-
Let \(( X,\mathcal{A},\mu ) \) be a probability space.
- (\(\mathrm{H}_{2}\)):
-
Let the index set I denote either \(\{ 1,\ldots,n \} \) for some \(n\geq 1\) or \(\mathbb{N}_{+}\). Let the index set J denote either \(\{ 1,\ldots,k \} \) for some \(k\geq 1\) or \(\mathbb{N}_{+}\).
- (\(\mathrm{H}_{3}\)):
-
Let \(( \lambda _{j} ) _{j\in J}\) represent a positive probability distribution. For each \(j\in J\), let \(\pi _{j}\) be a permutation of the set I.
- (\(\mathrm{H}_{4}\)):
-
Suppose that we are given a sequence \(\mathfrak{M}_{I}= ( \mu _{i} ) _{i\in I}\) of measures on \(\mathcal{A}\) with \(\mu _{i} ( X ) >0\) for all \(i\in I\) and \(\sum_{i\in I}\mu _{i}=\mu \).
- (\(\mathrm{H}_{5}\)):
-
Suppose that we are given a sequence \(\mathfrak{S}_{I}= ( A_{i} ) _{i\in I}\) of pairwise disjoint sets \(A_{i}\in \mathcal{A}\) with \(\mu ( A_{i} ) >0\) for all \(i\in I\) and \(\bigcup_{i\in I}A_{i}=X\).
Theorem 8
Assume (\(\mathrm{H}_{1}\)–\(\mathrm{H}_{3}\)) and (\(\mathrm{H}_{4}\)). Let \(C\subset \mathbb{R}\) be an interval, and \(f:C\rightarrow \mathbb{R}\) be a convex function. Let φ be a μ-integrable function on X taking values in C such that \(f\circ \varphi \) is also μ-integrable on X. Then
Proof
This can be obtained by an application of Proposition 5 to the parameters
Really, \(( p_{i} ) _{i\in I}\) represents a positive probability distribution, and by the integral Jensen’s inequality, \(v_{i}\in C\) (\(i\in I\)).
Next we show that the series \(\sum_{i\in I}p_{i}v_{i}\) and \(\sum_{i\in I}p_{i}f ( v_{i} ) \) are absolutely convergent.
Since φ is a μ-integrable function on X and \(\sum_{i\in I}\mu _{i}=\mu \),
By Lemma 6, there exists a convex function g on C such that
Another application of the integral Jensen’s inequality and \(\sum_{i\in I}\mu _{i}=\mu \) now show that
We can see that the conditions of Proposition 5 hold, and therefore, by applying it, we obtain
As a final step, we can apply the integral Jensen’s inequality in (5).
The proof is complete. □
A useful consequence of the previous theorem is the next result.
Corollary 9
Assume (\(\mathrm{H}_{1}\)–\(\mathrm{H}_{3}\)) and (\(\mathrm{H}_{5}\)). Let \(C\subset \mathbb{R}\) be an interval and \(f:C\rightarrow \mathbb{R}\) be a convex function. Let φ be a μ-integrable function on X taking values in C such that \(f\circ \varphi \) is also μ-integrable on X. Then
Proof
Let the measure \(\mu _{i}\) (\(i\in I\)) be defined on \(\mathcal{A} \) by
and then apply Theorem 8.
The proof is complete. □
4 Discussion
First we study the relationship between Theorem 8 and Proposition 5.
Assume (\(\mathrm{H}_{2}\)) and (\(\mathrm{H}_{3}\)), and let \(( p_{i} ) _{i\in I}\) represent a positive probability distribution. Define the measure μ on the power set \(P ( I ) \) of I by
where \(\varepsilon _{i}\) (\(i\in I\)) is the unit mass at i on \(P ( I ) \), and use the measure space \(( I,P ( I ) ,\mu ) \) in (\(\mathrm{H}_{1}\)). Let \(C\subset \mathbb{R}\) be an interval, \(f:C\rightarrow \mathbb{R}\) be a convex function, and \(( v_{i} ) _{i\in I}\) be a sequence in \(\mathbb{R}\) such that the series \(\sum_{i\in I}p_{i}v_{i}\) and \(\sum_{i\in I}p_{i}f ( v_{i} ) \) are absolutely convergent. Define the function φ on I by
Finally, choose \(A_{i}:= \{ i \} \) (\(i\in I\)).
It is easy to check that under these conditions Corollary 9 is equivalent to Proposition 5.
Now we compare our main result with some recent refinement of the integral Jensen’s inequality.
Let \(( X,\mathcal{A},\nu ) \) be a measure space with \(\nu ( X ) \in \mathopen] 0,\infty \mathclose] \). For the ν-integrable positive ν-a.e. weight w, consider the Lebesgue space
For the ν-integrable positive ν-a.e. weight w and given \(n\geq 2\), we consider the set \(\mathfrak{B}_{k} ( w ) \) of all possible n-tuples of ν-integrable positive ν-a.e. weights \(\overline{w}= ( w_{1},\ldots,w_{n} ) \) with the property that \(\sum_{i=1}^{n}w_{j}=w\).
The next result can be found in [5].
Theorem 10
(see [5], Theorem 2.1)
Let \(f: [ m,M ] \rightarrow \mathbb{R}\) be a convex function, \(\varphi :X\rightarrow [ m,M ] \) be a ν-measurable function such that φ, \(f\circ \varphi \in L_{w} ( X,\nu ) \). Then, for any \(\overline{w}\in \mathfrak{B}_{k} ( w ) \), we have
where \(n\geq 2\).
Remark 11
Let the measure \(\mu _{i}\) (\(i\in I\)) and μ be defined on \(\mathcal{A}\) by
and by
Then \(\mu _{i} ( X ) >0\) (\(i\in I\)) and \(\mu =\sum_{i=1}^{n}\mu _{i}\). By choosing \(k=1\) (thus \(\lambda _{1}=1\)), we can see that Theorem 10 is a simple consequence of Theorem 8.
We say that the family of measurable sets \(F_{n} ( X ) = \{ A_{i} \} _{i=1,\ldots,n}\) is an n-division for X if \(X=\bigcup_{i=1}^{n}A_{i}\), \(A_{i}\cap A_{j}=\emptyset \) for any \(i,j\in \{ 1,\ldots,n \} \) with \(i\neq j\) and \(\nu ( A_{i} ) >0\) for any \(i\in \{ 1,\ldots,n \} \). For given \(n\geq 2\), we denote by \(\mathfrak{D}_{n} ( X ) \) the set of all n-divisions of X.
The following result appears in [6].
Theorem 12
(see [6], Theorem 2.1)
Let \(f: [ m,M ] \rightarrow \mathbb{R}\) be a convex function, \(\varphi :X\rightarrow [ m,M ] \) be a ν-measurable function such that φ, \(f\circ \varphi \in L_{w} ( X,\nu ) \). Then, for any \(F_{n} ( X ) \in \mathfrak{D}_{n} ( X ) \), we have
where \(n\geq 2\).
Remark 13
(a) Define the measure μ on \(\mathcal{A}\) by (7). By choosing \(k=1\) (thus \(\lambda _{1}=1\)), we can see that Theorem 12 is a simple consequence of Corollary 9. Moreover, it follows that Theorem 12 is contained in Theorem 10.
(b) The main result Theorem 2.1 in [7] is the special case of Corollary 9 when \(n=2\) and \(k=1\).
5 Applications
Let \([ a,b ] \subset \mathbb{R}\) (\(a< b\)). The σ-algebra of Lebesgue-measurable subsets of \(\mathbb{R}\) is denoted by \(\mathcal{L}\). λ means the Lebesgue measure on \(\mathcal{L}\). Assume that \(f: [ a,b ] \rightarrow \mathbb{R}\) is a convex function and \(g: [ a,b ] \rightarrow \mathopen[ 0,\infty \mathclose[ \) is a Lebesgue-integrable function which is symmetric to \(\frac{a+b}{2}\). The classical Fejér inequality (see [8]) says
This is a weighted generalization of the Hermite–Hadamard inequality (see [9]) which has the form
By applying our main result, we can obtain a refinement of the left-hand side of the Fejér inequality. There are refinements of the Fejér inequality (see e.g. [21]), but the next result provides a totally different refinement.
Proposition 14
Assume (\(\mathrm{H}_{2}\)–\(\mathrm{H}_{3}\)). Let \([ a,b ] \subset \mathbb{R}\) (\(a< b\)), and consider the measure space \(( [ a,b ] ,\mathcal{L},\lambda ) \) in (\(\mathrm{H}_{1}\)). Let \(f: [ a,b ] \rightarrow \mathbb{R}\) be a convex function and \(g: [ a,b ] \rightarrow \mathopen[ 0,\infty \mathclose[ \) be a Lebesgue-integrable function which is symmetric to \(\frac{a+b}{2}\).
(a) If \(\mu :=g\lambda \), where gλ is the measure on \(\mathcal{L}\) having density g with respect to λ, and \(\mu _{i}\) (\(i\in I\)) are measures on \(\mathcal{L}\) such that \(\mu _{i} ( [ a,b ] ) >0\) for all \(i\in I\) and \(\sum_{i\in I}\mu _{i}=\mu \), then
(b) If \(g_{i}: [ a,b ] \rightarrow \mathopen[ 0,\infty \mathclose[ \) (\(i\in I\)) is a Lebesgue-integrable function which is symmetric to \(\frac{a+b}{2}\), and \(\sum_{i\in I}g_{i}=g\), then
Proof
(a) The result follows immediately from Theorem 8 by choosing \(\varphi : [ a,b ] \rightarrow \mathbb{R}\), \(\varphi ( t ) =t\).
(b) It comes from (a) by some easy calculations.
The proof is complete. □
Remark 15
It is worth to mention that in part (a) the measures may not be symmetric to \(\frac{a+b}{2}\).
By applying the previous result, we obtain refinements of the Hermite–Hadamard inequality.
Proposition 16
Assume (\(\mathrm{H}_{2}\)–\(\mathrm{H}_{3}\)). Let \(C:= [ a,b ] \subset \mathbb{R}\) (\(a< b\)), and consider the measure space \(( [ a,b ] ,\mathcal{L},\lambda ) \) in (\(\mathrm{H}_{1}\)). Let \(f: [ a,b ] \rightarrow \mathbb{R}\) be a convex function.
(a) If \(\mu _{i}\) (\(i\in I\)) is a measure on \(\mathcal{L}\) such that \(\mu _{i} ( [ a,b ] ) >0\) for all \(i\in I\) and \(\sum_{i\in I}\mu _{i}=\lambda \), then
(b) Let \(( A_{i} ) _{i\in I}\) be a sequence of pairwise disjoint sets \(A_{i}\in \mathcal{A}\) with \(\lambda ( A_{i} ) >0\) for all \(i\in I\) and \(\bigcup_{i\in I}A_{i}= [ a,b ] \). Then
(c) If \(a=x_{0}< x_{1}<\cdots <x_{n}=b\) is a partition of \([ a,b ] \), then
Proof
(a) This is a special case of Proposition 14 (a).
(b) Let the measure \(\mu _{i}\) (\(i\in I\)) be defined on \(\mathcal{L}\) by
and then apply (a).
(c) Let \(A_{i}:=\mathopen [ x_{i-1},x_{i}\mathclose[ \) (\(i=1,\ldots,n-1\)) and \(A_{n}:= [ x_{n-1},x_{n} ] \) in (b).
The proof is complete. □
The second application concerns quasi-arithmetic means.
Let \(C\subset \mathbb{R}\) be an interval, and let \(q:C\rightarrow \mathbb{R}\) be a continuous and strictly monotone function. If \(( X,\mathcal{A},\mu ) \) is a probability space, and \(\varphi :X\rightarrow C\) is a function such that \(q\circ \varphi \) is μ-integrable on X, then
is called the quasi-arithmetic mean (integral q-mean) of φ.
Now we introduce some new quasi-arithmetic means related to the formula \(C_{mes}\).
Definition 17
Assume (\(\mathrm{H}_{1}\)–\(\mathrm{H}_{4}\)). Let \(C\subset \mathbb{R}\) be an interval, let \(q,r:C\rightarrow \mathbb{R}\) be continuous and strictly monotone functions, and \(\varphi :X\rightarrow C\) be a μ-integrable function for which \(q\circ \varphi \) and \(r\circ \varphi \) are also μ-integrable functions. Then we define the following quasi-arithmetic mean of φ with respect to \(C_{mes}\):
In the next result the introduced means are compared.
Proposition 18
Assume that the conditions in Definition 17hold. If either \({q\circ }r{{^{-1}}}\) is convex and q is strictly increasing, or \({q\circ }r{{^{-1}}}\) is concave and q is strictly decreasing, then
while if either \({r\circ }q{{^{-1}}}\) is convex and r is strictly decreasing, or \({r\circ }q{{^{-1}}}\) is concave and r is strictly increasing, then
Proof
We consider only the case when \({q\circ }r{{^{-1}}}\) is convex and q is strictly increasing. By applying Theorem 8 with \(f:=q\circ r^{-1}\) and with \(r\circ \varphi \) instead of φ, we obtain
and this implies the result since \(q{{^{-1}}}\) is strictly increasing.
The proof is complete. □
Finally, some applications to information theory are presented.
Throughout the rest of the paper probability measures P and Q are defined on a fixed measurable space \(( X,\mathcal{A} ) \). It is also assumed that P and Q are absolutely continuous with respect to a σ-finite measure ν on \(\mathcal{A}\). The densities (or Radon–Nikodym derivatives) of P and Q with respect to ν are denoted by p and q, respectively. These densities are ν-almost everywhere uniquely determined.
Introduce the set of functions
and define, for every \(f\in F\), the function
If \(f\in F\), then either f is monotonic or there exists a point \(t_{0}\in \mathopen] 0,\infty \mathclose[ \) such that f is decreasing on \(\mathopen] 0,t_{0}\mathclose[ \). This implies that the limit
exists in \(\mathopen] -\infty ,\infty \mathclose] \), and
extends f into a convex function on \(\mathopen[ 0,\infty \mathclose[ \).
It is well known that, for every \(f\in F\), the function \(f^{\ast }\) also belongs to F, and therefore
The important notions of f-divergence were introduced in [3], [4], and independently in [1].
Definition 19
(a) For every \(f\in F\), we define the f-divergence of P and Q by
where the following conventions are used:
(b) Let \(f\in F\) be a positive convex function, and let \(\mathbf{p}:= ( p_{1},\ldots,p_{n} ) \) and \(\mathbf{q}:= ( q_{1},\ldots,q_{n} ) \) represent positive discrete probability distributions. The f-divergence functional of p and q is
It is possible to use nonnegative discrete probability distributions in the f-divergence functional by defining
Remark 20
(a) For every \(f\in F\), the perspective \(\hat{f}:\mathopen] 0,\infty \mathclose[ \times \mathopen] 0,\infty \mathclose[ \rightarrow \mathbb{R}\) of f is defined by
Then (see [18]) f̂ is also a convex function. Vajda [22] proved that (10) is the unique rule leading to convex and lower semicontinuous extension of f̂ to the set
(b) Since \(f^{\ast } ( 0 ) \in \mathopen ] -\infty ,\infty \mathclose ] \), Lemma 2.8 in [14] shows that \(D_{f} ( P,Q ) \) exists in \(\mathopen ] -\infty ,\infty \mathclose ] \) and
It follows that if P is absolutely continuous with respect to Q, then
The basic inequality (see [15])
is one of the key properties of f-divergences. In the next result a refinement of this inequality is obtained.
Proposition 21
Assume (\(\mathrm{H}_{2}\)) and (\(\mathrm{H}_{3}\)). Suppose that we are given a sequence \(\mathfrak{M}_{I}= ( Q_{i} ) _{i\in I}\) of measures on \(\mathcal{A}\) with \(Q_{i} ( X ) >0\) for all \(i\in I\) and \(\sum_{i\in I}Q_{i}=Q\). Then
Proof
Since Q is absolutely continuous with respect to ν, it follows from \(\sum_{i\in I}Q_{i}=Q\) that \(Q_{i}\) is also absolutely continuous with respect to ν for all \(i\in I\). By denoting the density (or Radon–Nikodym derivative) of \(Q_{i}\) with respect to ν by \(q_{i}\) (\(i\in I\)), we also obtain that \(\sum_{i\in I}q_{i}=q\) ν-a.e.
If \(D_{f} ( P,Q ) =\infty \), then (13) is obvious.
If \(D_{f} ( P,Q ) \in \mathbb{R}\), then the integral
is finite, and therefore either \(Q ( p=0 ) =0\) or \(Q ( p=0 ) >0\) and \(f ( 0 ) \) is finite. It can be seen that the integral Jensen’s inequality can be applied to this integral, and we have from Theorem 8 that
It follows from the proof of Theorem 2.12 (a) in [14] that
The proof is complete. □
Remark 22
(a) By using density functions, (13) can be rewritten in the following form:
(b) Define
and
By applying (\(\mathrm{H}_{3}\)), (\(\mathrm{H}_{4}\)), Theorem 7, and \(\sum_{i\in I}Q_{i}=Q\), we have
and hence \(( q_{i} ) _{i\in I}\) and q̂ represent a discrete probability distribution q.
Similarly, it follows from (\(\mathrm{H}_{3}\)), (\(\mathrm{H}_{4}\)), Theorem 7, and \(\sum_{i\in I}Q_{i}=Q\), that
and therefore \(( p_{i} ) _{i\in I}\) and p̂ also represent a discrete probability distribution p. Since
we can see that the second term in (13) can be considered as the f-divergence functional of p and q (if \(I=\mathbb{N}_{+}\), then we can say that \(I_{f} ( \mathbf{p},\mathbf{q} ) \) is a generalized f-divergence functional). It is interesting that inequality (12) for “continuous” f-divergence can be refined by “discrete” f-divergence.
Availability of data and materials
Not applicable.
References
Ali, M.S., Silvey, D.: A general class of coefficients of divergence of one distribution from another. J. R. Stat. Soc. Ser. B 28, 131–140 (1966)
Brnetić, I., Khan, K.A., Pečarić, J.: Refinement of Jensen’s inequality with applications to cyclic mixed symmetric means and Cauchy means. J. Math. Inequal. 9(4), 1309–1321 (2015)
Csiszár, I.: Eine Informationstheoretische Ungleichung und ihre Anwendung auf den Beweis der Ergodizität on Markoffschen Ketten. Publ. Math. Inst. Hung. Acad. Sci. Ser. A 8, 84–108 (1963)
Csiszár, I.: Information-type measures of difference of probability distributions and indirect observations. Studia Sci. Math. Hung. 2, 299–318 (1967)
Dragomir, S.S.: Refining Jensen’s integral inequality for partitions of weights. Libertas Math. 37, 25–44 (2017)
Dragomir, S.S.: Refining Jensen’s integral inequality for divisions of measurable space. J. Math. Ext. 12(2), 87–106 (2018)
Dragomir, S.S., Khan, M.A., Abathun, A.: Refinement of the Jensen integral inequality. Open Math. 14, 221–228 (2016)
Fejér, L.: Über die Fourierreihen, II. Math. Naturwiss. Anz Ungar. Akad. Wiss. 24, 369–390 (1906) (in Hungarian)
Hadamard, J.: Étude sur les propriétés des fonctions entiéres en particulier d’une fonction considérée par Riemann. J. Math. Pures Appl. 58, 171–215 (1893)
Horváth, L.: A refinement of the integral form of Jensen’s inequality. J. Inequal. Appl. 2012, Article ID 178 (2012)
Horváth, L.: New refinements of the discrete Jensen’s inequality generated by finite or infinite permutations. Aequ. Math. https://doi.org/10.1007/s00010-019-00696-z
Horváth, L., Khan, K.A., Pečarić, J.: Combinatorial Improvements of Jensen’s Inequality. Element, Zagreb (2014)
Horváth, L., Khan, K.A., Pečarić, J.: Cyclic refinements of the discrete and integral form of Jensen’s inequality with applications. Analysis 36(4), 253–262 (2016)
Horváth, L., Pečarić, D., Pečarić, J.: A refinement and an exact equality condition for the basic inequality of f-divergences. Filomat 32(12), 4263–4273 (2018)
Liese, F., Vajda, I.: On divergences and informations in statistics and information theory. IEEE Trans. Inf. Theory 52, 4394–4412 (2006)
Niculescu, C., Persson, L.E.: Convex Functions and Their Applications. A Contemporary Approach. Springer, Berlin (2006)
Perlman, M.D.: Jensen’s inequality for a convex vector-valued function on an infinite-dimensional space. J. Multivar. Anal. 4, 52–65 (1974)
Rockafellar, T.: Convex Analysis. Princeton University Press, Princeton (1970)
Rooin, J.: A refinement of Jensen’s inequality. J. Inequal. Pure Appl. Math. 6(2), Article ID 38 (2005)
Rudin, W.: Real and Complex Analysis, 3rd edn. McGraw-Hill, New York (1987)
Tseng, K.-L., Hwang, S.-R., Dragomir, S.S.: Refinements of Fejér’s inequality for convex functions. Period. Math. Hung. 65, 17–28 (2012)
Vajda, I.: Theory of Statistical Inference and Information. Kluwer Academic, Boston (1989)
Acknowledgements
Not applicable.
Funding
The research of the author was supported by Hungarian Scientific Research Fund Grant No. K101217 and Széchenyi 2020 under EFOP-3.6.1-16-2016-00015.
Author information
Authors and Affiliations
Contributions
The author carried out the results and read and approved the current version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The author declares that they have no competing interests.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Horváth, L. Refinements of the integral Jensen’s inequality generated by finite or infinite permutations. J Inequal Appl 2021, 12 (2021). https://doi.org/10.1186/s13660-021-02545-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13660-021-02545-7