New refinements of the discrete Jensen’s inequality generated by finite or infinite permutations

Horváth, László

doi:10.1007/s00010-019-00696-z

New refinements of the discrete Jensen’s inequality generated by finite or infinite permutations

Open access
Published: 14 December 2019

Volume 94, pages 1109–1121, (2020)
Cite this article

Download PDF

You have full access to this open access article

Aequationes mathematicae Aims and scope Submit manuscript

New refinements of the discrete Jensen’s inequality generated by finite or infinite permutations

Download PDF

László Horváth ORCID: orcid.org/0000-0003-0564-4991¹

2369 Accesses
7 Citations
Explore all metrics

Abstract

In this paper some new refinements of the discrete Jensen’s inequality are obtained in real vector spaces. The idea comes from some former refinements determined by cyclic permutations. We essentially generalize and extend these results by using permutations of finite sets and bijections of the set of positive numbers. We get refinements of the discrete Jensen’s inequality for infinite convex combinations in Banach spaces. Similar results are rare. Finally, some applications are given on different topics.

The A-integral for Riemann-measurable vector-valued functions

Article 28 May 2024

Riesz–Zygmund Means and Approximation in Variable Exponent Grand Spaces

Article 16 June 2024

Cyclic nearly invariant subspaces for semigroups of isometries

Article 21 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Different variants of Jensen’s inequality and other inequalities have their origin in the notion of convexity. A real function f defined on a convex subset C of a real vector space is called convex if it satisfies

$$\begin{aligned} f\left( \alpha v_{1}+\left( 1-\alpha \right) v_{2}\right) \le \alpha f\left( v_{1}\right) +\left( 1-\alpha \right) f\left( v_{2}\right) \end{aligned}$$

for all $v_{1},v_{2}\in C$ and all $\alpha \in \left[ 0,1\right] $.

The set of positive integers will be denoted by $\mathbb {N}_{+}$.

The following versions of Jensen’s inequality are well known.

Theorem 1.1

(discrete Jensen’s inequalities, see [11] and [13]) (a) Let C be a convex subset of a real vector space V, and let $f:C\rightarrow \mathbb {R}$ be a convex function. If $p_{1},\ldots ,p_{n}$ are nonnegative numbers with $\sum \nolimits _{i=1}^{n}p_{i}=1$, and $v_{1},\ldots ,v_{n}\in C$, then

$$\begin{aligned} f\left( \sum \limits _{i=1}^{n}p_{i}v_{i}\right) \le \sum \limits _{i=1} ^{n}p_{i}f\left( v_{i}\right) . \end{aligned}$$

(1.1)

(b) Let C be a closed convex subset of a real Banach space V, and let $f:C\rightarrow \mathbb {R}$ be a convex function. If $p_{1},p_{2},\ldots $ are nonnegative numbers with $\sum \nolimits _{i=1}^{\infty }p_{i}=1$, and $v_{1},v_{2},\ldots \in C$ such that the series $\sum \nolimits _{i=1}^{\infty }p_{i}v_{i} $ and $\sum \nolimits _{i=1}^{\infty }p_{i}f\left( v_{i}\right) $ are absolutely convergent, then

$$\begin{aligned} f\left( \sum \limits _{i=1}^{\infty }p_{i}v_{i}\right) \le \sum \limits _{i=1} ^{\infty }p_{i}f\left( v_{i}\right) . \end{aligned}$$

(1.2)

To give refinements of the discrete Jensen’s inequality (1.1) is an extensively investigated theme with numerous methods and results (see e.g. the book [8] and references therein), and applications (see e.g. [5] and [6]). Then again, to the best of my knowledge, there are no refinements of the discrete Jensen’s inequality (1.2) in such generality. There are some refinements of (1.2) when C is an interval of $\mathbb {R}$: either one estimates formulas in (1.2) in a suitable way (see [12]) or one can obtain results from refinements of integral Jensen’s inequality (see [7]).

The following refinement of (1.1) can be found in [9] (see also [1]).

Theorem 1.2

Let $2\le k\le n$ be integers, and let $p_{1},\ldots ,p_{n}$ and $\lambda _{1},\ldots ,\lambda _{k}$ be positive numbers with $\sum \nolimits _{i=1}^{n}p_{i}=1$ and $\sum \nolimits _{i=1}^{k}\lambda _{i}=1$. If C is a convex subset of a real vector space V, $f:C\rightarrow \mathbb {R}$ is a convex function, and $v_{1},\ldots ,v_{n}\in C$, then

$$\begin{aligned} f\left( \sum \limits _{i=1}^{n}p_{i}v_{i}\right)\le & {} C_{dis}=C_{dis}\left( f,\mathbf {v,p},{{\varvec{\lambda }}}\right) \nonumber \\ := & {} \sum \limits _{i=1}^{n}\left( \sum \limits _{j=0}^{k-1}\lambda _{j+1} p_{i+j}\right) f\left( \frac{\sum \nolimits _{j=0}^{k-1}\lambda _{j+1} p_{i+j}v_{i+j}}{\sum \nolimits _{j=0}^{k-1}\lambda _{j+1}p_{i+j}}\right) \nonumber \\\le & {} \sum \limits _{i=1}^{n}p_{i}f\left( v_{i}\right) , \end{aligned}$$

(1.3)

where $i+j$ means $i+j-n$ in case of $i+j>n$.

It is easy to think that the previous result cannot be generalized for infinite sums, but we can observe that the middle term $C_{dis}$ in (1.3) can be rewritten in the following form

$$\begin{aligned} C_{dis}=\sum \nolimits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j} p_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \nolimits _{j=1} ^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) } }{\sum \nolimits _{j=1}^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) , \end{aligned}$$

(1.4)

where $\pi _{j}$ $\left( j=1,\ldots ,k\right) $ is the $\left( j-1\right) $-cyclic permutation of the set $\left\{ 1,\ldots ,n\right\} $ to the right (all elements are moved to the right $j-1$ times with elements overflowing from the right being inserted to the left).

In this paper we show that formulas like (1.4) refine both (1.1) and (1.2) by using either permutations of the set $\left\{ 1,\ldots ,n\right\} $ or bijections from $\mathbb {N}_{+}$ onto itself. On the one hand, an essential generalization of Theorem 1.2 is given, on the other hand, refinements of (1.2) are developed without assuming that V is a special Banach space. Finally, we give some applications concerning information theory, the norm function, Hölder’s inequality and the inequality of arithmetic and geometric means.

2 Main results

The positive part $f^{+}$ and the negative part $f^{-}$ of a real valued function f are defined in the usual way.

Let the set I denote either $\left\{ 1,\ldots ,n\right\} $ for some $n\ge 1$ or $\mathbb {N}_{+}$. We say that the numbers $\left( p_{i}\right) _{i\in I}$ represent a (positive) discrete probability distribution if $\left( p_{i}>0\right) $ $p_{i}\ge 0$ $\left( i\in I\right) $ and $\sum \nolimits _{i\in I}p_{i}=1$. A permutation $\pi $ of I refers to a bijection from I onto itself.

We need the following hypotheses which are partitioned into two classes:

($\hbox {H}_{{1}}$):: Let $k,n\ge 2$ be integers, and let $p_{1},\ldots ,p_{n}$ and $\lambda _{1},\ldots ,\lambda _{k}$ represent positive probability distributions.
($\hbox {H}_{{2}}$):: For each $j=1,\ldots ,k$ let $\pi _{j}$ be a permutation of the set $\left\{ 1,\ldots ,n\right\} $.
($\hbox {H}_{{3}}$):: Let C be a convex subset of a real vector space V, and $f:C\rightarrow \mathbb {R}$ be a convex function.
($\hbox {C}_{{1}}$):: Let the set J denote either $\left\{ 1,\ldots ,k\right\} $ for some $k\ge 2$ or $\mathbb {N}_{+}$. Let $p_{1},p_{2},\ldots $ and $\left( \lambda _{j}\right) _{j\in J}$ represent positive probability distributions.
($\hbox {C}_{{2}}$):: For each $j\in J$ let $\pi _{j}$ be a permutation of the set $\mathbb {N}_{+}$.
($\hbox {C}_{{3}}$):: Let C be a closed convex subset of a real Banach space $\left( V,\left\| \cdot \right\| \right) $, and $f:C\rightarrow \mathbb {R}$ be a convex function.

Theorem 2.1

(a)
Assume ($\hbox {H}_{{1}}$), ($\hbox {H}_{{2}}$) and ($\hbox {H}_{{3}}$). If $v_{1},\ldots ,v_{n}\in C$, then
$$\begin{aligned} f\left( \sum \limits _{i=1}^{n}p_{i}v_{i}\right)\le & {} C_{per}=C_{per}\left( f,\mathbf {v,p},{{\varvec{\lambda }},{\varvec{\pi }}}\right) \nonumber \\ := & {} \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \nolimits _{j=1}^{k} \lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j=1}^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \nonumber \\\le & {} \sum \limits _{i=1}^{n}p_{i}f\left( v_{i}\right) . \end{aligned}$$
(2.1)
(b)
Assume ($\hbox {C}_{{1}}$), ($\hbox {C}_{{2}}$) and ($\hbox {C}_{{3}}$). If $v_{1},v_{2} ,\ldots \in C$ such that the series $\sum \nolimits _{i=1}^{\infty } p_{i}v_{i}$ and $\sum \nolimits _{i=1}^{\infty }p_{i}f\left( v_{i}\right) $ are absolutely convergent, then
$$\begin{aligned} f\left( \sum \limits _{i=1}^{\infty }p_{i}v_{i}\right)\le & {} C_{per} =C_{per}\left( f,\mathbf {v,p},{{\varvec{\lambda }},{\varvec{\pi }}}\right) \nonumber \\ := & {} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \nonumber \\\le & {} \sum \limits _{i=1}^{\infty }p_{i}f\left( v_{i}\right) . \end{aligned}$$
(2.2)

Proof

(a) By using Theorem 1.1 (a) and the fact that $\pi _{j}$ is a permutation of the set $\left\{ 1,\ldots ,n\right\} $,

$$\begin{aligned} C_{per}\le & {} \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j} p_{\pi _{j}\left( i\right) }f\left( v_{\pi _{j}\left( i\right) }\right) \right) =\sum \limits _{j=1}^{k}\lambda _{j}\left( \sum \limits _{i=1}^{n} p_{\pi _{j}\left( i\right) }f\left( v_{\pi _{j}\left( i\right) }\right) \right) \\= & {} \left( \sum \limits _{j=1}^{k}\lambda _{j}\right) \left( \sum \limits _{i=1} ^{n}p_{i}f\left( v_{i}\right) \right) =\sum \limits _{i=1}^{n}p_{i}f\left( v_{i}\right) . \end{aligned}$$

The left hand side inequality can be proved similarly. Since

$$\begin{aligned} \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) =\left( \sum \limits _{j=1}^{k}\lambda _{j}\right) \left( \sum \limits _{i=1}^{n}p_{i}\right) =1, \end{aligned}$$

the discrete Jensen’s inequality implies that

$$\begin{aligned} C_{per}\ge f\left( \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k} \lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }\right) \right) =f\left( \sum \limits _{i=1}^{n}p_{i}v_{i}\right) . \end{aligned}$$

(b) The proof is divided into four parts.

I. We first prove that the series

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\left\| v_{\pi _{j}\left( i\right) }\right\| \right) \end{aligned}$$

(2.3)

and

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\left| f\left( v_{\pi _{j}\left( i\right) }\right) \right| \right) \end{aligned}$$

(2.4)

are convergent and

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }\right) =\sum \limits _{i=1}^{\infty }p_{i}v_{i}, \end{aligned}$$

(2.5)

and

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }f\left( v_{\pi _{j}\left( i\right) }\right) \right) =\sum \limits _{i=1}^{\infty }p_{i}f\left( v_{i}\right) . \end{aligned}$$

For each $j\in J$ the series

$$\begin{aligned} \sum \limits _{i=1}^{\infty }p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) } \end{aligned}$$

(2.6)

is a rearrangement of the absolutely convergent series $\sum \nolimits _{i=1} ^{\infty }p_{i}v_{i}$, and hence it is also absolutely convergent and

$$\begin{aligned} \sum \limits _{i=1}^{\infty }p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }=\sum \limits _{i=1}^{\infty }p_{i}v_{i}. \end{aligned}$$

(2.7)

(i) If $J=\left\{ 1,\ldots ,k\right\} $, then it follows trivially from (2.7) that

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j=1}^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }\right) =\sum \limits _{i=1}^{\infty }p_{i}v_{i}. \end{aligned}$$

(ii) Assume $J=\mathbb {N}_{+}$. The property of absolute convergence of (2.6) implies that

$$\begin{aligned} \sum \limits _{j=1}^{\infty }\left( \sum \limits _{i=1}^{\infty }\lambda _{j} p_{\pi _{j}\left( i\right) }\left\| v_{\pi _{j}\left( i\right) }\right\| \right)= & {} \sum \limits _{j=1}^{\infty }\lambda _{j}\left( \sum \limits _{i=1}^{\infty }p_{\pi _{j}\left( i\right) }\left\| v_{\pi _{j}\left( i\right) }\right\| \right) \\= & {} \sum \limits _{i=1}^{\infty }p_{i}\left\| v_{i}\right\| <\infty . \end{aligned}$$

Therefore, as it is well known, the order of summation can be interchanged in the double sum

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j=1}^{\infty }\lambda _{j} p_{\pi _{j}\left( i\right) }f\left( v_{\pi _{j}\left( i\right) }\right) \right) , \end{aligned}$$

and hence

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j=1}^{\infty }\lambda _{j} p_{\pi _{j}\left( i\right) }f\left( v_{\pi _{j}\left( i\right) }\right) \right) =\sum \limits _{j=1}^{\infty }\left( \sum \limits _{i=1}^{\infty } \lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }\right) =\sum \limits _{i=1}^{\infty }p_{i}v_{i}. \end{aligned}$$

The series in (2.4) can be handled in a similar way.

II. (i) If $J=\left\{ 1,\ldots ,k\right\} $, then Theorem 1.1 (a) gives us that for every $n\in \mathbb {N}_{+}$

$$\begin{aligned} \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \nolimits _{j=1}^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j=1} ^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \le \sum \limits _{i=1} ^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j}p_{\pi _{j}\left( i\right) }f\left( v_{\pi _{j}\left( i\right) }\right) \right) . \end{aligned}$$

(2.8)

(ii) Assume $J=\mathbb {N}_{+}$. For each $i\in \mathbb {N}_{+}$ the series $\sum \nolimits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }$ is obviously convergent and

$$\begin{aligned} \frac{\sum \nolimits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }\left\| v_{\pi _{j}\left( i\right) }\right\| }{\sum \nolimits _{j=1} ^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }}\le & {} \frac{1}{\sum \nolimits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }}\sum \nolimits _{j=1}^{\infty }p_{\pi _{j}\left( i\right) }\left\| v_{\pi _{j}\left( i\right) }\right\| \\= & {} \frac{1}{\sum \nolimits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) } }\sum \limits _{i=1}^{\infty }p_{i}\left\| v_{i}\right\| <\infty , \end{aligned}$$

and hence the series

$$\begin{aligned} \frac{\sum \nolimits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }\left\| v_{\pi _{j}\left( i\right) }\right\| }{\sum \nolimits _{j=1} ^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }} \end{aligned}$$

is absolutely convergent. Further, we know from part I that the series

$$\begin{aligned} \sum \limits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }f\left( v_{\pi _{j}\left( i\right) }\right) ,\quad i\in \mathbb {N}_{+} \end{aligned}$$

is also absolutely convergent.

Consequently, Theorem 1.1 (b) shows that

$$\begin{aligned} \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \limits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j=1}^{\infty }\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \le \sum \limits _{i=1}^{n}\left( \sum \nolimits _{j=1}^{\infty }\lambda _{j} p_{\pi _{j}\left( i\right) }f\left( v_{\pi _{j}\left( i\right) }\right) \right) . \end{aligned}$$

(2.9)

We have seen in part I that the series (2.4) is convergent and its sum is $\sum \nolimits _{i=1}^{\infty }p_{i}f\left( v_{i}\right) $. Thus by (2.8) or by (2.9), the second inequality in (2.2) will be proved if we succeed in showing that the series

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \end{aligned}$$

(2.10)

is convergent.

It is known that the positive part of f is also convex. The convergence of (2.4) implies the convergence of the series

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }f^{+}\left( v_{\pi _{j}\left( i\right) }\right) \right) . \end{aligned}$$

It now follows that we can copy the proofs of (2.8) and (2.9) with $f^{+}$ instead of f. Taking account of the nonnegativity of $f^{+}$, we obtain

$$\begin{aligned}&\sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f^{+}\left( \frac{\sum \nolimits _{j\in J} \lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \nonumber \\&\quad \le \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j} p_{\pi _{j}\left( i\right) }f^{+}\left( v_{\pi _{j}\left( i\right) }\right) \right) <\infty . \end{aligned}$$

(2.11)

From this it follows that the series (2.10) will be convergent (absolutely) if and only if

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f^{-}\left( \frac{\sum \nolimits _{j\in J} \lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) <\infty . \end{aligned}$$

(2.12)

III. In this step we show that the series (2.12) is convergent and the first inequality in (2.2) holds assuming f is bounded below, that is $f\left( v\right) \ge c$ $\left( v\in C\right) $ for some nonpositive number c.

Since

$$\begin{aligned}&\sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f^{-}\left( \frac{\sum \nolimits _{j\in J} \lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \le -c\sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j} p_{\pi _{j}\left( i\right) }\right) \\&\quad =-c\sum \limits _{j\in J}\lambda _{j}\left( \sum \limits _{i=1}^{\infty }p_{\pi _{j}\left( i\right) }\right) =-c, \end{aligned}$$

the series (2.12) is convergent.

According to (2.11) and (2.12) the series (2.10) is absolutely convergent.

Since the series (2.3) and (2.10) are absolutely convergent and (2.5) holds, we can apply Theorem 1.1 (b), and obtain the first inequality in (2.2).

IV. At this point we abandon the lower boundedness hypothesis on f.

Let the function $f_{n}:C\rightarrow \mathbb {R}$ be defined by

$$\begin{aligned} f_{n}\left( v\right) =\max \left( f\left( v\right) ,-n\right) ,\quad n\in \mathbb {N}_{+}. \end{aligned}$$

Then $f_{n}$ $\left( n\in \mathbb {N}_{+}\right) $ is convex and bounded below, and $f_{n}\ge f$ $\left( n\in \mathbb {N}_{+}\right) $. From this and from the results of part III, we get that for each $n\in \mathbb {N}_{+}$ the series

$$\begin{aligned} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f_{n}\left( \frac{\sum \nolimits _{j\in J} \lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \end{aligned}$$

is absolutely convergent and

$$\begin{aligned} f\left( \sum \limits _{i=1}^{\infty }p_{i}v_{i}\right) \le f_{n}\left( \sum \limits _{i=1}^{\infty }p_{i}v_{i}\right) \le \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) f_{n}\left( \frac{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) . \end{aligned}$$

Since the sequence $\left( f_{n}\right) _{n\in \mathbb {N}_{+}}$ is decreasing and $\lim \nolimits _{n\rightarrow \infty }f_{n}=f$ pointwise, the previous two assertions imply that B. Levi’s theorem can be applied, and it gives that the series (2.10) is absolutely convergent and the first inequality in (2.2) holds.

The proof is complete. $\square $

Remark 2.2

It can be seen that Theorem 2.1 (a) contains Theorem 1.2 as a special case.

3 Applications

We begin with some inequalities corresponding to information theory.

The following notion was introduced by Csiszár in [2] and [3].

Definition 3.1

Let $f:\left] 0,\infty \right[ \rightarrow \left] 0,\infty \right[ $ be a convex function, and let $\mathbf {r}:=\left( r_{1},\ldots ,r_{n}\right) \in \left] 0,\infty \right[ ^{n}$ and $\mathbf {q}:=\left( q_{1},\ldots ,q_{n}\right) \in \left] 0,\infty \right[ ^{n}$. The f-divergence functional is

$$\begin{aligned} I_{f}(\mathbf {r},\mathbf {q}):= {\displaystyle \sum \limits _{i=1}^{n}} q_{i}f\left( \frac{r_{i}}{q_{i}}\right) . \end{aligned}$$

Based on this concept, we have introduced a new functional in [10], and this functional can be further generalized:

Definition 3.2

Let C be a convex subset of a real vector space V, and $f:C\rightarrow \mathbb {R}$ be a convex function. If $\mathbf {w}:=\left( w_{1},\ldots ,w_{n}\right) \in V^{n}$ and $\mathbf {q}:=\left( q_{1} ,\ldots ,q_{n}\right) \in \left] 0,\infty \right[ ^{n}$ such that

$$\begin{aligned} \frac{w_{i}}{q_{i}}\in C,\quad i=1,\ldots ,n, \end{aligned}$$

(3.1)

then define

$$\begin{aligned} I_{f}(\mathbf {w},\mathbf {q}):= {\displaystyle \sum \limits _{i=1}^{n}} q_{i}f\left( \frac{w_{i}}{q_{i}}\right) . \end{aligned}$$

Proposition 3.3

Let $k,n\ge 2$ be integers, and let $\lambda _{1},\ldots ,\lambda _{k}$ represent a positive probability distribution. Assume ($\hbox {H}_{{2}}$) and ($\hbox {H}_{{3}}$). If $\mathbf {w}:=\left( w_{1},\ldots ,w_{n}\right) \in V^{n}$ and $\mathbf {q}:=\left( q_{1},\ldots ,q_{n}\right) \in \left] 0,\infty \right[ ^{n}$ such that

$$\begin{aligned} \frac{w_{i}}{q_{i}}\in C,\quad i=1,\ldots ,n, \end{aligned}$$

then

$$\begin{aligned} I_{f}(\mathbf {w},\mathbf {q})= & {} {\displaystyle \sum \limits _{i=1}^{n}} q_{i}f\left( \frac{w_{i}}{q_{i}}\right) \nonumber \\\ge & {} \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j}q_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \nolimits _{j=1}^{k} \lambda _{j}w_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j=1}^{k}\lambda _{j}q_{\pi _{j}\left( i\right) }}\right) \nonumber \\\ge & {} \left( {\displaystyle \sum \limits _{i=1}^{n}} q_{i}\right) \cdot f\left( \frac{ {\displaystyle \sum \nolimits _{i=1}^{n}} w_{i}}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}}\right) . \end{aligned}$$

(3.2)

Proof

By applying Theorem 2.1 (a) with

$$\begin{aligned} p_{i}:=\frac{q_{i}}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}},\quad v_{i}:=\frac{w_{i}}{q_{i}},\quad i=1,\ldots ,n \end{aligned}$$

we have

$$\begin{aligned} {\displaystyle \sum \limits _{i=1}^{n}} q_{i}f\left( \frac{w_{i}}{q_{i}}\right)= & {} \left( {\displaystyle \sum \limits _{i=1}^{n}} q_{i}\right) \cdot {\displaystyle \sum \limits _{i=1}^{n}} \frac{q_{i}}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}}f\left( \frac{w_{i}}{q_{i}}\right) \\\ge & {} \left( {\displaystyle \sum \limits _{i=1}^{n}} q_{i}\right) \cdot \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k} \lambda _{j}\frac{q_{\pi _{j}\left( i\right) }}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}}\right) f\left( \frac{\sum \limits _{j=1}^{k}\lambda _{j}\frac{q_{\pi _{j}\left( i\right) }}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}}\frac{w_{\pi _{j}\left( i\right) }}{q_{\pi _{j}\left( i\right) }} }{\sum \limits _{j=1}^{k}\lambda _{j}\frac{q_{\pi _{j}\left( i\right) }}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}}}\right) \\= & {} \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j}q_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \nolimits _{j=1}^{k} \lambda _{j}w_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j=1}^{k}\lambda _{j}q_{\pi _{j}\left( i\right) }}\right) \nonumber \\\ge & {} \left( {\displaystyle \sum \limits _{i=1}^{n}} q_{i}\right) \cdot f\left( \frac{ {\displaystyle \sum \nolimits _{i=1}^{n}} w_{i}}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}}\right) . \end{aligned}$$

The proof is complete. $\square $

Remark 3.4

(a) It was proved in [4] that if $f:\left] 0,\infty \right[ \rightarrow \left] 0,\infty \right[ $ is a convex function, and $\mathbf {r}:=\left( r_{1},\ldots ,r_{n}\right) \in \left] 0,\infty \right[ ^{n}$ and $\mathbf {q}:=\left( q_{1},\ldots ,q_{n}\right) \in \left] 0,\infty \right[ ^{n}$, then

$$\begin{aligned} I_{f}(\mathbf {r},\mathbf {q})\ge \sum \limits _{i=1}^{n}q_{i}f\left( \frac{ {\displaystyle \sum \nolimits _{i=1}^{n}} r_{i}}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}}\right) . \end{aligned}$$

From Proposition 3.3 we can obtain the following refinement of this inequality:

$$\begin{aligned} I_{f}(\mathbf {r},\mathbf {q})\ge \sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j}q_{\pi _{j}\left( i\right) }\right) f\left( \frac{\sum \nolimits _{j=1}^{k}\lambda _{j}r_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j=1}^{k}\lambda _{j}q_{\pi _{j}\left( i\right) }}\right) \ge \sum \limits _{i=1}^{n}q_{i}f\left( \frac{ {\displaystyle \sum \nolimits _{i=1}^{n}} r_{i}}{ {\displaystyle \sum \nolimits _{i=1}^{n}} q_{i}}\right) . \end{aligned}$$

(3.3)

(b) Let $f:=-\log $, where the base of $\log $ is greater than 1, $\mathbf {r}:=\left( 1,\ldots ,1\right) $, and $\mathbf {q}:=\left( q_{1},\ldots ,q_{n}\right) $ represent a positive probability distribution. Then (3.3) gives

$$\begin{aligned} H(\mathbf {q}):= & {} - {\displaystyle \sum \limits _{i=1}^{n}} q_{i}\log \left( q_{i}\right) \\\le & {} -\sum \limits _{i=1}^{n}\left( \sum \limits _{j=1}^{k}\lambda _{j}q_{\pi _{j}\left( i\right) }\right) \log \left( \sum \limits _{j=1}^{k}\lambda _{j}q_{\pi _{j}\left( i\right) }\right) \le \log \left( n\right) , \end{aligned}$$

which is a refinement of a remarkable inequality for the Shannon entropy.

Next we establish inequalities for the norm function.

Proposition 3.5

Assume ($\hbox {C}_{{1}}$) and ($\hbox {C}_{{2}}$), and assume $\left( V,\left\| \cdot \right\| \right) $ is a Banach space. If $v_{1},v_{2} ,\ldots \in V$ such that the series $\sum \nolimits _{i=1}^{\infty } p_{i}\left\| v_{i}\right\| ^{\alpha }$ is absolutely convergent for some $\alpha \ge 1$, then

$$\begin{aligned} \left\| \sum \limits _{i=1}^{\infty }p_{i}v_{i}\right\| ^{\alpha }\le \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) \left\| \frac{\sum \nolimits _{j\in J} \lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right\| ^{\alpha }\le \sum \limits _{i=1}^{\infty }p_{i}\left\| v_{i}\right\| ^{\alpha }. \end{aligned}$$

Proof

Since the function $f:V\rightarrow \mathbb {R}$ defined by $f\left( v\right) =\left\| v\right\| ^{\alpha }$ is convex, and the series $\sum \nolimits _{i=1}^{\infty }p_{i}v_{i}$ is also absolutely convergent, the result follows from Theorem 2.1 (b). $\square $

Now we get a refinement of the discrete Hölder’s inequality.

Proposition 3.6

Let the set J denote either $\left\{ 1,\ldots ,k\right\} $ for some $k\ge 2$ or $\mathbb {N}_{+}$, and let $\left( \lambda _{j}\right) _{j\in J}$ represent a positive probability distribution. Let $\left( w_{i}\right) _{i=1}^{\infty }$ be a sequence of positive numbers, and let $\left( x_{i}\right) _{i=1}^{\infty }$ and $\left( y_{i}\right) _{i=1}^{\infty }$ be sequences of nonnegative numbers such that the series $\sum \nolimits _{i=1} ^{\infty }w_{i}x_{i}^{p}$ and $\sum \nolimits _{i=1}^{\infty }w_{i}y_{i}^{q}$ are convergent, where $p>1$ and $q>1$ are conjugate exponents that is $\frac{1}{p}+\frac{1}{q}=1$. Then

$$\begin{aligned} \sum \limits _{i=1}^{\infty }w_{i}x_{i}y_{i}\le & {} \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}w_{\pi _{j}\left( i\right) }x_{\pi _{j}\left( i\right) }^{p}\right) ^{\frac{1}{p}}\left( \sum \limits _{j\in J}\lambda _{j}w_{\pi _{j}\left( i\right) }y_{\pi _{j}\left( i\right) }^{q}\right) ^{\frac{1}{q}}\nonumber \\\le & {} \left( \sum \limits _{i=1}^{\infty }w_{i}x_{i}^{p}\right) ^{\frac{1}{p} }\left( \sum \limits _{i=1}^{\infty }w_{i}y_{i}^{q}\right) ^{\frac{1}{q} }. \end{aligned}$$

(3.4)

Proof

For each $s>1$ the power function

$$\begin{aligned} f_{s}:\left] 0,\infty \right[ \rightarrow \mathbb {R},\mathbb {\quad } f_{s}(x)=x^{s} \end{aligned}$$

(3.5)

is strictly convex. Let $v_{1},v_{2}\ldots $ be positive numbers such that the series $\sum \nolimits _{i=1}^{\infty }p_{i}v_{i}^{s}$ is convergent. Theorem 2.1 (b) can be applied to the function $f_{s}$ and to the positive numbers $v_{1},v_{2}\ldots $, and it yields that

$$\begin{aligned} \left( \sum \limits _{i=1}^{\infty }p_{i}v_{i}\right) ^{s}\le \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }\right) ^{1-s}\left( \sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }\right) ^{s}\le \sum \limits _{i=1}^{\infty }p_{i}v_{i}^{s}. \end{aligned}$$

(3.6)

If $\sum \nolimits _{i=1}^{\infty }w_{i}y_{i}^{q}=0$, then (3.4) is obvious.

Otherwise, from the inequality (3.6) with the choices

$$\begin{aligned} s=\frac{1}{p},\quad p_{i}=\frac{w_{i}y_{i}^{q}}{\sum \nolimits _{l=1}^{\infty }w_{l}y_{l}^{q}},\quad v_{i}=x_{i}^{p}y_{i}^{-q},\quad i\in \mathbb {N}_{+} \end{aligned}$$

($-f_{1/p}$ is convex) we obtain

$$\begin{aligned}&\left( \frac{1}{\sum \nolimits _{l=1}^{\infty }w_{l}y_{l}^{q}}\right) ^{\frac{1}{p}}\left( \sum \limits _{i=1}^{\infty }w_{i}x_{i}^{p}\right) ^{\frac{1}{p} }\ge \frac{1}{\sum \nolimits _{l=1}^{\infty }w_{l}y_{l}^{q}} \\&\quad \cdot \sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j} w_{\pi _{j}\left( i\right) }y_{\pi _{j}\left( i\right) }^{q}\right) ^{\frac{1}{q}}\left( \sum \limits _{j\in J}\lambda _{j}w_{\pi _{j}\left( i\right) }x_{\pi _{j}\left( i\right) }^{p}\right) ^{\frac{1}{p}} \\&\qquad \ge \frac{1}{\sum \nolimits _{l=1}^{\infty }w_{l}y_{l}^{q}}\sum \limits _{i=1} ^{n}w_{i}x_{i}y_{i}, \end{aligned}$$

and this delivers the desired conclusion.

The proof is complete. $\square $

Finally, we apply our results to get a refinement of the inequality of generalized arithmetic and geometric means.

Proposition 3.7

Assume ($\hbox {C}_{{1}}$) and ($\hbox {C}_{{2}}$). If $v_{1},v_{2},\ldots $ are positive numbers such that the series $\sum \nolimits _{i=1}^{\infty } p_{i}v_{i}$ and $\sum \nolimits _{i=1}^{\infty }p_{i}\ln \left( v_{i}\right) $ are absolutely convergent, then

$$\begin{aligned} \sum \limits _{i=1}^{\infty }p_{i}v_{i}\ge \prod \limits _{i=1}^{\infty }\left( \frac{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) ^{\sum \limits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\ge \prod \limits _{i=1}^{\infty }v_{i}^{p_{i}}. \end{aligned}$$

(3.7)

Proof

By applying Theorem 2.1 (b) to the convex function $-\ln $, we obtain

$$\begin{aligned}&-\ln \left( \sum \limits _{i=1}^{\infty }p_{i}v_{i}\right) \\&\quad \le -\sum \limits _{i=1}^{\infty }\left( \sum \limits _{j\in J}\lambda _{j} p_{\pi _{j}\left( i\right) }\right) \ln \left( \frac{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }v_{\pi _{j}\left( i\right) }}{\sum \nolimits _{j\in J}\lambda _{j}p_{\pi _{j}\left( i\right) }}\right) \le -\sum \limits _{i=1}^{\infty }p_{i}\ln \left( v_{i}\right) , \end{aligned}$$

and this is equivalent to (3.7). $\square $

References

Brnetić, I., Khan, K.A., Pečarić, J.: Refinement of Jensen’s inequality with applications to cyclic mixed symmetric means and Cauchy means. J. Math. Inequal. 9(4), 1309–1321 (2015)
Article MathSciNet Google Scholar
Csiszár, I.: Information measures: a critical survey, Trans. 7th Prague Conference on Info. Th., Statist. Decis. Funct., Random Processes and 8th European Meeting of Statistics, Volume B, Academia Prague, pp. 73–86 (1978)
Csiszár, I.: Information-type measures of difference of probability distributions and indirect observations. Studia Sci. Math. Hung. 2, 299–318 (1967)
MathSciNet MATH Google Scholar
Csiszár, I., Körner, J.: Information Theory: Coding Theorems for Discrete Memoryless Systems. Academic Press, New York (1981)
MATH Google Scholar
Dragomir, S.S.: A refinement of Jensen’s inequality with applications for f-divergence measures. Taiwan. J. Math. 14, 153–164 (2010)
Article MathSciNet Google Scholar
Dragomir, S.S.: A new refinement of Jensen’s inequality in linear spaces with applications. Math. Comput. Model. 52, 1497–1505 (2010)
Article MathSciNet Google Scholar
Horváth, L.: A refinement of the integral form of Jensen’s inequality. J. Inequal. Appl. 2012, 178 (2012)
Article MathSciNet Google Scholar
Horváth, L., Khan, K.A., Pečarić, J.: Combinatorial improvements of Jensen’s inequality. Classical and new refinements of Jensen’s inequality with application. In: Monographs in Inequalities 8. Element, Zagreb, Croatia (2014)
Horváth, L., Khan, K.A., Pečarić, J.: Cyclic refinements of the discrete and integral form of Jensen’s inequality with applications. Analysis 36(4), 253–262 (2016)
Article MathSciNet Google Scholar
Horváth, L., Pečarić, D., Pečarić, J.: Estimations of $f$- and Rényi divergences by using a cyclic refinement of the Jensen’s inequality. Bull. Malays. Math. Sci. Soc. 42(3), 933–946 (2019)
Article MathSciNet Google Scholar
Niculescu, C., Persson, L.E.: Convex Functions and Their Applications. A Contemporary Approach. Springer, Berlin (2006)
Book Google Scholar
Pavić, Z.: Refinements of Jensen’s inequality for infinite convex combinations. Turk. J. Inequal. 2(2), 44–53 (2018)
MathSciNet Google Scholar
Perlman, M.D.: Jensen’s inequality for a convex vector-valued function on an infinite-dimensional space. J. Multivar. Anal. 4, 52–65 (1974)
Article MathSciNet Google Scholar

Download references

Acknowledgements

Open access funding provided by University of Pannonia (PE). The research of the author has been supported by Hungarian National Foundations for Scientific Research Grant No. K120186.

Author information

Authors and Affiliations

Department of Mathematics, University of Pannonia, Egyetem u. 10., Veszprém, 8200, Hungary
László Horváth

Authors

László Horváth
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to László Horváth.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Horváth, L. New refinements of the discrete Jensen’s inequality generated by finite or infinite permutations. Aequat. Math. 94, 1109–1121 (2020). https://doi.org/10.1007/s00010-019-00696-z

Download citation

Received: 12 August 2019
Revised: 30 November 2019
Published: 14 December 2019
Issue Date: December 2020
DOI: https://doi.org/10.1007/s00010-019-00696-z

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

New refinements of the discrete Jensen’s inequality generated by finite or infinite permutations

Abstract

Similar content being viewed by others

The A-integral for Riemann-measurable vector-valued functions

Riesz–Zygmund Means and Approximation in Variable Exponent Grand Spaces

Cyclic nearly invariant subspaces for semigroups of isometries

1 Introduction

Theorem 1.1

Theorem 1.2

2 Main results

Theorem 2.1

Proof

Remark 2.2

3 Applications

Definition 3.1

Definition 3.2

Proposition 3.3

Proof

Remark 3.4

Proposition 3.5

Proof

Proposition 3.6

Proof

Proposition 3.7

Proof

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

New refinements of the discrete Jensen’s inequality generated by finite or infinite permutations

Abstract

Similar content being viewed by others

The A-integral for Riemann-measurable vector-valued functions

Riesz–Zygmund Means and Approximation in Variable Exponent Grand Spaces

Cyclic nearly invariant subspaces for semigroups of isometries

1 Introduction

Theorem 1.1

Theorem 1.2

2 Main results

Theorem 2.1

Proof

Remark 2.2

3 Applications

Definition 3.1

Definition 3.2

Proposition 3.3

Proof

Remark 3.4

Proposition 3.5

Proof

Proposition 3.6

Proof

Proposition 3.7

Proof

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation