1 Introduction

The convexity of functions has been frequently used in various fields of pure and applied mathematics, for instance in function theory, mathematical analysis, functional analysis, probability theory, optimization theory, operational research, information theory. In short, convex functions entail a strong and elegant interaction between analysis and geometry. The simple generalization to a convex function extensively widens our scope for analysis.

Inequalities are frequently used in solving several problems of the applied sciences. Some recent work on the applications of mathematical inequalities can be found in [7, 12] and [24].

The Jensen inequality for convex functions plays a pivotal role in the theory of inequalities because of the fact that various other inequalities, for instance the Holder and Minkowski inequalities and the arithmetic mean–geometric mean inequality can be obtained as particular cases.

Some useful results on Jensen’s inequality for strongly convex functions and generalized majorization inequalities can be found in [20] and [13].

For \(\psi :[a,b]\to \mathbb {R}\) where \(\psi ^{(n-1)}\) is absolutely continuous, the renowned Taylor formula \(\forall z \in [a,b]\) at the point \(c\in [a,b]\) is

$$ \psi (z)=\sum_{w=0}^{n-1} \frac{\psi ^{(w)}(c)}{w!}(z-c)^{w}+ \frac{1}{(n-1)!} \int _{c}^{z}{\psi ^{(n)}(\xi ) (z-\xi )^{n-1}}\,d\xi . $$
(1)

Taking \(v=1, \ldots , 5\), the Green functions \(G_{v}:[a , b] \times [a,b] \rightarrow \mathbb {R}\) are defined as

$$\begin{aligned}& G_{1}(z, \xi )= \textstyle\begin{cases} \frac{(b-z)(a-\xi )}{b-a},& a\le \xi \leq z, \\ \frac{(b-\xi )(a-z)}{b-a} ,& z \leq \xi \le b , \end{cases}\displaystyle \end{aligned}$$
(2)
$$\begin{aligned}& G_{2}(z,\xi ) = \textstyle\begin{cases} a-\xi , & a\le \xi \le z, \\ a-z, & z\le \xi \le b, \end{cases}\displaystyle \end{aligned}$$
(3)
$$\begin{aligned}& G_{3}(z,\xi ) = \textstyle\begin{cases} z-b, & a\le \xi \le z, \\ \xi -b, & z\le \xi \le b, \end{cases}\displaystyle \end{aligned}$$
(4)
$$\begin{aligned}& G_{4}(z,\xi ) = \textstyle\begin{cases} z-a, & a\le \xi \le z, \\ \xi -a, & z\le \xi \le b, \end{cases}\displaystyle \end{aligned}$$
(5)
$$\begin{aligned}& G_{5}(z,\xi ) = \textstyle\begin{cases} b-\xi , & a\le \xi \le z, \\ b-z, & z\le \xi \le b. \end{cases}\displaystyle \end{aligned}$$
(6)

Each of these functions is convex and continuous with respect to the two variables z and ξ.

Lemma 1

([19])

Suppose \(\psi \in C^{2}[a,b]\), then the following identities are valid:

$$\begin{aligned}& {\psi }(z)=\frac{a-z}{b-a}{\psi }(a)+\frac{z-a}{b-a}{ \psi }(b)+ \int _{a}^{b }G_{1}(z, \xi ){\psi }''(\xi )\,d\xi , \end{aligned}$$
(7)
$$\begin{aligned}& \psi (z)=\psi (a)+(z-a)\psi ^{\prime }(b)+ \int _{a}^{b}{G_{2}(z,\xi ) \psi ^{\prime \prime }(\xi )}\,d\xi , \end{aligned}$$
(8)
$$\begin{aligned}& \psi (z)=\psi (b)+(b-z)\psi ^{\prime }(a)+ \int _{a}^{b}{G_{3}(z,\xi ) \psi ^{\prime \prime }(\xi )}\,d\xi , \end{aligned}$$
(9)
$$\begin{aligned}& \psi (z)=\psi (b)-(b-a)\psi ^{\prime }(b)+(z-a)\psi ^{\prime }(a)+ \int _{a} ^{b}{G_{4}(z,\xi )\psi ^{\prime \prime }(\xi )}\,d\xi , \end{aligned}$$
(10)
$$\begin{aligned}& \psi (z)=\psi (a)+(b-a)\psi ^{\prime }(a)-(b-z)\psi ^{\prime }(b)+ \int _{a} ^{b}{G_{5}(z,\xi )\psi ^{\prime \prime }(\xi )}\,d\xi . \end{aligned}$$
(11)

Proof

Consider the integral

$$ \int _{a}^{b}{G_{v}(z, r)\psi ''(r)}\,dr= \int _{a}^{z}G_{v}(z,r) \psi ''(r)\,dr+ \int _{z}^{b}G_{v}(z,r)\psi ''(r)\,dr. $$

For fixed \(v=1, \ldots ,5\), after integrating for a particular value of the Green functions, (7)–(11) are obtained. □

In order to state some recent investigations of Jensen’s inequality, we shall use the following assumptions:

(H1):

Suppose \(2\leq k\leq m\) are integers and probability distributions \(\mathbf{p}, \boldsymbol {\lambda } \in \mathbb{R}_{+}^{m}\) so that \(\sum_{i}{p_{i}}=1\) and \(\sum_{k}\lambda _{k}=1\).

(H2):

Suppose \(( Z,\mathcal{B},\mu ) \) represents probability space.

Suppose for a fixed integer \(l\geq 2\) and for \(m=1, \ldots , l \), \(\mathcal{B}^{l}\) denotes the σ-algebra in \(Z^{l}\) which is generated by the projection mappings \(pr_{m}:Z^{l}\rightarrow Z\) given as

$$ pr_{m} ( z_{1},\ldots ,z_{l} ) :=z_{m}, $$

where \(\mu ^{l}\) represents a product measure on \(\mathcal{B}^{l}\). This measure is uniquely (μ is σ-finite) specified by

$$ \mu ^{l} ( B_{1}\times \cdots \times B_{l} ) :=\mu ( B _{1} ) \cdots \mu ( B_{l} ) , \quad B_{m}\in \mathcal{B}, m=1,\ldots ,l. $$
(H3):

g is μ-integrable on Z with values in \(I\subset \mathbb{R}\).

(H4):

Suppose ψ is convex on I with \(\psi \circ g\) being μ-integrable on Z.

Considering the assumptions (H1)–(H4), we now define the following.

Theorem 1

([8])

Assume (H1) with \(\psi :C\to \mathbb {R}\) being a convex function where C is a convex subset of real vector space Z. If \(z_{1},\ldots ,z_{m}\in C\), then

$$\begin{aligned} \psi \Biggl( \sum_{i=1}^{m}p_{i}z_{i} \Biggr) \leq& C_{ \mathrm{dis}}=C_{\mathrm{dis}} ( \psi ,{z,\mathbf{p}}, \boldsymbol { \lambda } ) \\ :=&\sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}p _{i+j} \Biggr) \psi \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p _{i+j}z_{i+j}}{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}} \biggr) \leq \sum _{i=1}^{m}p_{i}\psi ( z_{i} ), \end{aligned}$$
(12)

while \(i+j\) means \(i+j\) (modm).

Theorem 2

([8])

Suppose (H1)(H4) hold. Then

$$ \psi \biggl( \int _{Z}g\,d\mu \biggr) \leq C_{\mathrm{par}} ( t ) \leq C_{\mathrm{int}}\leq \int _{Z}\psi \circ g\,d\mu , \quad t\in [ 0,1 ], $$

where

$$\begin{aligned} C_{\mathrm{int}} =&C_{\mathrm{int}} ( \psi ,g,\mu , \mathbf{p}, \boldsymbol {\lambda }) \\ :=&\sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}p _{i+j} \Biggr) \int _{Z^{m}}\psi \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}g ( z_{i+j} ) }{ \sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}} \biggr) \,d\mu ^{m} ( z_{1},\ldots ,z_{m} ) , \end{aligned}$$
(13)

and for \(t\in [ 0,1 ] \)

$$\begin{aligned} C_{\mathrm{par}} ( t ) =&C_{\mathrm{par}} ( t,\psi ,g, \mu , \mathbf{p}, \boldsymbol {\lambda } ) \\ :=&\sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}p_{i+j} \Biggr) \\ &{}\cdot \int _{Z^{m}}\psi \biggl( t\frac{\sum_{j=0} ^{k-1}\lambda _{j+1}p_{i+j}g ( z_{i+j} ) }{\sum_{j=0} ^{k-1}\lambda _{j+1}p_{i+j}}+ ( 1-t ) \int _{Z}g\,d\mu \biggr) \,d\mu ^{m} ( z_{1},\ldots ,z_{m} ) , \end{aligned}$$
(14)

while \(i+j\) means \(i+j\) (modm).

For the purpose of achieving our targets, we construct non-negative linear functionals from the above theorem.

Remark 1

Under the assumptions of Theorem 1 and Theorem 2 for ψ to be a convex function, we have

$$\begin{aligned}& {J _{1}}(\psi ) = {J _{1}}(\mathbf{{z}},\mathbf{{p}}, \boldsymbol {{\lambda }};\psi ): = \sum_{i = 1}^{m} {{p_{i}}} \psi ( {{z_{i}}} ) - {C_{\mathrm{dis}}} ( { \psi , \mathbf{{z}},\mathbf{{p}},\boldsymbol {{\lambda }}} )\ge 0, \\& {J _{2}}(\psi ) = {J _{1}}(\mathbf{{z}},\mathbf{{p}}, \boldsymbol {{\lambda }};\psi ): = {C_{\mathrm{dis}}} ( {\psi , \mathbf{{z}}, \mathbf{{p}},\boldsymbol {{\lambda }}} ) - \psi \Biggl( {\sum _{i = 1}^{m} {{p_{i}}} {z_{i}}} \Biggr)\ge 0, \\& {J _{3}}(\psi ) = {J _{3}}(\psi ,g,\mu ,\mathbf{{p}}, \boldsymbol {{\lambda }}): = \int _{X} \psi \circ g\,d\mu - {C_{\mathrm{int}}} ( {\psi ,g,\mu ,\mathbf{{p}}, \boldsymbol {{\lambda }}} )\ge 0, \\& {J _{4}}(\psi ) = {J _{4}}(t,\psi ,g,\mu , \mathbf{{p}}, \boldsymbol {{\lambda }}): = \int _{X} \psi \circ g\,d\mu - {C_{\mathrm{par}}} ( {t, \psi ,g,\mu ,\mathbf{{p}}, \boldsymbol {{\lambda }}} ) \ge 0; \quad t \in [ {0,1} ], \\& {J _{5}}(\psi ) = {J _{5}}(t,\psi ,g,\mu , \mathbf{{p}}, \boldsymbol {{\lambda }}): = {C_{\mathrm{int}}} ( {\psi ,g,\mu , \mathbf{{p}},\boldsymbol {{\lambda }}} ) - {C_{\mathrm{par}}} ( {t,\psi ,g,\mu , \mathbf{{p}},\boldsymbol {{\lambda }}} ) \ge 0; \quad t \in [ {0,1} ], \\& {J _{6}}(\psi ) = {J _{6}}(t,\psi ,g,\mu , \mathbf{{p}}, \boldsymbol {{\lambda }}): = {C_{\mathrm{par}}} ( {t,\psi ,g,\mu , \mathbf{{p}},\boldsymbol {{\lambda }}} ) - \psi \biggl( { \int _{X} g \,d\mu } \biggr) \ge 0; \quad t \in [ {0,1} ]. \end{aligned}$$

2 Main results

To start for real weights, we need the following assumptions:

(A1):

For \(J_{u} (\cdot )\) (\(u=1, 2\)), suppose \(\frac{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}z_{i+j}}{\sum_{j=0}^{k-1} \lambda _{j+1}p_{i+j}} \in [a,b]\) for \(i=1, \ldots, m\).

(A2):

For the linear functionals \(J_{u} (\cdot )\) (\(u=3, \ldots ,6\)), suppose (H2)–(H4) are satisfied and

$$ \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}g ( z_{i+j} ) }{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}}\in [a,b]\quad \mbox{for }i=1, \ldots, m. $$

We take cyclic refinements of Jensen’s inequality in discrete and continuous versions and form the following identities with real weights by using Taylor’s formula.

Theorem 3

Suppose \(m,k \in \mathbb{N}\), \(p_{1},\ldots ,p_{m}\) and \(\lambda _{1}, \ldots ,\lambda _{k}\) are real tuples for \(2\leq k\leq m\), such that \(\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}\neq 0\) for \(i=1, \ldots, m\) with \(\sum_{i=1}^{m} p_{i} =1\) and \(\sum_{j=1}^{k} {\lambda }_{j} =1\). Also let \(z\in [a,b]\subset \mathbb{R}\) and \(\mathbf{z} \in [a,b]^{m}\). Consider the function \(\psi :[a,b]\to \mathbb {R}\) such that \(\psi ^{(n-1)}\) is absolutely continuous and \(G_{v}\) (\(v=1, \ldots ,5\)) are same as given in (2)(6), respectively. Then for (\(u=1, \ldots , 6\)) along with the assumptions (A1) and (A2), we have the following generalized identities:

  1. (a)
    $$ J_{u} (\psi ) = \sum_{w=1}^{n-1} \frac{\psi ^{(w)}(a)}{w!}J_{u}\bigl((z-a)^{w}\bigr)+ \frac{1}{(n-1)!} \int _{a}^{b}{\psi ^{(n)}(\xi )J_{u}\bigl((z-\xi )_{+} ^{n-1}\bigr)}\,d\xi , $$
    (15)
  2. (b)
    $$ J_{u} (\psi ) = \sum_{w=1}^{n-1} \frac{(-1)^{w}\psi ^{(w)}(b)}{w!}J_{u}\bigl((b-z)^{w}\bigr)- \frac{(-1)^{n-1}}{(n-1)!} \int _{a}^{b}{\psi ^{(n)}(\xi )J_{u}\bigl(( \xi -z)_{+}^{n-1}\bigr)}\,d\xi , $$
    (16)
  3. (c)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &= \biggl( \frac{{\psi }(b)-{\psi }(a)}{b-a} \biggr) J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{ \psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}+ \frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{\xi } ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (17)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &=\psi '(b)J_{u}(z) + \int _{a} ^{b} {J_{u} \bigl(G_{2}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}+ \frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{\xi } ^{b} {J_{u} \bigl(G_{2}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (18)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &= -\psi '(a)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{3}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}+ \frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{\xi } ^{b} {J_{u} \bigl(G_{3}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (19)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &= \psi '(a)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{4}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}+ \frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{\xi } ^{b} {J_{u} \bigl(G_{4}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (20)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &= \psi '(b)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{5}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}+ \frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{\xi } ^{b} {J_{u} \bigl(G_{5}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (21)
  4. (d)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &= \biggl( \frac{{\psi }(b)-{\psi }(a)}{b-a} \biggr) J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{ \psi ^{(w)}(a)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}-\frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{a} ^{\xi } {J_{u} \bigl(G_{1}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (22)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &=\psi '(b)J_{u}(z) + \int _{a} ^{b} {J_{u} \bigl(G_{2}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}-\frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{a} ^{\xi } {J_{u} \bigl(G_{2}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (23)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &= -\psi '(a)J_{u}(z) + \int _{a} ^{b} {J_{u} \bigl(G _{3}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}-\frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{a} ^{\xi } {J_{u} \bigl(G_{3}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (24)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &=\psi '(a)J_{u}(z) + \int _{a} ^{b} {J_{u} \bigl(G_{4}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}-\frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{a} ^{\xi } {J_{u} \bigl(G_{4}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (25)
    $$\begin{aligned}& \begin{aligned}[b] J_{u} (\psi ) &=\psi '(b)J_{u}(z) + \int _{a} ^{b} {J_{u} \bigl(G_{5}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &\quad {}-\frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {{\psi ^{(n)}}} (\xi ) \biggl( \int _{a} ^{\xi } {J_{u} \bigl(G_{5}(z,r)\bigr)} (r- \xi )^{n-3}\,dr \biggr)\,d\xi , \end{aligned} \end{aligned}$$
    (26)

    where \((z-\xi )_{+}=\max (z-\xi ,0)\).

Proof

Fix \(u=1, \ldots , 6\).

(a) Applying Taylor’s formula (1) at point a, we get

$$ \psi (z)=\sum_{w=0}^{n-1} \frac{\psi ^{(w)}(a)}{w!}(z-a)^{w}+ \frac{1}{(n-1)!} \int _{a}^{b}{\psi ^{(n)}(\xi ) (z- \xi )_{+}^{n-1}}\,d\xi . $$
(27)

Since each \(J_{u} (\cdot )\) is a linear functional, applying it to the Taylor expression (27) and using the constancy property of the \(J_{u}(\cdot )\), we get (15).

(b) Applying Taylor’s formula (1) at point b, we get

$$ \psi (z)=\sum_{w=0}^{n-1} \frac{(-1)^{w}\psi ^{(w)}(b)}{w!}(z-b)^{w}- \frac{(-1)^{n-1}}{(n-1)!} \int _{a}^{b}{\psi ^{(n)}(\xi ) (\xi -z)_{+} ^{n-1}}\,d\xi , $$
(28)

and following similar steps we get (16).

(c) For fixed \(v=1\), applying (7) to \(J_{u} ( \cdot )\) and using the linearity of \(J_{u} (\cdot )\), we have

$$\begin{aligned} J_{u} (\psi ) =&{\psi }(a)J_{u} \biggl( \frac{b-z}{b-a} \biggr)+{\psi }(b) J _{u} \biggl( \frac{z-a}{b-a} \biggr)+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr)} \psi ^{\prime \prime }(r)\,dr \\ =&{\psi }(a)\frac{J_{u}(b-z)}{b-a}+{\psi }(b) \frac{J_{u}(z-a)}{b-a}+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr)} \psi ^{\prime \prime }(r)\,dr \\ =&\frac{1}{b-a} \bigl({\psi }(a)J_{u}(b)-{\psi }(a)J_{u}(z)+{\psi }(b) J _{u}(z)-{\psi }(b)J_{u}(a) \bigr) \\ &{}+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr)} \psi ^{\prime \prime }(r)\,dr \\ =&\frac{1}{b-a} \bigl({\psi }(b) J_{u}(z)-{\psi }(a)J_{u}(z) \bigr)+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr)} \psi ^{\prime \prime }(r) \,dr. \end{aligned}$$
(29)

Differentiating (1) twice and putting \(c=a\) and replacing n by \((n-2)\) or utilizing (1) on the function \(\psi ''\) at the point a, we get

$$ \psi ''(r) = \sum _{w = 2}^{n - 1} {\frac{{{\psi ^{(w)}}(a )}}{ {(w - 2)!}}} {(r - a )^{w - 2}} + \frac{1}{{(n - 3)!}} \int _{a} ^{r} {{\psi ^{(n)}}(\xi ){{(r - \xi )}^{n - 3}}\,d\xi }. $$
(30)

Now, using (30) in (29), we get

$$\begin{aligned} J_{u}(\psi ) =& \biggl(\frac{{\psi }(b)-{\psi }(a)}{b-a} \biggr) J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{ \psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr \\ &{}+ \frac{1}{{ ( {n - 3} )!}} \int _{a} ^{b} {J_{u}\bigl(G _{1}(z,r)\bigr)} \biggl( \int _{a} ^{r} {{\psi ^{(n)}}} (\xi ) (r-\xi )^{n-3}\,d\xi \biggr)\,dr. \end{aligned}$$

Now applying Fubini’s theorem to the last term gives (17), respectively, for \(v=1\) and \(u=1,\ldots , 6\). The results for \(v=2,3,4,5\) can be obtained in the same way.

(d) Differentiating (1) twice and now taking \(c=b\), we get

$$ \psi ''(r) = \sum _{w = 2}^{n - 1} {\frac{{{\psi ^{(w)}}(b )}}{ {(w - 2)!}}} {(r - b )^{w - 2}} - \frac{1}{{(n - 3)!}} \int _{r} ^{b} {{\psi ^{(n)}}(\xi ){{(r - \xi )}^{n - 3}}\,d\xi }. $$
(31)

Analogously, putting (31) in (29) and using Fubini’s theorem gives (22), respectively, for \(v=1\) and \(u=1,\ldots , 6\). The results for \(v=2,3,4,5\) can be obtained in the same way. □

The following theorem gives a key criterion to test the n-convexity of a function ψ (see [23, p. 16]).

Theorem 4

If \(\psi ^{ (n )}\) exists, then ψ is n-convex if and only if \(\psi ^{ (n )}\geq 0\).

We now use Theorem 4 to give generalizations of cyclic Jensen type linear functionals for real weights.

Corollary 1

Under the assumptions of Theorem 3, if ψ is n-convex, we obtain the following results:

  1. (a)

    If, for all \(u=1,\ldots , 6\),

    $$ J_{u}\bigl((z-\xi )_{+}^{n-1} \bigr)\ge 0, \quad \xi \in [a,b], $$
    (32)

    holds, then we have

    $$ J_{u} (\psi )\ge \sum _{w=1}^{n-1}\frac{\psi ^{(w)}(a)}{w!}J_{u} \bigl((z-a)^{w}\bigr) $$
    (33)

    for \(u=1,\ldots , 6\).

  2. (b)

    If, for all \(u=1,\ldots , 6\),

    $$ (-1)^{n-1}J_{u}\bigl((\xi -z)_{+}^{n-1}\bigr)\le 0, \quad \xi \in [a,b], $$
    (34)

    holds, then we have

    $$ J_{u}(\psi )\ge \sum_{w=1}^{n-1} \frac{(-1)^{w}\psi ^{(w)}(b)}{w!}J_{u}\bigl((b-z)^{w}\bigr) $$
    (35)

    for \(u=1,\ldots , 6\).

  3. (c)

    If, for all \(u=1,\ldots , 6\) and \(v=1,\ldots ,5\),

    $$ \int _{\xi } ^{b} {J_{u} \bigl(G_{v}(z,r)\bigr)} (r-\xi )^{n-3}\,dr\ge 0, \quad \xi \in [a,b], $$
    (36)

    holds, then we have

    $$\begin{aligned}& J_{u} (\psi )\ge \biggl(\frac{{\psi }(b)-{\psi }(a)}{b-a} \biggr) J_{u}(z) \\& \hphantom{J_{u} (\psi )\ge{}}{}+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{ \psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr, \end{aligned}$$
    (37)
    $$\begin{aligned}& J_{u} (\psi )\ge \psi '(b)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G _{2}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr, \end{aligned}$$
    (38)
    $$\begin{aligned}& J_{u} (\psi )\ge -\psi '(a)J_{u}(z) + \int _{a} ^{b} {J_{u} \bigl(G _{3}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr, \end{aligned}$$
    (39)
    $$\begin{aligned}& J_{u} (\psi )\ge \psi '(a)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{4}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr, \end{aligned}$$
    (40)
    $$\begin{aligned}& J_{u} (\psi )\ge \psi '(b)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G _{5}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(a)(r-a)^{w-2}}{(w-2)!} \Biggr)} \,dr. \end{aligned}$$
    (41)
  4. (d)

    If, for all \(u=1,\ldots , 6\) and \(v=1,\ldots ,5\),

    $$ \int _{a} ^{\xi } {J_{u} \bigl(G_{v}(z,r)\bigr)} (r-\xi )^{n-3}\,dr\le 0, \quad \xi \in [a,b], $$
    (42)

    holds, then we have

    $$\begin{aligned}& J_{u} (\psi )\ge \biggl(\frac{{\psi }(b)-{\psi }(a)}{b-a} \biggr) J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{1}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{ \psi ^{(w)}(b)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr, \end{aligned}$$
    (43)
    $$\begin{aligned}& J_{u} (\psi )\ge \psi '(b)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G _{2}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(b)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr, \end{aligned}$$
    (44)
    $$\begin{aligned}& J_{u} (\psi )\ge -\psi '(a)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G _{3}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(b)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr, \end{aligned}$$
    (45)
    $$\begin{aligned}& J_{u} (\psi )\ge \psi '(a)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G _{4}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(b)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr, \end{aligned}$$
    (46)
    $$\begin{aligned}& J_{u} (\psi )\ge \psi '(b)J_{u}(z)+ \int _{a} ^{b} {J_{u} \bigl(G_{5}(z,r)\bigr) \Biggl( \sum_{w=2}^{n-1} \frac{\psi ^{(w)}(b)(r-b)^{w-2}}{(w-2)!} \Biggr)} \,dr. \end{aligned}$$
    (47)

Proof

We begin with the proof of (a) and its assumed conditions. Fix \(u=1,\ldots , 6\).

By our assumption \(\psi ^{(n-1)}\) is absolutely continuous on \([a,b]\), as a result \(\psi ^{(n)}\) exists almost everywhere. Moreover, ψ is supposed to be n-convex, so by Theorem 4, \(\psi ^{(n)}(z)\ge 0\) almost everywhere on \([a,b]\). Therefore by applying Theorem 3, we get (33).

Similarly, the rest of the inequalities can be proved. □

We now give the final results of the present section.

Theorem 5

Under the assumptions of Theorem 3 if \(p_{1},\ldots ,p _{m}\) and \(\lambda _{1},\ldots ,\lambda _{k}\) are non-negative tuples such that \(\sum_{i=1}^{m} p_{i}=1\) and \(\sum_{j=1}^{k} {\lambda } _{j}=1\), then for ψ being n-convex, we have:

  1. (a)

    Inequality (33) is valid when \(n\ge 3\). Besides, for the function

    $$ F_{1}(z):=\sum_{w=1}^{n-1} \frac{\psi ^{(w)}(a)}{w!}(z-a)^{w} $$
    (48)

    to be convex, the right side of (33) is non-negative, meaning

    $$ J_{u} ({\psi } )\ge 0, \quad u=1, \ldots , 6. $$
    (49)
  2. (b)

    For n even, (35) holds. Furthermore, for the function

    $$ F_{2}(z):=\sum_{w=1}^{n-1} \frac{(-1)^{w}\psi ^{(w)}(b)}{w!}(b-z)^{w} $$
    (50)

    to be convex, the right hand side of (35) is non-negative, particularly (49) holds.

  3. (c)

    Inequalities (37)(41) hold for all \(n\ge 3\). Moreover, let (37)(41) be valid and

    $$ {\sum_{w = 2}^{n - 1} { \frac{{{\psi ^{(w)}}(a ){{(r - a )}^{w-2}}}}{ {(w-2)!}}} }\ge 0, $$
    (51)

    then we get (49) for every \(u=1,\ldots , 6\) and \(v =1,\ldots , 5\).

  4. (d)

    If n is even, then (43)(47) hold. Moreover, let (43)(47) be valid and

    $$ {\sum_{w = 2}^{n - 1} { \frac{{{\psi ^{(w)}}(b ){{(r - a)}^{w-2}}}}{ {(w-2)!}}} }\ge 0, $$
    (52)

    then we get (49) for \(u=1,\ldots , 6\) and for \(v=1,\ldots , 5\).

Proof

(a) For fixed \(u=1,\ldots , 6\).

For (\(n\ge 3\)), \(x\mapsto ((x-t)_{+})^{n-1}\) is a convex function, so (32) holds by virtue of Remark 1 on account of the given weights being positive. Hence (33) is established by taking into account Corollary 1(a). Moreover, we can write the R.H.S. of (33) in the form \(J_{u}(F_{1})\) for (\(u=1,\ldots , 6\)), after reorganizing this side. By using Remark 1 we can ensure the non-negativity of the R.H.S. of (33), especially (49) is established.

(b) Similar to the proof of (a).

(c) Fix \(u=1,\ldots , 6\).

Considering the positive weights and \(v=1,\ldots , 5\), \(G_{v}(z,r)\) is convex. Hence using Remark 1, \(J_{u} (G_{v}(z,r))\ge 0\). Since ψ is n-convex, by using Corollary 1(c), we get (37)–(41). Moreover, the linear function z is convex (concave), therefore considering the positive weights, Remark 1 shows that \(0\le J_{u}(z)\ge 0\) implying \(J_{u}(z)=0\). Finally using the positivity of \(J_{u} (G_{v}(z,r))\) and (51), (49) is obtained.

(d) Similar to the proof of (c). □

3 Applications to monotonic functionals

In [22], Pečarić et al. proposed a more generalized class of n-convex functions, that is, n-convex functions at a point (see also [1]).

Definition 1

Consider the function \(\psi :I \rightarrow \mathbb {R}\), d is a point in the interior of I. ψ is called \((n+1)\)-convex at d if there exists a constant C so that the function

$$ F(z)=\psi (z)-\frac{\mathrm{C}}{n!}z^{n} $$
(53)

is n-concave on the interval \(I \cap (-\infty ,d]\) and n-convex on the interval \(I \cap [d,\infty )\). The function ψ is called \((n+1)\)-concave at d if −ψ is \((n+1)\)-convex at d.

A property that explains the name of this class is that a function is n-convex on an interval if and only if it is n-convex at every point of the interval (see [22]). Witkowski et al. in [22] deduced the conditions which are necessary and sufficient on two linear functionals,

$$ \varGamma :C\bigl([a,d]\bigr)\rightarrow \mathbb {R}$$

and

$$ \varUpsilon :C\bigl([d, b ]\bigr)\rightarrow \mathbb {R}$$

so that \(\varGamma (\psi )\le \varUpsilon (\psi )\) is valid for each ψ which is n-convex at d. We shall apply this extended class of n-convex functions to obtaining monotonic linear functionals of Jensen type. Suppose \(\zeta ^{u}\) represents the monomials \(\zeta ^{u}(x)=x ^{u}\), \(u\in \mathbb{N}_{0}\). For the remaining part of this section, \(\varGamma _{u}(\psi _{[a,d]})\) and \(\varUpsilon _{u}(\psi _{[d,b]})\) will represent the linear functionals obtained as the difference of the L.H.S. and R.H.S. of inequality (33), where \(u=1,\ldots ,6\), applied to the intervals \([a,d]\) and \([d,b]\), respectively, i.e., for \(\mathbf{z}\in [a,d]^{m}\) and \(\mathbf{y} \in [d,b]^{{m}}\). Let

$$\begin{aligned}& \varGamma _{u} (\psi _{[a,d]}): = \varGamma _{u} (\mathbf{{z}},\mathbf{{p}}, {\boldsymbol {\lambda }};\psi _{[a,d]})= J_{u}(\psi )- \sum_{w=1}^{n-1} \frac{ \psi ^{(w)}(a)}{w!}J_{u}\bigl((z-a)^{w}\bigr), \end{aligned}$$
(54)
$$\begin{aligned}& \varUpsilon _{u} (\psi _{[d,b]}): = \varUpsilon _{u}(\mathbf{{y}},\mathbf{{q}}, \tilde{\boldsymbol {\lambda }}; \psi _{[d,b]})= J_{u}(\psi )- \sum _{w=1} ^{n-1}\frac{\psi ^{(w)}(d)}{w!}J_{u} \bigl((y-d)^{w}\bigr). \end{aligned}$$
(55)

In a similar vein, by using the inequality (35), for \(u=1,\ldots ,6\), we define linear functionals as

$$\begin{aligned}& {\hat{\varGamma }_{u}(\psi _{[a,d]}):= \hat{\varGamma }(\mathbf{{z}}, \mathbf{{p}},{\boldsymbol {\lambda }};\psi _{[a,d]})=J_{u} (\psi )-\sum_{w=1}^{n-1} \frac{(-1)^{w}\psi ^{(w)}(d)}{w!}J_{u}\bigl((d-z)^{w}\bigr)}, \end{aligned}$$
(56)
$$\begin{aligned}& {\hat{\varUpsilon }_{u}(\psi _{[d,b]}):= \hat{\varUpsilon }_{u}(\mathbf{{y}}, \mathbf{{q}},\tilde{\boldsymbol {\lambda }};\psi _{[d,b]})=J_{u}(\psi )-\sum _{w=1}^{n-1}\frac{(-1)^{w}\psi ^{(w)}(b)}{w!}J_{u} \bigl((b-y)^{w}\bigr)}. \end{aligned}$$
(57)

By constructing new linear functionals \(\varGamma _{u}(\psi _{[a,d]})\) and \(\varUpsilon _{u}(\psi _{[d,b]})\) using identity (15) for \(u=1,\ldots ,6\), on \([a,d]\) and \([d,b]\) we obtain

$$\begin{aligned}& \varGamma _{u}(\psi _{[a,d]})= \frac{1}{(n-1)!} \int _{a}^{d}{\psi ^{(n)}( \xi )J_{u}\bigl((z-\xi )_{+}^{n-1}\bigr)}\,d\xi , \end{aligned}$$
(58)
$$\begin{aligned}& \varUpsilon _{u}(\psi _{[d,b]})= \frac{1}{(n-1)!} \int _{d}^{b}{\psi ^{(n)}(\xi )J_{u}\bigl((y-\xi )_{+}^{n-1}\bigr)}\,d\xi . \end{aligned}$$
(59)

By applying identity (16) for \(u=1,\ldots ,6\) to the intervals \([a,d]\) and \([d,b]\) we get

$$\begin{aligned}& \hat{\varGamma }_{u}(\psi _{[a,d]})=- \frac{(-1)^{n-1}}{(n-1)!} \int _{a}^{d}{\psi ^{(n)}(\xi )J_{u}\bigl((\xi -z)_{+}^{n-1}\bigr)}\,d\xi , \end{aligned}$$
(60)
$$\begin{aligned}& \hat{\varUpsilon }_{u}(\psi _{[d,b]})=- \frac{(-1)^{n-1}}{(n-1)!} \int _{d}^{b}{\psi ^{(n)}(\xi )J_{u}\bigl((\xi -y)_{+}^{n-1}\bigr)}\,d\xi . \end{aligned}$$
(61)

Immediately, we can state the first theorem of the present section.

Theorem 6

Consider \(\mathbf{z}\in [a,d]^{m}\) and \(\mathbf{y}\in [d,b]^{{m}}\) where we have the following.

  1. (i)

    For \(u=1,\ldots ,6\), consider

    $$\begin{aligned}& J_{u}\bigl((z-\xi )_{+}^{n-1} \bigr)\ge 0, \quad \xi \in [a,d], \end{aligned}$$
    (62)
    $$\begin{aligned}& J_{u}\bigl((y-\xi )_{+}^{n-1} \bigr)\ge 0, \quad \xi \in [d,b], \end{aligned}$$
    (63)
    $$\begin{aligned}& \int _{a}^{d}{J_{u}\bigl((z-\xi )_{+}^{n-1}\bigr)}\,d\xi = \int _{d} ^{b}{J_{u}\bigl((y-\xi )_{+}^{n-1}\bigr)}\,d\xi , \end{aligned}$$
    (64)

    and let \(\varGamma _{u}(\psi _{[a,d]})\) and \(\varUpsilon _{u}(\psi _{[d,b]})\) be the linear functionals introduced in (54) and (55). If \(\psi :[a,b]\to \mathbb {R}\) is \((n+1)\)-convex at the point d, then

    $$ \varGamma _{u}(\psi _{[a,d]})\le \varUpsilon _{u}(\psi _{[d,b]}) \quad \textit{for } u=1, \ldots ,6. $$
    (65)
  2. (ii)

    Similarly, for \(u=1,\ldots ,6\), suppose

    $$\begin{aligned}& (-1)^{n-1}J_{u}\bigl((\xi -z)_{+}^{n-1}\bigr)\le 0, \quad \xi \in [a,d], \end{aligned}$$
    (66)
    $$\begin{aligned}& (-1)^{n-1}J_{u}\bigl((\xi -y)_{+}^{n-1}\bigr)\le 0, \quad \xi \in [d,b], \end{aligned}$$
    (67)
    $$\begin{aligned}& \int _{a}^{d}{J_{u}\bigl((\xi -z)_{+}^{n-1}\bigr)}\,d\xi = \int _{d} ^{b}{J_{u}\bigl((\xi -y)_{+}^{n-1}\bigr)}\,d\xi , \end{aligned}$$
    (68)

    and let \(\hat{\varGamma }_{u}(\psi _{[a,d]})\) and \(\hat{\varUpsilon }_{u}( \psi _{[d,b]})\) be the linear functionals which are given by (56) and (57). If \(\psi :[a,b]\to \mathbb {R}\) is \((n+1)\)-convex at the point d, then

    $$ \hat{\varGamma }_{u}(\psi _{[a,d]})\le \hat{\varUpsilon }_{u}(\psi _{[d,b]}) \quad \textit{for } u=1,\ldots ,6. $$
    (69)

Proof

(i) Fix \(u=1,\ldots ,6\). Using Definition 1, construct the function \(F=\psi -\frac{\mathrm{C}}{n!}\zeta ^{n}\) such that F is n-concave on \([a,d]\) and is n-convex on \([d,b]\). Now by applying Corollary 1 to F in the interval \([a,d]\), we get

$$ 0\ge \varGamma _{u}(\varPsi )=\varGamma _{u}(\psi _{[a,d]})- \frac{\mathrm{C}}{(n)!}\varGamma _{u}\bigl(\zeta ^{n}_{[a,d]}\bigr). $$
(70)

Similarly, applying Corollary 1 to F in \([d,b]\), we have

$$ 0\le \varUpsilon _{u}(\varPsi )=\varUpsilon _{u}(\psi _{[d,b]})- \frac{\mathrm{C}}{(n)!}\varUpsilon _{u}\bigl(\zeta ^{n}_{[d,a]}\bigr). $$
(71)

Moreover, utilizing the monomials \(\zeta ^{n}(\cdot )\) in (58) and (59) we get

$$\begin{aligned}& \varGamma _{u}\bigl(\zeta ^{n}_{[a,d]} \bigr)=n \int _{a}^{d} J_{u}\bigl((z-\xi )_{+} ^{n-1}\bigr)\,d\xi , \end{aligned}$$
(72)
$$\begin{aligned}& \varUpsilon _{u}\bigl(\zeta ^{n}_{[d,b]} \bigr)=n \int _{d}^{b} J_{u}\bigl((y-\xi )_{+} ^{n-1}\bigr)\,d\xi . \end{aligned}$$
(73)

Therefore assumption (64) is equivalent to

$$ \varGamma _{u}\bigl(\zeta ^{n}_{[a,d]}\bigr)= \varUpsilon _{u}\bigl(\zeta ^{n}_{[d,b]} \bigr). $$

So from (70) and (71), one can get

$$ \varGamma _{u}(\psi _{[a,d]})\le \frac{\mathrm{C}}{(n)!}\varGamma _{u}\bigl(\zeta ^{n}_{[a,d]} \bigr)=\frac{\mathrm{C}}{(n)!}\varUpsilon _{u}\bigl(\zeta ^{n}_{[d,b]}\bigr) \le \varUpsilon _{u}(\psi _{[d,b]}). $$
(74)

So (65) is obtained for \(u=1,\ldots ,6\).

(ii) A method similar to the above can be employed by using the identities (60) and (61). Hence by deducing supposition (68), we have (69) for \(u=1,\ldots ,6\). □

We conclude with the following remarks.

Remark 2

Note that inequality (65) and (69) are also valid on replacing assumptions (64) and (68) with the weaker assumptions that

$$\begin{aligned}& \mathrm{C} \bigl(\varUpsilon _{u}\bigl(\zeta ^{n}_{[d,b]} \bigr)- \varGamma _{u}\bigl(\zeta ^{n}_{[a,d]} \bigr) \bigr)\ge 0 \quad \mbox{and} \\& \mathrm{C} \bigl(\hat{\varUpsilon }_{u}\bigl(\zeta ^{n}_{[d,b]}\bigr)-\hat{\varGamma } _{u}\bigl( \zeta ^{n}_{[a,d]}\bigr) \bigr)\ge 0 \end{aligned}$$

for \(u=1,\ldots ,6\), respectively.

Remark 3

Similar results can also be formulated by employing Theorem 6 for the inequalities (37) to (41) and (43) to (47) in Corollary 1.

4 New entropic bounds

In fields like probability theory, mathematical statistics and information theory, measures of dissimilarity between probability distributions play a pivotal role. Various divergence measures have been introduced for this purpose. For instance, we have the f-divergence, some particular cases of which are the Kullback–Leibler divergence and the Jensen–Shannon divergence. Entropies are used to quantify the uncertainty, diversity and the randomness of a system. The idea is frequently used in several scientific disciplines. Some new estimations for the Shannon and Zipf–Mandelbrot entropies can be found in [14].

In the current section we will work in discrete space, i.e., with discrete probability distributions.

We first introduce some important definitions and results used for the rest of this section. Csiszár in [5] introduced the following notion of a divergence functional.

Definition 2

Let \(\psi : ( 0,\infty ) \rightarrow ( 0,\infty ) \) be a convex function and let \(\mathbf{p}:= ( p_{1},\ldots ,p_{m} ) \) and \(\mathbf{q}:= ( q_{1},\ldots ,q _{m} ) \) be positive probability distributions. The ψ-divergence functional is

$$ I_{\psi }(\mathbf{p},\mathbf{q}):={ \sum_{i=1}^{n}} q_{i}\psi \biggl( \frac{p_{i}}{q_{i}} \biggr) . $$

It is possible to use non-negative probability distributions in the ψ-divergence functional, by defining

$$ \psi ( 0 ) :=\lim_{t\rightarrow 0+}\psi ( t ) ; \qquad 0\psi \biggl( \frac{0}{0} \biggr) :=0;\qquad 0\psi \biggl( \frac{a}{0} \biggr) :=\lim_{t\rightarrow 0+}t\psi \biggl( \frac{a}{t} \biggr) , \quad a>0. $$

Based on the previous definition, we introduce a new functional.

Definition 3

Suppose \(\psi : I\rightarrow \mathbb {R}\) with I being an interval in \(\mathbb {R}\). Suppose \(\mathbf{p}:= ( p_{1},\ldots ,p_{m} ) \in \mathbb {R}^{m}\) and \(\mathbf{q}:= ( q_{1},\ldots ,q_{m} )\in (0,\infty )^{m}\) such that

$$ \frac{p_{i}}{q_{i}}\in I, \quad i=1,\ldots ,m. $$

Then let

$$ \widetilde{I}_{\psi }(\mathbf{{p}},\mathbf{{q}})= \sum _{i= 1} ^{m} {{q_{i}}\psi \biggl( { \frac{p_{i}}{q_{i}}} \biggr)}. $$
(75)

Remark 4

As a consequences of Corollary 1 for the case \(u=1\), we consider the explicit form of our generalized Jensen inequalities for n-convex functions, from (33) and (35):

$$\begin{aligned}& \sum_{i= 1}^{m} {{p_{i}}} \psi ( {{z_{i}}} )- \sum_{i=1}^{m} \Biggl( \sum_{j=0}^{k-1}\lambda _{j+1}p _{i+j} \Biggr) \psi \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p _{i+j}z_{i+j}}{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}} \biggr) \\& \quad \ge\sum_{w=1}^{n-1} \frac{\psi ^{(w)}(a)}{w!} \Biggl(\sum_{i = 1}^{m} {{p_{i}}} (z_{i}-a)^{w}-\sum _{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}p_{i+j} \Biggr) \biggl( \frac{ \sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}z_{i+j}}{\sum_{j=0} ^{k-1}\lambda _{j+1}p_{i+j}}-a \biggr)^{w} \Biggr), \end{aligned}$$
(76)
$$\begin{aligned}& \sum_{i= 1}^{m} {{p_{i}}} \psi ( {{z_{i}}} )- \sum_{i=1}^{m} \Biggl( \sum_{j=0}^{k-1}\lambda _{j+1}p _{i+j} \Biggr) \psi \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p _{i+j}z_{i+j}}{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}} \biggr) \\& \quad \ge\sum_{w=1}^{n-1} \frac{(-1)^{w}\psi ^{(w)}(b)}{w!} \Biggl(\sum_{i = 1}^{m} {{p_{i}}} (b-z_{i})^{w}-\sum _{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}p_{i+j} \Biggr) \biggl(b- \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}z_{i+j}}{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}} \biggr)^{w} \Biggr). \end{aligned}$$
(77)

For the rest of this section, we will use the following assumptions:

(A3):

Suppose for \(m,k \in \mathbb{N}\) (\(2\leq k\leq m\)), \(\lambda _{1},\ldots ,\lambda _{k}\) are positive probability distributions.

(A4):

Let \(\mathbf{q}:= ( q_{1},\ldots ,q_{m} )\in (0, \infty )^{m}\).

Theorem 7

Considering (A3) and (A4), let \(\mathbf{p}:= ( p_{1}, \ldots ,p_{m} ) \in \mathbb {R}^{m}\) such that

$$ \frac{p_{i}}{q_{i}}\in [a,b], \quad i=1,\ldots ,m. $$

Let \(\psi : [a,b]\rightarrow \mathbb {R}\) where \(\psi ^{(n-1)}\) is absolutely continuous and ψ is n-convex, then the following inequalities hold:

  1. (a)
    $$\begin{aligned} \widetilde{I}_{\psi }(\mathbf{{p}},\mathbf{{q}}) \ge& \sum _{i=1} ^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}q_{i+j} \Biggr) \psi \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}}{ \sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j}} \biggr)+\sum_{w=1}^{n-1} \frac{\psi ^{(w)}(a)}{w!} \\ &{}\cdot \Biggl(\sum_{i = 1}^{m} {{q_{i}}} \biggl(\frac{p_{i}}{q_{i}}-a \biggr)^{w}- \sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{i+1}q _{i+j} \Biggr) \biggl( \frac{\sum_{i=0}^{k-1}\lambda _{i+1}p _{i+j}}{\sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j}}-a \biggr)^{w} \Biggr), \end{aligned}$$
    (78)
  2. (b)
    $$\begin{aligned} \widetilde{I}_{\psi }(\mathbf{{p}},\mathbf{{q}}) \ge& \sum _{i=1} ^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}q_{i+j} \Biggr) \psi \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p_{i+j}}{ \sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j}} \biggr)+\sum_{w=1}^{n-1} \frac{(-1)^{w}\psi ^{(w)}(b)}{w!} \\ &{} \cdot\Biggl(\sum_{i = 1}^{m} {{q_{i}}} \biggl(b-\frac{p_{i}}{q_{i}} \biggr) ^{w}- \sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}q _{i+j} \Biggr) \biggl(b- \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p _{i+j}}{\sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j}} \biggr)^{w} \Biggr). \end{aligned}$$
    (79)

Proof

Replacing \(p_{i}\) with \(q_{i}\) and \(z_{i}\) with \(\frac{p_{i}}{q_{i}}\) for (\(i=1,\ldots ,m\)) in (76) and (77), we get (78) and (79), respectively. □

We now present two significant applications of the previous results.

Shannon’s entropy, also known as a measure of uncertainty, plays a pivotal role in information theory. It is also frequently applied in fields like population genetics, molecular ecology and dynamical systems (see [18]). Some recent bounds for Shannon entropy can be found in [9, 15]. For a positive m-tuple \(\mathbf{{p}} = ( {{p_{1}},\ldots,{p_{m}}} )\) such that \(\sum_{i= 1}^{m} {{p_{i}}} = 1\), the Shannon entropy is defined by

$$ S(\mathbf{{p}}) = - \sum_{i = 1}^{m} {{p_{i}}\ln {p_{i}}} . $$
(80)

Corollary 2

Suppose (A3) holds.

  1. (a)

    Considering (A4), if n is even, we get

    $$\begin{aligned} \sum_{i=1}^{m}q_{i}\ln q_{i} \ge& \sum_{i=1}^{m} \Biggl( \sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j} \Biggr) \ln \Biggl({\sum _{j=0}^{k-1}\lambda _{j+1}q_{i+j}} \Biggr)+\sum_{w=1}^{n-1} \frac{(-1)^{w}}{w.(a)^{w}} \\ &{}\cdot \Biggl(\sum_{i = 1}^{m} {{q_{i}}} \biggl(\frac{1}{q_{i}}-a \biggr)^{w}- \sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}q _{i+j} \Biggr) \biggl( \frac{1}{\sum_{j=0}^{k-1}\lambda _{j+1}q _{i+j}}-a \biggr)^{w} \Biggr). \end{aligned}$$
    (81)
  2. (b)

    If \(\mathbf{q }:= (q_{1}, \ldots , q_{m})\) is a positive probability distribution, while n is even, we obtain the bounds for the Shannon entropy of q,

    $$\begin{aligned} S(\mathbf{{q}}) \le& -\sum_{i=1}^{m} \Biggl( \sum_{j=0} ^{k-1}\lambda _{j+1}q_{i+j} \Biggr) \ln \Biggl({\sum _{j=0}^{k-1} \lambda _{j+1}q_{i+j}} \Biggr)- \sum_{w=1}^{n-1} \frac{(-1)^{w}}{w.(a)^{w}} \\ &{}\cdot \Biggl(\sum_{i = 1}^{m} {{q_{i}}} \biggl(\frac{1}{q_{i}}-a \biggr)^{w}- \sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}q _{i+j} \Biggr) \biggl( \frac{1}{\sum_{j=0}^{k-1}\lambda _{j+1}q _{i+j}}-a \biggr)^{w} \Biggr). \end{aligned}$$
    (82)

    For n odd, (81) and (82) are satisfied in the reverse directions.

Proof

(a) Substituting \(\psi (t):=-\ln t\), and \(\mathbf{p}:= (1,1, \ldots , 1)\) in Theorem 7(a), we obtain the desired results.

(b) This part is a special case of (a). □

The Kullback–Leibler measure, also known as the relative entropy, is a famous non-parametric measure. It is also one of the most famous distance functions used in signal processing, information theory and mathematical statistics. Some recent bounds for the relative entropy can be found in [9, 15]. The Kullback–Leibler distance [17] between the positive probability distributions \(\mathbf{p}=(p_{1},\ldots,p_{m})\) and \(\mathbf{q}=(q_{1},\ldots,q_{m})\) is given as

$$ D(\mathbf{{q}}\parallel {\mathbf{{p}}}) = \sum _{i= 1}^{m} {{q_{i}}\ln \biggl( { \frac{{{q_{i}}}}{{{p_{i}}}}} \biggr)} . $$
(83)

Corollary 3

Suppose (A3) holds.

  1. (a)

    If \(\mathbf{q }:= (q_{1}, \ldots , q_{m})\), \(\mathbf{p }:= (p _{1}, \ldots , p_{m})\in (0,\infty )^{m}\). Then, for even n, we get

    $$\begin{aligned} \sum_{i = 1}^{m} {{q_{i}} \ln \biggl( {\frac{{{q_{i}}}}{{{p_{i}}}}} \biggr)} \ge& \sum _{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}q_{i+j} \Biggr) \ln \biggl( \frac{ \sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j}}{\sum_{j=0}^{k-1} \lambda _{j+1}p_{i+j}} \biggr)+\sum_{w=1}^{n-1} \frac{(-1)^{w}}{w.(a)^{w}} \\ &{}\cdot \Biggl(\sum_{i = 1}^{m} {{q_{i}}} \biggl(\frac{p_{i}}{q_{i}}-a \biggr) ^{w}- \sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}q _{i+j} \Biggr) \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p _{i+j}}{\sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j}}-a \biggr)^{w} \Biggr). \\ & \end{aligned}$$
    (84)
  2. (b)

    If \(\mathbf{q }:= (q_{1}, \ldots , q_{m})\), \(\mathbf{p }:= (p _{1}, \ldots , p_{m})\) are positive probability distributions with n even, we obtain

    $$\begin{aligned} D(\mathbf{{q}}\parallel {\mathbf{{p}}}) \ge& \sum_{i=1}^{m} \Biggl( \sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j} \Biggr) \ln \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j}}{\sum_{j=0} ^{k-1}\lambda _{j+1}p_{i+j}} \biggr)+\sum_{w=1}^{n-1} \frac{(-1)^{w}}{w.(a)^{w}} \\ &{}\cdot \Biggl(\sum_{i= 1}^{m} {{q_{i}}} \biggl(\frac{p_{i}}{q_{i}}-a \biggr)^{w}- \sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\lambda _{j+1}q _{i+j} \Biggr) \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}p _{i+j}}{\sum_{j=0}^{k-1}\lambda _{j+1}q_{i+j}}-a \biggr)^{w} \Biggr). \end{aligned}$$
    (85)

    In the case n is odd, (84) and (85) are satisfied in the reverse directions.

Proof

(a) Setting \(\psi (t):=-\ln t\) in Theorem 7(a) gives the required results.

(b) This part is a special case of (a). □

Philologist George Kingsley Zipf (1902–1950) studied statistical occurrences of words in different languages. It was one of the first academic studies of word frequency and was originally prescribed only for linguistics. It was only later that many other disciplines took credit of it: the Pareto law in economy reveals another aspect of it and the “Zipfian distribution” is present in other fields as well: information science, bibliometrics, social sciences etc.

Benoit Mandelbrot (1924–2010), in 1966, generalized the Zipf law and improved for counting the low-rank words. It has various applications in art, in ecological field studies and is also used in information sciences [6, p. 294]. Some of the recent studies regarding the Zipf–Mandelbrot law can be found in the listed references (see [9, 10, 15, 16]).

Hence if we rank (r) word in accordance with the frequency of the word occurrence (ψ), the product is a constant (C):

$$ C = r \psi . $$

For \(m\in \{1, 2, \ldots \}\), \(t\ge 0\) and \(s>0\) the Zipf–Mandelbrot law is given as

$$ \psi (i;m,t,s)=\frac{1}{((i+t)^{s}H_{t,s}^{m})},\quad i=(1,2,\ldots ,m), $$
(86)

where

$$ H_{t,s}^{m}=\sum_{k=1}^{m} \frac{1}{(k+t)^{s}}. $$

In the formula, i represents the rank of the data, t and s are parameters of the distribution.

Suppose \(m\in \{1, 2, \ldots \}\), \(t \ge 0\), \(s > 0\), then the Zipf–Mandelbrot entropy is given as

$$ Z(H,t,s)=\frac{s}{H_{t,s}^{m}}\sum _{i=1}^{m} \frac{\ln (i+t)}{(i+t)^{s}}+\ln \bigl(H_{t,s}^{m}\bigr). $$
(87)

Consider

$$ q_{i}=\psi (i;m,t,s)=\frac{1}{((i+t)^{s}H_{t,s}^{m})}. $$
(88)

The following results give bounds for the Zipf–Mandelbrot entropy.

Theorem 8

Suppose (A3) holds and q is the same as given in (88) by the Zipf–Mandelbrot law with parameters \(m\in \{1, 2, \ldots \}\), \(t \ge 0\), \(s > 0\). For n even, we have

$$\begin{aligned} S(\mathbf{{q}}) =&Z(H,t,s) \\ \le& -\sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1}\frac{\lambda _{j+1}}{((i+j+t)^{s}H_{t,s}^{m})} \Biggr) \ln \Biggl(\frac{1}{H_{t,s}^{m}}{\sum_{j=0}^{k-1} \frac{ \lambda _{j+1}}{((i+j+t)^{s})}} \Biggr) \\ &{}-\sum_{w=1}^{n-1} \frac{(-1)^{w}}{w.(a)^{w}} \Biggl(\sum_{i = 1}^{m} {{\frac{1}{((i+t)^{s}H_{t,s}^{m})}}} \bigl(\bigl((i+t)^{s}H _{t,s}^{m}\bigr)-a \bigr)^{w} \Biggr) \\ &{}+\sum_{w=1}^{n-1} \frac{(-1)^{w}}{w.(a)^{w}} \Biggl(\sum_{i=1}^{m} \Biggl( \sum_{j=0}^{k-1} \frac{\lambda _{j+1}}{((i+j+t)^{s}H _{t,s}^{m})} \Biggr) \biggl( \frac{1}{\sum_{j=0}^{k-1}\frac{ \lambda _{j+1}}{((i+j+t)^{s}H_{t,s}^{m})}}-a \biggr)^{w} \Biggr). \end{aligned}$$
(89)

Equation (89) holds in the opposite direction for n odd.

Proof

Substituting this \(q_{i}=\frac{1}{((i+t)^{s}H_{t,s}^{m})}\) in Corollary 2(b) gives the required result. Also note that \(\sum_{i=1}^{m}q_{i}=1\). Moreover, using the above \(q_{i}\) in the Shannon entropy (80), we get the Mandelbrot entropy (87),

$$\begin{aligned} S(q) =&-q_{i}\ln q_{i} \\ =&-\sum_{i=1}^{m} \frac{1}{((i+t)^{s}H_{t,s} ^{m})}\ln \frac{1}{((i+t)^{s}H_{t,s}^{m})} \\ =&\frac{-1}{H_{t,s}^{m}}\sum_{i=1}^{m} \frac{1}{(i+t)^{s}} \ln \frac{1}{(i+t)^{s}H_{t,s}^{m}} \\ =&\frac{-1}{H_{t,s}^{m}}\sum_{i=1}^{m} \frac{1}{(i+t)^{s}} \bigl(\ln (1)-s\ln (u+t)-\ln \bigl(H_{t,s}^{m} \bigr) \bigr) \\ =&\frac{1}{H_{t,s}^{m}}\sum_{i=1}^{m} \frac{1}{(i+t)^{s}} \bigl(s \ln (i+t)+\ln \bigl(H_{t,s}^{m} \bigr) \bigr) \\ =&\frac{s}{H_{t,s}^{m}}\sum_{i=1}^{m} \frac{\ln (i+t)}{(i+t)^{s}}+\ln \bigl(H_{t,s}^{m} \bigr). \end{aligned}$$
(90)

 □

Corollary 4

Suppose (A3) holds and for \(t_{1}, t_{2} \in [0,\infty )\), \(s_{1}, s_{2} >0\), let \(H_{t_{1},s_{1}}^{m}=\sum_{k=1}^{m}\frac{1}{(k+t _{1})^{s_{1}}}\) and \(H_{t_{2},s_{2}}^{m}=\sum_{k=1}^{m}\frac{1}{(k+t _{2})^{s_{2}}}\). Now using \({q_{i}} = \frac{1}{{{{(i + t_{1})}^{s_{1}}}{H_{t_{1},s_{1}}^{m}}}}\) and \({p_{i}} = \frac{1}{{{{(i + t_{2})}^{s_{2}}}{H_{t_{2},s_{2}}^{m}}}}\) in Corollary 3(b), with n even, we have

$$\begin{aligned} D(\mathbf{{q}}\parallel {\mathbf{{p}}}) =& \sum _{i = 1}^{m} {{\frac{1}{{{{(i+ t_{1})}^{s_{1}}}{H_{t_{1},s_{1}}^{m}}}}}\ln \biggl( \frac{{{(i + t_{2})}^{s_{2}}}{H_{t_{2},s_{2}}^{m}}}{{{{(i + t_{1})} ^{s_{1}}}{H_{t_{1},s_{1}}^{m}}}} \biggr)} \\ \ge& \sum_{i=1}^{m} \Biggl( \sum _{j=0}^{k-1} \frac{ \lambda _{j+1}}{{{{(i +j+ t_{1})}^{s_{1}}}{H_{t_{1},s_{1}}^{m}}}} \Biggr) \ln \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}\frac{1}{{{{(i +j+ t_{1})}^{s_{1}}}{H_{t_{1},s_{1}}^{m}}}}}{\sum_{j=0}^{k-1} \lambda _{j+1} \frac{1}{{{{(i + j+t_{2})}^{s_{2}}}{H_{t_{2},s_{2}}^{m}}}}} \biggr) \\ &{}+\sum_{w=1}^{n-1} \frac{(-1)^{w}}{w.(a)^{w}} \Biggl(\sum_{i = 1}^{m} {{\frac{1}{{{{(i+ t_{1})}^{s_{1}}}{H_{t_{1},s_{1}} ^{m}}}}}} \biggl(\frac{{{(i + t_{1})}^{s_{1}}}{H_{t_{1},s_{1}}^{m}}}{ {{{(i+ t_{2})}^{s_{2}}}{H_{t_{2},s_{2}}^{m}}}}-a \biggr)^{w} \Biggr) \\ &{}-\sum_{w=1}^{n-1} \frac{(-1)^{w}}{w.(a)^{w}} \\ &{}\cdot \Biggl(\sum_{i=1}^{m} \Biggl( \sum_{j=0}^{k-1} \frac{\lambda _{j+1}}{{{{(i+j+ t_{1})}^{s_{1}}}{H_{t_{1},s_{1}}^{m}}}} \Biggr) \biggl( \frac{\sum_{j=0}^{k-1}\lambda _{j+1}\frac{1}{{{{(i+j+ t_{2})}^{s_{2}}} {H_{t_{2},s_{2}}^{m}}}}}{\sum_{j=0}^{k-1}\lambda _{j+1}\frac{1}{ {{{(i+ j+t_{1})}^{s_{1}}}{H_{t_{1},s_{1}}^{m}}}}}-a \biggr)^{w} \Biggr). \end{aligned}$$
(91)

If n is odd, (91) holds in the reverse direction.

5 Concluding remarks

New upper bounds for Grïss and Ostrowski type inequalities can be formulated (see [2]) as a direct application of the results obtained by Dragomir et al. in [4] from new generalized inequalities for n-convex functions obtained in the first section. We can also construct several functionals from the inequalities introduced in Theorem 1 and give mean value theorems of Lagrange and Cauchy type for n-convex functions. Moreover, considering the n-exponentially method introduced by Pečarić et al. in [11] and [21] (see also [3]), we can establish a novel collection of non-trivial examples of functions which are n-exponentially and exponentially convex. Finally we can also construct monotonic Cauchy means.