Stochastic Gradient Algorithm Under (h,φ)-Entropy Criterion

Chen, B.; Hu, J.; Pu, L.; Sun, Z.

doi:10.1007/s00034-007-9004-9

Stochastic Gradient Algorithm Under (h,φ)-Entropy Criterion

Published: 03 January 2008

Volume 26, pages 941–960, (2007)
Cite this article

Circuits, Systems & Signal Processing Aims and scope Submit manuscript

B. Chen¹,
J. Hu¹,
L. Pu¹ &
…
Z. Sun¹

114 Accesses
43 Citations
Explore all metrics

Abstract

Motivated by the work of Erdogmus and Principe, we use the error (h,φ)-entropy as the supervised adaptation criterion. Several properties of the (h,φ)-entropy criterion and the connections with traditional error criteria are investigated. By a kernel estimate approach, we obtain the nonparametric estimator of the instantaneous (h,φ)-entropy. Then, we develop the general stochastic information gradient algorithm, and derive the approximate upper bound for the step size in the adaptive linear neuron training. Moreover, the (h,φ) pair are optimized to improve the performance of the proposed algorithm. For the finite impulse response identification with white Gaussian input and noise, the exact optimum φ function is derived. Finally, simulation experiments verify the results and demonstrate the noticeable performance improvement that may be achieved by the optimum (h,φ)-entropy criterion.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust Gradient Estimation Algorithm for a Stochastic System with Colored Noise

Article 30 January 2023

A New Conjugate Gradient Method with Smoothing $$L_{1/2} $$ Regularization Based on a Modified Secant Equation for Training Neural Networks

Article 21 November 2017

Adaptive Filtering Based on Minimum Error Entropy Conjugate Gradient

Article 17 April 2024

References

Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, Chichester (1991)
MATH Google Scholar
Douglas, S.C., Meng, H.Y.: Stochastic gradient adaptation under general error criteria. IEEE Trans. Signal Process. 42, 1335–1351 (1994)
Article Google Scholar
Erdogmus, D., Principe, J.C.: Comparison of entropy and mean square error criteria in adaptive system training using higher order statistics. In: Second International Workshop on Independent Component Analysis and Blind Signal Separation, pp. 75–80 (2000)
Erdogmus, D., Principe, J.C.: Generalized information potential criterion for adaptive system training. IEEE Trans. Neural Netw. 13, 1035–1044 (2002)
Article Google Scholar
Erdogmus, D., Principe, J.C.: Convergence properties and data efficiency of the minimum error entropy criterion in Adaline training. IEEE Trans. Signal Process. 51, 1966–1978 (2003)
Article Google Scholar
Erdogmus, D., Kenneth, E.H., Principe, J.C.: Online entropy manipulation: stochastic information gradient. IEEE Signal Process. Lett. 10, 242–245 (2003)
Article Google Scholar
Feng, X., Loparo, K.A., Fang, Y.: Optimal state estimation for stochastic systems: an information theoretic approach. IEEE Trans. Autom. Control 42(6), 771–785 (1997)
Article MATH MathSciNet Google Scholar
Gibson, J.D., Gray, S.D.: MVSE adaptive filtering subject to a constraint on MSE. IEEE Trans. Circuits Syst. 35(5), 603–608 (1988). May
Article Google Scholar
Haykin, S.: Adaptive Filtering Theory, 3rd edn. Prentice-Hall, Upper Saddle River (1996)
Google Scholar
Kaplan, D., Glass, L.: Understanding Nonlinear Dynamics. Springer, New York (1995)
MATH Google Scholar
Lo, J.T., Wanner, T.: Existence and uniqueness of risk-sensitivity estimates. IEEE Trans. Autom. Control 47(11), 1945–1948 (2002)
Article MathSciNet Google Scholar
Menendez, M.L., Pardo, J.A., Pardo, M.C.: Estimators based on sample quantiles using (h,φ)-entropy measures. Appl. Math. Lett. 11, 99–104 (1998)
Article MATH MathSciNet Google Scholar
Pardo, L.: Statistical Inference Based on Divergence Measures. Chapman & Hall/CRC, Boca Raton (2006)
MATH Google Scholar
Salicru, M., Menendez, M.L., Morales, D., Pardo, L.: Asymptotic distribution of (h,φ)-entropies. Commun. Stat. Theory Methods 22, 2015–2031 (1993)
Article MATH MathSciNet Google Scholar
Sherman, S.: Non-mean-square error criteria. IRE Trans. Inf. Theory IT-4, 125–126 (1958). September
Article MathSciNet Google Scholar
Silverman, B.W.: Density Estimation for Statistic and Data Analysis. Chapman & Hall, New York (1986)
Google Scholar
Walach, E., Widrow, B.: The least mean fourth (LMF) adaptive algorithm and its family. IEEE Trans. Inf. Theory IT-30(2), 275–283 (1984). March
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, People’s Republic of China
B. Chen, J. Hu, L. Pu & Z. Sun

Authors

B. Chen
View author publications
You can also search for this author in PubMed Google Scholar
J. Hu
View author publications
You can also search for this author in PubMed Google Scholar
L. Pu
View author publications
You can also search for this author in PubMed Google Scholar
Z. Sun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to B. Chen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, B., Hu, J., Pu, L. et al. Stochastic Gradient Algorithm Under (h,φ)-Entropy Criterion. Circuits Syst Signal Process 26, 941–960 (2007). https://doi.org/10.1007/s00034-007-9004-9

Download citation

Received: 17 October 2006
Revised: 12 April 2007
Published: 03 January 2008
Issue Date: December 2007
DOI: https://doi.org/10.1007/s00034-007-9004-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Stochastic Gradient Algorithm Under (h,φ)-Entropy Criterion

Abstract

Access this article

Similar content being viewed by others

Robust Gradient Estimation Algorithm for a Stochastic System with Colored Noise

A New Conjugate Gradient Method with Smoothing $$L_{1/2} $$ Regularization Based on a Modified Secant Equation for Training Neural Networks

Adaptive Filtering Based on Minimum Error Entropy Conjugate Gradient

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Stochastic Gradient Algorithm Under (h,φ)-Entropy Criterion

Abstract

Access this article

Similar content being viewed by others

Robust Gradient Estimation Algorithm for a Stochastic System with Colored Noise

A New Conjugate Gradient Method with Smoothing $$L_{1/2} $$ Regularization Based on a Modified Secant Equation for Training Neural Networks

Adaptive Filtering Based on Minimum Error Entropy Conjugate Gradient

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation