Skip to main content
Log in

Face hallucination with K-means++ dictionary learning

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Interested face regions have the low-resolution problem in the video surveillance because the distance between face and camera is far. Thus, the high-resolution (HR) faces need to be reconstructed from low-resolution (LR) faces for further processing. Typical face hallucination based on patch-wise sparse coding can achieve better results but have very high complexity for training. In order to reduce the complexity, this paper proposes a method which uses K-means++ clustering instead of sparse coding to obtain an over-complete dictionary pair. Then, the least angle regression (LARS) algorithm is utilized to calculate the coefficients and reconstruct the high-resolution faces. The experimental results show that proposed algorithm can effectively reduce complexity in condition of irregular LR faces. In addition, the comparisons also prove that the proposed method can improve the value of PSNR and SSIM in the same database.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Abbreviations

HR:

high-resolution

LR:

low-resolution

LARS:

least angle regression

ScDL:

semi-coupled dictionary learning

SVD:

singular value decomposition

ScSR:

sparse coding superresolution

ZCA:

Zero Components Analysis

PSNR:

Peak Signal to Noise Ratio

SSIM:

Structural Similarity

References

  1. Arthur D, Vassilvitskii S (2007) k-means++: the advantages of careful seeding. In: Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, pp 1027–1035

  2. Baker S, Kanade T (2000) Hallucinating faces. fg 2000:83–88

  3. Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE transactions on pattern analysis & machine intelligence, no 7, pp 711–720

  4. Buchta C, Kober M, Feinerer I, Hornik K (2012) Spherical k-means clustering. J Stat Softw 50(10):1–22

    Google Scholar 

  5. Coates A, Ng AY (2012) Learning feature representations with kmeans. In: Neural networks: tricks of the trade. Springer, Berlin, pp 561–580

    Chapter  Google Scholar 

  6. Dong C, Loy CC, He K, Tang X (2015) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307

    Article  Google Scholar 

  7. Dong W, Fu F, Shi G, Cao X, Wu J, Li G, Li X (2016) Hyperspectral image super-resolution via non-negative structured sparse representation. IEEE Trans Image Process 25(5):2337–2352

    Article  MathSciNet  Google Scholar 

  8. Donoho DL (2006) For most large underdetermined systems of linear equations the minimal ??1-norm solution is also the sparsest solution. Commun Pure Appl Math 59(6):797–829

    Article  MathSciNet  Google Scholar 

  9. Efron B, Hastie T, Johnstone I, Tibshirani R et al (2004) Least angle regression. Ann Stat 32(2):407–499

    Article  MathSciNet  Google Scholar 

  10. Eskicioglu AM, Fisher P (1995) Image quality measures and their performance. IEEE Trans Commun 43(12):2959–2965

    Article  Google Scholar 

  11. Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 2. IEEE, pp 524–531

  12. Fookes C, Lin F, Chandran V, Sridharan S (2012) Evaluation of image resolution and super-resolution on face recognition performance. J Vis Commun Image Represent 23(1):75–93

    Article  Google Scholar 

  13. Gosling C (2010) Encyclopedia of distances. Reference Reviews

  14. He L, Qi H, Zaretzki R (2013) Beta process joint dictionary learning for coupled feature spaces with application to single image superresolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 345–352

  15. Huang K, Hu R, Han Z, Lu T, Jiang J, Wang F (2012) Face hallucination via k-selection mean constrained sparse representation. In: Proceedings of the 21st international conference on pattern recognition (ICPR2012). IEEE, pp 882–885

  16. Jian M, Lam K-M (2015) Simultaneous hallucination and recognition of low-resolution faces based on singular value decomposition. IEEE Trans Circuits Syst Video Technol 25(11):1761–1772

  17. Kim J, Kwon Lee J, Mu Lee K (2016) Deeply-recursive convolutional network for image super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1637–1645

  18. Kim J, Lee JK, Lee KM (2016) Accurate image super-resolution using very deep convolutional networks. In: IEEE conference on computer vision and pattern recognition

  19. Ledig C, Theis L, Huszar F, Caballero J, Cunningham A, Acosta A, Aitken A, Tejani A, Totz J, Wang Z, et al (2017) Photo-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4681–4690

  20. Lee H, Battle A, Raina R, Ng AY (2007) Advances in Neural Information Processing Systems 20: 21st Annual Conference on Neural Information Processing Systems 2007. Desc: Proceedings of a meeting held 3-6 December 2007, Vancouver, British Columbia, Canada pp 801–808

  21. Liong VE, Lu J, Wang G (2013, 2013) Face recognition using deep PCA. In: 9th international conference on information, communications & signal processing. IEEE, pp 1–5

  22. MacQueen J et al (1967) Some methods for classification and analysis of multivariate observations In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1, no 14. Oakland, pp 281–297

  23. Mairal J, Bach F, Ponce J et al (2014) Sparse modeling for image and vision processing. Found Trends R Comput Graph Vision 8(2–3):85–283

    Article  Google Scholar 

  24. Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol 58(1):267–288

    MathSciNet  MATH  Google Scholar 

  25. Wang Z, Bovik AC (2002) A universal image quality index. IEEE Signal Process Lett 9(3):81–84

    Article  Google Scholar 

  26. Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612

    Article  Google Scholar 

  27. Wang S, Zhang L, Liang Y, Pan Q (2012) Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 2216–2223

  28. Wang Z, Miao Z, Wu QJ, Wan Y, Tang Z (2014) Low-resolution face recognition: a review. Vis Comput 30(4):359–386

    Article  Google Scholar 

  29. Wang N, Tao D, Gao X, Li X, Li J (2014) A comprehensive survey to face hallucination. Int J Comput Vis 106(1):9–30

    Article  Google Scholar 

  30. Wang Q, Chen M, Nie F, Li X (2018) Detecting coherent groups in crowd scenes by multiview clustering. In: IEEE transactions on pattern analysis and machine intelligence, vol PP, no 99, pp 1–1

  31. Wang Q, Qin Z, Nie F, Li X (2019) Spectral embedded adaptive neighbors clustering. IEEE Trans Neural Netw Learn Syst 30(4):1265–1271

    Article  Google Scholar 

  32. Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Philip SY et al (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14(1):1–37

    Article  Google Scholar 

  33. Yang J, Wright J, Huang TS, Ma Y (2010) Image super-resolution via sparse representation. IEEE Trans Image Process 19(11):2861–2873

    Article  MathSciNet  Google Scholar 

  34. Yuan Y, Ma D, Wang Q (2019) Hyperspectral anomaly detection via sparse dictionary learning method of capped norm. IEEE Access 7(99):16132–16144

    Article  Google Scholar 

Download references

Acknowledgments

The author would like to thank Mengxue Wang from Shandong University for revising this paper. This work was supported in part by the National Key R&D Program of China (2019YFB1311001), in part by the National Natural Science Foundation of China (61203261 and 61876099), in part by the Scientific and Technological Development Project of Shandong Province (2019GSF111002), in part by the Shenzhen Science and Technology Research and Development Funds (JCYJ20180305164401921), in part by the Foundation of Ministry of Education Key Laboratory of System Control and Information Processing (Scip201801), in part by the Foundation of Key Laboratory of Intelligent Computing & Information Processing of Ministry of Education (2018ICIP03), and in part by the Foundation of State Key Laboratory of Integrated Services Networks (ISN20-06).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhenxue Chen.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, Z., Li, J. & Liu, C. Face hallucination with K-means++ dictionary learning. Multimed Tools Appl 79, 11685–11698 (2020). https://doi.org/10.1007/s11042-019-08505-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-08505-0

Keywords

Navigation