Skip to main content
Log in

PSO based combined kernel learning framework for recognition of first-person activity in a video

  • Special Issue
  • Published:
Evolutionary Intelligence Aims and scope Submit manuscript


This paper presents human activity recognition problem from first-person view-point (ego-centric video). The task is to understand the activities of a person by an observer (wearable camera or robot) from real-time video data. An efficient human activity recognition system demands the choice of useful traits and the suitable kernels for those traits. In this work, we have proposed a combined kernel learning (CKL) framework using PSO as optimization algorithm for first-person activity recognition in a video. This framework does appropriate feature selection and combines those features from their respective kernels from the video data in a productive way. The proposed algorithm learns an optimal composite kernel from the combination of the basis kernel constructed from different motion-related features of the first-person video. To determine both basis kernel and their combination, this method can optimize a data-dependent kernel evaluation measure. The performance of the proposed CKL is evaluated by combining different types of motion features from the first-person video (JPL-interaction dataset). The result shows a comparatively better rate of accuracy than that of other state-of-the-art human activity recognition methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others


  1. Ryoo MS, Aggarwal J (2009) Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities. In: ICCV.

  2. Ryoo MS, Aggarwal JK (2011) Stochastic representation and recognition of high-level group activities. Int J Comput Vision 93(2):183–200

    Article  MathSciNet  Google Scholar 

  3. Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: a local SVM approach. In: ICPR

  4. Ryoo MS, Matthies L (2013) First-person activity recognition: what are they doing to me? In: IEEE conference on computer vision and pattern recognition

  5. Iwashita Y, Takamine A, Kurazume R, Ryoo MS (2014) First-person animal activity recognition. In: Int. Conf. Pattern Recognition (ICPR)

  6. Gori I, Aggarwal JK, Matthies L, Ryoo MS (2016) Multitype activity recognition in robot-centric scenarios. IEEE Robot Autom Lett 1(1):593–600

    Article  Google Scholar 

  7. Ozkan F, Ali M, Surer E, Temizel A (2017) Boosted multiple kernel learning for first-person activity recognition. In: Proceedings of the 25th European signal processing conference

  8. Yeh Y-R, Lin T-C, Chung Y-Y, Wang Y-CF (2012) A novel multiple kernel learning framework for heterogeneous feature fusion and variable selection. IEEE Trans Multimedia 14(3):563–574

    Article  Google Scholar 

  9. Rakotomamonjy A, Bach F, Canu S, Grandvalet Y (2008) Simple CKL. J Mach Learn Res 9:2491–2521

    MathSciNet  Google Scholar 

  10. G¨onen M, Alpaydın E (2011) Multiple kernel learning algorithms. J Mach Learn Res 12:2211–2268

    MathSciNet  MATH  Google Scholar 

  11. Bucak SS, Jin R, Jain AK (2014) Multiple kernel learning for visual object recognition: a review. IEEE Trans Pattern Anal Mach Intell 36(7):1354–1369

    Article  Google Scholar 

  12. Bobick AF, Davis JW (2001) The recognition of human movement using temporal templates. IEEE Trans Pattern Anal Mach Intell 23(3):257–267

    Article  Google Scholar 

  13. Farnebäck G (2003) Two-frame motion estimation based on polynomial expansion. In: Bigun J, Gustavsson T (eds) Image analysis. SCIA 2003. Lecture notes in computer science, vol 2749. Springer, Berlin, Heidelberg

    Google Scholar 

  14. Dollar P, Rabaud V, Cottrell G, Belongie S (2005) Behaviorrecognition via sparse spatio-temporal features. In: IEEEWorkshop on VS-PETS

  15. Human Action Recognition Using Histogram of Oriented Gradient of Motion History Image (2011) In: International Conference on Instrumentation, Measurement, Computer, Communication and Control

  16. Cristianini N, Kandola J, Elisseeff A, Shawe-Taylor J (2002) On kernel target alignment. In: Proc. 14th Int. Conf. adv. neural Inf. process. Syst. vol. 14, pp. 367–374

  17. Baram Y (2005) Learning by kernel polarization. Neural Comput 17:1264–1275

    Article  MathSciNet  Google Scholar 

  18. Gu Y, Wang C, You D, Zhang Y, Wang S, Zhang Y (2012) Representative multiple kernel learning for classification in hyperspectral imagery. IEEE Trans Geosci Remote Sens 50(7):2852–2865

    Article  Google Scholar 

  19. Kennedy J (2011) Particle swarm optimization. Encyclopedia of machine learning. Springer, New York, pp 760–766

    Google Scholar 

  20. Elbeltagi E, Hegazy T, Grierson D (2005) Comparison among five evolutionary-based optimization algorithms. Adv Eng Inform 19:43–53

    Article  Google Scholar 

  21. Hassan R, Cohanim B, De Weck O, Venter G (2005) A comparison of particle swarm optimization and the genetic algorithm. In: Proc. 1st AIAA multidisciplinary des. optimization spec. conf, pp. 18–21

  22. Wang T, Tian S, Huang H, Deng D (2009) Learning by local kernel polarization. Neurocomputing 72:3077–3084

    Article  Google Scholar 

  23. Igel C, Glasmachers T, Mersch B, Pfeifer N, Meinicke P (2007) Gradientbased optimization of kernel-target alignment for sequence kernels applied to bacterial gene start detection. IEEE/ACM Trans Comput Biol Bioinf 4(2):216–226

    Article  Google Scholar 

  24. Bansal JC, Singh P, Saraswat M, Verma A, Jadon SS, Abraham A (2011) Inertia weight strategies in particle swarm optimization. In: Proc. 3rd world congr. nature biologically inspired comput, pp. 633–640

  25. Trelea IC (2003) The particle swarm optimization algorithm: Convergence analysis and parameter selection. Inf Process Lett 85:317–325

    Article  MathSciNet  Google Scholar 

  26. Kim D-W, Lee K, Lee D, Lee KH (2005) Rapid and brief communication evaluation of the performance of clustering algorithms in kernelinduced feature space. Pattern Recognit 38:607–611

    Article  Google Scholar 

  27. Mishra SR, Mishra TK, Sanyal G, Sarkar A (2018) Human gesture recognition in still images using GMM approach. In: Bhateja V, Coello Coello C, Satapathy S, Pattnaik P (eds) Intelligent engineering informatics. Advances in intelligent systems and computing, vol 695. Springer, Singapore

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Soumya Ranjan Mishra.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mishra, S.R., Mishra, T.K., Sarkar, A. et al. PSO based combined kernel learning framework for recognition of first-person activity in a video. Evol. Intel. 14, 273–279 (2021).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: