Abstract
Intelligent Systems for autonomous vehicles including drone, robot vision, and video surveillance, need to distinguish pedestrian from other object. Pedestrian detection is an essential and significant research topic due to its diverse applications. In this paper, a new visual distinctiveness detection method for pedestrian is proposed based on the statistically weighting probabilistic latent semantic analysis. We detect the distinctiveness by integrating three steps as follows: first representing the co-ocurrence matrix of images, which were vectorized using the bag of visual words (BoVW) framework; then calculating the weights through the histograms of visual words of each class; and finally applying the weights to the test images as the distinctiveness of visual words. The probabilistic latent semantic analysis (PLSA) was used as classification method in our system. We extracted the weighted visual words by sampling the patches from the current image. The proposed method was compared to the PLSA using the Caltech 256 datasets. The classes used include pedestrians, cars, motorbikes, airplanes and horses. The results of the experiment show that the proposed method outperforms current methods in predicting pedestrians and transportation objects.
Similar content being viewed by others
References
S. Zhang, R. Benenson, M. Omran, J. Hosang, and B. Schiele, “How far are we from solving pedestrian detection?” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, pp. 1259–1267, 2016.
R. Benenson, M. Omran, J. Hosang, and B. Schiele, “Ten years of pedestrian detection, what have we learned?” Proc. of the European Conf. Computer Vision, pp. 613–627, 2014.
H. Han, Q. Han, X. Li, and J. Gu, “Hierarchical spatial pyramid max pooling based on sift features and sparse coding for image classification,” IET Computer Vision, vol. 7, no. 2, pp. 144–150, 2013. [click]
Z. Ji, “Decoupling sparse coding with fusion of fisher vectors and scalable svms for large-scale visual recognition,” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, pp. 450–457, 2013.
O. M. Parkhi, A. Vedaldi, A. Zisserman, and C. Jawahar, “Cats and dogs,” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, pp. 3498–3505, 2012.
R. Fergus, L. Fei-Fei, P. Perona, and A. Zisserman, “Learning object categories from google’s image search,” Proc. of the IEEE Conf. Computer Vision, vol. 2, pp. 1816–1823, 2005.
D. G. Lowe, “Object recognition from local scale-invariant features,” Proc. of the IEEE Conf. Computer Vision, vol. 2, pp. 1150–1157, 1999.
P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. I–I, 2001.
T. Q. Bui, T. T. Vu, and K.-S. Hong, “Extraction of sparse features of color images in recognizing objects,” International Journal of Control, Automation and Systems, vol. 14, no. 2, pp. 616–627, 2016. [click]
J. Kim, G. H. Lee, J. J. Jung, and K. N. Choi, “Real-time head pose estimation framework for mobile devices,” Mobile Networks and Applications, vol. 22, no. 4, pp. 634–641, 2017. [click]
S. H. Chang, D.-S. Shim, H.-Y. Kim, and K.-N. Choi, “Object motion tracking using a moving direction estimate and color updates,” International Journal of Control, Automation and Systems, vol. 10, no. 1, pp. 136–142, 2012. [click]
N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 886–893, 2005. [click]
H. Bay, T. Tuytelaars, and L. Van Gool, “Surf: Speeded up robust features,” Proc. of the European Conf. Computer Vision, pp. 404–417, 2006. [click]
Y. Mu, S. Yan, Y. Liu, T. Huang, and B. Zhou, “Discriminative local binary patterns for human detection in personal album,” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, pp. 1–8, 2008. [click]
T. Hofmann, “Probabilistic latent semantic indexing,” Proc. of the International ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 50–57, 1999. [click]
D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent dirichlet allocation,” Journal of Machine Learning Research, vol. 3, no. Jan, pp. 993–1022, 2003.
C. Zhong and Z. Miao, “Modeling correlation between multi-modal continuous words for plsa-based video classification,” Proc. of the International Conf. Image Processing, pp. 4304–4308, 2014.
K. Pliakos and C. Kotropoulos, “Plsa driven image annotation, classification, and tourism recommendation,” Proc. of the International Conf. Image Processing, pp. 3003–3007, 2014.
L. Duan, C. Wu, J. Miao, L. Qing, and Y. Fu, “Visual saliency detection by spatially weighted dissimilarity,” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, pp. 473–480, 2011.
P. Pinoli, D. Chicco, and M. Masseroli, “Enhanced probabilistic latent semantic analysis with weighting schemes to predict genomic annotations,” Proc. of the IEEE Conf. Bioinformatics and Bioengineering, pp. 1–4, 2013.
L. Xie, J. Wang, B. Zhang, and Q. Tian, “Fine-grained image search,” IEEE Transactions on Multimedia, vol. 17, no. 5, pp. 636–647, 2015.
R. Fergus, “Visual object category recognition,” 2005.
H. J. Choi, Y. S. Lee, D. S. Shim, C. G. Lee, and K.N. Choi, “Effective pedestrian detection using deformable part model based on human model,” International Journal of Control, Automation and Systems, vol. 14, no. 6, pp. 1618–1625, 2016. [click]
J. Sivic, B. C. Russell, A. A. Efros, A. Zisserman, and W. T. Freeman, “Discovering object categories in image collections,” Proc. of the IEEE Conf. Computer Vision, 2005.
A. Bosch, A. Zisserman, and X. Muñoz, “Scene classification via plsa,” Proc. of the European Conf. Computer Vision, pp. 517–530, 2006.
L. Fei-Fei, R. Fergus, and P. Perona, “Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories,” Computer vision and Image understanding, vol. 106, no. 1, pp. 59–70, 2007. [click]
G. Griffin, A. Holub, and P. Perona, “Caltech-256 object category dataset,” California Institute of Technology, 2007.
C. Harris and M. Stephens, “A combined corner and edge detector.” Alvey Vision Conference, vol. 15, no. 50, pp. 10–5244, 1988.
D. G. Lowe, “Local feature view clustering for 3d object recognition,” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. I–I, 2001.
G. Csurka, C. Dance, L. Fan, J. Willamowski, and C. Bray, “Visual categorization with bags of keypoints,” Proc. of the European Conf. Computer Vision, vol. 1, no. 1-22, pp. 1–2, 2004.
L. Fei-Fei and P. Perona, “A bayesian hierarchical model for learning natural scene categories,” Proc. of the IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 524–531, 2005.
J. Sivic and A. Zisserman, “Video google: A text retrieval approach to object matching in videos.” Proc. of the IEEE Conf. Computer Vision, vol. 2, no. 1470, pp. 1470–1477, 2003.
S. Deerwester, S. T. Dumais, G.W. Furnas, T. K. Landauer, and R. Harshman, “Indexing by latent semantic analysis,” Journal of the American society for Information Science, vol. 41, no. 6, p. 391, 1990.
T. K. Landauer, P. W. Foltz, and D. Laham, “An introduction to latent semantic analysis,” Discourse Processes, vol. 25, no. 2-3, pp. 259–284, 1998. [click]
G. Overett, L. Petersson, N. Brewer, L. Andersson, and N. Pettersson, “A new pedestrian dataset for supervised learning,” Proc. of the IEEE Conf. Intelligent Vehicles Symposium, pp. 373–378, 2008.
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay, “Scikitlearn: machine learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
Author information
Authors and Affiliations
Corresponding author
Additional information
Recommended by Associate Editor Sung Jin Yoo under the direction of Editor Euntai Kim. This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (NRF-2010-0025512). This research was supported by the Chung-Ang University Research Scholarship Grants in 2017.
Hyun Chul Song is a Ph.D. candidate of the School of Computer Science and Engineering, Chung-Ang University, Seoul, Korea. His research interests include image classification, image captioning, and generative adversarial networks.
Gyun Hyuk Lee is a M.S. student of the School of Computer Science and Engineering, Chung-Ang University, Seoul, Korea. His research interests include haze removal recognition.
Duk-Sun Shim is a professor at the School of Electrical and Electronics Engineering in Chung-Ang University, Seoul, Korea. He received his B.S. and M.S. degrees from the Department of Control and Instrumentation Engineering in Seoul National University, Seoul, Korea, in 1984 and 1986, respectively. He received his Ph.D. degree in Aerospace Engineering from the University of Michigan, Ann Arbor, USA, in 1993. He served at the Department of Electrical Engineering and Computer Science in the University of Michigan as a postdoc from January 1994 to January 1995. Since March 1995, he has been with the School of Electrical and Electronics Engineering in Chung-Ang University, Seoul, Korea, where he is currently a Professor. His research interests include robust control, robot SLAM, GNSS and inertial navigation systems, fault detection and isolation, and computer vision.
Kwang Nam Choi is a professor at the School of Computer Science and Engineering, Chung-Ang University, Seoul, Korea. He received his B.S. and M.S. degrees from the Department of Computer Science in Chung-Ang University, Korea, in 1988 and 1990, respectively. He received his Ph.D. degree in Computer Science from the University of York, U.K. in 2002. Since 2002, he has been with the School of Computer Science and Engineering in Chung-Ang University, Seoul, Korea where he is currently a Professor. His research interests include motion tracking, object categorization, and 3D image recognition.
Rights and permissions
About this article
Cite this article
Song, H.C., Lee, G.H., Shim, DS. et al. Visual Distinctiveness Detection of Pedestrian based on Statistically Weighting PLSA for Intelligent Systems. Int. J. Control Autom. Syst. 16, 815–822 (2018). https://doi.org/10.1007/s12555-017-0253-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12555-017-0253-5