Engineering the Perception of Recognition Through Interactive Raw Primal Sketch by HNFGS and CNN-MRF

Das, Apurba; Ajithkumar, Nitin

doi:10.1007/978-981-10-7895-8_18

Apurba Das¹⁷ &
Nitin Ajithkumar¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 703))

621 Accesses
2 Citations

Abstract

The impression of a scene on human brain, specifically the primary visual cortex, is still a far-reached goal by the computer vision research community. This work is a proposal of a novel system to engineer the human perception of recognizing a subject of interest. This end-to-end solution implements all the stages from entropy-based unbiased cognitive interview to the final reconstruction of human perception in terms of machine sketch in the framework of forensic sketch of suspects. The lower mid-level vision as designed behaviorally in primary visual cortex honoring the scale-space concept of object identification has been modeled by hierarchical 2D filters, namely hierarchical neuro-visually inspired figure-ground segregation (HNFGS) for interactive sketch rendering. The aforementioned human–machine interaction is twofold: in gross structural design layer and finer/granular modification of the pre-realized digital perception. Pre-realized sketches are formed learning the characteristics of human artists while sketching an object through integrated framework of deep convolutional neural network (D-CNN) and Markov Random field (MRF). After few iterations of interactive fine-tuning of the sketch, a psycho-visual experiment has been designed and performed to evaluate the feasibility and effectiveness of the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 199.00; Price excludes VAT (USA)

Softcover Book: USD 259.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ullman, S.: High Level Vision MIT Press, Cambridge, Massachussets, 1996.
Google Scholar
Paterson, A., Squad, C. I., Police, V.: Computerised facial construction and reconstruction. Proceedings of the Asia Pacific Police Technology Conference, 135–144 (1991)
Google Scholar
Frowd, C. D., Hancock, P. J., Carson, D.: EvoFIT: A holistic, evolutionary facial imaging technique for creating composites. ACM Transactions on applied perception (TAP)1, no. 1, 19–39 (2004)
Google Scholar
Laughery, K. R., Fowler, R. H.: Sketch artist and Identi-kit procedures for recalling faces. Journal of Applied Psychology 65, no. 3, 307 (1980)
Google Scholar
Willis, G. B.: Cognitive interviewing and questionnaire design: a training manual. US Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Health Statistics(1994)
Google Scholar
Zhang, L., Lin, L., Wu, X., Ding, S., Zhang, L. : End-to-end photo-sketch generation via fully convolutional representation learning. 5th ACM on International Conference on Multimedia Retrieval, 627–634 (2015)
Google Scholar
Whitbeck, M.,Guo, H.: Multiple Landmark Warping Using Thin-plate Splines. IPCV, 6, 256–263 (2006)
Google Scholar
Das, A.: Digital Communication: Principles and system modelling. Springer Science and Business Media, 169–172 (2010)
Google Scholar
Parua, S; Das, A; Mazumdar D.; Mitra S.: Determination of Feature Hierarchy from Gabor and SIFT Features for Face Recognition. Second International Conference on Emerging Applications of Information Technology, 2011, pp. 257–260
Google Scholar
Wise, R. A., Fishman, C. S., Safer, M. A.: How to analyze the accuracy of eyewitness testimony in a criminal case., Conn. L. Rev., 42, 435 (2009)
Google Scholar
Marr, D.: Vision: A computational investigation into the human representation and processing of visual information. MIT press, 2010.
Google Scholar
D. Marr and E. Hildreth. Theory of edge detection. In Proceedings of the Royal Society of London, 1980, 207, 187217.
Google Scholar
R. W. Rodieck and J. Stone. Analysis of receptive fields of cat retinal ganglion cells. Journal of Neurophysiology, 1965, 28:833849.
Google Scholar
Ikeda, H. and Wright, J. H.: Functional organization of the periphery effect in retinal ganglion cells Vision Research, 1972, 12, 1857–1879
Google Scholar
Ghosh, K., Roy, A.: Neuro-visually inspired figure-ground segregation. International Conference on Image Information Processing (ICIIP), 1–6 (2011)
Google Scholar
Lindeberg: Scale-space theory: A basic tool for analyzing structures at different scales. Journal of Applied Statistics, 21(2):224270, 1994.
Google Scholar
Das, A. and Ghosh, K.: Enhancing face matching in a suitable binary environment. International Conference on Image Information Processing (ICIIP), 1–6 (2011)
Google Scholar
Das, A., Roy, A., and Ghosh, K.: Proposing a CNN Based Architecture of Mid-Level Vision for Feeding the WHERE and WHAT Pathways in the Brain, FANCCO, LNCS 7076/2011, pp. 559–568.
Google Scholar
Wu, Z., Lin, D., Tang, X.: Deep Markov Random Field for Image Modeling. 14th European Conference on Computer Vision (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Embedded Innovation Lab., Tata Consultancy Services, Bengaluru, India
Apurba Das
Amrita School of Engineering, Amrita Vishwavidyapeetham, Kollam, Kerala, India
Nitin Ajithkumar

Authors

Apurba Das
View author publications
You can also search for this author in PubMed Google Scholar
Nitin Ajithkumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Apurba Das .

Editor information

Editors and Affiliations

Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India
Bidyut B. Chaudhuri
School of Computing, National University of Singapore, Singapore, Singapore
Mohan S. Kankanhalli
Department of Computer Science and Engineering, Indian Institute of Technology Roorkee, Roorkee, Uttarakhand, India
Balasubramanian Raman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Das, A., Ajithkumar, N. (2018). Engineering the Perception of Recognition Through Interactive Raw Primal Sketch by HNFGS and CNN-MRF. In: Chaudhuri, B., Kankanhalli, M., Raman, B. (eds) Proceedings of 2nd International Conference on Computer Vision & Image Processing . Advances in Intelligent Systems and Computing, vol 703. Springer, Singapore. https://doi.org/10.1007/978-981-10-7895-8_18

Download citation

DOI: https://doi.org/10.1007/978-981-10-7895-8_18
Published: 12 April 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7894-1
Online ISBN: 978-981-10-7895-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics