Skip to main content

Computer Vision Overview

  • Chapter
  • First Online:
3-D Computer Vision

Abstract

Computer vision is an information subject/discipline that uses computers to realize the functions of human vision system (HVS). This book mainly introduces the high-level content of computer vision, which can be used as a textbook for in-depth study of computer vision. This chapter will introduce the characteristics of human vision, the brightness properties of vision, the spatial properties of vision, and the temporal properties of vision, as well as makes some discussions on visual perception. This chapter will discuss the research purpose, research tasks, and research methods of computer vision. It also introduces the visual computational theory proposed by Marr in more detail. Moreover, a combined presentation for some improvement ideas are provided. This chapter will give a general introduction to the 3-D vision system that obtains 3-D spatial information and realizes the understanding of the scene. It compares and discusses the layers of computer vision and image technology, leading to the main content of this book. Finally, this chapter will present the structure of the book and gives the brief summaries of each chapter in this book.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 89.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Davies E R. Computer and Machine Vision: Theory, Algorithms, Practicalities, 4th Ed. Elsevier. 2012.

    Google Scholar 

  2. Lu Y L, Fu C Y, Zeng S R, et al. College Physics Dictionary. Chemical Industry Press, 1991.

    Google Scholar 

  3. Aumont J. The Image. Translation: Pajackowska C. British Film Institute. 1994,

    Google Scholar 

  4. Hao B Y, Zhang H C, Chen S Y. Experimental Psychology. Beijing University Press, 1983.

    Google Scholar 

  5. Guo X Y, Yang Z L. Basic Experimental Psychology. Higher Education Press, 2005.

    Google Scholar 

  6. Qian J Y. Visual Psychology — Thinking and Communication of Visual Forms. Xue Lin Press, 2006.

    Google Scholar 

  7. Shapiro L, Stockman G. Computer Vision. Prentice Hall. 2001.

    Google Scholar 

  8. Jain A K, Dorai C. Practicing vision: integration, evaluation and applications. Pattern Recognition, 1997, 30(2): 183-196.

    Google Scholar 

  9. Marr D. Vision — A Computational Investigation into the Human Representation and Processing of Visual Information. W.H. Freeman. 1982.

    Google Scholar 

  10. Edelman S. Representation and Recognition in Vision. MIT Press. 1999.

    Google Scholar 

  11. Davies E R. Machine Vision: Theory, Algorithms, Practicalities. 3rd Ed. Elsevier. 2005.

    Google Scholar 

  12. Aloimonos Y (ed.). Special Issue on Purposive, Qualitative, Active Vision. CVGIP-IU, 1992, 56(1): 1-129.

    Google Scholar 

  13. Huang T, Stucki P (eds.). Special Section on 3-D Modeling in Image Analysis and Synthesis. IEEE-PAMI, 1993, 15(6): 529-616.

    Google Scholar 

  14. Forsyth D, Ponce J. Computer Vision: A Modern Approach. Prentice Hall. 2003.

    Google Scholar 

  15. Forsyth D, Ponce J. Computer Vision: A Modern Approach, 2nd Ed. Prentice Hall. 2012.

    Google Scholar 

  16. Zhang Y J. Image engineering and bibliography in China. Technical Digest of International Symposium on Information Science and Technology, 1996, 158-160.

    Google Scholar 

  17. Zhang Y-J. Statistics on image engineering literatures. Encyclopedia of Information Science and Technology, 3rd Ed. 2015, Chapter 595 (6030-6040).

    Google Scholar 

  18. Zhang Y-J. Development of image engineering in the last 20 years. Encyclopedia of Information Science and Technology, 4th Ed., 2018, Chapter 113 (1319-1330).

    Google Scholar 

  19. Zhang Y-J. Image Engineering, Vol.1: Image Processing. De Gruyter, 2017.

    Google Scholar 

  20. Zhang Y-J. Image Engineering, Vol.2: Image Analysis. De Gruyter, 2017.

    Google Scholar 

  21. Zhang Y-J. Image Engineering, Vol.3: Image Understanding. De Gruyter, 2017.

    Google Scholar 

  22. Zhang Y-J. Image Engineering in China: 1995. Journal of Image and Graphics, 1996, 1(1): 78-83.

    Google Scholar 

  23. Zhang Y-J. Image Engineering in China: 1995 (Complement). Journal of Image and Graphics, 1996, 1(2): 170-174.

    Google Scholar 

  24. Zhang Y-J. Image Engineering in China: 1996. Journal of Image and Graphics, 1997, 2(5): 336-344.

    Google Scholar 

  25. Zhang Y-J. Image Engineering in China: 1997. Journal of Image and Graphics, 1998, 3(5): 404-414.

    Google Scholar 

  26. Zhang Y-J. Image Engineering in China: 1998. Journal of Image and Graphics, 1999, 4A(5): 427-438.

    Google Scholar 

  27. Zhang Y-J. Image Engineering in China: 1999. Journal of Image and Graphics, 2000, 5A(5): 359-373.

    Google Scholar 

  28. Zhang Y-J. Image Engineering in China: 2000. Journal of Image and Graphics, 2001, 6A(5): 409-424.

    Google Scholar 

  29. Zhang Y-J. Image Engineering in China: 2001. Journal of Image and Graphics, 2002, 7A(5): 417-433.

    Google Scholar 

  30. Zhang Y-J. Image Engineering in China: 2002. Journal of Image and Graphics, 2003, 8A(5): 481-498.

    Google Scholar 

  31. Zhang Y-J. Image Engineering in China: 2003. Journal of Image and Graphics, 2004, 9A(5): 513-531.

    Google Scholar 

  32. Zhang Y-J. Image Engineering in China: 2004. Journal of Image and Graphics, 2005, 10A(5): 537-560.

    Google Scholar 

  33. Zhang Y-J. Image Engineering in China: 2005. Journal of Image and Graphics, 2006, 11(5): 601-623.

    Google Scholar 

  34. Zhang Y-J. Image Engineering in China: 2006. Journal of Image and Graphics, 2007, 12(5): 753-775.

    Google Scholar 

  35. Zhang Y-J. Image Engineering in China: 2007. Journal of Image and Graphics, 2008, 13(5): 825-852.

    Google Scholar 

  36. Zhang Y-J. Image Engineering in China: 2008. Journal of Image and Graphics, 2009, 14(5): 809-837.

    Google Scholar 

  37. Zhang Y-J. Image Engineering in China: 2009. Journal of Image and Graphics, 2010, 15(5): 689-722.

    Google Scholar 

  38. Zhang Y-J. Image Engineering in China: 2010. Journal of Image and Graphics, 2011, 16(5): 693-702.

    Google Scholar 

  39. Zhang Y-J. Image Engineering in China: 2011. Journal of Image and Graphics, 2012, 17(5): 603-612.

    Google Scholar 

  40. Zhang Y-J. Image Engineering in China: 2012. Journal of Image and Graphics, 2013, 18(5): 483-492.

    Google Scholar 

  41. Zhang Y-J. Image Engineering in China: 2013. Journal of Image and Graphics, 2014, 19(5): 649-658.

    Google Scholar 

  42. Zhang Y-J. Image Engineering in China: 2014. Journal of Image and Graphics, 2015, 20(5): 585-598.

    Google Scholar 

  43. Zhang Y-J. Image Engineering in China: 2015. Journal of Image and Graphics, 2016, 21(5): 533-543.

    Google Scholar 

  44. Zhang Y-J. Image Engineering in China: 2016. Journal of Image and Graphics, 2017, 22(5): 563-574.

    Google Scholar 

  45. Zhang Y-J. Image Engineering in China: 2017. Journal of Image and Graphics, 2018, 23(5): 617-629.

    Google Scholar 

  46. Zhang Y-J. Image Engineering in China: 2018. Journal of Image and Graphics, 2019, 24(5): 665-676.

    Google Scholar 

  47. Zhang Y-J. Image Engineering in China: 2019. Journal of Image and Graphics, 2020, 25(5): 864-878.

    Google Scholar 

  48. Zhang Y-J. Image Engineering in China: 2020. Journal of Image and Graphics, 2021, 26(5): 978-990.

    Google Scholar 

  49. Zhang Y-J. Image Engineering in China: 2021. Journal of Image and Graphics, 2022, 27(4): 1009-1022.

    Google Scholar 

  50. Zhang Y-J. Handbook of Image Engineering. Springer Nature, 2021.

    Google Scholar 

  51. Marques O. Practical Image and Video Processing Using MATLAB. Wiley Publishing, Inc. 2013.

    Google Scholar 

  52. Peters J F. Foundations of Computer Vision. Springer, 2017.

    Google Scholar 

  53. Zhang Y-J. Problem Analysis in Image Engineering. Tsinghua University Press, 2018.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Self-Test Questions

Self-Test Questions

The following questions include both single-choice questions and multiple-choice questions, so each option must be judged.

  1. 1.1

    Human Vision and Characteristics

    1. 1.1.1

      Compare vision and other related concepts (·).

      1. (a)

        Vision and computer vision both perceive the objective world subjectively.

      2. (b)

        Vision and image generation both generate images from the abstract description of the scene.

      3. (c)

        The computer vision system and the machine vision system are comparable to the human vision system.

      4. (d)

        The vision process and the computer vision process are completely deterministic and predictable.

    [Hint] Consider the difference between vision and other concepts.

    1. 1.1.2

      Mach band effect (·).

      1. (a)

        Can be explained by simultaneous contrast

      2. (b)

        Shows the same fact as simultaneous contrast

      3. (c)

        Depends on the brightness adaptation level of the human visual system

      4. (d)

        Indicates that the actual brightness distribution on the strip will be affected by the subjective brightness curve

    [Hint] The Mach band effect shows that the brightness that people perceive is not only related to the light intensity of the scene.

    1. 1.1.3

      Subjective brightness (·).

      1. (a)

        Is only related to scene brightness

      2. (b)

        Is proportional to the illuminance of the object

      3. (c)

        Is possible independent to the absolute value of the object brightness

      4. (d)

        Determines the overall sensitivity of the human visual system

    [Hint] Subjective brightness refers to the brightness of the observed object, which is judged by the human eyes according to the intensity of the light stimulation of the retina.

  2. 1.2

    Computer Vision Theory and Model

    1. 1.2.1

      Computer vision (·).

      1. (a)

        Whose goal is to uncover all the mysteries of the visual process.

      2. (b)

        The research method refers to the structural principles of the human visual system.

      3. (c)

        It is a means to explore the working mechanism of human brain vision.

      4. (d)

        It is realized with the help of the understanding of the human visual system.

    [Hint] Analyze according to the definition of computer vision.

    1. 1.2.2

      Marr’s visual computational theory believes that (·).

      1. (a)

        The visual process is far more complicated than human imagination

      2. (b)

        The key to solve visual problems is the representation and processing of information

      3. (c)

        To complete the visual task, all the works must be combined

      4. (d)

        All visual information problems can be computed with modern computers.

    [Hint] See the five points of Marr’s visual computational theory.

    1. 1.2.3

      In the improved visual computational framework shown in Fig. 1.7, (·).

      1. (a)

        The image acquisition module provides the basis for qualitative vision.

      2. (b)

        The image acquisition module provides the basis for selective vision.

      3. (c)

        The vision purpose module should be constructed based on the purpose of active vision.

      4. (d)

        The function of the high-level knowledge module is to feed back the later result information to the early processing.

    [Hint] Analyze the shortcomings of Marr’s theory.

  3. 1.3

    Three-Dimensional Vision System and Image Technology

    1. 1.3.1

      According to the 3-D vision system flow chart, (·).

      1. (a)

        The 3-D reconstruction must use motion information.

      2. (b)

        The objective analysis of the scene is based on the interpretation of the scene.

      3. (c)

        Decisions can only be made based on the interpretation and understanding of the scene.

      4. (d)

        To obtain motion information, video images must be collected.

    [Hint] Analyze the connection between each step.

    1. 1.3.2

      For image understanding, (·).

      1. (a)

        Its abstraction is high, its operand is the object, and its semantic level is high level.

      2. (b)

        Its abstraction is high, its operand is the symbol, and its semantic level is middle level.

      3. (c)

        Its abstraction is high, its amount of data is small, and its semantic level is high level.

      4. (d)

        Its abstraction is high, its amount of data is large, and its operand is the symbol.

    [Hint] Refer to Fig. 1.9.

    1. 1.3.3

      Which of the following image technique(s) is/are image understanding technologies? (·).

      1. (a)

        Image segmentation

      2. (b)

        Scene restoration

      3. (c)

        Image matching and fusion

      4. (d)

        Extraction and analysis of object characteristics

    [Hint] Consider the input and output of each technology.

  4. 1.4

    Overview of the Structure and Content of This Book

    1. 1.4.1

      In the following content, the five modules in this book in turn are (·).

      1. (a)

        3-D image acquisition, video and motion, binocular stereo vision, scenery matching, scene interpretation

      2. (b)

        3-D image acquisition, binocular stereo vision, monocular multi-image restoration, scene matching, scene interpretation

      3. (c)

        Camera calibration, moving object detection and tracking, binocular stereo vision, monocular and single image restoration, and spatial-temporal behavior understanding

      4. (d)

        Camera calibration, video and motion, monocular multi-image restoration, moving object detection and tracking, and spatial-temporal behavior understanding

    [Hint] Refer to Fig. 1.10.

    1. 1.4.2

      Among the following statements, the correct one/ones is/are (·).

      1. (a)

        3-D image is a kind of depth image

      2. (b)

        Background modeling is a technique for detecting and tracking moving objects in videos

      3. (c)

        Recovering the shape of the object from the tonal change of the object surface is a method of recovering the scene by using multiple monocular images

      4. (d)

        The bag-of-words/bag of feature model is a model of spatial-temporal behavior understanding

    [Hint] Consider the content discussed in each chapter separately.

    1. 1.4.3

      Among the following statements, the incorrect one/ones is/are (·).

      1. (a)

        The region-based binocular stereo matching technology is a relatively abstract matching technology

      2. (b)

        It is possible to use a non-linear camera model for camera calibration

      3. (c)

        The classification of actions is a technique for detecting and tracking moving objects

      4. (d)

        The method of obtaining structure from motion based on the optical flow field is a method of recovering the scene by using multiple monocular images

    [Hint] Refer to the introduction in the overview of each chapter.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Zhang, YJ. (2023). Computer Vision Overview. In: 3-D Computer Vision. Springer, Singapore. https://doi.org/10.1007/978-981-19-7580-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-981-19-7580-6_1

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-19-7579-0

  • Online ISBN: 978-981-19-7580-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics