AdaPIP: Adaptive picture-in-picture guidance for 360° film watching

Li, Yi-Xiao; Luo, Guan; Xu, Yi-Ke; He, Yu; Zhang, Fang-Lue; Zhang, Song-Hai

doi:10.1007/s41095-023-0347-3

AdaPIP: Adaptive picture-in-picture guidance for 360° film watching

Research Article
Open access
Published: 02 May 2024

Volume 10, pages 487–503, (2024)
Cite this article

Download PDF

You have full access to this open access article

Computational Visual Media Aims and scope Submit manuscript

AdaPIP: Adaptive picture-in-picture guidance for 360° film watching

Download PDF

Yi-Xiao Li^1,2,
Guan Luo^1,2,
Yi-Ke Xu^1,2,
Yu He³,
Fang-Lue Zhang⁴ &
…
Song-Hai Zhang^1,2

140 Accesses
Explore all metrics

Abstract

360° videos enable viewers to watch freely from different directions but inevitably prevent them from perceiving all the helpful information. To mitigate this problem, picture-in-picture (PIP) guidance was proposed using preview windows to show regions of interest (ROIs) outside the current view range. We identify several drawbacks of this representation and propose a new method for 360° film watching called AdaPIP. AdaPIP enhances traditional PIP by adaptively arranging preview windows with changeable view ranges and sizes. In addition, AdaPIP incorporates the advantage of arrow-based guidance by presenting circular windows with arrows attached to them to help users locate the corresponding ROIs more efficiently. We also adapted AdaPIP and Outside-In to HMD-based immersive virtual reality environments to demonstrate the usability of PIP-guided approaches beyond 2D screens. Comprehensive user experiments on 2D screens, as well as in VR environments, indicate that AdaPIP is superior to alternative methods in terms of visual experiences while maintaining a comparable degree of immersion.

Article PDF

360 $$^\circ $$ User-Generated Videos: Current Research and Future Trends

CompassbAR: A Technique for Visualizing Out-of-View Objects in a Mixed Reality Environment

Augmented ODV: Web-Driven Annotation and Interactivity Enhancement of 360 Degree Video in Both 2D and 3D

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Rhee, T.; Petikam, L.; Allen, B.; Chalmers, A. MR360: Mixed reality rendering for 360° panoramic videos. IEEE Transactions on Visualization and Computer Graphics Vol. 23, No. 4, 1379–1388, 2017.
Article Google Scholar
Lin, Y. C.; Chang, Y. J.; Hu, H. N.; Cheng, H. T.; Huang, C. W.; Sun, M. Tell me where to look: Investigating ways for assisting focus in 360° video. In: Proceedings of the CHI Conference on Human Factors in Computing Systems, 2535–2545, 2017.
Baudisch, P.; Rosenholtz, R. Halo: A technique for visualizing off-screen objects. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 481–488, 2003.
Gustafson, S. G.; Irani, P. P. Comparing visualizations for tracking off-screen moving targets. In: Proceedings of the CHI’ 07 Extended Abstracts on Human Factors in Computing Systems, 2399–2404, 2007.
Gustafson, S.; Baudisch, P.; Gutwin, C.; Irani, P. Wedge: Clutter-free visualization of off-screen locations. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 787–796, 2008.
Pavel, A.; Hartmann, B.; Agrawala, M. Shot orientation controls for interactive cinematography with 360 video. In: Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology, 289–297, 2017.
Liu, S. J.; Agrawala, M.; DiVerdi, S.; Hertzmann, A. View-dependent video textures for 360° video. In: Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology, 249–262, 2019.
Lin, Y. T.; Liao, Y. C.; Teng, S. Y.; Chung, Y. J.; Chan, L.; Chen, B. Y. Outside-In: Visualizing out-of-sight regions-of-interest in a 360° video using spatial picture-in-picture previews. In: Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology, 255–265, 2017.
Google Spotlight Stories. 360 Google Spotlight Stories: Rain or Shine. 2016. Available at https://www.youtube.com/watch?v=QXF7uGfopnY
Adam Cosco. Knives. 2019. Available at https://youtu.be/IrAXKwEKVGA?si=y9gyhtBvxzFY1v-S
AutoNavi Information Technology Co. Ltd. AutoNavi. 2021. Available at https://mobile.amap.com/
Rothe, S.; Buschek, D.; Hußmann, H. Guidance in cinematic virtual reality-taxonomy, research status and challenges. Multimodal Technologies and Interaction Vol. 3, No. 1, 19, 2019.
Article Google Scholar
Adcock, M.; Feng, D.; Thomas, B. Visualization of off-surface 3D viewpoint locations in spatial augmented reality. In: Proceedings of the 1st Symposium on Spatial User Interaction, 1–8, 2013.
Van den Broeck, M.; Kawsar, F.; Schöning, J. It’s all around you: Exploring 360° video viewing experiences on mobile devices. In: Proceedings of the 25th ACM International Conference on Multimedia, 762–768, 2017.
Fonseca, D.; Kraus, M. A comparison of head-mounted and hand-held displays for 360° videos with focus on attitude and behavior change. In: Proceedings of the 20th International Academic Mindtrek Conference, 287–296, 2016.
iNFINITE Production. Crowd-Sourced Data. 2020. Available at https://www.infinite.cz/projects/HMD-tester-virtual-reality-headset-database-utility
Larson, A. M.; Loschky, L. C. The contributions of central versus peripheral vision to scene gist recognition. Journal of Vision Vol. 9, No. 10, 6.1–6.16, 2009.
Article Google Scholar
Millodot, M. Dictionary of Optometry and Visual Science E-Book. Butterworth-Heinemann, 2014.
Kit, D.; Katz, L.; Sullivan, B.; Snyder, K.; Ballard, D.; Hayhoe, M. Eye movements, visual search and scene memory, in an immersive virtual environment. PLoS One Vol. 9, No. 4, e94362, 2014.
Article Google Scholar
Li, C. L.; Aivar, M. P.; Kit, D. M.; Tong, M. H.; Hayhoe, M. M. Memory and visual search in naturalistic 2D and 3D environments. Journal of Vision Vol. 16, No. 8, Article No. 9, 2016.
David, E.; Beitner, J.; Võ, M. L. H. Effects of transient loss of vision on head and eye movements during visual search in a virtual environment. Brain Sciences Vol. 10, No. 11, Article No. 841, 2020.
Nuthmann, A. On the visual span during object search in real-world scenes. Visual Cognition Vol. 21, No. 7, 803–837, 2013.
Article Google Scholar
Cajar, A.; Engbert, R.; Laubrock, J. Spatial frequency processing in the central and peripheral visual field during scene viewing. Vision Research Vol. 127, 186–197, 2016.
Article Google Scholar
David, E. J.; Lebranchu, P.; Perreira Da Silva, M.; Le Callet, P. Predicting artificial visual field losses: A gaze-based inference study. Journal of Vision Vol. 19, No. 14, Article No. 22, 2019.
Matsuzoe, S.; Jiang, S.; Ueki, M.; Okabayashi, K. Intuitive visualization method for locating off-screen objects inspired by motion perception in peripheral vision. In: Proceedings of the 8th Augmented Human International Conference, Article No. 29, 2017.
Kasahara, S.; Rekimoto, J. JackIn: Integrating first-person view with out-of-body vision generation for human-human augmentation. In: Proceedings of the 5th Augmented Human International Conference, Article No. 46, 2014.
Google Spotlight Stories. 360 Google Spotlight Stories: HELP. 2016. Available at https://www.youtube.com/watch?v=G-XZhKqQAHU
Corridor. 360 Wizard Battle. 2016. Available at https://youtu.be/bb5eETSspVI?si=Wayr9bbhRsVtrWSG
Iris. Invisible - Episode 5 - Into The Den. 2016. Available at https://youtu.be/qYxNCB678WQ?si=uJhsaetH-HytKyzY
The Rock. The Rock Presents: “Escape From Calypso Island” - A 360 VR Adventure. 2016. Available at https://youtu.be/G4w_MBMNMEQ?si=XGdQOCgb2-yy5XD8K
Google Spotlight Stories. Google Spotlight Stories: Special Delivery Trailer. 2015. Available at https://youtu.be/3QxZtQoAIOs?si=Wz2pRXtEvRwLr5E6
Google Spotlight Stories. 360 Google Doodles/Spotlight Stories: Back to the Moon. 2018. Available at https://youtu.be/BEePFpC9qG8?si=PxDQjkefXBOuUMd1
Sato, Y.; Sugano, Y.; Sugimoto, A.; Kuno, Y.; Koike, H. Sensing and controlling human gaze in daily living space for human-harmonized information environments. In: Human-Harmonized Information Technology, Volume 1. Nishida, T. Ed. Springer Tokyo, 199–237, 2016.
Chapter Google Scholar
Tam, W. J.; Stelmach, L. B.; Corriveau, P. J. Psychovisual aspects of viewing stereoscopic video sequences. In: Proceedings of the SPIE 3295, Stereoscopic Displays and Virtual Reality Systems V, 226–235, 1998.
National Geographic. Lions 360°. 2017. Available at https://youtu.be/sPyAQQklc1s?si=ztk3XKDkXchZqTCn
Zhou, F.; Kang, S. B.; Cohen, M. F. Time-mapping using space-time saliency. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3358–3365, 2014.
Liu, C.; Yuen, J.; Torralba, A. SIFT flow: Dense correspondence across scenes and its applications. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 33, No. 5, 978–994, 2011.
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Project Number 62132012), the Beijing Science and Technology Program (Project Number Z221100007722001), and the Tsinghua–Tencent Joint Laboratory for Internet Innovation Technology.

Author information

Authors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Beijing, China
Yi-Xiao Li, Guan Luo, Yi-Ke Xu & Song-Hai Zhang
Beijing National Research Center for Information Science and Technology (BNRist), Tsinghua University, Beijing, China
Yi-Xiao Li, Guan Luo, Yi-Ke Xu & Song-Hai Zhang
Key Laboratory of Space Utilization, Technology and Engineering Center for Space Utilization, Chinese Academy of Sciences, Beijing, China
Yu He
Victoria University of Wellington, Wellington, New Zealand
Fang-Lue Zhang

Authors

Yi-Xiao Li
View author publications
You can also search for this author in PubMed Google Scholar
Guan Luo
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Ke Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yu He
View author publications
You can also search for this author in PubMed Google Scholar
Fang-Lue Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Song-Hai Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Song-Hai Zhang.

Ethics declarations

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Yi-Xiao Li received her bachelor degree in arts & design from Tsinghua University, Beijing, in 2020, where she is currently pursuing her master degree in the Academy of Arts & Design of the same university. Her research interests include human–computer interaction and virtual reality.

Guan Luo is currently a Ph.D. student in the Department of Computer Science and Technology, Tsinghua University, supervised by Song-Hai Zhang. His research interests include computer vision and virtual reality.

Yi-Ke Xu is currently an undergraduate student in the Department of Computer Science and Technology, Tsinghua University. His research interests include virtual reality and image/video processing.

Yu He received a doctoral degree from the Zhejiang University of Technology in 2019. He completed his postdoctoral work in the Department of Computer Science and Technology at Tsinghua University in 2021. He is currently an assistant researcher at Yanqi Lake Beijing Institute of Mathematical Sciences and Applications. His research interests include 3D vision and virtual reality.

Fang-Lue Zhang is currently a lecturer at Victoria University of Wellington, New Zealand. He received his bachelor degree from Zhejiang University, Hangzhou, China, in 2009, and his doctoral degree from Tsinghua University, Beijing, China, in 2015. His research interests include image and video editing, computer vision, and computer graphics. He is a member of IEEE and ACM. He received Victoria Early Career Research Excellence Award in 2019.

Song-Hai Zhang received his Ph.D. degree in computer science and technology from Tsinghua University, Beijing, in 2007. He is currently an associate professor in the Department of Computer Science and Technology at Tsinghua University. His research interests include virtual reality and image/video processing.

Electronic supplementary material

Supplementary material, approximately 18.0 MB.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Other papers from this open access journal are available free of charge from http://www.springer.com/journal/41095. To submit a manuscript, please go to https://www.editorialmanager.com/cvmj.

Reprints and permissions

About this article

Cite this article

Li, YX., Luo, G., Xu, YK. et al. AdaPIP: Adaptive picture-in-picture guidance for 360° film watching. Comp. Visual Media 10, 487–503 (2024). https://doi.org/10.1007/s41095-023-0347-3

Download citation

Received: 18 February 2023
Accepted: 31 March 2023
Published: 02 May 2024
Issue Date: June 2024
DOI: https://doi.org/10.1007/s41095-023-0347-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

AdaPIP: Adaptive picture-in-picture guidance for 360° film watching

Abstract

Article PDF

Similar content being viewed by others

360 $$^\circ $$ User-Generated Videos: Current Research and Future Trends

CompassbAR: A Technique for Visualizing Out-of-View Objects in a Mixed Reality Environment

Augmented ODV: Web-Driven Annotation and Interactivity Enhancement of 360 Degree Video in Both 2D and 3D

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Keywords

Navigation

AdaPIP: Adaptive picture-in-picture guidance for 360° film watching

Abstract

Article PDF

Similar content being viewed by others

360 $$^\circ $$ User-Generated Videos: Current Research and Future Trends

CompassbAR: A Technique for Visualizing Out-of-View Objects in a Mixed Reality Environment

Augmented ODV: Web-Driven Annotation and Interactivity Enhancement of 360 Degree Video in Both 2D and 3D

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation