Attention-embedding mesh saliency

Liu, Cheng-ming; Luan, Wan-na; Fu, Rong-hua; Pang, Hai-bo; Li, Ying-hao

doi:10.1007/s00371-022-02444-y

Attention-embedding mesh saliency

Original article
Published: 11 May 2022

Volume 39, pages 1783–1795, (2023)
Cite this article

The Visual Computer Aims and scope Submit manuscript

Cheng-ming Liu ORCID: orcid.org/0000-0002-8650-4271¹,
Wan-na Luan¹,
Rong-hua Fu¹,
Hai-bo Pang¹ &
…
Ying-hao Li¹

3 Citations
1 Altmetric
Explore all metrics

Abstract

Recently, the learning method is gradually penetrating into the field of 3D saliency, but the ground truth annotation is too insufficient to directly train a 3D saliency network. Here, we propose a novel attention-embedding strategy for 3D saliency estimation by directly applying the attention embedding scheme to 3D mesh. With this method, the network is trained in a weakly supervised manner, requiring no saliency annotations but generalizing well on different categories of objects, such as animals, furniture, cars and people. Experimental results show that our approach is comparable with existing state-of-the-art methods. We also apply saliency results to mesh simplification. Evaluations on simplified models show that the visually significant parts can be retained during saliency-aware simplification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 9

MeT: mesh transformer with an edge

Article 14 July 2023

Visual saliency guided textured model simplification

Article 16 May 2015

Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation

References

Ba, J., Mnih, V., Kavukcuoglu, K.: Multiple object recognition with visual attention. arXiv preprint arXiv:1412.7755 (2014)
Bronstein, M.M., Bruna, J., LeCun, Y., Szlam, A., Vandergheynst, P.: Geometric deep learning: going beyond Euclidean data. IEEE Signal Process. Mag. 34(4), 18–42 (2017)
Article Google Scholar
Castellani, U., Cristani, M., Fantoni, S., Murino, V.: Sparse points matching by combining 3d mesh saliency with statistical descriptors. In: Computer Graphics Forum, vol. 27, pp. 643–652. Wiley Online Library (2008)
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets. Computer Science, pp. 1–14 (2014)
Chen, L., Zhang, H., Xiao, J., Nie, L., Shao, J., Liu, W., Chua, T.S.: SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5659–5667 (2017)
Chen, S., Tan, X., Wang, B., Hu, X.: Reverse attention for salient object detection. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 234–250 (2018)
Chen, X., Ma, H., Wan, J., Li, B., Xia, T.: Multi-view 3d object detection network for autonomous driving. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1907–1915 (2017)
Chen, X., Saparov, A., Pang, B., Funkhouser, T.: Schelling points on 3d surface meshes. ACM Trans. Graph. 31(4CD), 29.1-29.12 (2012)
Google Scholar
Cignoni, P., Rocchini, C., Scopigno, R.: Metro: measuring error on simplified surfaces. Comput. Graph. Forum 17(2), 167–174 (2010)
Article Google Scholar
Cong, R., Lei, J., Zhang, C., Huang, Q., Cao, X., Hou, C.: Saliency detection for stereoscopic images based on depth confidence analysis and multiple cues fusion. IEEE Signal Process. Lett. 23(6), 819–823 (2016)
Article Google Scholar
Ding, X., Lin, W., Chen, Z., Zhang, X.: Point cloud saliency detection by local and global feature fusion. IEEE Trans. Image Process. 28(11), 5379–5393 (2019)
Article MathSciNet MATH Google Scholar
Dutagaci, H., Cheung, C.P., Godil, A.: Evaluation of 3d interest point detection techniques via human-generated ground truth. Vis. Comput. 28(9), 901–917 (2012)
Article Google Scholar
Engelmann, F., Kontogianni, T., Leibe, B.: Dilated point convolutions: on the receptive field size of point convolutions on 3d point clouds. In: International Conference on Robotics and Automation (ICRA), vol. 1 (2020)
Gal, R., Cohen-Or, D.: Salient geometric features for partial shape matching and similarity. ACM Trans. Graph. (TOG) 25(1), 130–150 (2006)
Article Google Scholar
Garland, M., Heckbert, P.S.: Surface simplification using quadric error metrics. ACM Siggraph Comput. Graph. 1997, 209–216 (1997)
Google Scholar
Guo, F., Shen, J., Li, X.: Learning to detect stereo saliency. In: 2014 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2014)
Hamann, B.: A data reduction scheme for triangulated surfaces. Comput. Aided Geom. Des. 11(2), 197–214 (1994)
Article MathSciNet MATH Google Scholar
Hanocka, R., Hertz, A., Fish, N., Giryes, R., Fleishman, S., Cohen-Or, D.: Meshcnn: a network with an edge. ACM Trans. Graph. (TOG) 38(4), 1–12 (2019)
Article Google Scholar
Hoppe, H.: Mesh optimization. In: Conference on Computer Graphics & Interactive Techniques (1993)
Hou, T., Qin, H.: Admissible diffusion wavelets and their applications in space-frequency processing. IEEE Trans. Vis. Comput. Graph. 19(1), 3–15 (2012)
Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Hu, S., Liang, X., Shum, H.P., Li, F.W., Aslam, N.: Sparse metric-based mesh saliency. Neurocomputing 400, 11–23 (2020)
Article Google Scholar
Hua, B.S., Tran, M.K., Yeung, S.K.: Pointwise convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 984–993 (2018)
Huang, J., You, S.: Point cloud labeling using 3d convolutional neural network. In: 2016 23rd International Conference on Pattern Recognition (ICPR), pp. 2670–2675. IEEE (2016)
Jeong, S.W., Sim, J.Y.: Saliency detection for 3d surface geometry using semi-regular meshes. IEEE Trans. Multimed. 19(12), 2692–2705 (2017)
Article Google Scholar
Koch, C., Poggio, T.: Predicting the visual world: silence is golden. Nat. Neurosci. 2(1), 9–10 (1999)
Article Google Scholar
Komarichev, A., Zhong, Z., Hua, J.: A-CNN: annularly convolutional neural networks on point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7421–7430 (2019)
Lahav, A., Tal, A.: Meshwalker: deep mesh understanding by random walks. arXiv preprint arXiv:2006.05353 (2020)
Lan, S., Yu, R., Yu, G., Davis, L.S.: Modeling local geometric structure of 3d point clouds using geo-CNN. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019)
Lee, C.H., Varshney, A., Jacobs, D.W.: Mesh saliency. In: ACM SIGGRAPH 2005 Papers, pp. 659–666 (2005)
Leifman, G., Shtrom, E., Tal, A.: Surface regions of interest for viewpoint selection. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (2012)
Limper, M., Kuijper, A., Fellner, D.W.: Mesh saliency analysis via local curvature entropy. In: Eurographics (Short Papers), pp. 13–16 (2016)
Liu, F., Wen, Y., Zhang, D., Jiang, X., Xing, X., Meng, D.: Log2vec: a heterogeneous graph embedding based approach for detecting cyber threats within enterprise. In: Proceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security, pp. 1777–1794 (2019)
Low, K.L., Tan, T.S.: Model simplification using vertex-clustering. In: Symposium on Interactive 3d Graphics (1997)
Mnih, V., Heess, N., Graves, A., Kavukcuoglu, K.: Recurrent models of visual attention. arXiv preprint arXiv:1406.6247 (2014)
Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: ICML (2010)
Nousias, S., Arvanitis, G., Lalos, A.S., Moustakas, K.: Mesh saliency detection using convolutional neural networks. In: 2020 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2020)
Papon, J., Abramov, A., Schoeler, M., Worgotter, F.: Voxel cloud connectivity segmentation-supervoxels for point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2027–2034 (2013)
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3d classification and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 652–660 (2017)
Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in Neural Information Processing Systems, pp. 5099–5108 (2017)
Schroeder, W.J., Zarge, J.A., Lorensen, W.E.: Decimation of triangle meshes. ACM Siggraph Comput. Graph. 26(2), 65–70 (1992)
Article Google Scholar
Sibson, R.: A brief description of natural neighbor interpolation. In: Barnett, V. (ed.) Interpreting Multivariate Data. Wiley, New York, pp. 21–36 (1981)
Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. In: Workshop at International Conference on Learning Representations. Citeseer (2014)
Song, R., Liu, Y., Martin, R.R., Echavarria, K.R.: Local-to-global mesh saliency. Vis. Comput. 34(3), 323–336 (2018)
Article Google Scholar
Song, R., Liu, Y., Martin, R.R., Rosin, P.L.: Mesh saliency via spectral processing. ACM Trans. Graph. (TOG) 33(1), 1–17 (2014)
Article MATH Google Scholar
Song, R., Liu, Y., Rosin, P.: Mesh saliency via weakly supervised classification-for-saliency CNN. IEEE Trans. Vis. Comput. Graph. 27(1), 151–164 (2019)
Article Google Scholar
Tao, P., Cao, J., Li, S., Liu, X., Liu, L.: Mesh saliency via ranking unsalient patches in a descriptor space. Comput. Graph. 46, 264–274 (2015)
Article Google Scholar
Tatarchenko, M., Park, J., Koltun, V., Zhou, Q.Y.: Tangent convolutions for dense prediction in 3d. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3887–3896 (2018)
Thomas, H., Qi, C.R., Deschaud, J.E., Marcotegui, B., Goulette, F., Guibas, L.J.: Kpconv: flexible and deformable convolution for point clouds. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 6411–6420 (2019)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Wang, Y., Sun, Y., Liu, Z., Sarma, S.E., Bronstein, M.M., Solomon, J.M.: Dynamic graph CNN for learning on point clouds. ACM Trans. Graph. (TOG) 38(5), 1–12 (2019)
Article Google Scholar
Wei, N., Gao, K., Ji, R., Chen, P.: Surface saliency detection based on curvature co-occurrence histograms. IEEE Access 6, 54536–54541 (2018)
Article Google Scholar
Wolfe, J.M.: Guided search 2.0 a revised model of visual search. Psychon. Bull. Rev. 1(2), 202–238 (1994)
Article Google Scholar
Woo, S., Park, J., Lee, J.Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 3–19 (2018)
Wu, J., Shen, X., Zhu, W., Liu, L.: Mesh saliency with global rarity. Graph. Models 75(5), 255–264 (2013)
Article Google Scholar
Wu, W., Qi, Z., Fuxin, L.: Pointconv: deep convolutional networks on 3d point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9621–9630 (2019)
Xi, W., Koch, S., Holmqvist, K., Alexa, M.: Tracking the gaze on objects in 3d: how do people really look at the bunny? In: SIGGRAPH Asia 2018 Technical Papers (2018)
Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhudinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. In: International Conference on Machine Learning, pp. 2048–2057. PMLR (2015)
Zhao, H., Jiang, L., Fu, C.W., Jia, J.: Pointweb: enhancing local neighborhood features for point cloud processing. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5565–5573 (2019)
Zheng, T., Chen, C., Yuan, J., Li, B., Ren, K.: Pointcloud saliency maps. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1598–1606 (2019)

Download references

Acknowledgements

We thank all the anonymous reviewers for their valuable comments. This work was supported by the National Key Research and Development Program of China (2020YFB1712401, 2018YFC0824402).

Author information

Authors and Affiliations

School of Cyber Science and Engineering, Zhengzhou University, Zhengzhou, 450002, China
Cheng-ming Liu, Wan-na Luan, Rong-hua Fu, Hai-bo Pang & Ying-hao Li

Authors

Cheng-ming Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wan-na Luan
View author publications
You can also search for this author in PubMed Google Scholar
Rong-hua Fu
View author publications
You can also search for this author in PubMed Google Scholar
Hai-bo Pang
View author publications
You can also search for this author in PubMed Google Scholar
Ying-hao Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hai-bo Pang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, Cm., Luan, Wn., Fu, Rh. et al. Attention-embedding mesh saliency. Vis Comput 39, 1783–1795 (2023). https://doi.org/10.1007/s00371-022-02444-y

Download citation

Accepted: 20 February 2022
Published: 11 May 2022
Issue Date: May 2023
DOI: https://doi.org/10.1007/s00371-022-02444-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Attention-embedding mesh saliency

Abstract

Access this article

Similar content being viewed by others

MeT: mesh transformer with an edge

Visual saliency guided textured model simplification

Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Attention-embedding mesh saliency

Abstract

Access this article

Similar content being viewed by others

MeT: mesh transformer with an edge

Visual saliency guided textured model simplification

Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation