Abstract
Hash learning has been a great success in large-scale data retrieval field because of its superior retrieval efficiency and storage consumption. However, labels for large-scale data are difficult to obtain, thus supervised learning-based hashing methods are no longer applicable. In this paper, we introduce a method called Semi-Supervised Semantic Adaptive Cross-modal Hashing (S3ACH), which improves performance of unsupervised hash retrieval by exploiting a small amount of available label information. Specifically, we first propose a higher-order dynamic weight public space collaborative computing method, which balances the contribution of different modalities in the common potential space by invoking adaptive higher-order dynamic variable. Then, less available label information is utilized to enhance the semantics of hash codes. Finally, we propose a discrete optimization strategy to solve the quantization error brought by the relaxation strategy and improve the accuracy of hash code production. The results show that S3ACH achieves better effects than current advanced unsupervised methods and provides more applicable while balancing performance compared with the existing cross-modal hashing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cao, M., Li, S., Li, J., Nie, L., Zhang, M.: Image-text retrieval: a survey on recent research and development. arXiv preprint arXiv:2203.14713 (2022)
Cheng, M., Jing, L., Ng, M.K.: Robust unsupervised cross-modal hashing for multimedia retrieval. ACM Trans. Inf. Syst. 38(3), 1–25 (2020)
Cheng, S., et al.: Uncertainty-aware and multigranularity consistent constrained model for semi-supervised hashing. IEEE Trans. Circuits Syst. Video Technol. 32(10), 6914–6926 (2022)
Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z.: Nus-wide: a real-world web image database from national university of Singapore. In: ACM International Conference on Image & Video Retrieval (2009)
Da, C., Xu, S., Ding, K., Meng, G., Xiang, S., Pan, C.: AMVH: asymmetric multi-valued hashing. In: 2017 IEEE, CVPR, pp. 898–906 (2017)
Ding, G., Guo, Y., Zhou, J.: Collective matrix factorization hashing for multimodal data. In: IEEE on CVPR (2014)
Hu, P., Zhu, H., Lin, J., Peng, D., Zhao, Y.P., Peng, X.: Unsupervised contrastive cross-modal hashing. IEEE Trans. Pattern Anal. Mach. Intell. 45, 3877–3889 (2022)
Huiskes, M.J., Lew, M.S.: The MIR flickr retrieval evaluation. In: ACM International Conference on Multimedia Information Retrieval, p. 39 (2008)
Kumar, S., Udupa, R.: Learning hash functions for cross-view similarity search. In: International Joint Conference on Artificial Intelligence (2011)
Liu, W., Wang, J., Ji, R., Jiang, Y., Chang, S.: Supervised hashing with kernels. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, June 16–21, 2012, pp. 2074–2081 (2012)
Meng, M., Wang, H., Yu, J., Chen, H., Wu, J.: Asymmetric supervised consistent and specific hashing for cross-modal retrieval. IEEE Trans. Image Process. 30, 986–1000 (2021)
Shen, H.T., et al.: Exploiting subspace relation in semantic labels for cross-modal hashing. IEEE Trans. Knowl. Data Eng. 33(10), 3351–3365 (2020)
Shi, D., Zhu, L., Li, J., Zhang, Z., Chang, X.: Unsupervised adaptive feature selection with binary hashing. IEEE Trans. Image Process. 32, 838–853 (2023)
Shi, Y., et al.: Deep adaptively-enhanced hashing with discriminative similarity guidance for unsupervised cross-modal retrieval. IEEE Trans. Circuits Syst. Video Technol. 32(10), 7255–7268 (2022)
Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources. IN: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data (2013)
Tu, R.C., Jiang, J., Lin, Q., Cai, C., Tian, S., Wang, H., Liu, W.: Unsupervised cross-modal hashing with modality-interaction. IEEE Trans. Circ. Syst. Video Technol. (2023)
Wang, D., Wang, Q., He, L., Gao, X., Tian, Y.: Joint and individual matrix factorization hashing for large-scale cross-modal retrieval. Pattern Recogn. 107, 107479 (2020)
Wang, L., Yang, J., Zareapoor, M., Zheng, Z.: Cluster-wise unsupervised hashing for cross-modal similarity search. Pattern Recogn. 111(5), 107732 (2021)
Wang, Y., Chen, Z.D., Luo, X., Li, R., Xu, X.S.: Fast cross-modal hashing with global and local similarity embedding. IEEE Trans. Cybern. 52(10), 10064–10077 (2021)
Wu, F., Li, S., Gao, G., Ji, Y., Jing, X.Y., Wan, Z.: Semi-supervised cross-modal hashing via modality-specific and cross-modal graph convolutional networks. Pattern Recogn. 136, 109211 (2023)
Wu, W., Li, B.: Locality sensitive hashing for structured data: a survey. arXiv preprint arXiv:2204.11209 (2022)
Yang, F., Ding, X., Liu, Y., Ma, F., Cao, J.: Scalable semantic-enhanced supervised hashing for cross-modal retrieval. Knowl.-Based Syst. 251, 109176 (2022)
Yang, F., Han, M., Ma, F., Ding, X., Zhang, Q.: Label embedding asymmetric discrete hashing for efficient cross-modal retrieval. Eng. Appl. Artif. Intell. 123, 106473 (2023)
Yang, Z., Deng, X., Guo, L., Long, J.: Asymmetric supervised fusion-oriented hashing for cross-modal retrieval. IEEE Transactions on Cybernetics (2023)
Yang, Z., Deng, X., Long, J.: Fast unsupervised consistent and modality-specific hashing for multimedia retrieval. Neural Comput. Appl. 35(8), 6207–6223 (2023). https://doi.org/10.1007/s00521-022-08008-4
Yang, Z., Raymond, O.I., Huang, W., Liao, Z., Zhu, L., Long, J.: Scalable deep asymmetric hashing via unequal-dimensional embeddings for image similarity search. Neurocomputing 412, 262–275 (2020)
Yu, G., Liu, X., Wang, J., Domeniconi, C., Zhang, X.: Flexible cross-modal hashing. IEEE TNNLS 33(1), 304–314 (2022)
Zhang, C., Li, H., Gao, Y., Chen, C.: Weakly-supervised enhanced semantic-aware hashing for cross-modal retrieval. IEEE Trans. Knowl. Data Eng. 35, 6475–6488 (2022)
Zhang, J., Peng, Y., Yuan, M.: Unsupervised generative adversarial cross-modal hashing. In: National Conference on Artificial Intelligence (2018)
Zhang, J., Peng, Y., Yuan, M.: SCH-GAN: semi-supervised cross-modal hashing by generative adversarial network. IEEE Trans. Cybern. 50(2), 489–502 (2020)
Zhang, P.F., Li, Y., Huang, Z., Yin, H.: Privacy protection in deep multi-modal retrieval. In: ACM SIGIR, pp. 634–643 (2021)
Zhou, J., Ding, G., Guo, Y.: Latent semantic sparse hashing for cross-modal similarity search. In: ACM SIGIR, pp. 415–424 (2014)
Acknowledgements
This work is supported in part by the National Natural Science Foundation of China under the Grant No.62202501, No.62172451 and No.U2003208, in part by the National Key R &D Program of China under Grant No.2021YFB3900902, in part by the Science and Technology Plan of Hunan Province under Grant No.2022JJ40638 and in part by Open Research Projects of Zhejiang Lab under the Grant No.2022KG0AB01.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Yang, L., Zhang, K., Li, Y., Chen, Y., Long, J., Yang, Z. (2024). S3ACH: Semi-Supervised Semantic Adaptive Cross-Modal Hashing. In: Luo, B., Cheng, L., Wu, ZG., Li, H., Li, C. (eds) Neural Information Processing. ICONIP 2023. Lecture Notes in Computer Science, vol 14450. Springer, Singapore. https://doi.org/10.1007/978-981-99-8070-3_20
Download citation
DOI: https://doi.org/10.1007/978-981-99-8070-3_20
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8069-7
Online ISBN: 978-981-99-8070-3
eBook Packages: Computer ScienceComputer Science (R0)