Ensembling of text and images using Deep Convolutional Neural Networks for Intelligent Information Retrieval

Mahalakshmi, P.; Fatima, N. Sabiyath

doi:10.1007/s11277-021-08211-x

Ensembling of text and images using Deep Convolutional Neural Networks for Intelligent Information Retrieval

Published: 23 February 2021

Volume 127, pages 235–253, (2022)
Cite this article

Wireless Personal Communications Aims and scope Submit manuscript

843 Accesses
9 Citations
Explore all metrics

Abstract

Information retrieval (IR) defines the process of searching and attaining specific information resources which are related to the specific information requirements from the available resource pool. It finds useful in several real time application areas namely digital library, healthcare, education, internet browsing, etc. Recently, deep learning (DL) models become popular in different fields of image processing, object detection, and natural language processing. Therefore, in this paper, DL models are employed to retrieve the text and images proficiently. This paper presents an ensemble of DL based IR models for text and images. The proposed model intends to develop DL models individually for text and images. Initially, convolutional neural network based VGGNet-19 model is used as a feature extractor and Euclidian distance based similarity measurement for the retrieval of images. At the same time, bidirectional-long short-term memory (BiLSTM) technique is applied for retrieval of textual documents. The presented BiLSTM model sequentially considers every word in a sentence, extracts the details and embeds it to the semantic vector. In addition to the feature extraction using deep learning techniques, the similarity measurement emphasis the closeness of the document to the given query. The proposed retrieval system has tested on text and images for both general and specific domain (agriculture) with the datasets of Yahoo, Google and Corel10K. With the datasets the performance has been computed by the standard measures such as precision, recall and F-score where the proposed deep learning model produces better results when compared to existing techniques. The proposed model has been tested for the specific domain and achieves the performance of 93% precision and 85% recall and 90% F-score when compared to the existing model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

A survey on deep learning approaches for text-to-SQL

Article Open access 23 January 2023

Recommendation system based on deep learning methods: a systematic review and new directions

Article 03 August 2019

Medical Image Analysis using Convolutional Neural Networks: A Review

Article 08 October 2018

References

Le, Q., & Mikolov, T. (2014). Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on Machine Learning (ICML-14) (pp. 1188–1196).
Kiros, R., Zhu, Y., Salakhutdinov, R. R., Zemel, R., Urtasun, R., Torralba, A., & Fidler, S. (2015). Skip-thought vectors. In Advances in neural information processing systems (pp. 3294–3302).
Palangi, H., Deng, L., Shen, Y., Gao, J., He, X., Chen, J., et al. (2016). Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 24(4), 694–707.
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E., Imagenet classification with deep convolutional neural networks, In Advances in neural information processing systems, 2012, pp. 1097–1105.
Zhou, D., Li, X., Zhang, Y.J., (2016). A novel cnn-based match kernel for image retrieval. In IEEE International Conference on Image Processing
Gordo, A., Almazán, J., Revaud, J., Larlus, D., (2016). Deep image retrieval: Learning global representations for image search, In Computer Vision—ECCV 2016—14th European Conference, Amsterdam, The Netherlands, October 11–14, Proceedings, Part VI, 2016, pp. 241–257.
Fu, R., Li, B., Gao, Y., Ping, W., (2017). Content-based image retrieval based on cnn and svm. In IEEE International Conference on Computer & Communications, 2017.
Sun, P.X. Lin, H.T., Tao, L., (2016). Learning discriminative cnn features and similarity metrics for image retrieval. In IEEE International Conference on Signal Processing
Shimoda, K. Yanai, Learning food image similarity for food image retrieval, in: IEEE Third International Conference on Multimedia Big Data, 2017.
Liu, P. Guo, J.M., Wu, C.Y., Cai, D., (2017). Fusion of deep learning and compressed domain features for content based image retrieval, IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society, PP (99) 1–1
Li, Z., & Tang, J. (2015). Weakly supervised deep metric learning for community-contributed image retrieval. IEEE Transactions on Multimedia, 17(11), 1989–1999.
Article Google Scholar
Wang, X., Xiong, D., & Xiang, B. (2016). Deep sketch feature for cross-domain image retrieval. Neurocomputing, 207, S0925231216303198.
Article Google Scholar
Chung, Y.-A., Weng, W.-H., (2017). Learning Deep Representations of Medical Images using Siamese CNNs with Application to Content-Based Image Retrieval, (2017). Sourced from Microsoft Academic—https://academic.microsoft.com/ paper/2768570904.
Qayyum, A., Anwar, S.M., Awais, M., Majid, M., (2017). Medical image retrieval using deep convolutional neural network. Neurocomputing. S0925231217308445
Do, T., Hoang, T., Tan, D.L., Pham, T., Le, H., Cheung, N., Reid, I.D., (2019) Binary constrained deep hashing network for image retrieval without manual annotation, In IEEE Winter Conference on Applications of Computer Vision, WACV 2019, Waikoloa Village, HI, USA, January 7–11, 2019, 2019, pp. 695–704.
Palangi, H., Deng, L., Shen, Y., Gao, J., He, X., Chen, J., et al. (2016). Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(4), 694–707.
Article Google Scholar
Bala, A., & Kaur, T. (2016). Local texton XOR patterns: A new feature descriptor for content-based image retrieval. Engineering Science and Technology, an International Journal, 19(1), 101–112.
Article Google Scholar
Siami-Namini, S., Tavakoli, N. and Namin, A.S., 2019, December. The performance of LSTM and BiLSTM in forecasting time series. In 2019 IEEE International Conference on Big Data (Big Data) (pp. 3285–3292). IEEE.
Khalifi, H., Elqadi, A., & Ghanou, Y. (2018). Support vector machines for a new hybrid information retrieval system. Procedia Computer Science, 127, 139–145.
Article Google Scholar
Chu, K., & Liu, G.H. (2020). Image Retrieval Based on a multi-integration features model. Mathematical Problems in Engineering, 2020(2020), 1–10.
Alsmadi, M. K. (2018). Query-sensitive similarity measure for content-based image retrieval using meta-heuristic algorithm. Journal of King Saud University-Computer and Information Sciences, 30(3), 373–381.
Article Google Scholar
Alsmadi, M. K. (2017). An efficient similarity measure for content based image retrieval using memetic algorithm. Egyptian Journal of Basic and Applied Sciences, 4(2), 112–122.
Article Google Scholar
Madhavi, K. V., Tamilkodi, R., & Sudha, K. J. (2016). An innovative method for retrieving relevant images by getting the top-ranked images first using interactive genetic algorithm. Procedia Computer Science, 79, 254–261.
Article Google Scholar
Jhanwar, N., Chaudhuri, S., Seetharaman, G., & Zavidovique, B. (2004). Content based image retrieval using motif cooccurrence matrix. Image and Vision Computing, 22(14), 1211–1220.
Article Google Scholar
ElAlami, M. E. (2011). A novel image retrieval model based on the most relevant features. Knowledge-Based Systems, 24(1), 23–32.
Article Google Scholar
Pavithra, L. K., & Sharmila, T. S. (2018). An efficient framework for image retrieval using color, texture and edge features. Computers & Electrical Engineering, 70, 580–593.
Article Google Scholar
Yuan, B.H., & Liu, G.H. (2020). Image retrieval based on gradient-structures histogram. Neural Computing and Applications, 1–11.
Sadeghi-Tehran, P., Angelov, P., Virlet, N., & Hawkesford, M. J. (2019). Scalable database indexing and fast image retrieval based on deep learning and hierarchically nested structure applied to remote sensing and plant biology. Journal of Imaging, 5(33), 1–5.
Google Scholar
Sezavar, A., Farsi, H., & Mohamadzadeh, S. (2019). Content-based image retrieval by combining convolutional neural networks and sparse representation. Multimedia Tools and Applications, 78(6), 1–18.
Google Scholar
Kanwal, K., Ahmad, K. T., Khan, R., Abbasi, A. T., & Li, J. (2020). Deep Learning Using Symmetry, FAST Scores, Shape-Based Filtering and Spatial Mapping Integrated with CNN for Large Scale Image Retrieval. Symmetry, 12, 612.
Article Google Scholar
Pang, L., Lan, Y., Guo, J., Xu, J., Xu, J., Cheng, X. (2019). DeepRank: A New Deep Architecture for Relevance Ranking in Information Retrieval. In Proceedings of 26thACM Conference on Information and Knowledge Management, Singapore.
Hu, B., Lu, Z., Li, H., and Chen, Q., (2014). Convolutional neural network architectures for matching natural language sentences. In NIPS. 2042–2050.
Pang, L., Lan, Y., Guo, J., Xu, J., and Cheng, X., (2016). A study of matchpyramid models on ad-hoc retrieval. In Neu-IR16 SIGIR Workshop on Neural Information Retrieval.
Pang, L., Lan, Y., Guo, J., Xu, J., Wan, S., and Cheng, X., 2016. Text matching as image recognition. In AAAI. AAAI Press, 2793–2799.
Wan, S., Lan, Y., Guo, J., Xu, J., Pang, L., Cheng, X. (2016). Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNN. In IJCAI. pp. 2922–2928
Qin, X., Zhang, H., Zheng, H., (2019). Research on Intelligent Retrieval System for agricultural information resources based on ontology. In IOP Conference Series. Journal of Physics
Dang, V., Bendersky, M., Croft, W. B., (2013). Two-Stage Learning to Rank for Information Retrieval, Lecture Notes in Computer Science book series, pp. 423–434.
Prabhu, L. A. J., Sengan, S., Kamalam, G. K., Vellingiri, J., Gopal, J., Velayutham, P., Subramaniyaswamy, V., (2020). Medical Information Retrieval Systems for e-Health Care Records using Fuzzy Based Machine Learning Model, Microprocessors and Microsystems.
Ramkumar, J., Baskar, M., Nipun, P., Aithagani, A., (2020). Effective Framework to Monitor Patient Health Care through Intelligent System, International Journal of Advanced Science and Technology, 29(4), 1828–1835, ISSN: 2005–4238, April 2020.
Ramkumar, J., Baskar, M., Kondru, S., Kuchipudi, J., (2020). Wearable Biometric authentication for health monitoring system using RedTacton, International Journal of Advanced Science and Technology, 29(4), 1819–1827, ISSN: 2005–4238, April 2020.
Arulananth, T. S., Balaji, L., Baskar, M., et al. (2020). PCA based dimensional data reduction and segmentation for DICOM images. Neural Processing Letters. https://doi.org/10.1007/s11063-020-10391-9.
Article Google Scholar
Baskar. M, Gnansekaran. T., (2017). Developing Efficient Intrusion Tracking System using Region Based Traffic Impact Measure Towards the Denial of Service Attack Mitigation, Journal of Computational and Theoretical Nanoscience, 14(7), 3576–3582, ISSN: 1546–1955 (Print): EISSN: 1546–1963 (Online) , July 2017.
Suchithra, M., Baskar, M., Ramkumar, J., Kalyanasundaram, P., & Amutha, B. (2020). Invariant packet feature with network conditions for efficient low rate attack detection in multimedia networks for improved QoS. Journal of Ambient Intell Human Computation. https://doi.org/10.1007/s12652-020-02056-1.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, B.S.Abdur Rahman Crescent Institute of Science & Technology, Chennai, India
P. Mahalakshmi & N. Sabiyath Fatima
Department of Computer Science and Engineering, School of Computing, College of Engineering and Technology, SRM Institute of Science and Technology, Kattankulathur, India
P. Mahalakshmi

Authors

P. Mahalakshmi
View author publications
You can also search for this author in PubMed Google Scholar
N. Sabiyath Fatima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Mahalakshmi.

Ethics declarations

Conflicts of interest

We authors not having any conflict of interest among ourselves to submit and publish our articles in Wireless Personal Communications journal.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mahalakshmi, P., Fatima, N.S. Ensembling of text and images using Deep Convolutional Neural Networks for Intelligent Information Retrieval. Wireless Pers Commun 127, 235–253 (2022). https://doi.org/10.1007/s11277-021-08211-x

Download citation

Accepted: 29 January 2021
Published: 23 February 2021
Issue Date: November 2022
DOI: https://doi.org/10.1007/s11277-021-08211-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Ensembling of text and images using Deep Convolutional Neural Networks for Intelligent Information Retrieval

Abstract

Access this article

Similar content being viewed by others

A survey on deep learning approaches for text-to-SQL

Recommendation system based on deep learning methods: a systematic review and new directions

Medical Image Analysis using Convolutional Neural Networks: A Review

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Ensembling of text and images using Deep Convolutional Neural Networks for Intelligent Information Retrieval

Abstract

Access this article

Similar content being viewed by others

A survey on deep learning approaches for text-to-SQL

Recommendation system based on deep learning methods: a systematic review and new directions

Medical Image Analysis using Convolutional Neural Networks: A Review

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation