RbQE: An Efficient Method for Content-Based Medical Image Retrieval Based on Query Expansion

Rashad, Metwally; Afifi, Ibrahem; Abdelfatah, Mohammed

doi:10.1007/s10278-022-00769-7

RbQE: An Efficient Method for Content-Based Medical Image Retrieval Based on Query Expansion

Original Paper
Open access
Published: 26 January 2023

Volume 36, pages 1248–1261, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Digital Imaging Aims and scope Submit manuscript

RbQE: An Efficient Method for Content-Based Medical Image Retrieval Based on Query Expansion

Download PDF

2625 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

Systems for retrieving and managing content-based medical images are becoming more important, especially as medical imaging technology advances and the medical image database grows. In addition, these systems can also use medical images to better grasp and gain a deeper understanding of the causes and treatments of different diseases, not just for diagnostic purposes. For achieving all these purposes, there is a critical need for an efficient and accurate content-based medical image retrieval (CBMIR) method. This paper proposes an efficient method (RbQE) for the retrieval of computed tomography (CT) and magnetic resonance (MR) images. RbQE is based on expanding the features of querying and exploiting the pre-trained learning models AlexNet and VGG-19 to extract compact, deep, and high-level features from medical images. There are two searching procedures in RbQE: a rapid search and a final search. In the rapid search, the original query is expanded by retrieving the top-ranked images from each class and is used to reformulate the query by calculating the mean values for deep features of the top-ranked images, resulting in a new query for each class. In the final search, the new query that is most similar to the original query will be used for retrieval from the database. The performance of the proposed method has been compared to state-of-the-art methods on four publicly available standard databases, namely, TCIA-CT, EXACT09-CT, NEMA-CT, and OASIS-MRI. Experimental results show that the proposed method exceeds the compared methods by 0.84%, 4.86%, 1.24%, and 14.34% in average retrieval precision (ARP) for the TCIA-CT, EXACT09-CT, NEMA-CT, and OASIS-MRI databases, respectively.

UNet++: A Nested U-Net Architecture for Medical Image Segmentation

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Article 24 December 2022

Convolutional neural networks: an overview and application in radiology

Article Open access 22 June 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

One of the most active medical image processing research domains, according to recent studies, is content-based medical image retrieval (CBMIR). This is because the usage of several medical techniques, including ultrasound (US), MR, X-ray, and CT, is expanding and accelerating. The similarity of the images is considered the most important thing that CBMIR’s systems are focused on. The user submits a query, and the system retrieves images with the same criterion of similarity in descending order. The two fundamental steps of every CBMIR technique are feature extraction (offline phase) and similarity measurement computations (online phase) [1,2,3]. The CBMIR system’s main architecture is shown in Fig. 1. The CBMIR system has many upgrades that were created to improve its effectiveness and retrieval performance, which can be at the stage of pre-processing or extraction [4, 5]. The extensive medical image retrieval literature shows that texture-based features are well-accepted and popular among researchers worldwide [6,7,8,9,10]. However, medical imaging becomes more sophisticated over time as it attempts to gather as much information about the patient’s anatomy as possible. As a result, developing a powerful CBMIR system based solely on texture is insufficient. It is, therefore, necessary for the hour to build a system for the multi-dimensional retrieval of medical images that will combine multi-dimensional information, for example, texture, edge, and shape. It is a fundamental component of any CBMIR system that compares an image to a database image to determine how similar they are and to find matching pairings for the image [11, 12]. Traditional methods rely on low-level extraction by assessing their colors, textures, forms, and spatial structure from medical imagery. All features are low-level and often do not accurately reflect semantic notions in the images. Using these features for retrieval usually yields unsatisfactory results. Therefore, pre-trained deep convolutional neural network (DCNN) model features have lately achieved superior performance and flexibility than classical descriptors in common image retrieval applications due to the quick advancement of deep learning (e.g., image retrieval or object recognition). Rich image semantic information is provided by this feature, which is crucial for improving the precision of image retrieval.

Having considered all of this in mind, in this paper, the RbQE approach has been used to demonstrate an effective way to retrieve CT and MR images. The RbQE expands the query image by reformulating it based on calculating the mean value of the top-ranking images from each class, and this expansion method is considered fully automated. The RbQE method benefits from the pre-trained DCNN (AlexNet and VGG-19) as extractors of features that are compact, high-level, and robust toward image noise to best represent the medical images and achieve high accuracy. The main contributions of our paper are summarized as follows:

1.
Proposed an efficient RbQE medical image retrieval method that expands the query in a new automated way.
2.
Use the pre-trained deep convolutional neural networks (AlexNet and VGG-19) as feature extractors that describe and represent medical images to obtain complex and high-level features, which have the best ability to withstand external interferences, such as changes in lighting, noise, rotation, and blurred images.
3.
Extensive tests were carried out to compare the performance of the proposed method (RbQE with DCNN) with the existing and modern methods, and it demonstrated that the proposed method exceeds all these methods in retrieving medical images.

The remainder of the paper is arranged as follows. The literature review is shown in the “Literature Review’’ section. The proposed method is described in the “Proposed Method’’ section. The “Experimental Framework’’ section describes the experimental framework used to evaluate the performance of the retrieval to the proposed method and comparative methods. The experimental results of the proposed method and all comparative methods are presented in four standard medical databases in the “Experimental Results’’ section. Finally, conclusions are presented in the “Conclusion’’ section.

Literature Review

Content-based medical image retrieval (CBMIR) technology has a very important role in medical image analysis, where the existing CBMIR technology is used to index and retrieve medical images by using traditional visual indicators to represent all medical images in the image database. Standard descriptors of visual content include texture, edge, color histogram, shape, and a large number of variants. In the past, CBMIR feature extraction was a critical aspect of the accurate retrieval of medical images. CBMIR strives to remove redundant information by reducing the dimensionality of image data [13, 14].

Medical images are available in different formats, such as CT and MRI images. As a result, the authors of [15] proposed a method for detecting CT and MRI co-occurrences based on local feature descriptors. The authors in [16] present a robust and fast MRI retrieval system for brain images. A powerful textural descriptor, known as the local binary pattern (LBP), was proposed in [17]. Based on the LBP, the feature vector is created by each pixel’s intensity. The authors in [18] use LBP, joint LBP, and histograms of image intensity to produce a pulmonary emphysema quantitative analysis of CT. In [19], the feature extractor technology was principally used to evaluate CT images of the chest on the basis of structure and local brightness. The authors in [20] presented a local ternary co-occurrence pattern (LTCoP). In [21], they proposed a BMI approach known as the local mesh pattern (LMeP), and the approach provided in [22] established an LMePVEP algorithm.

For high-level feature descriptors, in [23, 24], the authors present some studies on the ability to obtain efficient images using convolutional neural networks (CNNs), which have been used in machine learning applications. The authors suggested in [25] the AlexNet descriptor medical image retrieval system for local bit plane decoding (LBpDAD), which combines the benefits of local bit plane decoding with the features resulting from a neural network like AlexNet. The authors in [26] introduced the histogram of compressed scattering coefficients (HCSCs) method, where they created a new feature based on employing the transformation in the scatter and a specific version of deep networks to determine the textural features of CT images. Furthermore, [27] proposed an integrated scattering feature based on two separate forms of compressed scattering data: data concentration and canonical correlation analysis (CCA). The authors presented an image reconstruction network (IR-Net) in [28], where the input image would be encoded into a set of features before being rebuilt from the encoded features.

When it comes to the expand query approach, the expansion gains from the label data of the top-ranked images that are obtained and saved in a feedback session. In the literature, there have been numerous successful attempts at various expansion techniques depending on local, global, and CNN features, including certain functionalities that include expanding queries and other methods and a recent query expansion review in data retrieval [29]. The authors in [30] have broken down the expansion model into two components: offline and online retrieval. In the offline procedure, the Laplacian score method is generalized for computation, while the query is classified according to the feature score of the relevant items in the online retrieval component of the database. Finally, the original query was replaced by a slew of first-page results. Their tests with sets of images and single-voice objects were far superior to those of their opponents.

The authors in [31] introduced the query expansion approach, where they used the pre-trained CNN model by using the convolutional layer’s learning filters as visual word detectors. Combined with geometric testing, query expansion techniques are particularly effective in the context of using top-relevant images to expand query-relevant features into eventual successful and valid matches, as shown in [32].

The authors of [33] attempted to extend the automated query expansion by proposing three extensions, where the spatial verification was improved and repositioning was done by reflecting the previously evaluated results, and suggested an approach that expands the query by integrating matching features outside the original query limit, utilizing the spatial context. The authors in the latest study [34] have created a query expansion template based on the mathematical architecture by treating query extension as a discrimination-related learning issue, in which a grouping model is supervised and learned, and then (LAttQE) offered the addition model to communicate data through automatic attention between the top-ranking item and the query. The top-ranked techniques are increasingly used; in the case of building a framework for multimodal query expansion through user interaction methods [35], several approaches are used to achieve this goal.

The authors in [36] achieved a significant level of accuracy in the retrieval of MRI and histopathological images by introducing an expansion approach for features extracted by pre-trained Residual Networks (ResNets).

Proposed Method

The proposed method has two important parts, which are shown in Fig. 2. The first part is an excellent feature extractor, and the second is an efficient matching and retrieval method for medical images. So, two deep feature extractors and the RbQE (retrieval based on query expansion) method have been used in the proposed method. Based on pre-trained models, the deep feature extractor can extract compact and high-level features to represent all images in the medical database. There are two aims of using the deep neural network instead of raw pixels in the analysis of medical images: the first is to extract invariant features, which are more robust against different interferences like noise and changes in the light that appears during the generation of the medical images. Second, there is no need for the deep feature extractor to be retrained, if trained offline using a huge image database, even in the case of analyzing various types and formats of medical images. Consequently, the used deep model is likely to dramatically increase computational efficiency and lower calculation costs in comparison to other retrieval systems that also use deep models.

In addition, the RbQE method is used to improve the matching and retrieval in the CBMIR by expanding the deep features of the original query and the construction of a new query. The RbQE method relies on two search processes: a quick search and a final search. First, a rapid search of the database will retrieve the top-ranked images for the original query from each database class, and for each class, a new query expansion (NQE) will be formed. Secondly, in the final search, the image that is most similar (NQE) to the query images is taken and used as the final new query expansion (FNQE), which is one of the main benefits of our suggested method. The next subsections provide more information on these feature extractors and query expansion methods.

Deep Feature Extraction

We use more robust and efficient deep features to extract more discriminative and high-level features for medical images, thereby minimizing the interference problem. Deep learning has gained enormous popularity recently, with promising applications in a variety of areas [37]. The basic idea behind deep learning has not changed, despite the fact that numerous architectures have been proposed and put into practice: deep learning is a feature representation learning approach that concentrates on huge amounts of unprocessed image data and can use different levels of representation. This concept is stable in spite of several models of deep learning that have been suggested and implemented. Many levels of abstraction enable learning data representations by computational models with many layers of processing L ($L>1$), where after the input layer, each layer transforms the representation of the preceding layer into a more abstract representation, then you can obtain complex structures indirectly from large format imagery and ideally use them to create the original image or the image of the query after studying most of the distinctive variations layer by layer.

In this paper, for medical image retrieval, two types of supervised CNN learning models are used as deep neural networks. CNN is a form of neural network that has been proposed to deal with images and obtain local features located in images. To work with high-resolution images, CNN has three properties: First, each convolution kernel has a small function in depth that is a feature that, despite its small size, can distinguish between different images. Second, since each convolutional feature map uses the same convolution kernel, the same deep features may be filtered and obtained from different locations in the input image. Finally, by subsampling from the convolutional layer to the pooling layer, the image’s dimensionality is reduced and computing efficiency is increased. Figure 3 shows a previously trained DCNN model (AlexNet) and Fig. 4 shows other previously trained DCNN models (VGG-19), which have been trained offline in the ImageNet database [38] and contain millions of labelled images.

Significantly deeper neural networks cannot be used for medical image processing. Because the small differences between identical biomedical images with high-level features are difficult to differentiate, the small disparity will disappear with greater abstraction. However, a small difference is particularly essential in biological images and may be applied precisely to discriminate biomedical images of several types, such as images from our research in the OASIS-MRI database used in our research. As shown in Fig. 3, the AlexNet, which is inspired by biological processes in which the object is recognized from the low-level to the semantic level step by step, is typically composed of four key components: Firstly, the convolutional layers, which are connected to a limited, mostly human visual system location by a convolutional kernel and considered the greatest highlight of AlexNet. Secondly, the activation functions are frequently followed by the convolutional layers, where the ReLU (rectified linear unit) activation function is used to extract from the input signals the more complicated features. Thirdly, the dimensionality of the feature map is lessened by the pooling layers, while the convolutional layer sensitivity is decreased. Finally, at the conclusion of the AlexNet structure, the fully connected layer is combined to generate a feature vector, which provides the prediction result. By applying the backpropagation approach, the loss function between the prediction outcomes and ground truth is minimized using the AlexNet training procedure until the error loss is considerably reduced or a certain number of iterations have been completed. We have used the learned AlexNet as an extractor of biomedical frameworks, utilizing the fully connected layer-6. We use completely connected layer-6 features, since various studies have shown that layer-6 features are more efficient than layer-7 features in biomedical image processing [39,40,41,42,43].

Convolutional networks with extremely deep layers (up to 19 weight layers) (VGG-19) were employed as a feature extractor in our paper, where there are 16 convolutional layers and 3 FC layers, as shown in Fig. 4, where the number of channels is quite small, starting at 64 in the first layer and growing by a factor of two after each max-pooling layer until it reaches 512. In this network (VGG-19), the image was transmitted through a stack of a convolutional layer that is a composite of filters with an extremely narrow receptive field $3 \times 3$ to gripe the notion of up/down, left/right, and center. The convolution stride was set to one pixel, and the spatial padding of convolution is 1 pixel $3 \times 3$ convolution layers. There are five max-pooling layers, each of which was conducted across a $2 \times 2$ pixel window with stride 2. Three Fully Connected (FC) layers follow a stack of convolutional layers: the first two (FC1, FC2) have 4096 channels (features) apiece, while the third (FC3) has 1000 channels (features), and the soft-max layer is found in the final. Also, here we have utilized the fully connected layer-1 (FC1) of the VGG-19 as a feature vector extractor. There are also various studies that show that FC1 features are more efficient than layer-2 (FC2) features in biomedical image processing, except in the TCIA-CT database, where FC2 features have achieved higher accuracy than FC1 features.

Query Expansion Method

As shown in Fig. 2, which describes the complete idea of the expansion of deep features for the original query and the reformulation of a new query for the final search process, the RbQE technique employs the mean values of the deep feature values for images of the top-ranked after a rapid search using a “Query with 4096 Deep Features” (QDF) of the original query to all deep features of photos in the database. From each class in the database, the top ten similar images to the original query are retrieved, and the mean value of deep features for each of the top ten is calculated. This process produces a number of NQEs equal to the number of classes in the database. After that, the most similar NQE to the original query will be taken as the final NQE (FNQE), and then the FNQE is used for the final search procedure. Table 1 provides a simple numerical example of building NQE, where the feature vector dimension for each image is 4096 for both AlexNet and VGG-19. Figure 5 illustrates the proposed algorithm for the RbQE method.

Table 1 NQE based on mean values

Full size table

Experimental Framework

This section presents the computational methods used to compare the performance of the proposed method with other modern retrieval methods. The name and abbreviations of all methods used for comparison with the proposed method are presented in Table 2.

Table 2 Name and Abbreviations of all methods used in the comparison

Full size table

Image Similarity Estimation

Similarity values are calculated with the Euclidean Distance (ED), which is used to calculate the similarity, for both rapid and final search. Let X = ($x_{1}$, $x_{2}$,..., $x_{n}$) and Y = ($y_{1}$, $y_{2}$,..., $y_{n}$), two feature vectors with n dimension, the similarity is computed as follows:

$$\begin{aligned} ED \left( X,Y\right) =\sqrt{\sum \limits _{i=1}^{n} \left( x_{i}-y_{i}\right) ^2} \end{aligned}$$

(1)

Performance Estimation

In experiments, every image in the database is used as a query, and an image is only relevant if it belongs to the same category as the query. Average Precision Retrieval (ARP), Average Retrieval Rate (ARR) and $F_{score}$ are the three performance metrics used to evaluate each retrieval strategy.

$$\begin{aligned} precision:P(q)=\frac{\mathrm{Number\;of\;relevant\;images\;retrieved}}{\mathrm{Number\;of\;images\;retrieved}} \end{aligned}$$

(2)

$$\begin{aligned} recall:R(q)=\frac{\mathrm {Number\;of\;relevant\;images\;retrieved}}{\mathrm{Number\;of\;relevant\;images\;in\;the\;database}} \end{aligned}$$

(3)

$$\begin{aligned} ARP (\%)=\frac{100}{\mid DB \mid }\sum \limits _{i=1}^{\mid DB \mid } P(I_i) \end{aligned}$$

(4)

$$\begin{aligned} ARR (\%)=\frac{100}{\mid DB \mid }\sum \limits _{i=1}^{\mid DB \mid } R(I_i) \end{aligned}$$

(5)

$$\begin{aligned} F_{score} (\%)=\frac{2 \times \text {ARP}\times \text {ARR}}{\text {ARP}+ \text {ARR}} \end{aligned}$$

(6)

where $\mid DB \mid$ indicates the count of all database images.

Image Model Databases

Experiments were carried out on four publicly available image databases with different formats in order to test the performance of the RbQE method, namely the TCIA-CT database [53], the EXACT09-CT database [54], the NEMA-CT database [55] for CT image retrieval, and the OASIS-MRI database^{Footnote 1} [56] for MRI image retrieval. Figures 6, 7, 8, and 9 respectively show sample of images in each class. The four databases used in our experiments are summarized in Table 3 in terms of image number, size of each image, class number, and images in each class.

Table 3 Databases summary used in the experimental framework

Full size table

Experimental Results

This section includes several experiments that demonstrate the efficacy of the proposed method RbQE and compare its results to those of existing methods listed in Table 2. The RbQE method applied two different searching techniques: a rapid search for each database class using one query image selected from the database’s image collection, where every image in the database is considered a query. Then, the final search is done using the final NQE (FNQE). Note that all searches are automated without user participation or suggestion, which is considered a strong point. The performance of the proposed method is compared to that of modern methods, whether deep learning-based or not.

Retrieval Performance on TCIA-CT Database

The performance of the RbQE method on the TCIA-CT database was evaluated using two feature extractors, AlexNet and VGG-19, in addition to VGG-16, to demonstrate that VGG-19 with RbQE outperforms VGG-16 with RbQE. The retrieval results are shown in Table 4 in terms of ARP, ARR, and $F_score$. When compared to other methods, the suggested RbQE method using VGG-19 performs the best on the top 10 images. In terms of ARP, ARR, and $F_score$, the proposed method outperforms $ST-CCA_v$ by 0.84%, 0.16%, and 0.27%, respectively. Figure 10 exhibits the TCIA-CT database query outcomes of the RbQE method with VGG-19 features, which shows all the top 10 images in the same query image class.

Table 4 Performance of different methods on TCIA-CT database with the top 10 matches considered

Full size table

Retrieval Performance on EXACT09-CT Database

The comparison methods used in the “Retrieval Performance on TCIA-CT Database’’ section are also considered and evaluated here using the same experimental parameters as the TCIA-CT database. The retrieval result of the RbQE with different feature extractors is shown in Table 5. In comparison to other methods, the features of the AlexNet descriptor with the RbQE method achieve the highest performance on the top 10 images, and the result of the RbQE with VGG-16 exceeds the RbQE with VGG-19 only on that database, while all the descriptors with the RbQE method exceed the $ST-CCA_{v}$ method. The outcomes of the AlexNet with the RbQE method in relation to $ST-CCA_{v}$ in the ARP, ARR, and $F_{score}$ ranges are improved by 4.86%, 1.64%, 2.47%. The results of the top 10 images obtained using the RbQE search technique with AlexNet features are shown in Fig. 11.

Table 5 Performance of different methods on EXACT09-CT database with the top 10 matches considered

Full size table

Retrieval Performance on NEMA-CT Database

We also use the NEMA-CT database to evaluate the performance of RbQE with different feature extractors and other modern methods. The proposed RbQE with VGG-19 features achieves the most satisfactory accuracy on the top 10 images and is superior to all other descriptors used by the RbQE. The retrieval results of the RbQE method with VGG-19 are improved by 1.24%, 0.18%, and 0.36% compared to the HCSCs method, as shown in Table 6. The result of the top 10 images for the query using the RbQE method with VGG-19 features is shown in Fig. 12.

Table 6 Performance of different methods on NEMA-CT database with top 10 matches considered

Full size table

Retrieval Performance on OASIS-MRI Database

The efficiency of the RbQE method with different feature extractors was also compared against another medical image retrieval method, IR-Net [28], this method was tested using a benchmark database called Open Access Series (OASIS) with MRI [56]. We have followed all the settings for comparison as in IR-Net, where Table 8 presents the performance of the top 10 images in terms of ARP. In Table 7, the RbQE method with AlexNet, VGG-16, and VGG-19 features exceeds other existing methods as shown group-wise in terms of ARP. On the top 10 images, the RbQE with AlexNet features performs with the highest level of accuracy compared to the RbQE with VGG-16 and VGG-19. The retrieval results of the proposed method are improved by 14.51% on average group-wise compared with the IR-Net method. The results of the query using the RbQE method with AlexNet features are shown in Fig. 13.

Table 7 Performance of different methods on OASIS database in terms of ARP for group-wise

Full size table

Table 8 Performance of different methods on OASIS database in terms of ARP for top 10 matches

Full size table

Time Complexity

The feature extraction average time, retrieval average time, and total CPU time in seconds are shown in Table 9 using the proposed RbQE with the different feature extraction methods (VGG-16, VGG-19, and AlexNet) over each database (TCIA-CT, EXACT09-CT, NEMA-CT, and OASIS-MRI). All experiments were carried out on a computer equipped with an Intel(R) Core(TM) i7-4510U processor running at 2.00 GHz, 8 GB of RAM, and a 64-bit Windows 10 Enterprise LTSC operating system. The total CPU time of AlexNet is less than VGG-16 and VGG-19. The retrieval times of VGG-16, VGG-19, and AlexNet are equal on the same database because they have the same dimension of feature vectors (4096).

Table 9 CPU elapse time (sec) for proposed RbQE with the different features extractions methods over all four test databases

Full size table

Discussion

As we described earlier in the methodology’s main framework, there are two search processes: a rapid search using a single query image chosen at random from each class of images, followed by a final search utilizing newly expanded queries. One of the key advantages of our proposed method is that the images retrieved from this search are used as input for the expansion process automatically, without any user intervention or suggestion. The newly created query images will then be utilized in the final search, and all evaluation metrics will be produced based on the results of this search. Our proposed method has been proven to be superior in its retrieval ability in comparison to all the existing and state-of-the-art methods. Our method depends on improving three basic processes in the CBMIR framework to improve the medical image retrieval process: feature extraction, similarity measurement, and query expansion. Firstly, in the process of extracting features from medical images, we focus on extracting deep and high-level features able to represent the medical images with high accuracy, especially since medical images contain more details than natural images. These details are difficult to represent using the local descriptors that extract low-level features, leading to an increase in the problem of the semantic gap that occurs between both the visual input of the human visual system (HVS) and the system of imaging when lost information in the process of representation of the image is converted from high-level semantics to low-level features to reduce this problem and achieve high accuracy. We have focused on using deep learning descriptors that produce high-level features for medical images, so we found that pre-trained DCNN models can be used to achieve these goals. After testing and comparing many pre-trained DCNN models, we found that AlexNet and VGGNets achieve high accuracy in representing medical images. According to its characteristics, as we mentioned before, the AlexNet achieves high accuracy in representing the most complex and difficult datasets in representation (EXACT09-CT and OASIS-MR) because of the high similarity between classes and complex details inside the images, as shown in Figs. 7 and 9 respectively. For VGGNets, we found that VGG-19 provided a better representation of the datasets (TCIA-CT and NEAM-CT) than VGG-16 and AlexNet. Secondly, in the process of similarity measurement, the Euclidean distance (ED) has been used in other CBMIR methods, but we want to find if there is any other similarity measurement method that can enhance the result with us, so we have tested many methods such as Euclidean distance, Manhattan distance, and chi-square distance. Then, in the end, we found that ED achieves high accuracy in similarity measurement and enhancement of the result. Thirdly, the process of query expansion is considered the core of the RbQE method, where this process has two parts: the first part is to obtain the NQE for deep features of the top 10 images from each class for enhancement of the retrieval process, and the second part is to calculate the similarity between the original query and all NQEs for obtaining the most similarly formed NQE for the original query, which means that the original query is from the same class as that NQE, and then this NQE will be used in the final search, which leads to enhancement of the retrieval process. On the other hand, retrieval accuracy is comparatively more crucial for medical retrievals, particularly for diagnostic purposes, than implementation efficiency. Therefore, our method improved these two factors, high retrieval accuracy and low time consumption in implementation.

Conclusion

In this paper, we proposed an efficient method (RbQE ) for the retrieval of medical images. Our method relies on expanding the query image with a fully automatic process by reformulating it based on calculating the mean value of the top-ranking images from each class. DCNNs (AlexNet and VGG-19) have been used as extractors of deep and high-level features. Our method has been tested on four publicly available databases with different formats (TCIA-CT, EXACT09-CT, NEMA-CT, and OASIS-MRI), and the results showed that our method achieved high accuracy compared to other state-of-the-art CBMIR methods.

Notes

https://www.oasis-brains.org

References

Owais M, Arsalan M, Choi J, Park KR. Effective Diagnosis and Treatment through Content–Based Medical Image Retrieval (CBMIR) by Using Artificial Intelligence. Journal of Clinical Medicine. 2019 apr;8(4):462. https://doi.org/10.3390/jcm8040462.
Article PubMed PubMed Central Google Scholar
Tschandl P, Argenziano G, Razmara M, Yap J. Diagnostic accuracy of content–based dermatoscopic image retrieval with deep classification features. British Journal of Dermatology. 2018;181(1):155–65. https://doi.org/10.1111/bjd.17189
Article PubMed Google Scholar
Sadeghi M, Chilana P, Yap J, Tschandl P, Atkins MS. Using content-based image retrieval of dermoscopic images for interpretation and education: A pilot study. Skin Research and Technology. 2019 dec;26(4):503–512. https://doi.org/10.1111/srt.12822
Shinde A, Rahulkar A, Patil C. Content based medical image retrieval based on new efficient local neighborhood wavelet feature descriptor. Biomedical Engineering Letters. 2019 may;9(3):387–394. https://doi.org/10.1007/s13534-019-00112-0.
Kaur P, Singh RK. A Panoramic View of Content-based Medical Image Retrieval system. In: 2020 International Conference on Intelligent Engineering and Management (ICIEM). IEEE; 2020.
Rui Y, Huang TS, Chang SF. Image retrieval: Current techniques, promising directions, and open issues. Journal of visual communication and image representation. 1999;10(1):39–62.
Article Google Scholar
Smeulders AWM, Worring M, Santini S, Gupta A, Jain R. Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2000;22(12):1349–80. https://doi.org/10.1109/34.895972.
Article Google Scholar
Kokare M, Chatterji BN, Biswas PK. A Survey on Current Content based Image Retrieval Methods. IETE Journal of Research. 2002 may;48(3–4):261–71. https://doi.org/10.1080/03772063.2002.11416285.
Article Google Scholar
Liu Y, Zhang D, Lu G, Ma WY. A survey of content-based image retrieval with high-level semantics. Pattern Recognition. 2007 jan;40(1):262–82. https://doi.org/10.1016/j.patcog.2006.04.045
Article Google Scholar
Müller H, Michoux N, Bandon D, Geissbuhler A. A review of content-based image retrieval systems in medical applications—clinical benefits and future directions. International Journal of Medical Informatics. 2004 feb;73(1):1–23. https://doi.org/10.1016/j.ijmedinf.2003.11.024.
Article PubMed Google Scholar
Ahmad J, Sajjad M, Mehmood I, Rho S, Baik SW. Saliency-weighted graphs for efficient visual content description and their applications in real-time image retrieval systems. Journal of Real-Time Image Processing. 2015 nov;13(3):431–447. https://doi.org/10.1007/s11554-015-0536-0.
Article Google Scholar
Ahmad J, Sajjad M, Rho S, Baik SW. Multi-scale local structure patterns histogram for describing visual contents in social image retrieval systems. Multimedia Tools and Applications. 2016;75(20):12669–12692. https://doi.org/10.1007/s11042-016-3436-9.
Article Google Scholar
Pölsterl S, Conjeti S, Navab N, Katouzian A. Survival analysis for high-dimensional, heterogeneous medical data: Exploring feature extraction as an alternative to feature selection. Artificial intelligence in medicine. 2016;72:1–11.
Article PubMed Google Scholar
Brea MLS, Rodríguez NB, Maroño NS, González AM, García-Resúa C, Fernández MJG. On the development of conjunctival hyperemia computer-assisted diagnosis tools: Influence of feature selection and class imbalance in automatic gradings. Artificial Intelligence in Medicine. 2016;71:30–42. https://doi.org/10.1016/j.artmed.2016.06.004.
Article Google Scholar
Felipe JC, Traina AJM, Traina C. Retrieval by content of medical images using texture for tissue identification. https://doi.org/10.1109/cbms.2003.1212785.
Unay D, Ekin A, Jasinschi RS. Local Structure-Based Region-of-Interest Retrieval in Brain MR Images. IEEE Transactions on Information Technology in Biomedicine. 2010 jul;14(4):897–903. https://doi.org/10.1109/titb.2009.2038152.
Article PubMed Google Scholar
Ojala T, Pietikäinen M, Harwood D. A comparative study of texture measures with classification based on featured distributions. Pattern Recognition. 1996 jan;29(1):51–9. https://doi.org/10.1016/0031-3203(95)00067-4.
Article Google Scholar
Srensen L, Shaker SB, de Bruijne M. Quantitative Analysis of Pulmonary Emphysema Using Local Binary Patterns. IEEE Transactions on Medical Imaging. 2010;29(2):559–569. https://doi.org/10.1109/tmi.2009.2038575.
Article Google Scholar
Peng SH, Kim DH, Lee SL, Lim MK. Texture feature extraction based on a uniformity estimation method for local brightness and structure in chest CT images. Computers in Biology and Medicine. 2010 nov;40(11–12):931–942. https://doi.org/10.1016/j.compbiomed.2010.10.005.
Article PubMed Google Scholar
Murala S, Wu QMJ. Local ternary co-occurrence patterns: A new feature descriptor for MRI and CT image retrieval. Neurocomputing. 2013 may;119:399–412. https://doi.org/10.1016/j.neucom.2013.03.018.
Article Google Scholar
Murala S, Wu QMJ. Local Mesh Patterns Versus Local Binary Patterns: Biomedical Image Indexing and Retrieval. IEEE Journal of Biomedical and Health Informatics. 2014 may;18(3):929–38. https://doi.org/10.1109/jbhi.2013.2288522.
Article PubMed Google Scholar
Murala S, Wu QJ. MRI and CT image indexing and retrieval using local mesh peak valley edge patterns. Signal processing: image communication. 2014;29(3):400–409. https://doi.org/10.1016/j.image.2013.12.002.
Article Google Scholar
Rehman SU, Tu S, Huang Y, Yang Z. Face recognition: A novel un-supervised convolutional neural network method. In: 2016 IEEE International Conference of Online Analysis and Computing Science (ICOACS). IEEE; 2016.
ur Rehman S, Tu S, Waqas M, Huang Y, ur Rehman O, Ahmad B, et al. Unsupervised pre-trained filter learning approach for efficient convolution neural network. Neurocomputing. 2019;365:171–190. https://doi.org/10.1016/j.neucom.2019.06.084.
Dubey SR, Roy SK, Chakraborty S, Mukherjee S, Chaudhuri BB. Local bit-plane decoded convolutional neural network features for biomedical image retrieval. Neural Computing and Applications. 2019 jun;32(11):7539–7551. https://doi.org/10.1007/s00521-019-04279-6.
Article Google Scholar
Lan R, Zhou Y. Medical Image Retrieval via Histogram of Compressed Scattering Coefficients. IEEE Journal of Biomedical and Health Informatics. 2017 sep;21(5):1338–1346. https://doi.org/10.1109/jbhi.2016.2623840.
Article PubMed Google Scholar
Lan R, Wang H, Zhong S, Liu Z, Luo X. An integrated scattering feature with application to medical image retrieval. Computers & Electrical Engineering. 2018;69:669–675. https://doi.org/10.1016/j.compeleceng.2018.01.027
Article Google Scholar
Pinapatruni R, Bindu CS. Learning image representation from image reconstruction for a content-based medical image retrieval. Signal, Image a nd Video Processing. 2020 mar;14(7):1319–1326. https://doi.org/10.1007/s11760-020-01670-y.
Article Google Scholar
Azad HK, Deepak A. Query expansion techniques for information retrieval: A survey. Information Processing & Management. 2019;56(5):1698–1735. https://doi.org/10.1016/j.ipm.2019.05.009.
Article Google Scholar
Houle ME, Ma X, Oria V, Sun J. Query Expansion for Content-Based Similarity Search Using Local and Global Features. ACM Transactions on Multimedia Computing, Communications, and Applications. 2017 aug;13(3):1–23. https://doi.org/10.1145/3063595
Article Google Scholar
Kondylidis N, Tzelepi M, Tefas A. Exploiting tf-idf in deep Convolutional Neural Networks for Content Based Image Retrieval. Multimedia Tools and Applications. 2018 jun;77(23):30729–30748. https://doi.org/10.1007/s11042-018-6212-1.
Article Google Scholar
Imbriaco R, Sebastian C, Bondarev E, de With P. Aggregated Deep Local Features for Remote Sensing Image Retrieval. Remote Sensing. 2019 feb;11(5):493. https://doi.org/10.3390/rs11050493.
Article Google Scholar
Chum O, Mikulik A, Perdoch M, Matas J. Total recall II: Query expansion revisited. 2011 jun. https://doi.org/10.1109/cvpr.2011.5995601.
Gordo A, Radenovic F, Berg T; Springer. Attention-based query expansion learning. 2020:172–188.
Google Scholar
Feng B, Cao J, Chen Z, Zhang Y, Lin S. Multi-modal query expansion for web video search. 2010. https://doi.org/10.1145/1835449.1835583.
Rashad M, Afifi I, Abdelfatah M. Content-based Medical Image Retrieval based on Deep Features Expansion. In: 2022 5th International Conference on Computing and Informatics (ICCI). IEEE; 2022. Available from: https://doi.org/10.1109%2Ficci54321.2022.9756114.
Wainberg M, Merico D, Delong A, Frey BJ. Deep learning in biomedicine. Nature Biotechnology. 2018 oct;36(9):829–838. https://doi.org/10.1038/nbt.4233.
Article CAS PubMed Google Scholar
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L. ImageNet: A large-scale hierarchical image database. 2009 June. https://doi.org/10.1109/cvpr.2009.5206848.
Bar Y, Diamant I, Wolf L, Greenspan H. Deep learning with non-medical training used for chest pathology identification. 2015 Mar. https://doi.org/10.1117/12.2083124.
van Ginneken B, Setio AAA, Jacobs C, Ciompi F. Off-the-shelf convolutional neural network features for pulmonary nodule detection in computed tomography scans. 2015 Apr. https://doi.org/10.1109/isbi.2015.7163869.
Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229. 2013.
Bar Y, Diamant I, Wolf L, Lieberman S, Konen E, Greenspan H. Chest pathology identification using deep feature selection with non-medical training. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization. 2016 may;6(3):259–263. https://doi.org/10.1117/12.2083124.
Article Google Scholar
Ciompi F, de Hoop B, van Riel SJ, Chung K, Scholten ET, Oudkerk M, et al. Automatic classification of pulmonary peri-fissural nodules in computed tomography using an ensemble of 2D views and a convolutional neural network out-of-the-box. Medical Image Analysis. 2015 dec;26(1):195–202. https://doi.org/10.1016/j.media.2015.08.001.
Article PubMed Google Scholar
Tan X, Triggs B. Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions. IEEE Transactions on Image Processing. 2010;19(6):1635–50. https://doi.org/10.1109/tip.2010.2042645.
Zhang B, Gao Y, Zhao S, Liu J. Local Derivative Pattern Versus Local Binary Pattern: Face Recognition With High-Order Local Pattern Descriptor. IEEE Transactions on Image Processing. 2010 feb;19(2):533–44. https://doi.org/10.1109/tip.2009.2035882.
Article PubMed Google Scholar
Murala S, Maheshwari RP, Balasubramanian R. Local Tetra Patterns: A New Feature Descriptor for Content-Based Image Retrieval. IEEE Transactions on Image Processing. 2012;21(5):2874–86. https://doi.org/10.1109/tip.2012.2188809.
Article PubMed Google Scholar
Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems. 2012;25:1097–1005.
Google Scholar
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv preprint http://arxiv.org/abs/1409.1556
Dubey SR, Singh SK, Singh RK. Local Wavelet Pattern: A New Feature Descriptor for Image Retrieval in Medical CT Databases. IEEE Transactions on Image Processing. 2015;24(12):5892–5903. https://doi.org/10.1109/tip.2015.2493446.
Article PubMed Google Scholar
Murala S, Wu QMJ. Spherical symmetric 3D local ternary patterns for natural, texture and biomedical image indexing and retrieval. Neurocomputing. 2015 feb;149:1502–1514. https://doi.org/10.1016/j.neucom.2014.08.042.
Article Google Scholar
He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2016.
Galshetwar GM, Waghmare LM, Gonde AB, Murala S. Multi-dimensional multi-directional mask maximum edge pattern for bio-medical image retrieval. International Journal of Multimedia Information Retrieval. 2018 jun;7(4):231–239. https://doi.org/10.1007/s13735-018-0156-0
Article Google Scholar
Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, et al. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. Journal of Digital Imaging. 2013 jul;26(6):1045–1057. https://doi.org/10.1007/s10278-013-9622-7.
Article PubMed PubMed Central Google Scholar
Lo P, van Ginneken B, Reinhardt JM, Yavarna T, de Jong PA, Irving B, et al. Extraction of Airways From CT EXACT-09). IEEE Transactions on Medical Imaging. 2012 nov;31(11):2093–2107. https://doi.org/10.1109/tmi.2012.2209674.
Article PubMed Google Scholar
NEMA-CT image database. [Online]; 2012. Available from: ftp://medical.nema.org/medical/Dicom/Multiframe/CT.
Marcus DS, Fotenos AF, Csernansky JG, Morris JC, Buckner RL. Open Access Series of Imaging Studies: Longitudinal MRI Data in Nondemented and Demented Older Adults. Journal of Cognitive Neuroscience. 2010 dec;22(12):2677–2684. https://doi.org/10.1162/jocn.2009.21407.
Article PubMed PubMed Central Google Scholar

Download references

Funding

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB). Open Access funding provided by Science, Technology & Innovation Funding Authority (STDF) in cooperation with Egyptian Knowledge Bank (EKB).

Author information

Authors and Affiliations

Department of Computer Science, Faculty of Computers & Artificial Intelligence, Benha University, Benha, Egypt
Metwally Rashad
Faculty of Artificial Intelligence, Delta University for Science and Technology, Gamasa, Egypt
Metwally Rashad
Department of Information System, Faculty of Computers & Artificial Intelligence, Benha University, Benha, Egypt
Ibrahem Afifi & Mohammed Abdelfatah

Authors

Metwally Rashad
View author publications
You can also search for this author in PubMed Google Scholar
Ibrahem Afifi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Abdelfatah
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Ibrahem Afifi designed and developed the overall system, including conceptualization and the software, under the supervision of Dr. Metwally Rashad and Prof. Mohamed Abdelfatah. In addition, they revised the paper and contributed to dataset curation and annotation.

Corresponding author

Correspondence to Metwally Rashad.

Ethics declarations

Ethical Approval

Not applicable

Consent to Participate

Not applicable

Consent for Publication

Not applicable

Competing Interests

The authors declare no competing interests.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rashad, M., Afifi, I. & Abdelfatah, M. RbQE: An Efficient Method for Content-Based Medical Image Retrieval Based on Query Expansion. J Digit Imaging 36, 1248–1261 (2023). https://doi.org/10.1007/s10278-022-00769-7

Download citation

Received: 26 June 2022
Revised: 18 December 2022
Accepted: 19 December 2022
Published: 26 January 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s10278-022-00769-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

RbQE: An Efficient Method for Content-Based Medical Image Retrieval Based on Query Expansion

Abstract

Similar content being viewed by others

UNet++: A Nested U-Net Architecture for Medical Image Segmentation

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Convolutional neural networks: an overview and application in radiology

Introduction

Literature Review