Blind Image Quality Assessment Based on the Use of Saliency Maps and a Multivariate Gaussian Distribution

Charrier, Christophe; Saadane, Abdelhakim; Fernandez-Maloigne, Christine

doi:10.1007/978-3-030-30645-8_13

Christophe Charrier¹⁴,
Abdelhakim Saadane¹⁵ &
Christine Fernandez-Maloigne¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11752))

Included in the following conference series:

International Conference on Image Analysis and Processing

1901 Accesses
1 Citations

Abstract

With the widespread use of image processing technologies, objective image quality metrics are a fundamental and challenging problem. In this paper, we present a new No-Reference Image Quality Assessment (NR-IQA) algorithm based on visual attention modeling and a multivariate Gaussian distribution to predict the final quality score from the extracted features. Computational modeling of visual attention is performed to compute saliency maps at three resolution levels. At each level, distortions of the input image are extracted and weighted by the saliency maps in order to highlight degradations of visually attracting regions. The generated features are used by a probabilistic model to predict the final quality score. Experimental results demonstrate the effectiveness of the metric and show better performance when compared to well known NR-IQA algorithms.

This research is supported by the ANR project #ANR-16-CE39-0013.

You have full access to this open access chapter, Download conference paper PDF

Completely Blind Image Quality Assessment with Visual Saliency Modulated Multi-feature Collaboration

Visual Saliency Based on Two-Dimensional Fractional Fourier Transform

Referenceless image quality assessment by saliency, color-texture energy, and gradient boosting machines

Article Open access 06 August 2018

Keywords

1 Introduction

The development of image and video processing technologies and the exponential increase of new multimedia services raise the critical issue of assessing the visual quality. From several years, a number of investigations have been conducted to design robust Image Quality Assessment (IQA) metrics. Such metrics aim at predicting image quality that well correlates with Mean Opinion Scores (MOS). No-Reference IQA (NR-IQA) are interesting as they assume no knowledge of the reference image and can be embedded in practical and real-time applications. Three approaches may be used in the design of IQA algorithms. The first one looks to mimic the behavior of the Human Visual System (HVS). The HVS models used in this context, include relevant properties such as the contrast sensitivity function, masking effects and detection mechanisms. A number of investigations [1] have shown that these models when included in IQA algorithms, improve their performance. The second approach is well suited for assessing the quality of images distorted by known distortions. The algorithms of this approach quantify one or more distortions such as blockiness [20, 27], blur [2, 22] or ringing [9, 10] and score the image accordingly. The third and last approach is a general-purpose method. It considers that the HVS is very sensitive to structural information in the image and any loss of this structural information results in a perceptual loss of quality. To quantify loss of information, this approach uses Natural Scene Statistics (NSS). Generally, NSS-based algorithms apply a combination of learning-based approach with NSS-based extracted features. When a large ground truth is available, statistical modeling algorithms can achieve good performance. However, there is still an effort to provide to reach the subjective consistency of the HVS.

The work proposed in this paper is motivated by the interesting results of IQA when visual attention models are used. Computational visual saliency models extract regions that can attract human gaze. These regions are of a great interest in IQA. This paper presents a new NR-IQA based on the use of saliency maps to better weight the extracted distortions and combines these weighted distortions using a MultiVariate Gaussian distribution (MVGD).

2 The Proposed Approach

Figure 1 presents the overall synopsis of the multi-scale proposed approach, namely SABIQ (SAliency-based Blind Image Quality) index. First, a multi-scale decomposition is performed on the input image and a saliency map is computed at each level. The base level corresponds to the first image while the remaining ones are obtained by a low-pass filtering followed by a sub-sampling. Secondly, different distortion maps are generated at same scale levels. At each level, the Renyi entropy of the subsampled image is also computed. Thirdly, for each level, a weighting of each computed distortion map with the corresponding saliency map is performed in order to increase the strength of degradation in visually attracting areas. Finally, the combination at each level of the weighted distortion map with the computed Renyi entropy is performed to design a multiresolution distortion map. The final stage of the pipeline is a simple Bayesian model that predicts the quality score of the input image. The Bayesian approach maximizes the probability that the image has a certain quality score given the features extracted from the image. The associate posterior probability is modeled as a MultiVariate Gaussian Distribution (MVGD).

2.1 Visual Saliency Map

Visual attention is the ability of the HVS to rapidly direct our gaze towards regions of interest in our visual environment. Two attentional mechanisms are involved in such selection; bottom-up and top-down. Main features known to influence bottom-up attention include color, orientation, motion. Top-down attention is rather driven by the observer’s experience, task and expectation. Many conducted investigations have helped in understanding visual attention and many computational saliency models have been proposed in the literature [12, 13]. A recent state-of-the-art in visual attention is given in [5]. Most of these models use bottom-up approach and are based on the Feature Integration Theory of Treisman and Gelade [28]. They compute a 2D map that highlights locations where fixations are likely to occur. These image-based (stimulus-driven) models use the same architecture but vary in the selection of characteristics used to calculate the global saliency map.

The saliency models have addressed various applications, including computer vision [21], robotics [6] and visual signal processing [7, 29]. In the context of IQA algorithms, the saliency models are intended to extract the most relevant visual features that when combined, produce a quality score highly correlated with human judgment [3].

Many research have been investigated to model the phenomenon that any human viewer can focus on attractive points at a first glance, and many saliency models have been proposed in the literature.

Saliency models can be categorized into (1) pixel-based models and (2) object-based models. The pixel-based models aim to highlight pixel locations where fixations are likely to occur. The object-based models focus on detecting salient objects in a visual scene. The majority of saliency models in the literature are pixel-based saliency models, such as ITTI [11], STB [30], PQFT [8], etc.

In this paper, the ITTI model [12] has been employed. This model combines multiscale image features into a single topographical saliency map. Three channels (Intensity, Color and Orientation) are used as low level features. First, feature maps are calculated for each channel via center-surround differences operation. Three kinds of conspicuity maps are then obtained by across-scale combination. The final saliency map is built through combining all of the conspicuity maps.

2.2 Distortion Maps

Many studies have shown that image quality degradations are well measured by features of local structure [31], contrast [31, 32], multi-scale and multi-orientation decomposition [34].

Contrast Distortion Map. The image gradient is an interesting descriptor to capture both local image structure and local contrast [33]. Also according to this study, the partial derivatives and gradient magnitudes change with the strength of applied distortions.

Following this strategy and in order to generate the contrast distortion map, we compute both horizontal and vertical gradient component images $\partial {I}/\partial {x}$ and $\partial {I}/\partial {y}$ from the image I. From those two gradient images, the gradient magnitude image is computed as $\sqrt{(\partial {I}/\partial {x})^2 + (\partial {I}/\partial {y})^2}$ and then modelled by a Weibull distribution. This distribution fits well the gradient magnitude of natural images [25] and its 2 parameters (the scale parameter and the shape parameter) roughly approximate the local contrast and the texture activity in the gradient magnitude map, respectively. Larger values of the scale parameter imply greater local contrast.

Yet, instead of computing the contrast on the entire image, the image is first partitioned into equally sized $n\times n$ blocks (referred to as local image patches), then the local contrast is computed for each block yielding in final to a local contrast map $\mathcal {M}_C$.

Structural Distortion Map. The structural distortion map considered here uses structural distortion features that are extracted from both spatial and frequency information. To extract image structure information from frequency domain, the image is partitioned into equally sized $n\times n$ local image patches and then a 2D-DCT (Discrete Cosine Transform) is applied on each patch. The feature extraction is thus locally performed in the spatio-frequency domain according to local spatial visual processing property of the HVS [4]. To capture degradation depending on directional information in the image, block DCT coefficients are modeled along three orientations (0, 45 and 90$^\circ $). For each orientation, a Generalized Gaussian is fitted to the associated coefficients, and the coefficient $\zeta $ is computed from the histogram model as $\zeta =\sigma (X)/\mu (X)$ where $\sigma (X)$ and $\mu (X)$ are the standard deviation and the mean of the DCT coefficient magnitudes, respectively. In order to select the most significant map from the three generated distortion maps, the variance of $\zeta $ is then computed for each orientation. The distortion map associated to the highest value of the variance of $\zeta $ is finally chosen and serve as structural distortion map, namely $\mathcal {M}_S$.

Since the DC (Direct Coefficient) does not convey any structural information, it is removed from all computations.

Multi-orientation Image Property Map. It is widely admitted that the HVS is sensitive to spatial frequency and orientation. In order to capture this sensitivity, the steerable pyramid transform [26] is used.

Let $a(i,j,f,\theta )$ be an original coefficient issued from the decomposition process located at the position (i, j) in the frequency band f and orientation band $\theta $. The associated squared and normalized coefficient $r(i,j,f,\theta )$ is defined as:

$$\begin{aligned} r(i,j,f,\theta )=k \frac{a(i,j,f,\theta )^2}{\sum _{\phi \in \left[ 0, 45, 90, 135\right] } a(i,j,f,\phi )^2+\sigma ^2} \end{aligned}$$

(1)

In this paper, four orientation bands with bandwidths of 45$^\circ $ 0, 45, 90, 135 plus one isotropic lowpass filter are used yielding in five response maps $\{R_{\theta }, R_\text {iso}\}, \theta \in [ 0, 45, 90, 135]$. The distortion map associated to the highest value of the variance is finally selected and will serve as frequency variation distortion map, namely $\mathcal {M}_F$.

From the four orientation bands, we compute the energy ratio in order to take account the modification of local spectral signatures of an image. This approach is inspired from the quality BLIINDS2 index [24]. Each map associated to $\theta $ $\{R_{\theta }\}, \theta \in [ 0, 45, 90, 135]$ is decomposed into equally sized $n\times n$ blocks. For each obtained patch, the average energy in frequency band $\theta $ models the variance corresponding to band $\theta $ as $e_\theta =\sigma _\theta ^2$.

For each $\theta \in [45, 90, 135]$, the relative distribution of energies in lower and higher bands is then computed as:

$$\begin{aligned} E_\theta = \frac{|e_\theta - 1/n \sum _{t<\theta }e_t|}{|e_\theta + 1/n \sum _{t<\theta }e_t|} \end{aligned}$$

(2)

where $1/n \sum _{t<\theta }e_t$ represents the average energy up to frequency band $\theta $. Three distortion maps are then generated.

The distortion map associated to the highest value of the variance of $E_\theta $ is finally selected and serves as energy ratio distortion map, namely $\mathcal {M}_E$.

2.3 Multiscale Features Computation

In this block, each distortion map is combined with the saliency map in order to obtain a saliency-based distortion map. From each saliency-based distortion map, a pooling strategy is applied by averaging over the highest 10th percentile coefficients across the distortion map. This pooling strategy is motivated by the fact that the “worst” distortions in an image heavily influence subjective impressions and that they are concentrated in few coefficients having higher values [18]. All the obtained values are referred to as $df^{10}(\cdot )$, where $(\cdot )$ represents one of the computed distortion maps $\{\mathcal {M}_C,\mathcal {M}_S,\mathcal {M}_F,\mathcal {M}_E\}$. In order to get information about the distribution of the distortions (over space or isolated distortions), the 100th percentile average of the local scores is also computed. The obtained values are referred to as $df^{100}(\cdot )$, where $(\cdot )$ represents one of the computed distortion maps $\{\mathcal {M}_C,\mathcal {M}_S,\mathcal {M}_F,\mathcal {M}_E\}$. The whole computation leads, in total, to 8 distortion features $\{df^{10}(k),df^{100}(k)\},$ $\forall k\in \{\mathcal {M}_C,\mathcal {M}_S,\mathcal {M}_F,\mathcal {M}_E\}$.

The final feature is computed at each scale level l as

$$\begin{aligned} \text {final-feature}^p_l(k)= df^p_l(k)* \text {entropy}_l \end{aligned}$$

(3)

where $\ p \in \{10,100\}$, $\ k \in \{\mathcal {M}_C,\mathcal {M}_S,\mathcal {M}_F,\mathcal {M}_E\}$, $df^p_l(k)$ represents the value of the distortion value $df^{p}(k)$ at level l, and $\text {entropy}_l$ is the Renyi entropy of the associated saliency-based distortion map. This strategy yields us to include information about the anisotropy property of distortion maps. In this paper, the number of scales l is set to 3 as this value achieves the best performance.

2.4 Probabilistic Model and Quality Score Prediction

The computed features and the DMOS (Difference of Mean Opinion Scores) values of training images are then used by the learning block to fit a MVGD. The resulting model SABIQ is given by:

$$\begin{aligned}&\text {SABIQ}\left( x\right) = \nonumber \\&\,\,\, \frac{1}{\left( 2\pi \right) ^{k/2}\left| \varSigma \right| ^{1/2}}\exp \left( -\frac{1}{2}\left( x-\beta \right) ^{T}\varSigma ^{-1}\left( x-\beta \right) \right) \end{aligned}$$

(4)

where $x = \left( \{\text {final-feature}^p_l(k)\}, DMOS\right) $ corresponds to the extracted features (Eq. 3) to which is added DMOS. $\beta $ and $\varSigma $ denote the mean and covariance matrix of the MVGD model and are estimated using the maximum likelihood method. The features extracted from testing images with DMOS values lying between 0 and 100 with a step of 0.5, are fed into the learned SABIQ to assess quality of image under test.

Table 1. SROCC values of NR-IQA models on each distortion types for the TID2013 database.

Full size table

Table 2. SROCC values of NR-IQA models on each distortion types for the CSIQ Images database.

Full size table

3 Performance Evaluation

3.1 Apparatus

To provide comparison of NR-IQA algorithms, two publicly available databases are used: (1) TID2013 database [23] and (2) CSIQ database [15]. Since LIVE database [14] has been used to train both the proposed metric and most of the trail NR-IQA schemes, it has not been used to evaluate performances. To train our model, we used LIVE database running multiple train-test sequences. For each sequence, the image database is divided into distinct training and test sets. In each train-test sequence, 80% of the LIVE IQA Database content was chosen to design the training set, and the remaining 20% were dedicated to the test set. This means each training set contains 23 reference images and their associated distorted images. The quality scores are computed using a bootstrap process with 999 replicates.

To assess the performance of SABIQ, the Spearman Rank Order Correlation Coefficient (SROCC) is computed between DMOS values and predicted scores from six state-of-the-art opinion-aware NR-IQA methods, including BRISQUE [17], BLIINDS2 [24], DIIVINE [19], CORNIA [17], ILNIQE [33] and SSEQ [16] which are all so far widely accepted in the research community.

3.2 Performance Evaluation

The SROCC between predicted DMOS and subjective DMOS is reported in Table 1 for the TID2013 database. From Table 1, one observes that SABIQ performs much better than the six other NR-IQA methods when the SROCC values for the whole database is considered. This significant gain in performance is likely induced by the visual attention that is used in the weighting of distortions. When single distortions are considered, SABIQ achieves performance comparable with CORNIA and performs better than the five remaining trail quality schemes. For multiple distortions, SABIQ performs better than BRISQUE, BLIINDS2, DIIVINE and SSEQ and competes very well with CORNIA and ILNIQE.

Similar results are shown in Table 2 for CSIQ Images database. SABIQ achieves better results for 4 out of 6 distortions and outperforms all trail NR-IQA algorithms when the entire database is considered. In this case, the gain in performance is about 7% compared to ILNIQE and is at least 32% compared to other metrics.

We also trained the methods on TID2013 excluding multi-distorted subsets (MD), then tested them on the two other datasets and the remaining MD subsets of TID2013. The results are shown in Table 3. The NR-IQA methods IL-NIQE and SABIQ clearly outperform the other trial method when trained on single distortion. When considering the LIVE database, IL-NIQE and SABIQ achieve almost the same results, which is not surprising since many existing recent NR-IQA schemes reach high correlations on that database. Furthermore, SABIQ presents the highest SROCC value with CSIQ database. All these results tend to highlight a high generalization capability of the proposed approach.

Table 3. SROCC values when trained on TID2013, excluding multi-distortion subsets (MD)

Full size table

4 Conclusion

In this paper, we investigated how the visual attention property of the HVS can be embedded in the NR-IQA algorithm design and in which way it can improve the prediction of image quality. The proposed approach, namely SABIQ, is based on the use of computational modeling of visual attention to compute the saliency map. At each of the three levels of the multiresolution scheme, distortions of the input image are generated and weighted by the saliency maps in order to highlight degradations of visually attracting regions. The extracted features are used by a probabilistic model to predict the final quality score. The obtained results demonstrate the effectiveness of the approach.

References

Babu, R.V., Perkis, A.: An HVS-based no-reference perceptual quality assessment of JPEG coded images using neural networks. In: Proceedings of the 2005 International Conference on Image Processing, ICIP 2005, Genoa, Italy, 11–14 September 2005, pp. 433–436 (2005). https://doi.org/10.1109/ICIP.2005.1529780
Barland, R., Saadane, A.: Blind quality metric using a perceptual map for JPEG-2000 compressed images. In: International Conference on Image Processing (ICIP) (2006)
Google Scholar
Ben Amor, M., Kammoun, F., Masmoudi, N.: Improved performance of quality metrics using saliency map and CSF filter for standard coding H264/AVC. Multimed. Tools Appl. 77(15), 19377–19397 (2018). https://doi.org/10.1007/s11042-017-5393-3
Article Google Scholar
Blake, R., Sekuler, R.: Perception, 5th edn. McGraw-Hill Higher Education, New York (2006)
Google Scholar
Borji, A., Itti, L.: State-of-the-art in visual attention modeling. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 185–207 (2013). https://doi.org/10.1109/TPAMI.2012.89
Article Google Scholar
Breazeal, C., Scassellati, B.: A context-dependent attention system for a social robot. In: 16th International Joint Conference on Artificial Intelligence (IJCAI), San-Francisco, CA, USA, pp. 1146–1153 (1999)
Google Scholar
Christopoulos, C., Skodras, A., Ebrahimi, T.: The jpeg2000 still image coding system: an overview. IEEE Trans. Consum. Electron. 46, 1103–1127 (2000)
Article Google Scholar
Guo, C., Zhang, L.: A novel multiresolution spatiotemporal saliency detection model and its applications in image and video compression. IEEE Trans. Image Process. 19(1), 185–198 (2010). https://doi.org/10.1109/TIP.2009.2030969
Article MathSciNet MATH Google Scholar
Hantao, L., Klomp, N., Heynderickx, I.: A no-reference metric for perceived ringing artifacts in images. IEEE Trans. Circuits Syst. Video Technol. 20(4), 529–539 (2010)
Article Google Scholar
Hu, S., Pizlo, Z., Allebach, J.P.: JPEG ringing artifact visibility evaluation. In: Proceedings of SPIE 9016, Image Quality and System Perfromance XI (2014)
Google Scholar
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(11), 1254–1259 (1998). http://link.aip.org/link/?JEI/19/011006/1
Article Google Scholar
Itti, L., Koch, C., Niebuhr, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(11), 1254–1259 (1998)
Article Google Scholar
Koch, C., Ullman, S.: Shifts in selective visual attention: towards the underlying neural circuitry. Human Neurobiol. 4(4), 219–227 (1985)
Google Scholar
Laboratory for Image & Video Engineering, University of Texas (Austin): LIVE Image Quality Assessment Database (2002). http://live.ece.utexas.edu/research/Quality
Larson, E.C., Chandler, D.M.: Most apparent distortion: full-reference image quality assessment and the role of strategy. J. Electron. Imaging 19(1), 011006 (2010). https://doi.org/10.1117/1.3267105. http://link.aip.org/link/?JEI/19/011006/1
Article Google Scholar
Liu, L., Liu, B., Huang, H., Bovik, A.: No-reference image quality assessment based on spatial and spectral entropies. Signal Process.: Image Commun. 29(8), 856–863 (2014). https://doi.org/10.1016/j.image.2014.06.006
Article Google Scholar
Mittal, A., Moorthy, A.K., Bovik, A.C.: No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process. 21(12), 4695–4708 (2012)
Article MathSciNet Google Scholar
Moorthy, A.K., Bovik, A.C.: Visual importance pooling for image quality assessment. IEEE J. Sel. Topics Signal Process. 3(2), 193–201 (2009). https://doi.org/10.1109/JSTSP.2009.2015374
Article Google Scholar
Moorthy, A.K., Bovik, A.C.: Blind image quality assessment: from natural scene statistics to perceptual quality. IEEE Trans. Image Process. 20(12), 3350–3364 (2011)
Article MathSciNet Google Scholar
Muijs, R., Kirenko, I.: A no-reference blocking artifact measure for adaptive video processing. In: European Signal Processing Conference (Eusipco) (2005)
Google Scholar
Navalpakkam, V., Itti, L.: An integrated model of top-down and bottom-up attention for optimizing detection speed. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2049–2056 (2006)
Google Scholar
ParvezSazzad, Z., Kawayoke, Y., Horita, Y.: No-reference image quality assessment for JPEG-2000 based on spatial features. Signal Process.: Image Commun. 23(4), 257–268 (2008)
Google Scholar
Ponomarenko, N., Carli, M., Lukin, V., Astola, J., Battisti, F.: Color image database for evaluation of image quality metrics. In: International Workshop on Multimedia Signal Processing, Australia, pp. 403–408, October 2008
Google Scholar
Saad, M., Bovik, A.C., Charrier, C.: Blind image quality assessment: a natural scene statistics approach in the DCT domain. IEEE Trans. Image Process. 21(8), 3339–3352 (2012)
Article MathSciNet Google Scholar
Scholte, H.S., Ghebreab, S., Waldorp, L., Smeulders, A.W.M., Lamme, V.A.F.: Brain responses strongly correlate with weibull image statistics when processing natural images. J. Vis. 9(4), 29 (2009). https://doi.org/10.1167/9.4.29
Article Google Scholar
Simoncelli, E.P., Freeman, W.T.: The steerable pyramid: a flexible architecture for multi-scale derivative computation. In: Proceedings. In: International Conference on Image Processing (ICIP), vol. 3, pp. 444–447, October 1995. https://doi.org/10.1109/ICIP.1995.537667
Song, X., Yang, Y.: A new no-reference assessmet metric of blocking artefacts on HVS masking effect. In: International Congress on Image and Signal Processing, pp. 1–6 (2009)
Google Scholar
Treisman, A.M., Gelade, G.: A feature-integration theory of attention. Cogn. Psychol. 12, 97–136 (1980)
Article Google Scholar
Walther, D., Rutishauser, U., Koch, C., Perona, P.: Selective visual attention enables learning and recognition of multiple objects in cluttered scenes. Comput. Vis. Image Underst. 100, 42–63 (2005)
Article Google Scholar
Walther, D., Koch, C.: Modeling attention to salient proto-objects. Neural Netw. 19(9), 1395–1407 (2006). https://doi.org/10.1016/j.neunet.2006.10.001. http://www.sciencedirect.com/science/article/pii/S0893608006002152. Brain and Attention
Article MATH Google Scholar
Wang, Z., Bovik, A., Sheikh, H., Simoncelli, E.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Xue, W., Mou, X., Zhang, L., Bovik, A., Feng, X.: Blind image quality assessment using joint statistics of gradient magnitude and Laplacian features. IEEE Trans. Image Process. 23(11), 4850–4862 (2014)
Article MathSciNet Google Scholar
Zhang, L., Zhang, L., Bovik, A.: A feature-enriched completely blind image quality evaluator. IEEE Trans. Image Process. 24(8), 2579–2591 (2015)
Article MathSciNet Google Scholar
Zhang, Y., Chandler, D.: No-reference image quality assessment based on log-derivative statistics of natural scenes. J. Electron. Imaging 22(4), 1–23 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Normandie Univ., UNICAEN, ENSICAEN, CNRS, GREYC, 14000, Caen, France
Christophe Charrier
Université de Nantes, XLIM, Nantes, France
Abdelhakim Saadane
Université de Poitiers, XLIM, Poitiers, France
Christine Fernandez-Maloigne

Authors

Christophe Charrier
View author publications
You can also search for this author in PubMed Google Scholar
Abdelhakim Saadane
View author publications
You can also search for this author in PubMed Google Scholar
Christine Fernandez-Maloigne
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christophe Charrier .

Editor information

Editors and Affiliations

University of Trento, Povo, Italy
Elisa Ricci
Mapillary Research, Graz, Austria
Samuel Rota Bulò
University of Amsterdam, Amsterdam, The Netherlands
Cees Snoek
Fondazione Bruno Kessler, Povo, Italy
Oswald Lanz
Fondazione Bruno Kessler, Povo, Italy
Stefano Messelodi
University of Trento, Povo, Italy
Nicu Sebe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Charrier, C., Saadane, A., Fernandez-Maloigne, C. (2019). Blind Image Quality Assessment Based on the Use of Saliency Maps and a Multivariate Gaussian Distribution. In: Ricci, E., Rota Bulò, S., Snoek, C., Lanz, O., Messelodi, S., Sebe, N. (eds) Image Analysis and Processing – ICIAP 2019. ICIAP 2019. Lecture Notes in Computer Science(), vol 11752. Springer, Cham. https://doi.org/10.1007/978-3-030-30645-8_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-30645-8_13
Published: 02 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30644-1
Online ISBN: 978-3-030-30645-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Blind Image Quality Assessment Based on the Use of Saliency Maps and a Multivariate Gaussian Distribution

Abstract

Similar content being viewed by others

Completely Blind Image Quality Assessment with Visual Saliency Modulated Multi-feature Collaboration

Visual Saliency Based on Two-Dimensional Fractional Fourier Transform

Referenceless image quality assessment by saliency, color-texture energy, and gradient boosting machines

Keywords

1 Introduction

2 The Proposed Approach

2.1 Visual Saliency Map

2.2 Distortion Maps

2.3 Multiscale Features Computation

2.4 Probabilistic Model and Quality Score Prediction

3 Performance Evaluation

3.1 Apparatus

3.2 Performance Evaluation

4 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Blind Image Quality Assessment Based on the Use of Saliency Maps and a Multivariate Gaussian Distribution

Abstract

Similar content being viewed by others

Completely Blind Image Quality Assessment with Visual Saliency Modulated Multi-feature Collaboration

Visual Saliency Based on Two-Dimensional Fractional Fourier Transform

Referenceless image quality assessment by saliency, color-texture energy, and gradient boosting machines

Keywords

1 Introduction

2 The Proposed Approach

2.1 Visual Saliency Map

2.2 Distortion Maps

2.3 Multiscale Features Computation

2.4 Probabilistic Model and Quality Score Prediction

3 Performance Evaluation

3.1 Apparatus

3.2 Performance Evaluation

4 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation