Validation of a new objective method to assess lipid layer thickness without the need of an interferometer

García-Marqués, José Vicente; Talens-Estarelles, Cristian; García-Lázaro, Santiago; Cerviño, Alejandro

doi:10.1007/s00417-021-05378-8

Validation of a new objective method to assess lipid layer thickness without the need of an interferometer

Pathology
Open access
Published: 06 September 2021

Volume 260, pages 655–676, (2022)
Cite this article

Download PDF

You have full access to this open access article

Graefe's Archive for Clinical and Experimental Ophthalmology Aims and scope Submit manuscript

Validation of a new objective method to assess lipid layer thickness without the need of an interferometer

Download PDF

José Vicente García-Marqués¹,
Cristian Talens-Estarelles¹,
Santiago García-Lázaro¹ &
…
Alejandro Cerviño ORCID: orcid.org/0000-0001-8014-3279¹

Abstract

Purpose

This study aimed to develop and validate new metrics to objectively assess the lipid layer thickness (LLT) through the analysis of grey intensity values obtained from the Placido disk pattern reflected onto the tear film.

Methods

Ocular surface parameters were measured using Oculus Keratograph 5 M in 94 healthy volunteers (43.8 ± 26.8 years). Subjects’ LLT was subjectively classified into 4 groups using an interferometry-based grading scale. New metrics based on the intensity of the Placido disk images were calculated and compared between groups. The repeatability of the new metrics and their diagnostic ability was analysed through receiver operating characteristics (ROC) curves. The level of agreement between the new objective tool and the existing subjective classification scale was analysed by means accuracy, weighted Kappa index and F-measure.

Results

Mean pixel intensity, median pixel intensity and relative energy at 5.33 s after blinking achieved the highest performance, with a correlation with LLT between r = 0.655 and 0.674 (p < 0.001), sensitivity between 0.92 and 0.94, specificity between 0.79 and 0.81, area under the ROC curve between 0.89 and 0.91, accuracy between 0.76 and 0.77, weighted Kappa index of 0.77 and F-measure between 0.86 and 0.87.

Conclusion

The analysis of grey intensity values in videokeratography can be used as an objective tool to assess LLT. These new metrics could be included in a battery of clinical tests as an easy, repeatable, objective and accessible method to improve the detection and monitoring of dry eye disease and meibomian gland dysfunction.

Intra-observer and inter-observer repeatability of ocular surface interferometer in measuring lipid layer thickness

Article Open access 15 May 2015

Validation of the phenol red thread test in a Chinese population

Article Open access 07 December 2023

Reliability and clinical applicability of a novel tear film imaging tool

Article Open access 29 March 2021

Find the latest articles, discoveries, and news in related topics.

Medical Imaging

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The lipid layer is the outermost layer of the tear film (TF) and is almost entirely derived from meibum, which is secreted by the meibomian glands. The lipid layer plays a vital role in the stabilization of the TF. It also spreads the whole TF over the ocular surface, lowers the surface tension at the air interface of the TF and prevents the aqueous layer from evaporating [1, 2].

Given the key role of the lipid layer in maintaining the properties of the TF, the assessment of the lipid layer thickness (LLT) is essential in dry eye disease (DED) and Meibomian gland dysfunction (MGD) [3, 4]. One of the most common methods for assessing the lipid layer is the evaluation of the colour and brightness of its interference patterns using an interferometer.

The Tearscope Plus™ is an interferometer developed to assess the LLT [4]. However, this is a subjective technique which requires an experienced clinician to classify the interference patterns. It has been reported that subjective diagnostic tests, such as grading scales, rely on the examiner’s ability, which might decrease inter and intra-observer repeatability [3, 5,6,7]. Likewise, in some cases, the grading of the interference patterns is difficult to perform, especially when dealing with thinner lipid layers [3, 8]. Currently, only the LipiView® system can provide quantitative values of the LLT. However, it has a small area of measurement and it only measures the LLT in blinking conditions [9, 10].

Lately, several studies have tried to solve the aforementioned problems by developing algorithms, based on the analysis of the texture, structure or colour of the interference patterns, which objectively assess the LLT [8, 11,12,13,14,15,16,17,18,19]. Likewise, other authors have used high-resolution microscopy systems to characterize the LLT [20] or have combined optical coherence tomography with interferometry to develop novel imaging systems [19, 21]. Nonetheless, none of these methods has been globally accepted and most of them are considerably time-consuming. Moreover, they require interferometers to be performed, which are too costly and sophisticated to be implemented in the clinic, being more suitable for research purposes.

During corneal topography measurement, the TF acts as a mirror and reflects the projected Placido disk ring pattern. Placido disk rings show lighter than the background. The healthy TF surface forms a well-structured and reflected pattern with good intensity of reflection, while an altered TF produces an irregular pattern with low reflectivity [22]. Accordingly, the primary aim of the present study was to develop and validate a novel method to objectively assess the lipid layer through the analysis of grey intensity values obtained from the Placido disk pattern reflected onto the TF, without the need of an interferometer, thus making the method widely accessible.

The base of the method is that a thicker lipid layer has more lipids [1], which will reflect the light of the Placido disk ring pattern with higher intensity. We hypothesized that high grey intensity values might be related to a thicker lipid layer, while low grey intensity values might be related to a thinner lipid layer. This method was developed following previous research, which shows that the analysis of grey level intensity values of videokeratoscopy images may significantly improve the diagnosis of DED in comparison to other image analysis approaches [22].

Material and methods

Ninety-four healthy volunteers ranging in age from 18 to 90 years (43.8 ± 26.8 years) were enrolled in this study. Only the right eye of participants was assessed to avoid subjects’ data duplication. Subjects had no prior history of ocular disease or injury in the last 3 months. No exclusion based on ocular surface parameters was made to evaluate different TF status. Contact lens users were instructed not to wear their contact lenses within a week before the examination. The work was performed in accordance with the tenets of the Declaration of Helsinki and was approved by the Ethics Committee of the University of Valencia. Written consent of each subject was obtained after a verbal explanation of the study protocol.

Ocular surface measurements

Participants’ ocular surface was evaluated using Oculus Keratograph 5 M (K5 M; Oculus GmbH, Wetzlar, Germany). Measurements were taken by the same experienced researcher following the guidelines of the Tear Film and Ocular Surface Dry Eye Workshop II (TFOS DEWS II) Diagnostic Methodology report [3] and were performed in the following order to avoid TF destabilization: Ocular Surface Disease Index (OSDI), Dry Eye Questionnaire-5 (DEQ-5), total bulbar redness, tear meniscus height (TMH), LLT, non-invasive keratograph break-up time (NIKBUT), meibomian glands expressibility and upper eyelid meibography. The illuminance, temperature and humidity of the room were maintained constant at 200 lx, 24.1 ± 1.6 °C and 44.9 ± 5.0%, respectively.

OSDI and DEQ-5 were used for scoring the ocular surface symptoms of subjects. Bulbar redness was assessed three consecutive times, and an average value was calculated [23], while TMH was obtained by capturing the meniscus immediately post-blink [24].

The LLT was recorded using Oculus Keratograph 5 M and assessed through the lipid layer interference pattern, which was subjectively classified by a masked and experienced examiner into 4 groups using a standardised grading scale [6, 25]: 1 = open meshwork (13–15 nm); 2 = closed meshwork (30–50 nm); 3 = wave (50–80 nm); and 4 = colour fringe (90–140 nm).

The moment of the first break-up of the TF (first NIKBUT) and the average time of all break-ups (mean NIKBUT) were also obtained. A total of three measurements were carried out, one every 3 min so that the TF stabilized between assessments, and the mean and median values of these three measurements were calculated [3].

The expressibility of the central 8 meibomian glands of the upper eyelid was assessed using a subjective grading scale [6, 26, 27]. Upper eyelid meibography was captured using non-contact infrared meibography, and meibomian glands drop-out was objectively calculated using ImageJ tool (Wayne Rasband, National Institutes of Health, Bethesda, MD) as the ratio between gland loss area and eyelid area [28].

Data analysis using the proposed algorithm

Oculus Keratograph 5 M was used to record a video of the NIKBUT measurement at 32 frames per second with a spatial resolution of 680 × 512 pixels. This video was recorded and saved to be later analysed. The proposed software was developed using Matlab R2019a® (MathWorks, Natick, MA). The software automatically decomposed the video into frames with a time interval of 0.031 s between them. The examiner manually selected the frames at 0.33, 5.33, 10.33, 15.33 and 20.33 s after blinking. The frame of 0.33 was selected since the eye was completely open after this time in all videos. Likewise, intervals of 5.00 s from this moment on were chosen to analyse whether the grey intensity values changed over time.

Once the frames of interest were selected, the software automatically processed the images. First, RGB images were transformed into grey-level images. Given that input images contained irrelevant information of external areas, the centre of the Placido disk ring pattern was isolated by the examiner through Matlab. After clicking the centre of the image, the software automatically selected a square of 241 × 241 pixels surrounding the centre of the rings (region of interest, ROI), as the area to perform the image processing.

Next, a band-pass filter was used to eliminate the background illumination and highlight the rings. Furthermore, the images were then smoothed by applying a 4-pixel sigma Gaussian filter to remove the remaining noise from the background [29]. After that, the final ROI was selected by the examiner, who manually selected the region of the image comprising solely the pupil, to avoid the influence of the iris on the results.

Finally, to increase the differences between normal and altered TFs, each pixel value of the resulting image was multiplied by 255 and divided by 85, thus enhancing the contrast between rings and non-ring spaces. These values were selected since they produced the highest possible contrast enhancement.

Once the images were processed, histograms were obtained from their pixel intensity values and metrics were calculated (Fig. 1). Figure 2 shows a summary of the main steps of image processing.

The base of our method is that a thicker lipid layer has more lipids [1], which will reflect the Placido disk rings with higher intensity. Thus, higher grey intensity values might be related to a thicker LLT, while lower grey intensity values could be related to a thinner LLT.

Mean, standard deviation (SD), median, mode, kurtosis and skewness of the histogram of the grey level intensity values were calculated. The minimum grey level in the image was also calculated. Besides, energy, relative energy, entropy and SD irregularity were calculated as follows [22]:

$$\mathrm{Energy as}:\frac{\sum p^2}{n}$$

$$\mathrm{Relative energy as}:\frac{\sum (\frac{p}{\mathrm{pmax}})^2}{n}$$

$$\mathrm{Entropy}:\frac{-\sum p.\mathrm{logr}(p)}{n}$$

$$\mathrm{SD irregularity as}:\frac{\sum \left(\frac{p-x}{\mathrm{pmax}}\right)^2}{n}.$$

where p = pixel grey value; n = number of pixels of the ROI; pmax = maximum pixel intensity; and x = mean pixel intensity values.

Metrics were divided by the number of pixels of the ROI (n) so that all images were comparable independently of the size of the ROI. Finally, the total area under the pixel intensity three-dimensional curve of the image was calculated and divided by the number of pixels in the ROI (Fig. 3).

Statistical analysis

Statistical analysis was carried out using SPSS v26.0 for Windows (IBM Corp, Armonk, NY, USA). Outcomes were shown as the mean ± SD.

Differences in new metrics depending on time after blinking

Repeated mixed model ANOVA was used to evaluate the differences in pixel intensity values depending on the moment after blinking. Bonferroni was used to assess the post hoc differences between paired moments.

Repeatability of new metrics

As three NIKBUT videos were recorded, the three videos were analysed so as to calculate the repeatability of the software in the calculation Placido disk’ reflectivity metrics. Repeatability of each Placido disk’ reflectivity metric was assessed by calculating the within-subject SD (S_w), coefficient of variation (CoV) and the repeatability coefficient (CoR) [30,31,32].

Correlations of new metrics with DED signs and symptoms

Rho Spearman correlations were used to analyse the correlations between ocular surface signs and symptoms and new metrics, for the whole sample. Moreover, the sample was divided into different groups according to the cut-off values reported by the Diagnostic Methodology report of TFOS DEWS [3].

Differences in Placido disk’ reflectivity metrics between groups was assessed by means of Mann–Whitney U test or Kruskal–Wallis test. A p-value less than 0.05 was defined as statistically significant.

Multiple linear regressions

Multiple linear regressions were performed to assess the predictability of tear film-dynamic metrics to ocular signs that had statistically significant correlations. Multiple linear models were constructed with new metrics as dependent variables and current metrics as independent variables to assess the relative importance of each independent variable and their contribution to the change of dependent variables. The following assumptions were checked: the linear relationship between the independent and dependent variables, normal distribution of residuals, homoscedasticity of residuals and predicted values and absence of multicollinearity between independent variables.

Diagnostic ability and validation of new metrics

Each new metric was validated by means receiver operating characteristics (ROC) curves. The probability density functions for an altered (LLT = 1) or normal (LLT ≥ 2) LLT were calculated [3], and different parameters were obtained for each ROC curve: sensitivity, specificity, area under the ROC curve, the cut-off value that optimizes the diagnosis, Youden index and discriminant power [33].

Finally, each Placido disk image was objectively classified into LLT groups depending on the cut-off values obtained in the ROC curves. The level of agreement between this objective classification and the subjective ones was analysed by calculating the accuracy, Kappa index, weighted Kappa index with quadratic weights and F-measure for each metric as in previous studies [34,35,36,37]. The three indexes denote high level of agreement between tests when the values are near 1 [34,35,36,37].

Results

The described algorithm was applied to ninety-four eyes from 94 volunteers, 54 females (57.4%) and 40 males (42.6%). The mean age was 43.8 ± 26.8 years, ranging from 18 to 90 years. The algorithm was able to obtain objective metrics in all subjects.

Placido disk reflectivity metrics over time

Table 1 shows the mean values and SD for each Placido disk reflectivity metric at 0.33, 5.33, 10.33, 15.33 and 20.33 s after blinking. Repeated mixed model ANOVA showed statistical higher pixel intensity values at 10.33, 15.33 and 20.33 s than at 0.33 s. Nevertheless, CoV revealed a low variability of metrics over time. Thus, pixel intensity of the Placido disk was stable in the same subject throughout the measuring period. CoV between seconds after blinking achieved values between 4.42 and 16.92%. Total area under pixel intensity curve, mean pixel intensity, SD of pixel intensity, median pixel intensity and skewness had a CoV < 10%, which evidenced that metrics did not change after blinking.

Table 1 Mean values for each Placido disk reflectivity metric

Full size table

Repeatability of Placido disk reflectivity metrics

Table 2 shows the repeatability scores for each metric. All metrics showed acceptable repeatability since S_w, CoR and CoV values were low, and the variability between the three measurements was not high. S_w achieved values between 2 × 10⁻⁶ and 7.07, CoR between 6 × 10⁻⁶ and 19.59 and CoV between 0.09 and 5.15.

Table 2 Repeatability of each Placido disk reflectivity metric

Full size table

Correlations between new metrics and DED signs and symptoms

Following the results of the previous sections, showing no variation of the metrics over time, only the metrics at 0.33, 5.33 and 10.33 s after blinking were further assessed. Metrics at 15.33 and 20.33 s were excluded from further analysis as most patients need to suppress blinking forcefully, and thus, they do not represent in most cases a real scenario.

Spearman’s significant correlations between each Placido disk reflectivity metric and DED signs and symptoms are shown in Table 3. Generally, there were moderate negative correlations between new metrics based on the grey intensity of pixels of Placido disk images and age, meibomian glands drop-out percentage, bulbar redness, TMH and OSDI. Meanwhile, Placido disk reflectivity metrics were positively correlated with LLT and NIKBUT. The correlation with LLT was the strongest. Given that LLT was statistically correlated with age (r = − 0.298, p = 0.002), glands drop-out (r = − 0.271, p = 0.004), mean first NIKBUT (r = − 0.209, p = 0.008), median first NIKBUT, mean mean NIKBUT and median mean NIKBUT, it might be possible that the correlation of new metrics with the other ocular surface metrics was as consequence of the correlation with LLT. Nevertheless, LLT was not statistically correlated with bulbar redness, TMH and OSDI.

Table 3 Statistically significant Rho Spearman correlations between Placido disk reflectivity metrics and DED signs and symptoms

Full size table

Entropy was the only metric which was not correlated with LLT. Likewise, new metrics were not correlated with meibomian glands expressibility or DEQ-5 score. The metrics measured at 5.33 and 10.33 can be considered the best to describe the LLT since they revealed the strongest correlations.

Differences between groups

The new metrics were analysed according to age and the different ocular surface parameters. Table 4 shows the statistically significant differences in Placido disk reflectivity metrics between classification groups. These outcomes were in accordance with correlations. Statistically higher pixel intensity values were found in young subjects, lower glands drop-out, high NIKBUT, low TMH and thick LLT. However, no statistical differences were found between grade 3 (wave) and 4 (colour fringe) interference patterns in the assessment of LLT (p > 0.005).

Table 4 Statistically significant differences in Placido disk reflectivity metrics for each ocular surface parameter

Full size table

Multiple linear regressions

Since the metrics at 5.33 s after blinking have proved to differentiate between grades 1 (open meshwork), 2 (closed meshwork) and 3 (wave) of the LLT, only the metrics at 5.33 s after blinking will be assessed in this section of the manuscript.

Multiple linear regressions (Table 5) were performed to show the current metrics that were associated with new metrics, avoiding that the interaction between current metrics mislead results. Multiple linear regressions showed that new metrics were statistically significant associated with LLT, explaining the variability between 7.1 and 47.0% depending on the metric. Kurtosis and skewness showed a weak association with gland drop-out percentage instead of with LLT. Energy also appeared to be associated with the first median NIKBUT together with LLT. No association was found with the remaining variables. Generally, these results suggest that the main predictor factor of new metrics was LLT.

Table 5 Multiple linear regressions for new metrics at 5.33 s where the independent variables included were gland drop-out percentage, bulbar redness, lipid layer thickness, tear meniscus height, first and mean NIKBUT, gland expressibility, OSDI and DEQ-5

Full size table

Diagnostic capability and validation of the new metrics

Table 6 summarizes the diagnostic power and the cut-off values for each new metric when grade 1 LLT was compared with other grades. New developed metrics were powerful indicators to detect subjects with an altered lipid layer (grade 1 — open meshwork) since the area under the curve, sensitivity and specificity obtained were high. Mean pixel intensity, median pixel intensity and relative energy were the metrics with the highest sensitivity, specificity, area under the curve, Youden index, discriminant power, accuracy, Kappa index and F-measure.

Table 6 ROC curve parameters of newly developed metrics to differentiate grade 1 LLT from other grades at 5.33 s

Full size table

Tables 7 and 8 show the diagnostic power of each new metric to differentiate between grades 1 and 2, and between grades 2 and 3, respectively. This step allowed finding the cut-off values for each new metric to objectively classify the lipid layer into different grades. The cut-off value which optimizes the diagnosis determines the best score to diagnose the disease. Thus, a subject with a higher score than the cut-off value in kurtosis and skewness was classified into the thinner LLT group, while a subject with a higher score than the cut-off value in the rest of the newly developed metrics was classified into the thicker LLT group. The SD of pixel intensity had a low specificity to distinguish between grades 1 and 2, which could lead to the lipid layer being misclassified.

Table 7 ROC curve parameters of new developed metrics to differentiate between grades 1 and 2 LLT at 5.33 s

Full size table

Table 8 ROC curve parameters of new developed metrics to differentiate between grade 2 and 3 LLT at 5.33 s

Full size table

Once the cut-off values were calculated, the lipid layer was objectively classified. The level of agreement between the newly developed objective and existing subjective classifications was evaluated (Table 9). Since different LLT grades were evaluated, the weighted Kappa index was calculated [37]. Mean pixel intensity, median pixel intensity and relative energy were the metrics with the highest area under the curve, best relationship between sensitivity and specificity and higher agreement between objective and subjective methods for LLT classification.

Table 9 Agreement between the subjective and objective classification of LLT for each parameter at 5.33 s

Full size table

Discussion

The assessment of LLT plays an essential role in DED and MGD because of the relevance of the lipid layer in the TF [1, 4]. Existing tests lack objectivity, preciseness, are time-consuming or are inaccessible for most clinicians due to the need of an interferometer to be performed [8, 10, 11, 13,14,15,16,17,18,19]. The present article introduces a new self-developed technique for the non-invasive objective evaluation of the LLT which can be implemented in any Placido disk topograph.

The present work has tested the validity and applicability of new metrics calculated from the grey level intensity values of the Placido disk pattern reflected onto the TF. Alonso-Caneiro et al. [22] performed a similar study, in which they used texture analysis of videokeratoscopy images and denoted that the proposed technique offered clinical utility in the diagnosis of DED (area under the curve from 0.77 to 0.82, sensitivity of 0.9 and specificity of 0.6). However, the authors did not explain why this could be a predictor of DED since they did not study the correlations of the metric with ocular surface parameters. Therefore, they did not evidence which parameter of the TF they were measuring.

The present work makes three important contributions: (1) the development of a new method to assess LLT in an unbiased, objective, quick and non-invasive way; (2) the possibility of assessing the lipid layer without the need of an interferometer, making the method widely accessible; (3) the validation of the new technique through the study of its repeatability, diagnostic capability and correlations with ocular surface parameters.

Correlations between Placido disk reflectivity metrics and ocular surface parameters

Moderate positive significant correlations were found between grey level intensities of the Placido disk pattern and LLT and NIKBUT. The correlations between new developed metrics and age, meibomian glands drop-out, bulbar redness, TMH and OSDI (Table 3) might be a consequence of their correlation with LLT since LLT is also correlated with age, meibomian glands drop-out and NIKBUT [38,39,40,41,42].

Despite the above, in the present study, LLT revealed no correlation with bulbar redness, TMH and OSDI. Finis et al.[41] neither found a significant correlation between DED symptoms and LLT, although this was not in accordance with others [39, 40, 43, 44]. New metrics, though less strongly correlated with bulbar redness, TMH and OSDI than with LLT, could still be used to assess these ocular surface parameters.

Entropy measures the randomness of a grey level distribution [22] and as a result might change as the TF becomes thinner and the Placido disk pattern becomes more unstructured [22]. This metric was not correlated with LLT, although it revealed a significant correlation with glands drop-out, bulbar redness, TMH, NIKBUT and OSDI, and thus, it might be used to predict these parameters.

Moreover, despite that new metrics were correlated with LLT, no statistically significant correlations were found with meibomian glands expressibility, although previous research did find a correlation between these parameters [41].

Differences between groups

When the sample was subjectively divided into 4 different LLT groups, using grade scales of interference patterns, statistically significant differences in the new metrics were found between them (Table 4). The measurements at 5.33 s after blinking were the best to differentiate among the different LLT grades since metrics were able to distinguish between grades 1 and 2 and grades 2 and 3. Nonetheless, the algorithm could not differentiate between grades 3 (wave) and 4 (colour fringe pattern). This could be due to the fact that grade 4 differs from grade 3 in that 4 is the only grade, in the interference scale, to imply a coloured pattern, which cannot be detected using grey level values. Hence, as already reported by other authors [8], it would be necessary to incorporate a colour analysis to differentiate between grades 3 and 4.

Nevertheless, since the TFOS DEWS II diagnostic report reported that a subject is classified as having DED when the LLT has a grade of 1, differentiating between grade 3 and 4 has a low clinical utility. Additionally, thinner patterns are more difficult to characterize by an examiner [3, 14].

In addition to being capable of differentiating between LLT grades, the metrics at 5.33 s after blinking are performed under more realistic conditions than at later times, as subjects are not required to forcefully suppress blinking. Moreover, metrics at 0.33 s might not have achieved a similar performance than at 5.33 s in assessing LLT since at 0.33 s after blinking, the lipid layer might not have stabilized yet.

Placido disk reflectivity metrics over time

Repeated mixed model ANOVA showed statistical higher pixel intensity values at 10.33, 15.33 and 20.33 s than at 0.33 s (Table 1). This might be due to the fact that the sample size decreased as the seconds after blink increased. Thus, only subjects with larger NIKBUT values were able to maintain the eye opened for 20.33 s. This may be behind the observed differences as LLT and NIKBUT were positively correlated with pixel intensity.

Nevertheless, despite that ANOVA revealed differences in the metrics between periods, when all subjects were analysed together, CoV, which evaluated the variability in each subject individually, revealed a low variability of metrics over time.

Repeatability of each Placido disk reflectivity metric

The present method has the limitation that is semiautomatic since the centre of the Placido disk pattern and the ROI must be selected manually by the examiner. In spite of this, the repeatability was acceptable in all metrics (Table 2) and the analysis can be carried out in less than 10 s. It has been previously reported that this time is considered appropriate for a clinical test [45].

Multiple linear regressions

As correlations showed, LLT was the clinical parameter that was more strongly correlated with new metrics. Nevertheless, other parameters were also correlated. This could be a bias since different metrics can confound results, affecting the classification of LLT. Therefore, multiple linear regression analysis has been performed to show which current metrics are independently associated with new metrics. Results showed that for most metrics, LLT was the only parameter associated. This suggests that new metrics are predictors of LLT and can be used to objectively assess it. Nevertheless, kurtosis and skewness were associated with gland drop-out and energy with LLT together with NIKBUT.

Diagnostic capability and validation of the new metrics

ROC curves were calculated to analyse the diagnostic ability of the new metrics. It has been previously reported that a 70% level of sensitivity and specificity is acceptable for the diagnosis of a disease [6]. Sensitivity and specificity were higher than 0.7 for most of the developed new metrics.

According to the classification on previous reports [46], the newly developed metrics showed areas under the curve between acceptable (0.74) and outstanding (0.91) discrimination. Thus, new metrics can be considered powerful aides to objectively assess the lipid layer.

It has been reported that accuracy, F-measure and kappa index denote good agreement between tests when they are close to 1 [33,34,35,36,37]. Generally, the agreement between new metrics and subjective classification methods of LLT showed an accuracy between 0.63 and 0.77, an F-measure between 0.78 and 0.87 and a Kappa index between 0.61 and 0.77 (very good agreement) (Table 8).

Mean pixel intensity, median pixel intensity and relative energy at 5.33 s after blinking were the metrics with the highest diagnostic capability in terms of sensitivity, specificity, area under the curve, Youden index and discriminant power (Table 5) and the metrics with the highest agreement with the subjective grading in terms of accuracy, Kappa index and F-measure (Table 8).

In comparison with previous studies on the analysis of interference patterns [8, 10,11,12,13,14,15,16,17,18], the new metrics showed slightly lower diagnostic ability and agreement with the subjective classification of LLT. Nevertheless, this method adds the possibility of objectively assessing the LLT without the need of having an interferometer, which might broad the assessment of the lipid layer in clinical practice.

This study had some limitations to consider. First, statistically significant correlations between new metrics and age were found. Consequently, age might act as a possible confounding factor. As in previous studies, age could not be excluded from the analysis because of its strong association with DED and MGD [39, 47]. Furthermore, the surrounding illumination and the focussing of the Placido disk pattern should be carefully controlled. In addition, LLT has not been measured objectively. However, it has been measured subjectively with a validated grading scale, which suggests that the present method is able to objectify the subjective measurement of this grading scale. It has been reported that this subjective grading scale is correlated with LLT [3, 4, 6, 7]. Therefore, these issues are not expected to affect results significantly. Future studies could assess the predictability of LLT measured objectively with the new metrics. Finally, the method only measures the grey intensity values of the Placido disk pattern within the pupil. Nevertheless, this issue is not expected to influence the outcomes since all the metrics have been designed to be pupil-independent. Moreover, the present study has demonstrated that the analysis of the pixels within the pupil area is enough to assess LLT.

Conclusions

Overall, the analysis of grey level intensity values in videokeratography is able to assess TF behaviour. Grey level intensity can be used as an alternative biomarker to objectively grade LLT. It has been demonstrated that the method is quick, objective, non-invasive, repeatable and with acceptable sensitivity and specificity. Therefore, it could be easily included in a battery of tests to improve the detection and monitoring of DED and MGD in clinical practice.

Further research is needed to assess the performance of these metrics in subjects diagnosed with DED or MGD. Likewise, the software could be further developed to be fully automatic and to distinguish between grades 3 and 4 of LLT. Nonetheless, although these outcomes are preliminary, they are highly encouraging. This study could be the base for future works which attempt to assess LLT objectively without the need of an interferometer.

Abbreviations

CI:: Confidence interval
CoR:: Coefficient of repeatability
CoV:: Coefficient of variation
DED:: Dry eye disease
DEQ-5:: Dry Eye Questionnaire-5
LLT:: Lipid layer thickness
MGD:: Meibomian gland dysfunction
NIKBUT:: Non-invasive keratograph break-up time
OSDI:: Ocular Surface Disease Index
ROC:: Receiver operating characteristics
ROI:: Region of interest
SD:: Standard deviation
S _w :: Within-subject standard deviation
TF:: Tear film
TFOS DEWS II:: Tear Film and Ocular Surface Dry Eye Workshop II
TMH:: Tear meniscus height

References

Willcox MDP, Argüeso P, Georgiev GA et al (2017) TFOS DEWS II tear film report. Ocul Surf 15:366–403. https://doi.org/10.1016/j.jtos.2017.03.006
Article PubMed PubMed Central Google Scholar
Ewen King-Smith P, Reuter KS, Braun RJ et al (2013) Tear film breakup and structure studied by simultaneous video recording of fluorescence and tear film lipid layer images. Investig Ophthalmol Vis Sci 54:4900–4909. https://doi.org/10.1167/iovs.13-11878
Article Google Scholar
Wolffsohn JS, Arita R, Chalmers R et al (2017) TFOS DEWS II Diagnostic Methodology report. Ocul Surf 15:539–574. https://doi.org/10.1016/j.jtos.2017.05.001
Article PubMed Google Scholar
Guillon J-P (1998) Non-invasive tearscope plus routine for contact lens fitting. Contact Lens Anterior Eye 21:S31–S40. https://doi.org/10.1016/S1367-0484(98)80035-0
Article PubMed Google Scholar
Arita R, Itoh K, Maeda S et al (2009) Proposed diagnostic criteria for obstructive meibomian gland dysfunction. Ophthalmology 116:2058-2063.e1. https://doi.org/10.1016/j.ophtha.2009.04.037
Article PubMed Google Scholar
Tomlinson A, Bron AJ, Korb DR et al (2011) The international workshop on meibomian gland dysfunction: report of the diagnosis subcommittee. Investig Ophthalmol Vis Sci 52:2006–2049. https://doi.org/10.1167/iovs.10-6997f
Article Google Scholar
Nichols JJ, Berntsen DA, Mitchell GL, Nichols KK (2005) An assessment of grading scales for meibography images. Cornea 24:382–388. https://doi.org/10.1097/01.ico.0000148291.38076.59
Article PubMed Google Scholar
Remeseiro B, Penas M, Barreira N et al (2013) Automatic classification of the interferential tear film lipid layer using colour texture analysis. Comput Methods Programs Biomed 111:93–103. https://doi.org/10.1016/j.cmpb.2013.04.007
Article CAS PubMed Google Scholar
Markoulli M, Duong TB, Lin M, Papas E (2018) Imaging the tear film: a comparison between the Subjective Keeler Tearscope-PlusTM and the Objective Oculus® Keratograph 5M and LipiView® interferometer. Curr Eye Res 43:155–162. https://doi.org/10.1080/02713683.2017.1393092
Article PubMed Google Scholar
Remeseiro B, Penedo MG, García-Resúa C et al (2014) Dry eye characterization by analyzing tear film images. In: Ng EYK, Acharya-Rajendra U, Rangayyan RM, Suri JS (eds) Ophthalmological imaging and applications, 1st edn. New York, pp 449–475
Remeseiro B, Oliver KM, Tomlinson A et al (2015) Automatic grading system for human tear films. Pattern Anal Appl 18:677–694. https://doi.org/10.1007/s10044-014-0402-x
Article Google Scholar
da Cruz LB, Souza JC, de Sousa JA, et al (2020) Interferometer eye image classification for dry eye categorization using phylogenetic diversity indexes for texture analysis. Comput Methods Programs Biomed 188. https://doi.org/10.1016/j.cmpb.2019.105269
Hwang H, Jeon H-J, Yow KC, et al (2017) Image-based quantitative analysis of tear film lipid layer thickness for meibomian gland evaluation. Biomed Eng Online 16. https://doi.org/10.1186/s12938-017-0426-8
García-Resúa C, Fernández MJG, González Penedo MF et al (2013) New software application for clarifying tear film lipid layer patterns. Cornea 32:538–546. https://doi.org/10.1097/ICO.0b013e31824d0d04
Article PubMed Google Scholar
Peteiro-Barral D, Remeseiro B, Méndez R, Penedo MG (2017) Evaluation of an automatic dry eye test using MCDM methods and rank correlation. Med Biol Eng Comput 55:527–536. https://doi.org/10.1007/s11517-016-1534-5
Article PubMed Google Scholar
Wu D, Boyer KL, Nichols JJ, King-Smith PE (2010) Texture based prelens tear film segmentation in interferometry images. Mach Vis Appl 21:253–259. https://doi.org/10.1007/s00138-008-0155-x
Article Google Scholar
Ramos L, Penas M, Remeseiro B, et al (2011) Texture and color analysis for the automatic classification of the eye lipid layer. Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics) 6692 LNCS:66–73
Bai Y, Nichols JJ (2019) In vivo thickness measurement of the lipid layer and the overall tear film by interferometry. Opt Lett 44:2410–2413. https://doi.org/10.1364/OL.44.002410
Article CAS PubMed PubMed Central Google Scholar
Lu H, Wang MR, Wang J, Shen M (2014) Tear film measurement by optical reflectometry technique. J Biomed Opt 19. https://doi.org/10.1117/1.JBO.19.2.027001
Bai Y, Ngo W, Nichols JJ (2019) Characterization of the thickness of the tear film lipid layer using high resolution microscopy. Ocul Surf 17:356–359. https://doi.org/10.1016/j.jtos.2018.12.003
Article PubMed Google Scholar
Bai Y, Nichols JJ (2017) Advances in thickness measurements and dynamic visualization of the tear film using non-invasive optical approaches. Prog Retin Eye Res 58:28–44. https://doi.org/10.1016/j.preteyeres.2017.02.002
Article PubMed Google Scholar
Alonso-Caneiro D, Szczesna-Iskander DH, Iskander DR et al (2013) Application of texture analysis in tear film surface assessment based on videokeratoscopy. J Optom 6:185–193. https://doi.org/10.1016/j.optom.2013.07.006
Article PubMed Central Google Scholar
Wu S, Hong J, Tian L et al (2015) Assessment of bulbar redness with a newly developed keratograph. Optom Vis Sci 92:892–899. https://doi.org/10.1097/OPX.0000000000000643
Article PubMed Google Scholar
Xie W, Zhang X, Xu Y, Yao Y-F (2018) Assessment of tear film and bulbar redness by keratograph 5M in pediatric patients after orthokeratology. Eye Contact Lens 44:S382–S386. https://doi.org/10.1097/ICL.0000000000000501
Article PubMed Google Scholar
Guillon J-P (1998) Use of the tearscope plus and attachments in the routine examination of the marginal dry eye contact lens patient. Adv Exp Med Biol 438:859–867
Article CAS PubMed Google Scholar
Bron AJ, Benjamin L, Snibson GR (1991) Meibomian gland disease. Classification and grading of lid changes. Eye 5:395–411. https://doi.org/10.1038/eye.1991.65
Article PubMed Google Scholar
Shimazaki J, Sakata M, Tsubota K (1995) Ocular surface changes and discomfort in patients with meibomian gland dysfunction. Arch Ophthalmol 113:1266–1270. https://doi.org/10.1001/archopht.1995.01100100054027
Article CAS PubMed Google Scholar
Pult H, Nichols JJ (2012) A review of meibography. Optom Vis Sci 89:E760-769
Article PubMed Google Scholar
Esmaeili M, Dehnavi A, Rabbani H, Hajizadeh F (2016) Three-dimensional segmentation of retinal cysts from spectral-domain optical coherence tomography images by the use of three-dimensional curvelet based K-SVD. J Med Signals Sens 6:166–171
Article PubMed PubMed Central Google Scholar
Martínez-Albert N, Esteve-Taboada JJ, Montés-Micó R et al (2019) Repeatability assessment of biometric measurements with different refractive states and age using a swept-source biometer. Expert Rev Med Devices 16:63–69. https://doi.org/10.1080/17434440.2019.1557517
Article CAS PubMed Google Scholar
McAlinden C, Khadka J, Pesudovs K (2015) Precision (repeatability and reproducibility) studies and sample-size calculation. J Cataract Refract Surg 41:2598–2604. https://doi.org/10.1016/j.jcrs.2015.06.029
Article PubMed Google Scholar
Bland JM, Altman DG (2010) Statistical methods for assessing agreement between two methods of clinical measurement. Int J Nurs Stud 47:931–936. https://doi.org/10.1016/j.ijnurstu.2009.10.001
Article Google Scholar
Sokolova M, Japkowicz N, Szpakowicz S (2006) Beyond accuracy, F-score and ROC: a family of discriminant measures for performance evaluation. In: AAAI Workshop - technical report. pp 24–29
Viera AJ, Garrett JM (2005) Understanding interobserver agreement: the kappa statistic. Fam Med 37:360–363
PubMed Google Scholar
Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174. https://doi.org/10.2307/2529310
Article CAS PubMed Google Scholar
Rosenfield GH, Fitzpatrick-Lins K (1986) A coefficient of agreement as a measure of thematic classification accuracy. Photogramm Eng Remote Sens 52:223–227
Google Scholar
Cohen J (1968) Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychol Bull 70:213–220. https://doi.org/10.1037/h0026256
Article CAS PubMed Google Scholar
Eom Y, Lee J-S, Kang S-Y et al (2013) Correlation between quantitative measurements of tear film lipid layer thickness and meibomian gland loss in patients with obstructive meibomian gland dysfunction and normal controls. Am J Ophthalmol 155:1104-1110.e2. https://doi.org/10.1016/j.ajo.2013.01.008
Article PubMed Google Scholar
Pult H, Riede-Pult BH, Nichols JJ (2012) Relation between upper and lower lids’ meibomian gland morphology, tear film, and dry eye. Optom Vis Sci 89:E310–E315. https://doi.org/10.1097/OPX.0b013e318244e487
Article PubMed Google Scholar
Hosaka E, Kawamorita T, Ogasawara Y et al (2011) Interferometry in the evaluation of precorneal tear film thickness in dry eye. Am J Ophthalmol 151:18-23.e1. https://doi.org/10.1016/j.ajo.2010.07.019
Article PubMed Google Scholar
Finis D, Pischel N, Schrader S, Geerling G (2013) Evaluation of lipid layer thickness measurement of the tear film as a diagnostic tool for Meibomian gland dysfunction. Cornea 32:1549–1553. https://doi.org/10.1097/ICO.0b013e3182a7f3e1
Article PubMed Google Scholar
Bron AJ, Tiffany JM (2004) The contribution of meibomian disease to dry eye. Ocul Surf 2:149–164. https://doi.org/10.1016/S1542-0124(12)70150-7
Article CAS PubMed Google Scholar
Best N, Drury L, Wolffsohn JS (2012) Clinical evaluation of the oculus keratograph. Contact Lens Anterior Eye 35:171–174. https://doi.org/10.1016/j.clae.2012.04.002
Article CAS PubMed Google Scholar
Foulks GN (2007) The correlation between the tear film lipid layer and dry eye disease. Surv Ophthalmol 52:369–374. https://doi.org/10.1016/j.survophthal.2007.04.009
Article PubMed Google Scholar
Remeseiro B, Bolon-Canedo V, Peteiro-Barral D et al (2014) A methodology for improving tear film lipid layer classification. IEEE J Biomed Heal Informatics 18:1485–1493. https://doi.org/10.1109/JBHI.2013.2294732
Article Google Scholar
Hosmer DW, Lemeshow S, Sturdivant RX (2013) Applied logistic regression. Hoboken, New Jersey
Rico-del-Viejo L, Benítez-del-Castillo JM, Gómez-Sanz FJ et al (2019) The influence of meibomian gland loss on ocular surface clinical parameters. Contact Lens Anterior Eye 42:562–568. https://doi.org/10.1016/j.clae.2019.04.004
Article PubMed Google Scholar

Download references

Funding

Open access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. This work was supported by the University of Valencia (“Atracció de Talent” scholarship, UV-INV-PREDOC18F2-886420) awarded to José Vicente García-Marqués; and the Ministerio de Educación, Cultura y Deporte (“Formación de Profesorado Universitario” scholarship, FPU17/03665) awarded to Cristian Talens-Estarelles.

Author information

Authors and Affiliations

Department of Optics and Optometry and Vision Sciences, University of Valencia, C/Dr Moliner, 50, 46100, Burjassot, Valencia, Spain
José Vicente García-Marqués, Cristian Talens-Estarelles, Santiago García-Lázaro & Alejandro Cerviño

Authors

José Vicente García-Marqués
View author publications
You can also search for this author in PubMed Google Scholar
Cristian Talens-Estarelles
View author publications
You can also search for this author in PubMed Google Scholar
Santiago García-Lázaro
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Cerviño
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alejandro Cerviño.

Ethics declarations

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the University of Valencia and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Disclaimer

The funding sources had no role in the design of the study.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

García-Marqués, J.V., Talens-Estarelles, C., García-Lázaro, S. et al. Validation of a new objective method to assess lipid layer thickness without the need of an interferometer. Graefes Arch Clin Exp Ophthalmol 260, 655–676 (2022). https://doi.org/10.1007/s00417-021-05378-8

Download citation

Received: 26 May 2021
Revised: 04 August 2021
Accepted: 09 August 2021
Published: 06 September 2021
Issue Date: February 2022
DOI: https://doi.org/10.1007/s00417-021-05378-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Validation of a new objective method to assess lipid layer thickness without the need of an interferometer

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Intra-observer and inter-observer repeatability of ocular surface interferometer in measuring lipid layer thickness

Validation of the phenol red thread test in a Chinese population

Reliability and clinical applicability of a novel tear film imaging tool

Explore related subjects

Introduction

Material and methods

Ocular surface measurements

Data analysis using the proposed algorithm

Statistical analysis

Differences in new metrics depending on time after blinking

Repeatability of new metrics

Correlations of new metrics with DED signs and symptoms

Multiple linear regressions

Diagnostic ability and validation of new metrics

Results

Placido disk reflectivity metrics over time

Repeatability of Placido disk reflectivity metrics

Correlations between new metrics and DED signs and symptoms

Differences between groups

Multiple linear regressions

Diagnostic capability and validation of the new metrics

Discussion

Correlations between Placido disk reflectivity metrics and ocular surface parameters

Differences between groups

Placido disk reflectivity metrics over time

Repeatability of each Placido disk reflectivity metric

Multiple linear regressions

Diagnostic capability and validation of the new metrics

Conclusions

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical approval

Informed consent

Disclaimer

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation