Introduction

The lipid layer is the outermost layer of the tear film (TF) and is almost entirely derived from meibum, which is secreted by the meibomian glands. The lipid layer plays a vital role in the stabilization of the TF. It also spreads the whole TF over the ocular surface, lowers the surface tension at the air interface of the TF and prevents the aqueous layer from evaporating [1, 2].

Given the key role of the lipid layer in maintaining the properties of the TF, the assessment of the lipid layer thickness (LLT) is essential in dry eye disease (DED) and Meibomian gland dysfunction (MGD) [3, 4]. One of the most common methods for assessing the lipid layer is the evaluation of the colour and brightness of its interference patterns using an interferometer.

The Tearscope Plus™ is an interferometer developed to assess the LLT [4]. However, this is a subjective technique which requires an experienced clinician to classify the interference patterns. It has been reported that subjective diagnostic tests, such as grading scales, rely on the examiner’s ability, which might decrease inter and intra-observer repeatability [3, 5,6,7]. Likewise, in some cases, the grading of the interference patterns is difficult to perform, especially when dealing with thinner lipid layers [3, 8]. Currently, only the LipiView® system can provide quantitative values of the LLT. However, it has a small area of measurement and it only measures the LLT in blinking conditions [9, 10].

Lately, several studies have tried to solve the aforementioned problems by developing algorithms, based on the analysis of the texture, structure or colour of the interference patterns, which objectively assess the LLT [8, 11,12,13,14,15,16,17,18,19]. Likewise, other authors have used high-resolution microscopy systems to characterize the LLT [20] or have combined optical coherence tomography with interferometry to develop novel imaging systems [19, 21]. Nonetheless, none of these methods has been globally accepted and most of them are considerably time-consuming. Moreover, they require interferometers to be performed, which are too costly and sophisticated to be implemented in the clinic, being more suitable for research purposes.

During corneal topography measurement, the TF acts as a mirror and reflects the projected Placido disk ring pattern. Placido disk rings show lighter than the background. The healthy TF surface forms a well-structured and reflected pattern with good intensity of reflection, while an altered TF produces an irregular pattern with low reflectivity [22]. Accordingly, the primary aim of the present study was to develop and validate a novel method to objectively assess the lipid layer through the analysis of grey intensity values obtained from the Placido disk pattern reflected onto the TF, without the need of an interferometer, thus making the method widely accessible.

The base of the method is that a thicker lipid layer has more lipids [1], which will reflect the light of the Placido disk ring pattern with higher intensity. We hypothesized that high grey intensity values might be related to a thicker lipid layer, while low grey intensity values might be related to a thinner lipid layer. This method was developed following previous research, which shows that the analysis of grey level intensity values of videokeratoscopy images may significantly improve the diagnosis of DED in comparison to other image analysis approaches [22].

Material and methods

Ninety-four healthy volunteers ranging in age from 18 to 90 years (43.8 ± 26.8 years) were enrolled in this study. Only the right eye of participants was assessed to avoid subjects’ data duplication. Subjects had no prior history of ocular disease or injury in the last 3 months. No exclusion based on ocular surface parameters was made to evaluate different TF status. Contact lens users were instructed not to wear their contact lenses within a week before the examination. The work was performed in accordance with the tenets of the Declaration of Helsinki and was approved by the Ethics Committee of the University of Valencia. Written consent of each subject was obtained after a verbal explanation of the study protocol.

Ocular surface measurements

Participants’ ocular surface was evaluated using Oculus Keratograph 5 M (K5 M; Oculus GmbH, Wetzlar, Germany). Measurements were taken by the same experienced researcher following the guidelines of the Tear Film and Ocular Surface Dry Eye Workshop II (TFOS DEWS II) Diagnostic Methodology report [3] and were performed in the following order to avoid TF destabilization: Ocular Surface Disease Index (OSDI), Dry Eye Questionnaire-5 (DEQ-5), total bulbar redness, tear meniscus height (TMH), LLT, non-invasive keratograph break-up time (NIKBUT), meibomian glands expressibility and upper eyelid meibography. The illuminance, temperature and humidity of the room were maintained constant at 200 lx, 24.1 ± 1.6 °C and 44.9 ± 5.0%, respectively.

OSDI and DEQ-5 were used for scoring the ocular surface symptoms of subjects. Bulbar redness was assessed three consecutive times, and an average value was calculated [23], while TMH was obtained by capturing the meniscus immediately post-blink [24].

The LLT was recorded using Oculus Keratograph 5 M and assessed through the lipid layer interference pattern, which was subjectively classified by a masked and experienced examiner into 4 groups using a standardised grading scale [6, 25]: 1 = open meshwork (13–15 nm); 2 = closed meshwork (30–50 nm); 3 = wave (50–80 nm); and 4 = colour fringe (90–140 nm).

The moment of the first break-up of the TF (first NIKBUT) and the average time of all break-ups (mean NIKBUT) were also obtained. A total of three measurements were carried out, one every 3 min so that the TF stabilized between assessments, and the mean and median values of these three measurements were calculated [3].

The expressibility of the central 8 meibomian glands of the upper eyelid was assessed using a subjective grading scale [6, 26, 27]. Upper eyelid meibography was captured using non-contact infrared meibography, and meibomian glands drop-out was objectively calculated using ImageJ tool (Wayne Rasband, National Institutes of Health, Bethesda, MD) as the ratio between gland loss area and eyelid area [28].

Data analysis using the proposed algorithm

Oculus Keratograph 5 M was used to record a video of the NIKBUT measurement at 32 frames per second with a spatial resolution of 680 × 512 pixels. This video was recorded and saved to be later analysed. The proposed software was developed using Matlab R2019a® (MathWorks, Natick, MA). The software automatically decomposed the video into frames with a time interval of 0.031 s between them. The examiner manually selected the frames at 0.33, 5.33, 10.33, 15.33 and 20.33 s after blinking. The frame of 0.33 was selected since the eye was completely open after this time in all videos. Likewise, intervals of 5.00 s from this moment on were chosen to analyse whether the grey intensity values changed over time.

Once the frames of interest were selected, the software automatically processed the images. First, RGB images were transformed into grey-level images. Given that input images contained irrelevant information of external areas, the centre of the Placido disk ring pattern was isolated by the examiner through Matlab. After clicking the centre of the image, the software automatically selected a square of 241 × 241 pixels surrounding the centre of the rings (region of interest, ROI), as the area to perform the image processing.

Next, a band-pass filter was used to eliminate the background illumination and highlight the rings. Furthermore, the images were then smoothed by applying a 4-pixel sigma Gaussian filter to remove the remaining noise from the background [29]. After that, the final ROI was selected by the examiner, who manually selected the region of the image comprising solely the pupil, to avoid the influence of the iris on the results.

Finally, to increase the differences between normal and altered TFs, each pixel value of the resulting image was multiplied by 255 and divided by 85, thus enhancing the contrast between rings and non-ring spaces. These values were selected since they produced the highest possible contrast enhancement.

Once the images were processed, histograms were obtained from their pixel intensity values and metrics were calculated (Fig. 1). Figure 2 shows a summary of the main steps of image processing.

Fig. 1
figure 1

Images and histograms of the main steps of the image processing in a random frame. From left to right: selection of the centre of the image; band-pass filter implementation; Gaussian filter implementation; selection of the final ROI and contrast enhancement (final image). In the histograms, axis “x” represents the grey level intensities (0–255), while axis “y” shows the number of pixels

Fig. 2
figure 2

The main steps of the image processing

The base of our method is that a thicker lipid layer has more lipids [1], which will reflect the Placido disk rings with higher intensity. Thus, higher grey intensity values might be related to a thicker LLT, while lower grey intensity values could be related to a thinner LLT.

Mean, standard deviation (SD), median, mode, kurtosis and skewness of the histogram of the grey level intensity values were calculated. The minimum grey level in the image was also calculated. Besides, energy, relative energy, entropy and SD irregularity were calculated as follows [22]:

$$\mathrm{Energy as}:\frac{\sum p^2}{n}$$
$$\mathrm{Relative energy as}:\frac{\sum (\frac{p}{\mathrm{pmax}})^2}{n}$$
$$\mathrm{Entropy}:\frac{-\sum p.\mathrm{logr}(p)}{n}$$
$$\mathrm{SD irregularity as}:\frac{\sum \left(\frac{p-x}{\mathrm{pmax}}\right)^2}{n}.$$

where p = pixel grey value; n = number of pixels of the ROI; pmax = maximum pixel intensity; and x = mean pixel intensity values.

Metrics were divided by the number of pixels of the ROI (n) so that all images were comparable independently of the size of the ROI. Finally, the total area under the pixel intensity three-dimensional curve of the image was calculated and divided by the number of pixels in the ROI (Fig. 3).

Fig. 3
figure 3

Three-dimensional graphic of the grey intensity values in the image. The “x” and “y” axes represent the size of the image and “z” the grey intensity value for each pixel

Statistical analysis

Statistical analysis was carried out using SPSS v26.0 for Windows (IBM Corp, Armonk, NY, USA). Outcomes were shown as the mean ± SD.

Differences in new metrics depending on time after blinking

Repeated mixed model ANOVA was used to evaluate the differences in pixel intensity values depending on the moment after blinking. Bonferroni was used to assess the post hoc differences between paired moments.

Repeatability of new metrics

As three NIKBUT videos were recorded, the three videos were analysed so as to calculate the repeatability of the software in the calculation Placido disk’ reflectivity metrics. Repeatability of each Placido disk’ reflectivity metric was assessed by calculating the within-subject SD (Sw), coefficient of variation (CoV) and the repeatability coefficient (CoR) [30,31,32].

Correlations of new metrics with DED signs and symptoms

Rho Spearman correlations were used to analyse the correlations between ocular surface signs and symptoms and new metrics, for the whole sample. Moreover, the sample was divided into different groups according to the cut-off values reported by the Diagnostic Methodology report of TFOS DEWS [3].

Differences in Placido disk’ reflectivity metrics between groups was assessed by means of Mann–Whitney U test or Kruskal–Wallis test. A p-value less than 0.05 was defined as statistically significant.

Multiple linear regressions

Multiple linear regressions were performed to assess the predictability of tear film-dynamic metrics to ocular signs that had statistically significant correlations. Multiple linear models were constructed with new metrics as dependent variables and current metrics as independent variables to assess the relative importance of each independent variable and their contribution to the change of dependent variables. The following assumptions were checked: the linear relationship between the independent and dependent variables, normal distribution of residuals, homoscedasticity of residuals and predicted values and absence of multicollinearity between independent variables.

Diagnostic ability and validation of new metrics

Each new metric was validated by means receiver operating characteristics (ROC) curves. The probability density functions for an altered (LLT = 1) or normal (LLT ≥ 2) LLT were calculated [3], and different parameters were obtained for each ROC curve: sensitivity, specificity, area under the ROC curve, the cut-off value that optimizes the diagnosis, Youden index and discriminant power [33].

Finally, each Placido disk image was objectively classified into LLT groups depending on the cut-off values obtained in the ROC curves. The level of agreement between this objective classification and the subjective ones was analysed by calculating the accuracy, Kappa index, weighted Kappa index with quadratic weights and F-measure for each metric as in previous studies [34,35,36,37]. The three indexes denote high level of agreement between tests when the values are near 1 [34,35,36,37].

Results

The described algorithm was applied to ninety-four eyes from 94 volunteers, 54 females (57.4%) and 40 males (42.6%). The mean age was 43.8 ± 26.8 years, ranging from 18 to 90 years. The algorithm was able to obtain objective metrics in all subjects.

Placido disk reflectivity metrics over time

Table 1 shows the mean values and SD for each Placido disk reflectivity metric at 0.33, 5.33, 10.33, 15.33 and 20.33 s after blinking. Repeated mixed model ANOVA showed statistical higher pixel intensity values at 10.33, 15.33 and 20.33 s than at 0.33 s. Nevertheless, CoV revealed a low variability of metrics over time. Thus, pixel intensity of the Placido disk was stable in the same subject throughout the measuring period. CoV between seconds after blinking achieved values between 4.42 and 16.92%. Total area under pixel intensity curve, mean pixel intensity, SD of pixel intensity, median pixel intensity and skewness had a CoV < 10%, which evidenced that metrics did not change after blinking.

Table 1 Mean values for each Placido disk reflectivity metric

Repeatability of Placido disk reflectivity metrics

Table 2 shows the repeatability scores for each metric. All metrics showed acceptable repeatability since Sw, CoR and CoV values were low, and the variability between the three measurements was not high. Sw achieved values between 2 × 10−6 and 7.07, CoR between 6 × 10−6 and 19.59 and CoV between 0.09 and 5.15.

Table 2 Repeatability of each Placido disk reflectivity metric

Correlations between new metrics and DED signs and symptoms

Following the results of the previous sections, showing no variation of the metrics over time, only the metrics at 0.33, 5.33 and 10.33 s after blinking were further assessed. Metrics at 15.33 and 20.33 s were excluded from further analysis as most patients need to suppress blinking forcefully, and thus, they do not represent in most cases a real scenario.

Spearman’s significant correlations between each Placido disk reflectivity metric and DED signs and symptoms are shown in Table 3. Generally, there were moderate negative correlations between new metrics based on the grey intensity of pixels of Placido disk images and age, meibomian glands drop-out percentage, bulbar redness, TMH and OSDI. Meanwhile, Placido disk reflectivity metrics were positively correlated with LLT and NIKBUT. The correlation with LLT was the strongest. Given that LLT was statistically correlated with age (r =  − 0.298, p = 0.002), glands drop-out (r =  − 0.271, p = 0.004), mean first NIKBUT (r =  − 0.209, p = 0.008), median first NIKBUT, mean mean NIKBUT and median mean NIKBUT, it might be possible that the correlation of new metrics with the other ocular surface metrics was as consequence of the correlation with LLT. Nevertheless, LLT was not statistically correlated with bulbar redness, TMH and OSDI.

Table 3 Statistically significant Rho Spearman correlations between Placido disk reflectivity metrics and DED signs and symptoms

Entropy was the only metric which was not correlated with LLT. Likewise, new metrics were not correlated with meibomian glands expressibility or DEQ-5 score. The metrics measured at 5.33 and 10.33 can be considered the best to describe the LLT since they revealed the strongest correlations.

Differences between groups

The new metrics were analysed according to age and the different ocular surface parameters. Table 4 shows the statistically significant differences in Placido disk reflectivity metrics between classification groups. These outcomes were in accordance with correlations. Statistically higher pixel intensity values were found in young subjects, lower glands drop-out, high NIKBUT, low TMH and thick LLT. However, no statistical differences were found between grade 3 (wave) and 4 (colour fringe) interference patterns in the assessment of LLT (p > 0.005).

Table 4 Statistically significant differences in Placido disk reflectivity metrics for each ocular surface parameter

Multiple linear regressions

Since the metrics at 5.33 s after blinking have proved to differentiate between grades 1 (open meshwork), 2 (closed meshwork) and 3 (wave) of the LLT, only the metrics at 5.33 s after blinking will be assessed in this section of the manuscript.

Multiple linear regressions (Table 5) were performed to show the current metrics that were associated with new metrics, avoiding that the interaction between current metrics mislead results. Multiple linear regressions showed that new metrics were statistically significant associated with LLT, explaining the variability between 7.1 and 47.0% depending on the metric. Kurtosis and skewness showed a weak association with gland drop-out percentage instead of with LLT. Energy also appeared to be associated with the first median NIKBUT together with LLT. No association was found with the remaining variables. Generally, these results suggest that the main predictor factor of new metrics was LLT.

Table 5 Multiple linear regressions for new metrics at 5.33 s where the independent variables included were gland drop-out percentage, bulbar redness, lipid layer thickness, tear meniscus height, first and mean NIKBUT, gland expressibility, OSDI and DEQ-5

Diagnostic capability and validation of the new metrics

Table 6 summarizes the diagnostic power and the cut-off values for each new metric when grade 1 LLT was compared with other grades. New developed metrics were powerful indicators to detect subjects with an altered lipid layer (grade 1 — open meshwork) since the area under the curve, sensitivity and specificity obtained were high. Mean pixel intensity, median pixel intensity and relative energy were the metrics with the highest sensitivity, specificity, area under the curve, Youden index, discriminant power, accuracy, Kappa index and F-measure.

Table 6 ROC curve parameters of newly developed metrics to differentiate grade 1 LLT from other grades at 5.33 s

Tables 7 and 8 show the diagnostic power of each new metric to differentiate between grades 1 and 2, and between grades 2 and 3, respectively. This step allowed finding the cut-off values for each new metric to objectively classify the lipid layer into different grades. The cut-off value which optimizes the diagnosis determines the best score to diagnose the disease. Thus, a subject with a higher score than the cut-off value in kurtosis and skewness was classified into the thinner LLT group, while a subject with a higher score than the cut-off value in the rest of the newly developed metrics was classified into the thicker LLT group. The SD of pixel intensity had a low specificity to distinguish between grades 1 and 2, which could lead to the lipid layer being misclassified.

Table 7 ROC curve parameters of new developed metrics to differentiate between grades 1 and 2 LLT at 5.33 s
Table 8 ROC curve parameters of new developed metrics to differentiate between grade 2 and 3 LLT at 5.33 s

Once the cut-off values were calculated, the lipid layer was objectively classified. The level of agreement between the newly developed objective and existing subjective classifications was evaluated (Table 9). Since different LLT grades were evaluated, the weighted Kappa index was calculated [37]. Mean pixel intensity, median pixel intensity and relative energy were the metrics with the highest area under the curve, best relationship between sensitivity and specificity and higher agreement between objective and subjective methods for LLT classification.

Table 9 Agreement between the subjective and objective classification of LLT for each parameter at 5.33 s

Discussion

The assessment of LLT plays an essential role in DED and MGD because of the relevance of the lipid layer in the TF [1, 4]. Existing tests lack objectivity, preciseness, are time-consuming or are inaccessible for most clinicians due to the need of an interferometer to be performed [8, 10, 11, 13,14,15,16,17,18,19]. The present article introduces a new self-developed technique for the non-invasive objective evaluation of the LLT which can be implemented in any Placido disk topograph.

The present work has tested the validity and applicability of new metrics calculated from the grey level intensity values of the Placido disk pattern reflected onto the TF. Alonso-Caneiro et al. [22] performed a similar study, in which they used texture analysis of videokeratoscopy images and denoted that the proposed technique offered clinical utility in the diagnosis of DED (area under the curve from 0.77 to 0.82, sensitivity of 0.9 and specificity of 0.6). However, the authors did not explain why this could be a predictor of DED since they did not study the correlations of the metric with ocular surface parameters. Therefore, they did not evidence which parameter of the TF they were measuring.

The present work makes three important contributions: (1) the development of a new method to assess LLT in an unbiased, objective, quick and non-invasive way; (2) the possibility of assessing the lipid layer without the need of an interferometer, making the method widely accessible; (3) the validation of the new technique through the study of its repeatability, diagnostic capability and correlations with ocular surface parameters.

Correlations between Placido disk reflectivity metrics and ocular surface parameters

Moderate positive significant correlations were found between grey level intensities of the Placido disk pattern and LLT and NIKBUT. The correlations between new developed metrics and age, meibomian glands drop-out, bulbar redness, TMH and OSDI (Table 3) might be a consequence of their correlation with LLT since LLT is also correlated with age, meibomian glands drop-out and NIKBUT [38,39,40,41,42].

Despite the above, in the present study, LLT revealed no correlation with bulbar redness, TMH and OSDI. Finis et al.[41] neither found a significant correlation between DED symptoms and LLT, although this was not in accordance with others [39, 40, 43, 44]. New metrics, though less strongly correlated with bulbar redness, TMH and OSDI than with LLT, could still be used to assess these ocular surface parameters.

Entropy measures the randomness of a grey level distribution [22] and as a result might change as the TF becomes thinner and the Placido disk pattern becomes more unstructured [22]. This metric was not correlated with LLT, although it revealed a significant correlation with glands drop-out, bulbar redness, TMH, NIKBUT and OSDI, and thus, it might be used to predict these parameters.

Moreover, despite that new metrics were correlated with LLT, no statistically significant correlations were found with meibomian glands expressibility, although previous research did find a correlation between these parameters [41].

Differences between groups

When the sample was subjectively divided into 4 different LLT groups, using grade scales of interference patterns, statistically significant differences in the new metrics were found between them (Table 4). The measurements at 5.33 s after blinking were the best to differentiate among the different LLT grades since metrics were able to distinguish between grades 1 and 2 and grades 2 and 3. Nonetheless, the algorithm could not differentiate between grades 3 (wave) and 4 (colour fringe pattern). This could be due to the fact that grade 4 differs from grade 3 in that 4 is the only grade, in the interference scale, to imply a coloured pattern, which cannot be detected using grey level values. Hence, as already reported by other authors [8], it would be necessary to incorporate a colour analysis to differentiate between grades 3 and 4.

Nevertheless, since the TFOS DEWS II diagnostic report reported that a subject is classified as having DED when the LLT has a grade of 1, differentiating between grade 3 and 4 has a low clinical utility. Additionally, thinner patterns are more difficult to characterize by an examiner [3, 14].

In addition to being capable of differentiating between LLT grades, the metrics at 5.33 s after blinking are performed under more realistic conditions than at later times, as subjects are not required to forcefully suppress blinking. Moreover, metrics at 0.33 s might not have achieved a similar performance than at 5.33 s in assessing LLT since at 0.33 s after blinking, the lipid layer might not have stabilized yet.

Placido disk reflectivity metrics over time

Repeated mixed model ANOVA showed statistical higher pixel intensity values at 10.33, 15.33 and 20.33 s than at 0.33 s (Table 1). This might be due to the fact that the sample size decreased as the seconds after blink increased. Thus, only subjects with larger NIKBUT values were able to maintain the eye opened for 20.33 s. This may be behind the observed differences as LLT and NIKBUT were positively correlated with pixel intensity.

Nevertheless, despite that ANOVA revealed differences in the metrics between periods, when all subjects were analysed together, CoV, which evaluated the variability in each subject individually, revealed a low variability of metrics over time.

Repeatability of each Placido disk reflectivity metric

The present method has the limitation that is semiautomatic since the centre of the Placido disk pattern and the ROI must be selected manually by the examiner. In spite of this, the repeatability was acceptable in all metrics (Table 2) and the analysis can be carried out in less than 10 s. It has been previously reported that this time is considered appropriate for a clinical test [45].

Multiple linear regressions

As correlations showed, LLT was the clinical parameter that was more strongly correlated with new metrics. Nevertheless, other parameters were also correlated. This could be a bias since different metrics can confound results, affecting the classification of LLT. Therefore, multiple linear regression analysis has been performed to show which current metrics are independently associated with new metrics. Results showed that for most metrics, LLT was the only parameter associated. This suggests that new metrics are predictors of LLT and can be used to objectively assess it. Nevertheless, kurtosis and skewness were associated with gland drop-out and energy with LLT together with NIKBUT.

Diagnostic capability and validation of the new metrics

ROC curves were calculated to analyse the diagnostic ability of the new metrics. It has been previously reported that a 70% level of sensitivity and specificity is acceptable for the diagnosis of a disease [6]. Sensitivity and specificity were higher than 0.7 for most of the developed new metrics.

According to the classification on previous reports [46], the newly developed metrics showed areas under the curve between acceptable (0.74) and outstanding (0.91) discrimination. Thus, new metrics can be considered powerful aides to objectively assess the lipid layer.

It has been reported that accuracy, F-measure and kappa index denote good agreement between tests when they are close to 1 [33,34,35,36,37]. Generally, the agreement between new metrics and subjective classification methods of LLT showed an accuracy between 0.63 and 0.77, an F-measure between 0.78 and 0.87 and a Kappa index between 0.61 and 0.77 (very good agreement) (Table 8).

Mean pixel intensity, median pixel intensity and relative energy at 5.33 s after blinking were the metrics with the highest diagnostic capability in terms of sensitivity, specificity, area under the curve, Youden index and discriminant power (Table 5) and the metrics with the highest agreement with the subjective grading in terms of accuracy, Kappa index and F-measure (Table 8).

In comparison with previous studies on the analysis of interference patterns [8, 10,11,12,13,14,15,16,17,18], the new metrics showed slightly lower diagnostic ability and agreement with the subjective classification of LLT. Nevertheless, this method adds the possibility of objectively assessing the LLT without the need of having an interferometer, which might broad the assessment of the lipid layer in clinical practice.

This study had some limitations to consider. First, statistically significant correlations between new metrics and age were found. Consequently, age might act as a possible confounding factor. As in previous studies, age could not be excluded from the analysis because of its strong association with DED and MGD [39, 47]. Furthermore, the surrounding illumination and the focussing of the Placido disk pattern should be carefully controlled. In addition, LLT has not been measured objectively. However, it has been measured subjectively with a validated grading scale, which suggests that the present method is able to objectify the subjective measurement of this grading scale. It has been reported that this subjective grading scale is correlated with LLT [3, 4, 6, 7]. Therefore, these issues are not expected to affect results significantly. Future studies could assess the predictability of LLT measured objectively with the new metrics. Finally, the method only measures the grey intensity values of the Placido disk pattern within the pupil. Nevertheless, this issue is not expected to influence the outcomes since all the metrics have been designed to be pupil-independent. Moreover, the present study has demonstrated that the analysis of the pixels within the pupil area is enough to assess LLT.

Conclusions

Overall, the analysis of grey level intensity values in videokeratography is able to assess TF behaviour. Grey level intensity can be used as an alternative biomarker to objectively grade LLT. It has been demonstrated that the method is quick, objective, non-invasive, repeatable and with acceptable sensitivity and specificity. Therefore, it could be easily included in a battery of tests to improve the detection and monitoring of DED and MGD in clinical practice.

Further research is needed to assess the performance of these metrics in subjects diagnosed with DED or MGD. Likewise, the software could be further developed to be fully automatic and to distinguish between grades 3 and 4 of LLT. Nonetheless, although these outcomes are preliminary, they are highly encouraging. This study could be the base for future works which attempt to assess LLT objectively without the need of an interferometer.