Introduction

Dry eye disease (DED) is a multifactorial disease of the tear film and ocular surface with an estimated prevalence of 11–22% in the US population, predominantly women over 55 years of age [1,2,3,4,5]. With the aging of the world population, it becomes increasingly important that researchers and clinicians strive to understand, diagnose, and treat DED better. In this review, we will discuss methods for studying DED, and how these techniques can inform clinicians on how to better treat the disease. One tool that has added much to our knowledge of this disease is the controlled adverse environment (CAE®) challenge, which is an ocular surface stress test that exacerbates the signs and symptoms of DED in a safe and controllable manner, in much the same way a stress test is used in cardiovascular medicine to safely provoke a response in subjects.

The tear film is an exquisite balance of aqueous, lipid, and mucin components that serves to protect the ocular surface and to create and maintain a transparent refractive surface for optimal visual performance [6]. Hundreds, if not thousands, of tear components protect the eye from infection, promote rapid healing, and provide adequate nutrition to the avascular cornea. Blinking assists in meibomian gland secretion and spreading of the tear film, as well as mixing and promoting outflow by creating negative pressure in the lacrimal sac. Deficiencies in tear constituents may lead to an unstable tear film, a drying of the ocular surface, and visual disturbances caused by optical aberrations [7]. Exposure of the ocular surface and epithelial desquamation due to tear film breakup will lead to inflammatory and neurogenic signals that manifest as signs of keratitis and symptoms of ocular discomfort commonly described as discomfort, dryness, stinging, burning, foreign body sensation, dryness, and pain [8, 9]. Alterations in blink patterns are characteristic of the disease and contribute to an overall diminution of visual function [10, 11]. Visual tasks such as reading, driving, watching television, and using a computer become particularly troublesome for the DED patient, and can greatly compromise quality of life [12,13,14,15,16].

Tear film deficiencies are caused by a variety of factors including aging and cellular oxidation, neuroendocrine signaling, autoimmune reactions to the lacrimal and/or accessory glands, and inflammation of goblet cells or meibomian glands. Regardless of the underlying cause, DED is associated with chronic inflammation of the ocular surface [2, 17,18,19,20,21], and is exacerbated by harsh environmental conditions. The multifactorial pathophysiology of DED creates many potential therapeutic targets for drug candidates, with a gamut of activities including anti-inflammatories, immunomodulators, secretagogues, anti-evaporatives, receptor agonists and antagonists, wound-healing promoters, hormonal, and nutritional supplements. A brief summary of potential dry eye target therapeutics is presented in Table 1.

Table 1 Examples of therapeutic agents targeting DED

A diagnosis of DED can be made with the presence of just symptoms, just signs, or both. The lack of a correlation of signs and symptoms can be, in fact, a characteristic of the disease [22,23,24,25]. The cornea is a highly innervated and exquisitely sensitive tissue, the signaling from which evokes a response in many surrounding systems: the lacrimal gland, goblet cells, meibomian glands, lid musculature, etc. [26]. It has been hypothesized that in the early stages of DED, patients with still healthy innervation may present with symptoms and little corneal and conjunctival staining or other objective measures. As the duration of DED progresses, patients experience damage to corneal nerves serving the lacrimal functional unit, resulting in a loss of sensitivity and diminished symptomatology, impaired compensatory mechanisms such as tearing and blinking, and more severe keratitis [27, 28].

Compliance with Ethics Guidelines

This article is based on previously conducted studies and does not involve any new studies of human or animal subjects performed by any of the authors.

DED and the Environment

One of the difficulties of studying and treating dry eye stems from its variability with environment and behavior. In DED, the inherent milieu created by age, neuroendocrine function, lacrimal and meibomian gland health, and inflammatory state does not allow the subject to respond adequately to environmental stress [29, 30]. Factors such as wind, humidity, temperature, contact lens wear, visual tasking, season, diurnal rhythms, and pollutants all affect tear film stability and the ocular surface. Lifestyle also contributes greatly to ocular drying: outdoor weekend sporting activities; weekday conditions in an arid office environment with extreme air-conditioning, lighting, and visual tasking such as driving, reading, computer terminal [31,32,33], television, and phone viewing; or flying at altitudes where low relative humidity causes hyper-evaporation [34]. When faced with these adverse environmental and behavioral conditions, tear production will increase in a normal subject to maintain a stable protective barrier and optimal refractive conditions. The DED subject lacks one or more compensatory mechanism that offsets these adverse conditions: the secretory capacity to increase aqueous tear production; upregulation of mucin expression to improve tear quality; increased blinking to prevent evaporation and refresh the tear film; and evaporation, desiccation, and damage to the ocular surface ensue [30, 35].

The natural variability of external conditions and internal responses to them creates a difficult paradigm for the study of DED and its treatment. When evaluating a potential therapeutic agent, the ebb and flow of signs and symptoms that occur throughout the day can overcome subtle improvements derived from treatment. This variability is assimilated into the resulting dataset, leading to large standard deviations and a dampening of quantifiable treatment response that together necessitate very large sample sizes to maintain statistical power. Additionally, the powerful effect of placebo acting as a tear substitute in dry eye studies leads to even lesser differentiation between active and control groups, necessitating the exclusion of placebo responders a priori with a run-in period before randomization [30].

The CAE Chamber

During a CAE study, subjects are screened for baseline signs and symptoms of dry eye, as well as confirmation of a positive medical and medication history. They enter the chamber, which allows for a highly standardized atmosphere of low relative humidity, increased airflow, and constant visual tasking. These perfect storm conditions overcome a dry eye subject’s ability to maintain a stable tear film such that we are able to reproducibly study and modify under controlled conditions the ocular surface desiccation and associated signs and symptoms of DED. Measures of dry eye are assessed immediately before and after the 90-min challenge, as well as frequent symptom monitoring during the challenge. See Table 2 for a list of published clinical trials involving CAE exposure and treatments of dry eye.

Table 2 List of published DED clinical trials utilizing the CAE

Patient Selection

When investigators select patients for clinical trials, they must consider severity of DED disease, responder subtypes, the drug candidate’s mechanism of action and other contributory factors, such as duration of disease, age, gender, lifestyle, and concomitant systemic diseases. These studies have very rigidly defined inclusion and exclusion criteria, and as a result, the rate of acceptance into studies is very low. When the criteria for inclusion are not well laid out, subjects are entered with misdiagnosis, concomitant medications or diseases that mask the therapeutic response to the test drug. Examples are conjunctivochalasis and recurrent corneal erosion, which present with similar symptoms of foreign body sensation, grittiness, irritation, blurred vision, and tearing. Other conditions such as allergy, epithelial basement membrane dystrophy, lid wiper epitheliopathy, giant papillary conjunctivitis, Salzmann’s nodular degeneration, and asthenopia can masquerade as DED.

Other subjects who are potential candidates for a DED clinical trial may have underlying systemic diseases that contribute to ocular surface damage. Patients with long-standing acne rosacea may also have advanced meibomian gland dysfunction (MGD) and may pass screening criteria required for a DED trial [36]. However, because the meibomian glands are damaged, very few, if any, topical agents would modify this condition, and treatment would fail [37]. Similarly, patients diagnosed with uncontrolled type 2 diabetes mellitus will continually present with signs and symptoms that resemble DED, yet will not be responsive to treatment [38].

Reflex tearing can confound results of a CAE exposure and thus, subjects with excessive compensatory tearing should also be excluded from CAE trials. Some patients exposed to a CAE may experience a period of temporary relief from their ocular discomfort due to reflex tearing. This natural compensation is inversely correlated with DED severity. Natural compensation occurs more quickly in normal subjects (~ 10 min), and mild-to-moderate dry eye patients (~ 20 min) compared to those with moderate-to-severe dry eye (> 40 min or not at all) [35].

CAE As a Screening and Subject Enrichment Tool

The response to CAE exposure is used in clinical trials as both a screening and subject enrichment tool, and for establishing treatment differences. By selecting subjects who respond to CAE exposure with worsened signs and symptoms of DED, all enrolled subjects present with a similar predictable baseline, and have a modifiable response to demonstrate change with intervention. Thus, the first important function of the CAE is to provide an entering subject population with known and reproducible dry eye disease. By using a positive CAE response as an inclusion criterion, the study is completed with a predefined population whose baseline characteristics and response to an adverse environment have been defined.

It is known that depending on mechanism of action, some therapies might affect primarily symptoms, some signs, and some both. We also know that some DED patients have more signs than symptoms, depending on disease phenotypes and duration. Because the CAE elicits both, distinct pools of dry eye subjects that are symptom responders, sign responders, and a mixture of both can be identified by CAE response. Furthermore, the regulatory environment now recognizes the lack of a contemporaneous association between signs and symptoms [23, 30]. This independence makes it even more critical to choose the right DED population to test a prospective therapy, and the CAE response allows this.

CAE As a Platform for Evaluating Treatments

Use of the CAE model in combination with stringent, study-specific inclusion and exclusion criteria and sensitive diagnostic tools allow for the selection of patients based on their calibrated response to environmental stress and not only on naturally presenting signs and symptoms that fluctuate hourly and daily within and among subjects. CAE-based selection of subjects also minimizes the natural regression to the mean that occurs in an environmental study for which subjects must present at the visit with adequate scores for signs and symptoms. While the CAE approach results in a higher rate of non-eligibility, the smaller pool of subjects is more homogeneous, and provides greater statistical power.

The second critical function of the CAE is for evaluation of interventions. By integrating sensitive and reproducible endpoints into the model, changes in a subject’s response to CAE exposure can be quantified. CAE response as an endpoint evaluates the protective benefit of a drug or its ability to improve a subject’s compensatory mechanisms. This information is extremely useful for honing in on the potential therapeutic activity of the test product. The expected time course for a maximum effect on CAE depends on the drug characteristics, and it is critical to time the primary endpoint around the time of expected maximum drug effect.

A typical study design includes two baseline CAE challenges prior to randomization. The initial challenge confirms a subject’s response with the protocol-defined degree of sign and/or symptom worsening. Between these two baseline CAE exposures, the subject is usually placed on a placebo regimen, called a run-in period, to exclude subjects whose signs and symptoms are alleviated by supplemental eye drops alone. This is an important step taken prior to initiating treatment, since the drug effect will be compared to the placebo-vehicle effect in double-masked randomized groups. The drug vehicle solutions mandated for use as negative controls in DED and all ophthalmic studies behave as tear substitutes, lubricating the ocular surface, and this benefit will narrow the differences between the study treatment and the placebo. Thus, the run-in period prevents vehicle-treated placebo responders from study entry, thereby minimizing potential noise in the data and optimizing the active drug’s ability to demonstrate improvement. The second confirmatory challenge is thus critical to assure (1) that subjects will respond adequately to the CAE after use of placebo and (2) that subjects have stable and reproducible DED.

As an efficacy tool, CAE challenge has advantages over environmental assessments of drug effects, particularly if the mechanism of action of the drug suggests activity as a protective agent or as an enhancer of compensatory mechanisms. However, assessments of signs and symptoms prior to initiating the challenge also establish the “environmental” status of this enriched population of DED subjects with naturally occurring disease, allowing for multiple opportunities to observe a drug effect. The CAE chamber has both stationary and exact-replica mobile units for use in multi-center studies. Using the same mobile units at various sites is another means of minimizing variability among research centers.

Highly sensitive diagnostic endpoints are integrated into CAE exposure, allowing investigators to subtly tailor study designs to match a therapeutic agent’s mode of action. A drug that increases aqueous tear production or improves meibomian gland function might be expected to cause a significant improvement in the discomfort and keratitis provoked by the CAE. Similarly, a mucin secretagogue-acting drug might stabilize the tear film from within such that corneal epithelial cells are better protected from unfavorable conditions, resulting in significantly less pre- to post-CAE keratitic staining [29, 39].

A variation of the CAE is used to exaggerate adverse environmental conditions more quickly by directing highly focused, rapid airflow to the eye. This model induces greater central corneal staining and a faster onset of symptoms and might be most appropriate for certain mechanisms of action.

Another modification to the CAE is the repeat CAE in which the subject is challenged morning, afternoon and evening. This diurnal model simulates the episodic environmental insults that DED patients experience throughout the day, and useful information can be gained from understanding how the subject’s response changes throughout the day, and how time awake can greatly influence results. This approach is useful for quantifying the cumulative worsening of signs and symptoms and the effectiveness and duration of barrier protective therapies such as artificial tears.

Refining the Assessments of Dry Eye Signs and Symptoms

In studying dry eye, it is critical that measures of disease severity are accurate, reproducible and sensitive. In CAE and non-CAE studies, we must be able to demonstrate a change in signs and symptoms with interventions. A scientific approach to the grading of signs and symptoms has allowed for more calibrated and reproducible assessments, particularly important when more than one site is involved in a study. Grading systems have been validated and tested over time for dry eye redness and vital dye staining of ocular surface damage. Refined methodology for tear film break up time has also been shown to more accurately measure signs of dry eye.

Vital dye staining of damaged ocular surface cells with fluorescein (for the cornea) and/or lissamine green (for the conjunctiva) with grading of severity by region is a common and useful measure of dry eye. Many grading systems exist, and an accurate scoring of the severity of keratitis and conjunctival damage is essential to understanding the effectiveness of an intervention. Finely calibrated grading systems divide the ocular surface into five physiologically and anatomically relevant regions, some of which have been shown to be more sensitive to dry eye and its treatment. Investigators are trained across sites to assure that grading is standardized and reproducible. Software now aids in objectifying these scoring systems [41], and this allows for greater reproducibility across study sites when conducting a multi-centered clinical trial (Fig. 1).

Fig. 1
figure 1

Quantification of ocular surface damage by digital imaging

Tear film break-up time (TFBUT) is another measure of dry eye, and involves defining in seconds the time between blinks before a dry spot is observed by slit lamp after instillation of fluorescein. Historically, this measure was performed with excessive amounts of fluorescein (25–40 µl drop) that flooded the tearfilm and so provided inaccurate information on tear film breakup. When the drop quantity was reduced to 2–5 µL, the interblink evaporation of innate tears became a more relevant measure, defined as < 5 s for a definitive DED diagnosis [42]. Improved understanding of the relationship between patient symptoms and TFBUT has resulted in a simple noninvasive measure of tear film instability that is easily implemented during a routine office visit or in a patient’s home. This test is known as the symptomatic tear film break-up time, and involves simply identifying the time in seconds between blinks before symptoms of discomfort ensue, which usually occurs within one second of the patient’s tear break up [43]. The ease of this technique allows patients to independently monitor their condition under various circumstances and evaluate symptom relief with treatments even at home.

Conjunctival vessel dilation is another hallmark sign of dry eye, and its horizontal, fine, linear pattern is subtly different from the redness seen with other anterior segment conditions such as allergy and infection. Many ophthalmic conditions share redness as a clinical sign, but the particular pattern of redness varies considerably across pathological conditions. The conjunctival, sclera, episcleral, and ciliary vessel beds are known to respond to different disease states through variations in color, location, pattern, and degree of redness. In the case of DED, redness presents as fine linear conjunctival and ciliary vessel dilation. Automated software has been developed to analyze conjunctival structure and redness from still image photography [44], detecting vascular patterns and quantifying the change in redness to demonstrate the therapeutic effect of a drug (Fig. 1). This technology is another example of how to improve on basic clinical grading to better standardize a multi-centered trial.

Schirmer’s test is a measure of aqueous tear production that has been long in use as a clinical diagnostic test for aqueous-deficient DED; however, it fails to reveal other types of tear deficiencies and appears to be resistant to modification.

At a unique crossroads between signs and symptoms of DED are the alterations in blink patterns that arise both as cause and effect of an unstable tear film. In fact, the importance of blink was unknown until the tear film breakup time technique was modified to use a greatly reduced quantity of fluorescein. The lower thresholds for breakup that indicate the presence of dry eye were found to be synchronized with blink [42]. Understanding that the drying tear film triggers a blink led to more in-depth study of modifications in blink in the context of dry eye [45, 46]. DED patients were found to have a faster blink rate that can account for some disturbances in visual function, and were less able to prolong the time between blinks during demanding visual tasks, bound as they are by the primary concern of refreshing the tear film [47]. This inability to vary blink with visual task is another major contributor to the visual dysfunction and fatigue common in DED subjects [47]. Furthermore, dry eye subjects were shown to have lid closures of very long duration (apparent microsleeps); these rest the eye and refresh the tear film such that a lengthening of the time between blinks is possible subsequently [10].

Techniques of monitoring blink have been developed to study what happens to the ocular surface between blinks. These are used as endpoints to study DED and the effect of interventions. Initially, from a simple ratio of tear film breakup time to blink rate, called the Ocular Protection Index (OPI), we understand when a subject’s ocular surface is compromised [48]. This measure was improved with the OPI 2.0 System, which evolved to assess tear film stability simultaneously with natural blink under normal visual conditions [45, 46, 49]. The OPI 2.0 System implements fully automated software algorithms that provide real-time measurements of corneal exposure (breakup area) for each inter-blink interval (IBI) during a 1-min video. Utilizing this method, the mean breakup area (MBA) and OPI 2.0 (MBA/IBI) can be calculated and analyzed [45, 46, 49]. Continuous monitoring of blinks with a headset connected to a cell phone can be implemented within the CAE chamber to study how blink is modified under conditions of stress and how blink patterns might be normalized through treatment [12].

Symptom grading is critical in a disease such as dry eye, which has such a subjective component, and unlike most diseases, can actually comprise only symptoms and not signs. To grade symptoms accurately, scoring systems must be implemented that are easy to use and reflect the disease state in an accurate and reproducible manner. Various tools are used for assessing both retrospective and immediate symptoms using 0–4 and 0–5 scales, as well as different qualifiers (discomfort, burning, dryness, grittiness, stinging), to be completed in-office and at home as part of diaries. During the CAE, symptom queries occur throughout and are a key piece to confirmation that the adverse environment challenge is effective, as well as providing valuable data that can be analyzed in multiple ways to reveal treatment modification of symptoms [12, 50, 51].

Quality of life questionnaires are also useful tools for assessing DED. In a CAE-based study, these subjective questionnaires provide additional environmental symptom endpoints along with diary data that together complement the within-CAE symptom measures, adding to the in-depth and multi-faceted understanding of a drug’s effects on symptoms. A short, 4-question questionnaire that focuses on the key aspects in which dry eye disrupts quality of life (daily activities, reading, watching television, and driving at night) has proven most valuable.

Assessments of visual function are a critical component to studying DED. The inter-blink interval visual acuity decay (IVAD) test provides a measurement of visual function in real time, identifying in the time between blinks when blurring occurs [52]. A suite of reading and contrast sensitivity tests [53] can also be incorporated into the CAE model. In the low contrast reading test, subjects are asked to read simple sentences at a constant print size in decreasing contrast levels. The IReST measures reading speed and errors under natural conditions, i.e., reading simple, standardized, and contextual paragraph texts. With the Wilkins reading test, reading rate is measured for 20 lines of text consisting of 15 simple words randomly arranged without context to eliminate any variability introduced by comprehension [54]. The menu reading test assesses scanning/reading function by asking subjects to read a simple restaurant menu.

Newer and more elaborate measures of evaluating DED include confocal microscopy, tear osmolarity, tear film lipid interferometry, meibography, and tear assaying for cytokines, enzymes, and other tear products. Pre- and post-CAE, analyses of various mucins in tears has been integrated as a means of further classifying patients into subgroups predictive of treatment response to secretagogues. Implementation of these techniques across multiple sites and practices would require standardization and inter-rater reliability data to assure their sensitivity and specificity.

Understanding the pharmacological target of a candidate therapeutic agent’s mechanism of action in DED is critical for selecting the sign or symptom endpoint that will best demonstrate drug efficacy. Endpoints must be standardized, reliable, reproducible, and possess the sensitivity to detect clinically relevant changes caused by the agent undergoing evaluation. The timing and order of these endpoint assessments, as well as rigorous investigator training to ensure consistency between visits and sites, are key factors to the success of a study. By assessing these precise standardized endpoints tailored for use in dry eye before and after a CAE challenge, a drug’s activity during conditions of ocular surface stress is reliably evaluated.

A Sampling of CAE Trials

Regardless of a drug’s mechanism of action, its evaluation in the context of a CAE challenge provides a deeper understanding of how a drug can improve DED. Many dry eye therapeutic candidates have been assessed using the CAE either as an efficacy endpoint or for the purposes of subject selection. Anti-inflammatory agents of diverse mechanisms (free radical scavengers, steroids, cytokine inhibitors, integrin inhibitors, calcineurin inhibitors, SYK kinase inhibitors, etc.), secretagogues, wound healing promoters, barrier function molecules, hormonal therapies and devices have been assessed with the aid of a CAE challenge. Several of these studies have been published. Positive findings have been reported for treatment of DED with iontophoresis of dexamethasone phosphate in the CAE chamber [55]. CAE was a critical component in one Phase 2 and in one Phase 3 study of lifitegrast [56, 57]. A thymosin β4 peptide was tested in another Phase 2 CAE trial and positive results were shown for discomfort scores in the CAE after a month of treatment [50]. Mucogenic agents such as MIM-D3, a selective tyrosine kinase (TrkA) receptor agonist and secretagogue, have been shown to reduce the pre- to post-change in staining that occurs in dry eye subjects with the CAE [39]. The mitochondrial antioxidant, SkQ1, was shown to be effective in improving central corneal staining, lid margin redness and dry eye symptoms [51]. Among others, Phase 2 CAE studies were also completed with resolving, an endogenous immune response mediator [58], and a novel formulation of cyclosporine that has been approved for marketing in the EU (Ikervis®, Santen) [59].

The CAE can also be used to evaluate the effects of contact lenses, solutions, or tear supplements in both DED and normal subjects. Situational dry eye is very common in normal subjects who experience contact lens-associated dryness and ocular surface discomfort due to adverse environmental conditions or behaviors such as intensive visual tasking or monitor use. Barrier protection will ameliorate these CAE-induced signs and symptoms of ocular distress even in normal subjects. Evidence of barrier protection in a CAE exposure would be an important differentiator for clinicians challenged with the hundreds of products marketed and the need to make informed recommendations to patients. One example of this was the finding of greater relief of discomfort under CAE-related adverse conditions with senofilcon A lenses compared to habitual lens wear or no lens wear [40]. Protection from ocular discomfort during exposure to the CAE was found to be superior with the senofilcon lenses. The mucosal drying effect of antihistamines is exacerbated with a CAE exposure, providing greater magnitudes of change that allow for differentiation among products. This early CAE study showed that a 4-day loratadine treatment was associated with more dryness and 93% more corneal and conjunctival staining after CAE than cetirizine [40]. Finally, the CAE was also used to validate the OPI 2.0 method of assessing ocular surface compromise as a function of mean break up area across the cornea between blinks [49].

While the CAE is an experimental method that is not for use in widespread clinical testing or for diagnosis of potential DED patients, the information collected from these studies is very valuable to practicing clinicians. Without the noise created by the widely varying external milieu, clinicians can understand how a drug affects DED, whether as a stimulator of aqueous production, a mucogenic protector, an anti-inflammatory, or a simple barrier to evaporation. Future treatments of DED are forthcoming, such as the Allergan intranasal device for tear stimulation (https://clinicaltrials.gov/ct2/show/NCT02798289). Regulatory agencies and clinicians all now recognize that subjects are tired of ineffectual drops that they must continually instill, or therapies that work in only small subgroups of patients. Therapy may indeed have to be combinations of complementary drugs/devices as is used in ocular hypertension and glaucoma, and identifying treatments that might target symptoms or signs separately is essential now that we know these might be independent components of the disease. The ability to compare and contrast onset and duration of activity within the same paradigm also informs the clinician on how new or existing products will behave outside of the clinic. The CAE in fact provides an essential grounding platform for studying the many facets of the fast moving target that is dry eye disease.

Conclusion

To conclude, the CAE reproduces in a safe, clinical setting the challenges and situations that dry eye subjects encounter every day. As a tool in clinical trials, the CAE reduces or mitigates the variability that plagues dry eye studies and limits their ability to demonstrate the effects of interventions. The CAE can be used as an enrichment tool to enroll patients with modifiable and appropriate signs and symptoms of DED, as well as an endpoint that demonstrates drug efficacy. The CAE can be tailored to highlight the mechanism of action of a drug, ultimately making studies smaller in scale, better standardized, and more precise. The information culled from these studies is useful to clinicians who ideally might match a drug’s mechanism of action and activity in the CAE to the type of dry eye of the subject.