Introduction

Traditional clinical assessment of psychiatric disorders has been limited by recall and symptom-state bias, while experimental assessment has been criticised for being cross-sectional in nature and subject to significant interrater variability. This is particularly problematic in the context of mood and anxiety disorders where symptoms are known to be episodic, fluctuating and context-specific. These limitations have significantly impeded the understanding of the mechanisms of psychiatric illness and the development of effective treatments. The transdiagnostic nature of many symptoms poses an additional question, as it remains unclear whether different disorders share common symptoms or whether these common symptoms arise from inaccuracies and a lack of sensitivity in diagnostic categorisation.

The rapid evolution of mobile and wearable technology in the last decade, the ubiquity of mobile networks and high rates of smartphone ownership have offered new opportunities to address the challenge of traditional diagnostic techniques. Smartphones enable the collection of prospective, high frequency self-report data about symptoms in ecologically valid settings while the wide range of sensors within smartphones enable collection of passive data such as movement, light exposure and temperature. High engagement with smartphones also offers potential to quantify the use of applications, phone-calls and messaging as a proxy for symptomology. Wearable technologies such as wrist worn actigraphs and mobile ECGs devices provide additional data which can complement symptom monitoring. All these routes of digital remote monitoring therefore offer the possibility of capturing signals more representative of the underlying physiology of a disorder in comparison to traditional clinical assessment, offering potential to understand mechanisms of disease in psychiatry.

Here we will review the current literature on the use of digital remote techniques to inform our understanding of mechanisms and prediction in psychiatry.

Digital Phenotyping

Digital phenotyping is defined as moment-by-moment quantification of the individual-level human phenotype in situ using data from personal digital devices [1]. Data can be collected actively, for example using self-report questionnaires, or passively, for example movement data collected from a smartphone. Self-report data collected online or via mobile phones are widely used in clinical and research settings and has been shown to correlate with clinician rated depressive and manic symptoms on the HDRS-17 and YMRS [2, 3••]. The resolution and temporal nature of such data can offer an insight into the mechanisms of psychiatric disorders.

In bipolar disorder particularly, high frequency prospective mood monitoring such as that conducted using the TrueColours [4•] system gives a richer, more accurate phenotype than conventional descriptions (Fig. 1). It has revealed a complex picture in which chronic mood variation – mood instability - is more typical than the discrete episodes which form part of the diagnostic criteria. Moreover, self-reported mood instability is common in the population (14%) and a risk factor for bipolar disorder [5].

Fig. 1
figure 1

example of a mood chart from an individual with bipolar disorder. Red lines denote Altman Mania Rating Scale scores, blue lines denote Quick Inventory of Depression Scale scores

The nature of the nature of mood instability also appears to be diagnostically specific. In the AMoSS study [6••] participants with bipolar disorder or borderline personality disorder and healthy volunteers reported their mood on a daily basis using a smartphone app. The three groups could be clearly differentiated on the degree of mood instability as well as the way in which different mood states related to one another.

The temporal relationship between different symptoms have also been described while phenotyping other diagnostic groups. Van Voorhees 2018 [7] employed monitoring using handheld devices to demonstrate that PTSD symptoms predicted subsequent irritability and hostile affect, but that the inverse was not true. These findings suggest that trauma history and symptoms should be implicated in anger management in this patient group. Employing similar methods, Simons 2018 found that PTSD symptoms predict conduct problems the following day in the context of alcohol dependence [8] and Short 2018 demonstrated the effect of maladaptive strategies such as avoidance on PTSD symptoms the following day [9]. In panic disorder Helbig-Lang 2012 used similar temporal analysis to investigate the relationship between anticipatory anxiety and panic attacks in panic disorder and agoraphobia. While panic attacks increased anticipatory anxiety, heightened anticipatory anxiety did not precede panic attacks, contrary to cognitive models of panic [10].

Passive Correlates of Mood States

The rapid expansion in wearable technologies and smartphone sensors has enabled the collection of data streams that do not require any active input from users. This passive data provides information about a range of symptoms and behaviours can be used to predict current mood state. Phone use has been explored in a number of studies as social interaction is commonly affected by mood state. In bipolar disorder significant correlations have been reported between the length and number of outgoing phone calls, number of outgoing text messages per day and symptoms of mania [11]. In bipolar depression Beiwinkel et al. found that the number of outgoing texts negatively correlated with depressive symptoms on the Hamilton Depression Rating Scale although a reversal of this correlation was not observed with the Young Mania Rating Scale score [12]. Changes in voice and speech are consistently reported as correlating with mood states in the context of structured interview calls [13] as well as clinical interactions and unstructured voice data [14, 15].

Activity as measured using accelerometers have revealed similarly mixed results with some studies reporting positive correlations between activity and mood while others have failed to find any significant relationship [16]. Geolocation parameters have also been explored in the context of mood disorders. Palmius et al. were able to predict current depressive symptoms with an accuracy of over 86% in individuals with bipolar disorder while Faurholt-Jepsen et al. reported significant positive correlations between cell tower ID changes per day and both rating scale scores (YMRS and HDRS-17), with depressive symptoms being associated with reduced movement and manic symptoms associated with increased movement [17••, 18]. However, the direction of causality remains unclear.

The chronic relapsing and remitting nature of psychiatric disorders is such that many researchers have sought to use remote monitoring to predict future mood states with a view to enabling earlier initiation of preventative treatment. A variety of mathematical approaches have been used including relaxation oscillatory models [19], control charts [20] and rough paths theory [21] with varying results. It seems likely that for any model to gain the necessary accuracy upon which to base treatment decisions, predictions would need to be made at in individual rather than group level.

Indication of Underlying Mechanisms

The combination of self-reported mood data and data derived from wearables is beginning to provide new insights about the underlying mechanisms in psychiatric disorders. The combination of actigraphy and heart rate data collected via wearable devices have revealed a specific circadian phenotype in borderline personality disorder (BPD). Individuals with BPD had clear evidence of phase delay in the diurnal rhythm of their activity, sleep and heart rate when compared with bipolar disorder and healthy controls. Inspection of correlation coefficients controlling for gender and employment status revealed strong associations between sleep/circadian rhythm parameters and symptoms of impulsivity and mood instability in BPD which were not seen in the other groups. The findings suggest that disturbances initiating/maintaining sleep may be an exacerbating factor for borderline psychopathology and that stabilisation and consolidation of rest-activity rhythms might be considered a treatment target [22,23,24]. In depressed patients the measurement of HRV as an index of vagal function have been proposed as a mediator of the relationship between low mood and cardiovascular disease [25].

Remote monitoring also provides insight into the psychological mechanisms of mood and anxiety disorders. Using ecological momentary assessment (EMA) methods, Kashdan 2014 demonstrated an association between momentary experiential avoidance and anxiety symptoms during social interactions in naturalistic settings, highest among participants with SAD [26]. An EMA study by Wenze 2018 provided further insight into this association, using lag analyses to demonstrate a bidirectional relationship between experiential avoidance and negative mood, thoughts and stress [27].

Thompson 2017 used handheld electronic devices to investigate the role of should affect - the belief of how one should feel in the moment - in major depressive disorder and generalised anxiety disorder (GAD) in naturalistic settings [28]. In both disorders, participants reported higher levels of should positive affect and lower levels of should negative affect than healthy controls, even once mean affect was accounted for. Kircanski 2015 used experience sampling methods to demonstrate prevalent rumination in participants with depression and anxiety [29], and Ruscio 2015 used EMA methods to demonstrate that individuals reported more rumination after stressful events compared to controls, even after the stressfulness of an event had been adjusted for, and that rumination predicted future symptoms of anxiety [30].

As well as psychological mechanisms, remote monitoring offers insight into how behavioural factors interact with anxiety disorders. Farmer 2015 studied the effect of stressful social events over a period of 2 weeks in participants with SAD and found increases in negative affect and decreases in positive affect and self-esteem on days with more stressful social events, and that this sensitivity to stress was more marked compared to a group of healthy controls [31]. Likewise, Morgan 2017 assessed youth with SAD, GAD and/or separation anxiety and found them more sensitive to positive events and rewarding social interactions than healthy controls, experiencing an altered positive affect profile in response to positive events, informing cognitive and behavioural theories of anxiety [32].

Applications in Treatment Development

The use of remote monitoring in combination with existing treatments of known efficacy may also enable greater understanding of underlying disease mechanisms as well as predictors of therapeutic response. One of the major barriers to new drug development in psychiatry is the lack of well validated early markers of treatment response which means that treatment trials are lengthy and prohibitively expensive to run. The use of existing treatments to generate early markers of treatment response has already been hugely successful in depression. Using the Emotional Test Battery Harmer et al. have shown that early changes in negative affective bias predict subsequent improvement in mood [33]. This is now widely accepted as providing an early marker of (and cognitive model for) antidepressant efficacy using short-term drug administration in healthy volunteers and patient groups. The use of remote monitoring in this context may enable experimental models for other diagnoses to be generated. The OxLith trial is using lithium to identify early clinic, neurocognitive and biological effects of treatment as captured using remote monitoring in combination with a range of imaging modalities [34].

Ethics

Digital remote monitoring also raises a number of ethical concerns [35]. Of particular interest is respect for privacy and autonomy, especially with regard to technologies that involve long-term monitoring and aim to predict or detect illness relapse. For instance, it may be difficult to safeguard autonomy when collecting data for use in machine learning techniques where the use of the data is not hypothesis-driven and may not be clear at time of collection. There is also the possibility that self-monitoring may encourage coercion or exacerbate power relationships between service providers and users.

A related ethical concern is that remote monitoring may affect participants’ identity and encourage medicalisation of everyday experiences. For instance, a qualitative study highlighted that participants worry that reactive changes in mood may be misinterpreted as illness relapse, while others raised concerns that users themselves may become pre-occupied with collecting self-report data in naturalistic settings [36•].

Similarly, the collection and storage of large volumes of data involves risks relating to confidentiality and data security, especially when data is collected which is not typically understood to be medical in nature. Other potential harms include the risk that remote monitoring may increase stigma or discrimination against certain groups of people, especially those with mental illness. Not only may wearable devices attract stigma, but the use of data and modelling relating to prediction when used by third parties such as commercial enterprise or government may result in discrimination against those with mental disorders [37•]. Likewise, although engagement with smartphones and wearables is growing, there is concern that advances may not be shared equally due to individuals’ limited access to technology.

The use of remote monitoring also raises the possibility of medical harm. The lack of regulation and validation of many commercially available remote monitoring methods is especially relevant in this regard [38]. Participants might be provided with poor information, or encouraged towards self-diagnosis and self-treatment rather seeking appropriate support [39]. Equally concerning are suggestions that when used inappropriately, remote interventions may even increase symptoms of anxiety/mental distress [40]. Although remote monitoring offers a wealth of data to be collected, it is imperative that such data is validated and shown to be clinically significant before broader conclusions are drawn from it.

Future Directions

The use of remote monitoring has expanded rapidly in recent years and there are an abundance of publicly and commercially available applications and wearables. While these are popular among consumers significant concerns have been raised about their validity and safety. Nicholas et al. reported that none of the symptom apps that they identified had been subject to rigorous research or cited published material and only a small proportion had a privacy policy (Nicholas et al. 2015). Many wearables have been shown to give inaccurate readouts of sleep and activity and none of the commercially available devices publish their analysis algorithms or allow user to access raw data. At present there is no consensus as to how remotely collected data should be reported or analysed although a set of reporting guidelines for mood data is in preparation [41]. The FDA is now in the process of developing guidance for the use of digital health medical devices and has launched a pilot precertification plan for digital health technology oversight.

There are also broader challenges for clinicians and service-users when interacting with remote monitoring. A better understanding of mechanisms and prediction informed by remotely collected data may disrupt traditional psychiatric classifications and may require changes to the phenotyping of mental disorders. This is especially true as we develop potential to collect data relating to physiology, movement, light exposure and other metrics which may not have historically been thought of as relevant to psychiatric diagnosis. Nonetheless, such shifts may promise exciting developments for prediction and treatment in psychiatric care.