Abstract
Purpose of Review
With the increasing prevalence of commercially available wearable digital devices in sleep medicine, this review aims to address some of the benefits and concerns with using these devices, with a specific focus on population-based studies.
Recent Findings
There has been a recent proliferation of sleep-related research studies that are using wearable digital devices, likely due to their relatively low cost and widespread use among consumers. As technology improves, it seems that wearables likely provide sleep information on par with actigraphy, which has been traditionally used in many sleep-related research studies. We also discuss issues unique to wearables, such as accessing data and data privacy, and the potential for decentralized digital research studies.
Summary
Wearable devices have the potential to change how we approach population-based sleep research studies, particularly when considering decentralized digital trials.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
The use of mobile health technologies, including wearable sensors, in sleep research has increased substantially over the last several years. The PubMed library contained 3 articles containing the terms “sleep” and “wearable” in 2012; this has now increased to 1262 articles by the end of 2022, suggesting that sleep researchers increasingly recognize the potential of harnessing wearable devices in their investigations of sleep.
Many of the benefits of using commercially available wearable devices in research studies are clearly apparent—the technology is of relatively low cost, readily available, and continues to improve in accuracy, convenience, and impact on care [1]. Furthermore, wearables passively collect data without the need for substantial levels of participant interaction, and without the need to travel to a sleep lab for polysomnography (PSG). One of the greatest potential benefits is in the ability of wearables to evaluate multiple nights (sometimes years) of sleep, thereby providing longitudinal data that may be more reflective of a participant’s normal sleep patterns compared to what can generally be obtained with more traditional methods. Longitudinal assessments, especially when gathered from large, diverse populations, can help us better understand how sleep variability and different sleep patterns might impact human health outcomes. Wearables also provide value to participants since they can return personalized health data in a user-friendly, real-time way via data visualizations, which may also help with engagement and retention in sleep-related research. Furthermore, wearables capture other data (e.g., steps, exercise) that might affect sleep, which cannot be done using PSG.
Given the emerging possibilities of utilizing wearables in sleep research, we aim to provide a comprehensive review regarding the use of wearable devices in population-based studies that focus on sleep, including potential challenges for researchers as well as future directions in the arena.
Where the Field has Been: Traditional Assessments of Sleep
Previously, the only assessment of sleep that has been done on a population scale is survey assessment using subjective recall. For example, the largest database of population sleep in the USA comes from the Behavioral Risk Factor Surveillance System (BRFSS)—where the Centers for Disease Control and Prevention (CDC) query randomly chosen individuals over the phone regarding how much interviewees think they sleep in a 24-h period; this database has often been tapped as a resource for sleep researchers [2,3,4]. The Nurses’ Health Study [5], with over 70,000 participants, was another influential, population-level study that utilized surveys for the assessment of sleep. However, survey-based data are fraught with limitations, including the use of subjective recall (instead of the use of objective measurement), the use of ordinal numbers (e.g., 5 vs 6 h of sleep per night), uncertainty as to whether this number means hours slept last night or on average over many nights, includes naps or non-nighttime sleep, and the cross-sectional nature of the query as opposed to longitudinally tracked sleep [6].
On the opposite end of the spectrum, the most in-depth assessment of sleep comes from PSG, which remains the gold standard for sleep assessment, diagnosis for many sleep disorders, and is currently the most accurate measure of sleep duration and architecture. Major limitations remain for PSG when considering population-level studies. PSG must occur in a sleep laboratory, outside of the natural environment of the participant, thereby limiting the number of participants in a research study. This also limits participant selection based on where a PSG laboratory might be available, and cost is an issue both in obtaining data and in analysis. Furthermore, PSG does not allow for the collection of longitudinal sleep information without repeated returns to a sleep laboratory (and if it did, it would likely alter the timing and duration of sleep it purports to measure).
Actigraphy, which has been used in sleep research studies since the late 1970s, is a non-invasive method of measuring gross motor activity, and has been used to monitor human rest/activity cycles. Actigraphs are generally wrist-worn devices that rely on accelerometry and have been used successfully in many sleep-related research studies to estimate sleep duration (though not sleep architecture) [7, 8]. These devices use (lack of) body movements (usually from the wrist/hand) to infer periods of sleep. While actigraphy cannot provide sleep staging data and is certainly not the gold standard for sleep measurements, it provides reasonable estimates of sleep duration, wake after sleep onset (WASO), sleep latency (when paired with a sleep diary), and sleep efficiency. Notably, actigraphy has been used in many different populations, including those who are ambulatory, chronically ill, and even those who are hospitalized [7, 9,10,11,12,13]. While some software can provide automatic scoring of the data, actigraphy is generally manually scored with an experienced scorer determining potential “rest intervals” of when a person is thought to be trying to sleep, and then an algorithm applied to these intervals to determine sleep vs. wake. The Cole-Kripke algorithm is an example of how some actigraphy programs determine sleep vs. not sleep and takes into account activity during a given epoch as well as activity in the surrounding time periods [14]. Actigraphy has been used successfully in moderate to very large studies of sleep duration for many years. The Study of Osteoporotic Fractures (SOF) enrolled almost 3000 women in the 1980s to examine what factors, including actigraphically measured sleep and activity, contributed to the development of osteoporosis [15]. Similarly, the Osteoporotic Fractures in Men Study (MrOS) used actigraphy in a group of almost 6000 men in the early 2000s, from which multiple sleep-related insights were gathered [16, 17]. The Hispanic Community Health Study/Study of Latinos (HCHS/SOL) study has used actigraphy in their ancillary studies, such as the Sueño Sleep Ancillary Study, and Study of Latinos – Investigation of Neurocognitive Aging (SOL-INCA) [18, 19], both of which have provided insights into the sleep and sleep-related health outcomes in a large Latino population. Furthermore, the UK Biobank study utilized actigraphy in over 100,000 individuals, where participants were asked to wear an actigraphy device on their wrist for seven consecutive days. These data have already led to novel sleep-related observations such as insight into possible sleep phenotypes, despite the modest duration of recording (only 7 days) and lack of use of PSG [20, 21]. Such data, when combined with genetic information, helped confirm—and also extend—knowledge of genes important for sleep timing and duration.
Where the Field is Now: Wearables, Smartphones, and What They Measure
Wearables
Wearables are like actigraphy in many ways—they use a similar underlying technology (3-axis accelerometry, which captures movement in the x-, y-, and z-planes), they infer sleep from a lack of movement, and are able to estimate similar sleep metrics as actigraphy through likely similar algorithms as described above (though the algorithms and analyses from these devices are proprietary and not publicly available). Many devices now also utilize a green-light photoplethysmography (PPG) or infrared sensing to determine heart rate, heart rate variability, and pulse oximetry, and can also now estimate respiratory rate (so-called multi-sensor wearables). Most commercially purchased devices are worn on the wrist or finger, with fewer that are worn on the chest or hip. The technology is also often connected to smartphones and other devices. The expectation is that wearables provide at least similar data to actigraphy, with claims or hopes that these additional inputs can help approach PSG level of accuracy.
In terms of accuracy in healthy populations, some studies suggest that wearables over-estimate sleep duration compared to PSG on the order of only 8 min (Fig. 1) [22,23,24,25,26,27,28]. Furthermore, recent studies have shown many newer devices performed as well as or better than actigraphy on sleep vs. wake measures in children to adults, though not as accurate as PSG [24, 29]. Given the reliance on movement (or lack thereof) for estimation of sleep, wearables have variable accuracy in being able to assess the time of sleep onset. The inclusion of heart rate, heart rate variability, and respiratory rate in device algorithms has improved the assessment of time to sleep onset, but not all devices carry these capabilities [30]. Circadian rhythm measurement is often of interest in sleep-related research and is easily measured with actigraphy. To our knowledge, no population-level studies have been conducted with wearables to evaluate circadian rhythms. However, some recent studies have shown promise in being able to extract circadian activity rhythm data using multiple parameters from the devices, and sometimes in conjunction with additional smartphone data [31,32,33].
The measurement of sleep staging has gained momentum in terms of the technology and algorithms used, but further improvements are likely needed prior to comfortably relying on staging data [34, 35]. Most sleep architecture information from wearables is gained from accelerometry but also heart rate variability, which changes when transitioning from one sleep stage to the next [27]. For example, in de Zambotti et al.’s validation study of the Fitbit Charge 2 (Fig. 1), they found 0.81 sensitivity in detecting light sleep (N1 + N2 stages), 0.49 sensitivity in detecting deep sleep (N3), and 0.74 sensitivity in detecting rapid-eye-movement (REM) sleep [36]. While these estimations certainly show promise, they do not yet provide PSG-level sleep architecture information. Indeed, the de Zambotti et al. study utilized slightly older technology, and wearables continue to improve in terms of accuracy.
Smartphones
Smartphones, which have become almost ubiquitous in the USA, have the potential to passively monitoring a person’s daily activities, including sleep-related measures. These devices allow for data collection from a person’s so-called digital footprint [37], which includes when they start/stop using their phones over the day, peak times of phone use, and typing speed. Others have harnessed these data (most using apps downloaded to the phone, or GPS mobility patterns) to track sleep/wake timing, sleep duration, and circadian-related patterns, with good success [38,39,40,41,42]. Ceolini and Ghosh recently used 2 years of data from 401 participants to analyze over 300 million smartphone touchscreen interactions in an effort to determine the presence of multi-day rhythms across the cohort [43], which underscores the potential for passively collected digital data in providing new insights into sleep and circadian-related patterns in human health.
Considerations When Using Wearables in Sleep Research
Intellectual Property, Proprietary Sleep Algorithms, and Future Open Framework Efforts
The data obtained from wearables in a research study do not necessarily “belong” to the researcher but remain with the company providing the device. Device companies vary in how much data they share, in what format, and which data sharing platforms and apps to which they can connect. Furthermore, the algorithms that underly sleep detection are proprietary, and thus the granular level data—such as that obtained with actigraphy—are often unavailable to the researcher. This leaves a reliance on the company provided algorithm, with no ability for the researcher to make their own assessments of sleep vs. wake or to make further manipulations of the data. While Fitbit has recently allowed for an extended level of granularity for approved research requests, this drawback is certainly a limitation of the use of wearables for sleep research. Additionally, questions remain regarding if and how the different algorithms for different devices might impact the collected results analysis; in our experience, sleep data provided through company algorithms (e.g., Apple vs. Fitbit) have variations that can impact results, even for basic sleep measurements such as sleep duration. Given the proprietary nature of the company algorithms, the development of open frameworks to standardize data analysis across devices has been difficult, though progress towards this goal is being made. For example, others with expertise in validating consumer devices for sleep research are working to standardize steps towards testing consumer sleep technology performance against PSG and have published open-source code for this purpose [44]. Perez-Pozuelo et al. recently published a device-agnostic heart rate–based algorithm applicable across multiple devices to predict sleep metrics with good results; their code library is open-source and thus available for others to use [26].
Data Management
Collection and aggregation of the sleep data must be considered. As wearables have become more prevalent for research, companies have made participant data (from participants who have consented to share their data) more accessible via application programming interfaces (API) and access to cloud-based storage of data. Third-party platforms are also sometimes available to help collect and aggregate data for research studies, depending on the device [45]. Additionally, some large digital studies, such as the DETECT study [46], collected data from different types of devices, and this aggregation from different platforms may be done increasingly in the future.
Device-related Issues
Data storage on the devices themselves is limited, and devices must be synced regularly to participants’ smartphones (and therefore the cloud-based applications that store individual data). Thus, participants generally need to understand this limitation and must have the knowledge or available assistance in ensuring their phones and devices are set to sync appropriately. Data loss can occur if syncing does not occur regularly; for example, Fitbit stores high-resolution data on the device itself for 5–7 days. If a device is not synced for more than 5 days, older high-resolution data are removed permanently and deemed unrestorable. Furthermore, firmware updates are often offered by device manufacturers, but variably installed by the participant. This could impact how many individuals are using the most optimized version of software, potentially impacting the quality of data collection. If studies are on a smaller scale, these issues are more readily dealt with using direct communication with participants and allowing for frequent check-ins to ensure devices are syncing and updated appropriately. However, depending on the geographic extent and volume of participants, these issues could be insurmountable and impact the reliability of the data collected.
Participant Habits
Participants vary in how they use wearable devices, and many questions remain surrounding sleep specifically. Some individuals are consistent users (Fig. 2), while others may use their devices on specific days or only when they are interested in tracking their data. Not all people wear their devices overnight, particularly given that some devices require charging overnight (though some devices have up to a 5- to 10-day battery life, Table 1), which could preclude a consistent assessment of sleep. This is an important consideration, especially when compared to actigraphy where the battery life can often last several weeks. Additionally, it can be difficult to track short naps, as some devices do not reliably capture naps shorter than 1 h. Furthermore, if an individual is consistently wearing their device to sleep and is actively tracking their sleep, it is possible the individual is concerned about their sleep habits (as opposed to an individual who habitually takes off their device at night and is not concerned enough about their sleep to track or investigate it), which could bias representative samples in sleep research.
Validation in the Population Studied
More investigation is needed regarding the use of wearable trackers for sleep research in healthy vs. unhealthy populations, though it is unlikely to be much different than the variation found in actigraphy measurements. Most of the validation data that is available for wearables has been conducted in a younger, ambulatory population. Despite this, many studies are utilizing wearable devices in populations with one or more comorbidities. For example, one recent systematic review investigated the use of wearable sensors for the monitoring of diabetes-related parameters in studies published between 2010 and 2020 and found that out of 26 studies, accelerometer-based devices with PPG were the most common with the intent of measuring activity and heart rate, with no specific validation data used [47]. Notably, de Zambotti et al. recently called for a move towards standardized performance evaluation approach to new technologies in sleep health measurements as opposed to the traditionally termed “validation” studies [48]. This is an effort to address the multiple new devices (and analytics algorithms), many with increased technologic capabilities, that are being used for sleep research—most without access to the raw data and proprietary algorithms that would allow for a scientific evaluation—in a variety of different populations.
Data Privacy
Data privacy issues are not unique to digital sleep health studies but apply to any study where participants are asked to share wearable and other health data electronically. While the number of individuals who use wearables has increased substantially, there are increasing concerns regarding data privacy [49]. Wearables collect an enormous amount of data—not only physical activity and sleep metrics, but personal information about the participants, including zip code, GPS location, and time stamps. In addition to the security of the databases themselves that store large datasets, there are also concerns regarding the exposures participants may face when linking their devices to cloud-based storage systems. Each layer of sharing has the potential to expose the participant to additional risks, with potential for reidentification remaining a large concern. A recent systemic review of 72 studies where participants shared biometric data of some kind, showed high risk to the participants of reidentification, with sometimes as little as 1–300 s of data needed to identify the an individual participant [50]. On an individual level, participants may also be concerned about which aspects of their data will be shared and how the information will be used. In one study, non-English speaking participants expressed concern about the term “tracker,” in activity tracker and how this would be applied to them [51].
Bias Within Samples and Diversity-Related Issues in Wearables
Bias within samples from consumer wearables is a major consideration for researchers, and relates to increasing the diversity of participant populations, which remains important for the advancement and strengthening of clinical research, including in sleep research [52, 53].
First, in regard to selection bias and representative samples, equity is certainly lacking in access to digital resources as well as the specific demographic groups that regularly purchase and use activity trackers. Even with growing adoption of broadband internet and smartphones among rural and lower-income Americans, there is still a digital divide in the use of internet-connected devices including mobile health technologies [54,55,56]. More studies are needed, but Pew research and data reported by Chandrasekaran et al. suggest that about 20–30% of Americans use a smartwatch or wearable fitness tracker, with women reporting higher usage than men [57, 58]. Individuals earning ≥ $75,000 annually were found to be more than twice as likely to wear a fitness tracker compared to those earning less; college graduates, white adults, and those who enjoyed exercise were more likely to wear a device as well [57]. Furthermore, more adults under the age of 50 use wearables compared to older adults, and suburban populations own more wearables than their urban and rural counterparts [58]. In general, those who buy and wear these devices are possibly more health conscious and may possess a greater understanding of health-related information. These consumer patterns naturally lend themselves to selection bias, particularly when examining secondary data from digital devices. Unless researchers are providing devices to their intended population or filtering participants in some capacity, a degree of selection bias should be expected. However, these studies are no less in important in adding to the literature given the potential to reach large populations.
Another source of potential bias in how wearables are used comes from the underlying technology itself. Some studies have reported that wearable devices, and the technology that underlies even clinical-grade pulse oximetry, have lower accuracy among users with darker skin tones, specifically for heart rate and oxygen sensing, with a recent class action lawsuit claiming racial bias against a high-profile wearable technology company [59,60,61].
All of the sources of bias discussed above have an impact on how diverse participants are included in sleep-related research. While these are all very important considerations, there may also be a flip side for digital devices in that there is also a potential to help bridge important gaps in research-related diversity issues. Studies in underserved and underrepresented communities show that while cost and knowledge of fitness trackers were barriers to having a device, the majority of individuals report they would use a device or mobile health application, and would be willing to share data for research purposes [51, 62]. Given that digital devices are relatively inexpensive, they may be more cost-effective to deploy to underserved communities compared to the more expensive actigraphy devices. Furthermore, in a bring-your-own-device (BYOD) approach (discussed below), enrolling participants who already have a device may free up funding and other resources to provide devices for those who are unable to afford them. Additionally, digitally deployed studies—where recruitment is decentralized (also discussed below)—could have a further reach in recruiting diverse populations than traditional in-person studies [63]. Finally, wearables can feedback health (and sleep) data back to participants, particularly in the context of digital research studies, where survey results, interpretations, and general recommendations can also be readily returned to participants.
When considering how wearables impact diversity, two opposing arguments could be made, as outlined below.
Bring-Your-Own-Device Approach
While investigators may be able to provide devices for hundreds or even thousands of individuals, it would become a much larger, and likely unsustainable, undertaking to provide devices for the tens to hundreds of thousands of individuals that often comprise population-level studies. Instead, given that up to 30% of individuals in the USA (equating to roughly 100 million people) currently use a wearable device, a bring-your-own-device (BYOD) approach to harness data from devices that individuals are already using may be the most effective [57]. For example, we were previously able to obtain sleep data from a large population (over 150,000 individuals) of Fitbit users and assess how sleep duration and variability correlated with self-reported BMI in the cohort [64]. This study was feasible only because these individuals already owned activity trackers and used them consistently prior to accessing the data. The DETECT study utilized a prospective BYOD approach based on data from wearables (including sleep information) in an effort to track COVID-19 infections [65], monitor long-COVID, and study vaccine reactogenicity [66, 67]. This study has enrolled over 40,000 participants to date [46]. Similarly, the TemPredict study analyzed biometric data from over 60,000 individuals wearing an OuraRing in an effort to detect COVID-19 [68]. While the latter two studies arose from a need (and likely public desire) to help fight COVID-19, they demonstrate how effective digital recruiting can be on a large scale. Furthermore, the BYOD design also allows for allocation of research funding and resources to provide devices to those who do not already have one or who cannot afford one, thus combatting some of the diversity issues described above.
Where we Might be Headed: Decentralized Digital Studies
As intimated above, most studies utilizing activity trackers are site-based, small, and observational, even though the potential for large population studies is immense. The current paradigm of in-person sleep research where one institution (or multiple institutions in multi-site trials) recruits hundreds to thousands of participants may pose limits on how data from wearable devices are retrieved, processed, and aggregated. Furthermore, recruitment for site-based research has geographic constraints since most study participants reside near study sites for in-person visits, which are typically located near urban academic medical centers. This selection bias results in homogenous study cohorts that do not fully represent the diversity of the real-word patient population, thus creating evidence that only applies to a limited group of patients and further propagates health disparities [69]. The decentralized study approach can leverage direct-to-participant engagement using digital recruitment methods through social media networks, apps, and other online channels without geographic restrictions [70, 71]. Sleep research may be able to move towards the same idea on a large scale with the utilization of wearables for fully remote sleep tracking [72]. With the use of 3rd party platforms and/or smartphone application, data collection from wearables among large study populations are increasingly feasible and more easily managed, in large part due to the COVID-19 pandemic [73, 74]. Furthermore, without research coordinators to conduct in-person screening, recruitment, and enrollment, the lack of manpower that sometimes limits studies would be a smaller hurdle in the conduction of large investigations. While the UK Biobank study recruited 500,000 participants (with ~ 100 K subsequently invited to wear accelerometer devices) over 22 sites [75], and the All of Us Research Program (AOURP) in the USA has recruited over 600,000 individuals to date (with participants offered the BYOD option to share sleep data) via digital and in-person recruitment, these studies had far more resources than most investigators have access to [76]. One of the main disadvantages of decentralizing studies is likely related to reliable data collection and ensuring that participants are completing any questionnaires/surveys and reliably linking their devices to the study—it may be unrealistic to expect the same level of engagement in a digital study that one would find in an in-person study.
An interesting aspect to consider in decentralized studies is the ability to bring in electronic health record (EHR) data, which has the potential to make data sets more robust as it provides background health information, as well as possibly vital signs and laboratory measurements [77]. Having access to these aspects of an individual’s healthcare can help enhance sleep-related research, and can also help us understand how sleep impacts important healthcare outcomes. Our approach to date is to have participants link their EHR data into digital studies when possible; however, there are major networks currently aggregating EHR data for the investigation of research questions. For examples, PCORnet studies have harnessed large-volume, real-world data through different Clinical Research Networks that contain patient data from multiple institutions all the way from major healthcare centers to community health clinics. Kaiser Permanente has been aggregating patient data for research purposes for decades [78, 79], and has recently aimed to have a biobank of over 400,000 patients that includes EHR and some biomarker data. The Veteran’s Administration (VA) also has an EHR database that can be mined for health research. While some sleep-related diagnoses may be able to be extracted from (e.g., insomnia, OSA) such datasets, future next steps may be to have individuals link their wearable device data into the EHR (and thus these networks) to make help networks link patient health data to daily digital footprints that include sleep data.
Future Considerations
Until recently, applications for wearable devices have revolved around the consumer, with a goal of increasing users in the wearable market by the owner company. Now, however, some devices are becoming cleared for specific medical use purposes (for example, Fitbit and Apple watch technology is FDA approved for atrial fibrillation), and clinical purposes that involve sleep disorders may be developed in the future, with the intent of diagnosis, monitoring, or even treatment. Given the proliferation of commercial wearables, we expect that the presence of these devices in the research space for discoveries related to sleep will only increase. Researchers may need to become more comfortable with the limitations of the data collection from these devices (e.g., not having PSG or granular actigraphy data available) and also recognize that differences in algorithms between devices can impact research findings. As the consumer space evolves, researchers may also need to adapt to different wearables. It also remains to be seen how popular wearables remain in the consumer setting as well, which could impact how these devices are utilized for in a sleep research context.
At the current juncture, wearables show promise in becoming a core component of the next wave of sleep-related research studies. One of the next stepping stones would be increasing the number of interventional studies, as most of the population-level studies completed using wearables to date have been observational in nature. Certainly, most modern wearables seem reasonable substitutes for actigraphy, and we hope that as technology and algorithms improve, deeper sleep-level data will also become available. Over time, we also hope that more granular data is made available by the companies that market these devices. Additionally, it would be interesting to see if wearables themselves could be used as an intervention on a large scale to help individuals track their own biometric data to improve their own sleep and possibly other potentially sleep-dependent clinical outcomes. Interestingly, not all studies with wearables have been positive—for example, step counters used for weight loss may have instead led participants to reward high step counts with additional food [80]. And, for sleep devices, some sleep data leads people to try and perfect their sleep, causing frustration where none existed before—so-called orthosomnia [81]. Important questions remain around accuracy with different skin tones, and health equity.
Conclusions
We anticipate an increase in wearable-based sleep research in the near future, and we may also see wearables become more prominent in the clinical setting as another tool to improve outcomes. More randomized clinical trials are needed in the population-level sleep space, and decentralized digital studies may serve as an important opportunity in accomplishing this goal.
References
Henriksen A, Haugen Mikalsen M, Woldaregay AZ, Muzny M, Hartvigsen G, Hopstock LA, Grimsgaard S. Using fitness trackers and smartwatches to measure physical activity in research: analysis of consumer wrist-worn wearables. J Med Internet Res. 2018;20(3):e110. https://doi.org/10.2196/jmir.9157.
Grandner MA, Patel NP, Perlis ML, Gehrman PR, Xie D, Sha D, et al. Obesity, diabetes, and exercise associated with sleep-related complaints in the American population. Z Gesundh Wiss. 2011;19(5):463–74. https://doi.org/10.1007/s10389-011-0398-2.
Williams NJ, Grandner MA, Wallace DM, Cuffee Y, Airhihenbuwa C, Okuyemi K, et al. Social and behavioral predictors of insufficient sleep among African Americans and Caucasians. Sleep Med. 2016;18:103–7. https://doi.org/10.1016/j.sleep.2015.02.533.
Grandner MA, Martin JL, Patel NP, Jackson NJ, Gehrman PR, Pien G, et al. Age and sleep disturbances among American men and women: data from the U.S. Behavioral Risk Factor Surveillance System. Sleep. 2012;35(3):395–406. https://doi.org/10.5665/sleep.1704.
Ayas NT, White DP, Al-Delaimy WK, Manson JE, Stampfer MJ, Speizer FE, et al. A prospective study of self-reported sleep duration and incident diabetes in women. Diabetes Care. 2003;26(2):380–4.
Jungquist CR, Mund J, Aquilina AT, Klingman K, Pender J, Ochs-Balcom H, et al. Validation of the Behavioral Risk Factor Surveillance System Sleep questions. J Clin Sleep Med. 2016;12(3):301–10. https://doi.org/10.5664/jcsm.5570.
Ancoli-Israel S, Cole R, Alessi C, Chambers M, Moorcroft W, Pollak CP. The role of actigraphy in the study of sleep and circadian rhythms. Sleep. 2003;26(3):342–92.
Sadeh A, Hauri PJ, Kripke DF, Lavie P. The role of actigraphy in the evaluation of sleep disorders. Sleep. 1995;18(4):288–302.
Jaiswal SJ, McCarthy TJ, Wineinger NE, Kang DY, Song J, Garcia S, et al. Melatonin and Sleep in preventing hospitalized delirium: a randomized clinical trial. Am J Med. 2018;131(9):1110–7. https://doi.org/10.1016/j.amjmed.2018.04.009. e4
Cook DJ, Arora VM, Chamberlain M, Anderson S, Peirce L, Erondu A, et al. Improving hospitalized children’s sleep by reducing excessive overnight blood pressure monitoring. Pediatrics. 2020;146(3) https://doi.org/10.1542/peds.2019-2217.
DePietro RH, Knutson KL, Spampinato L, Anderson SL, Meltzer DO, Van Cauter E, Arora VM. Association between inpatient sleep loss and hyperglycemia of hospitalization. Diabetes Care. 2017;40(2):188–93. https://doi.org/10.2337/dc16-1683.
Ancoli-Israel S, Clopton P, Klauber MR, Fell R, Mason W. Use of wrist activity for monitoring sleep/wake in demented nursing-home patients. Sleep. 1997;20(1):24–7.
Palesh O, Aldridge-Gerry A, Zeitzer JM, Koopman C, Neri E, Giese-Davis J, et al. Actigraphy-measured sleep disruption as a predictor of survival among women with advanced breast cancer. Sleep. 2014;37(5):837–42. https://doi.org/10.5665/sleep.3642.
Quante M, Kaplan ER, Cailler M, Rueschman M, Wang R, Weng J, et al. Actigraphy-based sleep estimation in adolescents and adults: a comparison with polysomnography using two scoring algorithms. Nat Sci Sleep. 2018;10:13–20. https://doi.org/10.2147/NSS.S151085.
Stone KL, Ancoli-Israel S, Blackwell T, Ensrud KE, Cauley JA, Redline S, et al. Actigraphy-measured sleep characteristics and risk of falls in older women. Arch Intern Med. 2008;168(16):1768–75. https://doi.org/10.1001/archinte.168.16.1768.
Blackwell T, Yaffe K, Ancoli-Israel S, Redline S, Ensrud KE, Stefanick ML, et al. Association of sleep characteristics and cognition in older community-dwelling men: the MrOS sleep study. Sleep. 2011;34(10):1347–56. https://doi.org/10.5665/SLEEP.1276.
Stone KL, Blackwell TL, Ancoli-Israel S, Cauley JA, Redline S, Marshall LM, et al. Sleep disturbances and risk of falls in older community-dwelling men: the outcomes of Sleep Disorders in Older Men (MrOS Sleep) Study. J Am Geriatr Soc. 2014;62(2):299–305. https://doi.org/10.1111/jgs.12649.
Ramos AR, Weng J, Wallace DM, Petrov MR, Wohlgemuth WK, Sotres-Alvarez D, et al. Sleep patterns and hypertension using actigraphy in the Hispanic Community Health Study/Study of Latinos. Chest. 2018;153(1):87–93. https://doi.org/10.1016/j.chest.2017.09.028.
Agudelo C, Tarraf W, Wu B, Wallace DM, Patel SR, Redline S, et al. Actigraphic sleep patterns and cognitive decline in the Hispanic Community Health Study/Study of Latinos. Alzheimers Dement. 2021;17(6):959–68. https://doi.org/10.1002/alz.12250.
Katori M, Shi S, Ode KL, Tomita Y, Ueda HR. The 103,200-arm acceleration dataset in the UK Biobank revealed a landscape of human sleep phenotypes. Proc Natl Acad Sci U S A. 2022;119(12):e2116729119. https://doi.org/10.1073/pnas.2116729119.
Doherty A, Jackson D, Hammerla N, Plotz T, Olivier P, Granat MH, et al. Large scale population assessment of physical activity using wrist worn accelerometers: the UK Biobank study. PLoS One. 2017;12(2):e0169649. https://doi.org/10.1371/journal.pone.0169649.
de Zambotti M, Baker FC, Willoughby AR, Godino JG, Wing D, Patrick K, Colrain IM. Measures of sleep and cardiac functioning during sleep using a multi-sensory commercially-available wristband in adolescents. Physiol Behav. 2016;158:143–9. https://doi.org/10.1016/j.physbeh.2016.03.006.
Razjouyan J, Lee H, Parthasarathy S, Mohler J, Sharafkhaneh A, Najafi B. Improving sleep quality assessment using wearable sensors by including information from postural/sleep position changes and body acceleration: a comparison of chest-worn sensors, wrist actigraphy, and polysomnography. J Clin Sleep Med. 2017;13(11):1301–10. https://doi.org/10.5664/jcsm.6802.
Chinoy ED, Cuellar JA, Huwa KE, Jameson JT, Watson CH, Bessman SC, et al. Performance of seven consumer sleep-tracking devices compared with polysomnography. Sleep. 2021;44(5) https://doi.org/10.1093/sleep/zsaa291.
Roberts DM, Schade MM, Mathew GM, Gartenberg D, Buxton OM. Detecting sleep using heart rate and motion data from multisensor consumer-grade wearables, relative to wrist actigraphy and polysomnography. Sleep. 2020;43(7) https://doi.org/10.1093/sleep/zsaa045.
Perez-Pozuelo I, Posa M, Spathis D, Westgate K, Wareham N, Mascolo C, et al. Detecting sleep outside the clinic using wearable heart rate devices. Sci Rep. 2022;12(1):7956. https://doi.org/10.1038/s41598-022-11792-7.
Miller DJ, Sargent C, Roach GD. A validation of six wearable devices for estimating sleep, heart rate and heart rate variability in healthy adults. Sensors (Basel). 2022;22(16) https://doi.org/10.3390/s22166317.
Baron KG, Duffecy J, Berendsen MA, Cheung Mason I, Lattie EG, Manalo NC. Feeling validated yet? A scoping review of the use of consumer-targeted wearable and mobile technology to measure and improve sleep. Sleep Med Rev. 2018;40:151–9. https://doi.org/10.1016/j.smrv.2017.12.002.
Burkart S, Beets MW, Armstrong B, Hunt ET, Dugger R, von Klinggraeff L, et al. Comparison of multichannel and single-channel wrist-based devices with polysomnography to measure sleep in children and adolescents. J Clin Sleep Med. 2021;17(4):645–52. https://doi.org/10.5664/jcsm.8980.
Quer G, Gouda P, Galarnyk M, Topol EJ, Steinhubl SR. Inter- and intraindividual variability in daily resting heart rate and its associations with age, sex, sleep, BMI, and time of year: Retrospective, longitudinal cohort study of 92,457 adults. PLoS One. 2020;15(2):e0227709. https://doi.org/10.1371/journal.pone.0227709.
Bowman C, Huang Y, Walch OJ, Fang Y, Frank E, Tyler J, et al. A method for characterizing daily physiology from widely used wearables. Cell Rep Methods. 2021;1(4) https://doi.org/10.1016/j.crmeth.2021.100058.
Lee HA, Lee HJ, Moon JH, Lee T, Kim MG, In H, et al. Comparison of wearable activity tracker with actigraphy for sleep evaluation and circadian rest-activity rhythm measurement in healthy young adults. Psychiatry Investig. 2017;14(2):179–85. https://doi.org/10.4306/pi.2017.14.2.179.
McMahon M, McConley I, Hashim C, Schnyer DM. Fitbit validation for rest-activity rhythm assessment in young and older adults. Smart Health. 2023;29:100418. https://doi.org/10.1016/j.smhl.2023.100418.
de Zambotti M, Cellini N, Goldstone A, Colrain IM, Baker FC. Wearable sleep technology in clinical and research settings. Med Sci Sports Exerc. 2019;51(7):1538–57. https://doi.org/10.1249/MSS.0000000000001947.
Liang Z, Chapa-Martell MA. Accuracy of Fitbit wristbands in measuring sleep stage transitions and the effect of user-specific factors. JMIR Mhealth Uhealth. 2019;7(6):e13384. https://doi.org/10.2196/13384.
de Zambotti M, Goldstone A, Claudatos S, Colrain IM, Baker FC. A validation study of Fitbit Charge 2 compared with polysomnography in adults. Chronobiol Int. 2018;35(4):465–76. https://doi.org/10.1080/07420528.2017.1413578.
Bidargaddi N, Musiat P, Makinen VP, Ermes M, Schrader G, Licinio J. Digital footprints: facilitating large-scale environmental psychiatric research in naturalistic settings through data from everyday technologies. Mol Psychiatry. 2017;22(2):164–9. https://doi.org/10.1038/mp.2016.224.
Lin YH, Wong BY, Pan YC, Chiu YC, Lee YH. Validation of the mobile app-recorded circadian rhythm by a digital footprint. JMIR Mhealth Uhealth. 2019;7(5):e13421. https://doi.org/10.2196/13421.
Wu C, McMahon M, Fritz H, Schnyer DM. Circadian rhythms are not captured equal: exploring circadian metrics extracted by differentcomputational methods from smartphone accelerometer and GPS sensors in daily life tracking. Digit Health. 2022;8:20552076221114200. https://doi.org/10.1177/20552076221114201.
Braund TA, Zin MT, Boonstra TW, Wong QJJ, Larsen ME, Christensen H, et al. Smartphone sensor data for identifying and monitoring symptoms of mood disorders: a longitudinal observational study. JMIR Ment Health. 2022;9(5):e35549. https://doi.org/10.2196/35549.
Lin YH, Wong BY, Lin SH, Chiu YC, Pan YC, Lee YH. Development of a mobile application (App) to delineate “digital chronotype” and the effects of delayed chronotype by bedtime smartphone use. J Psychiatr Res. 2019;110:9–15. https://doi.org/10.1016/j.jpsychires.2018.12.012.
Druijff-van de Woestijne GB, McConchie H, de Kort YAW, Licitra G, Zhang C, Overeem S, Smolders K. Behavioural biometrics: using smartphone keyboard activity as a proxy for rest-activity patterns. J Sleep Res. 2021;30(5):e13285. https://doi.org/10.1111/jsr.13285.
Ceolini E, Ghosh A. Common multi-day rhythms in smartphone behavior. NPJ Digit Med. 2023;6(1):49. https://doi.org/10.1038/s41746-023-00799-7.
Menghini L, Cellini N, Goldstone A, Baker FC, de Zambotti M. A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code. Sleep. 2021;44(2) https://doi.org/10.1093/sleep/zsaa170.
Conroy B, Silva I, Mehraei G, Damiano R, Gross B, Salvati E, et al. Real-time infection prediction with wearable physiological monitoring and AI to aid military workforce readiness during COVID-19. Sci Rep. 2022;12(1):3797. https://doi.org/10.1038/s41598-022-07764-6.
Quer G, Radin JM, Gadaleta M, Baca-Motes K, Ariniello L, Ramos E, et al. Wearable sensor data and self-reported symptoms for COVID-19 detection. Nat Med. 2021;27(1):73–7. https://doi.org/10.1038/s41591-020-1123-x.
Rodriguez-Leon C, Villalonga C, Munoz-Torres M, Ruiz JR, Banos O. Mobile and wearable technology for the monitoring of diabetes-related parameters: systematic review. JMIR Mhealth Uhealth. 2021;9(6):e25138. https://doi.org/10.2196/25138.
de Zambotti M, Menghini L, Grandner MA, Redline S, Zhang Y, Wallace ML, Buxton OM. Rigorous performance evaluation (previously, “validation”) for informed use of new technologies for sleep health measurement. Sleep Health. 2022;8(3):263–9. https://doi.org/10.1016/j.sleh.2022.02.006.
Jiang D, Shi G. Research on data security and privacy protection of wearable equipment in healthcare. J Healthc Eng. 2021;2021:6656204. https://doi.org/10.1155/2021/6656204.
Chikwetu L, Miao Y, Woldetensae MK, Bell D, Goldenholz DM, Dunn J. Does deidentification of data from wearable devices give us a false sense of security? A systematic review. Lancet Digital Health https://doi.org/10.1016/S2589-7500(22)00234-5.
Holko M, Litwin TR, Munoz F, Theisz KI, Salgin L, Jenks NP, et al. Wearable fitness tracker use in federally qualified health center patients: strategies to improve the health of all of us using digital health devices. NPJ Digit Med. 2022;5(1):53. https://doi.org/10.1038/s41746-022-00593-x.
Sharma A, Palaniappan L. Improving diversity in medical research. Nat Rev Dis Primers. 2021;7(1):74. https://doi.org/10.1038/s41572-021-00316-8.
Spong CY, Bianchi DW. Improving public health requires inclusion of underrepresented populations in research. JAMA. 2018;319(4):337–8. https://doi.org/10.1001/jama.2017.19138.
Reddick CG, Enriquez R, Harris RJ, Sharma B. Determinants of broadband access and affordability: an analysis of a community survey on the digital divide. Cities. 2020;106:102904. https://doi.org/10.1016/j.cities.2020.102904.
Dolcini MM, Canchola JA, Catania JA, Song Mayeda MM, Dietz EL, Cotto-Negron C, Narayanan V. National-level disparities in internet access among low-income and Black and Hispanic youth: current population survey. J Med Internet Res. 2021;23(10):e27723. https://doi.org/10.2196/27723.
Early J, Hernandez A. Digital disenfranchisement and COVID-19: broadband internet access as a social determinant of health. Health Promot Pract. 2021;22(5):605–10. https://doi.org/10.1177/15248399211014490.
Chandrasekaran R, Katthula V, Moustakas E. Patterns of use and key predictors for the use of wearable health care devices by US adults: insights from a national survey. J Med Internet Res. 2020;22(10):e22443. https://doi.org/10.2196/22443.
Vogels E: About one-in-five Americans use a smart watch or fitness tracker. https://www.pewresearch.org/fact-tank/2020/01/09/about-one-in-five-americans-use-a-smart-watch-or-fitness-tracker/ (2020). Accessed 6/20/2022 2022.
Shcherbina A, Mattsson CM, Waggott D, Salisbury H, Christle JW, Hastie T, et al. Accuracy in wrist-worn, sensor-based measurements of heart rate and energy expenditure in a diverse cohort. J Pers Med. 2017;7(2) https://doi.org/10.3390/jpm7020003.
Colvonen PJ, DeYoung PN, Bosompra NA, Owens RL. Limiting racial disparities and bias for wearable devices in health science research. Sleep. 2020;43(10) https://doi.org/10.1093/sleep/zsaa159.
Fawzy A, Wu TD, Wang K, Robinson ML, Farha J, Bradke A, et al. Racial and ethnic discrepancy in pulse oximetry and delayed identification of treatment eligibility among patients with COVID-19. JAMA Intern Med. 2022;182(7):730–8. https://doi.org/10.1001/jamainternmed.2022.1906.
Liu P, Astudillo K, Velez D, Kelley L, Cobbs-Lomax D, Spatz ES. Use of mobile health applications in low-income populations: a prospective study of facilitators and barriers. Circ Cardiovasc Qual Outcomes. 2020;13(9):e007031. https://doi.org/10.1161/CIRCOUTCOMES.120.007031.
Ramos E, Baca-Motes K, Pandit JA, Ajayi TA. Improving participant representation in the era of digital clinical studies. Trends Mol Med. 2022;28(12):1019–21. https://doi.org/10.1016/j.molmed.2022.07.004.
Jaiswal SJ, Quer G, Galarnyk M, Steinhubl SR, Topol EJ, Owens RL. Association of sleep duration and variability with body mass index: sleep measurements in a large US population of wearable sensor users. JAMA Intern Med. 2020;180(12):1694–6. https://doi.org/10.1001/jamainternmed.2020.2834.
Gadaleta M, Radin JM, Baca-Motes K, Ramos E, Kheterpal V, Topol EJ, et al. Passive detection of COVID-19 with wearable sensors and explainable machine learning algorithms. NPJ Digit Med. 2021;4(1):166. https://doi.org/10.1038/s41746-021-00533-1.
Radin JM, Quer G, Ramos E, Baca-Motes K, Gadaleta M, Topol EJ, Steinhubl SR. Assessment of prolonged physiological and behavioral changes associated with COVID-19 infection. JAMA Netw Open. 2021;4(7):e2115959. https://doi.org/10.1001/jamanetworkopen.2021.15959.
Quer G, Gadaleta M, Radin JM, Andersen KG, Baca-Motes K, Ramos E, et al. Inter-individual variation in objective measure of reactogenicity following COVID-19 vaccination via smartwatches and fitness bands. NPJ Digit Med. 2022;5(1):49. https://doi.org/10.1038/s41746-022-00591-z.
Mason AE, Hecht FM, Davis SK, Natale JL, Hartogensis W, Damaso N, et al. Detection of COVID-19 using multimodal data from a wearable device: results from the first TemPredict Study. Sci Rep. 2022;12(1):3463. https://doi.org/10.1038/s41598-022-07314-0.
Rosen CJ, Cook NL, Wilkins CH. Diversity in clinical trials - next steps. N Engl J Med. 2022;387(15):e34. https://doi.org/10.1056/NEJMp2201379.
Radin JM, Peters S, Ariniello L, Wongvibulsin S, Galarnyk M, Waalen J, Steinhubl SR. Pregnancy health in POWERMOM participants living in rural versus urban zip codes. J Clin Transl Sci. 2020;4(5):457–62. https://doi.org/10.1017/cts.2020.33.
Radin JM, Steinhubl SR, Su AI, Bhargava H, Greenberg B, Bot BM, et al. The Healthy Pregnancy Research Program: transforming pregnancy research through a ResearchKit app. NPJ Digit Med. 2018;1:45. https://doi.org/10.1038/s41746-018-0052-2.
Jaiswal SJ, Quer G, Galarnyk M, Steinhubl SR, Topol EJ, Owens RL. Association of sleep duration and variability with body mass index: sleep measurements in a large US population of wearable sensor users. JAMA Intern Med. 2020. https://doi.org/10.1001/jamainternmed.2020.2834.
Best J. Wearable technology: covid-19 and the rise of remote clinical monitoring. BMJ. 2021;372:n413. https://doi.org/10.1136/bmj.n413.
Channa A, Popescu N, Skibinska J, Burget R. The rise of wearable devices during the COVID-19 pandemic: a systematic review. Sensors (Basel). 2021;21(17) https://doi.org/10.3390/s21175787.
Matthews PM, Sudlow C. The UK Biobank. Brain. 2015;138(Pt 12):3463–5. https://doi.org/10.1093/brain/awv335.
The “All of Us” Research Program. New England Journal of Medicine. 2019;381(7):668-76 https://doi.org/10.1056/NEJMsr1809937.
Pawelek J, Baca-Motes K, Pandit JA, Berk BB, Ramos E. The power of patient engagement with electronic health records as research participants. JMIR Med Inform. 2022;10(7):e39145. https://doi.org/10.2196/39145.
Palzes VA, Weisner C, Chi FW, Kline-Simon AH, Satre DD, Hirschtritt ME, et al. The Kaiser Permanente Northern California Adult Alcohol Registry, an electronic health records-based registry of patients with alcohol problems: development and implementation. JMIR Med Inform. 2020;8(7):e19081. https://doi.org/10.2196/19081.
Selby JV, Friedman GD, Quesenberry CP Jr, Weiss NS. A case-control study of screening sigmoidoscopy and mortality from colorectal cancer. N Engl J Med. 1992;326(10):653–7. https://doi.org/10.1056/NEJM199203053261001.
Jakicic JM, Davis KK, Rogers RJ, King WC, Marcus MD, Helsel D, et al. Effect of wearable technology combined with a lifestyle intervention on long-term weight loss: the IDEA randomized clinical trial. JAMA. 2016;316(11):1161–71. https://doi.org/10.1001/jama.2016.12858.
Baron KG, Abbott S, Jao N, Manalo N, Mullen R. Orthosomnia: are some patients taking the quantified self too far? J Clin Sleep Med. 2017;13(2):351–4. https://doi.org/10.5664/jcsm.6472.
de Zambotti M, Goldstone A, Claudatos S, Colrain IM, Baker FC. A validation study of Fitbit Charge 2 compared with polysomnography in adults. Chronobiol Int. 2017:1–12. doi: https://doi.org/10.1080/07420528.2017.1413578.
Funding
Open access funding provided by SCELC, Statewide California Electronic Library Consortium. This work was funded in part by grant number UL1TR002550 from the National Center for Advancing Translational Sciences at the National Institutes of Health.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare no competing interests.
Human and Animal Rights and Informed Consent
No animals or human subjects were utilized in the writing of this review article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Jaiswal, S.J., Pawelek, J.B., Warshawsky, S. et al. Using New Technologies and Wearables for Characterizing Sleep in Population-based Studies. Curr Sleep Medicine Rep 10, 82–92 (2024). https://doi.org/10.1007/s40675-023-00272-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40675-023-00272-7