Abstract
Osteoarthritis (OA) is a chronic condition often associated with pain, affecting approximately fourteen percent of the population, and increasing in prevalence. A globally aging population have made treating OA-associated pain as well as maintaining mobility and activity a public health priority. OA affects all mammals, and the use of spontaneous animal models is one promising approach for improving translational pain research and the development of effective treatment strategies. Accelerometers are a common tool for collecting high-frequency activity data on animals to study the effects of treatment on pain related activity patterns. There has recently been increasing interest in their use to understand treatment effects in human pain conditions. However, activity patterns vary widely across subjects; furthermore, the effects of treatment may manifest in higher or lower activity counts or in subtler ways like changes in the frequency of certain types of activities. We use a zero inflated Poisson hidden semi-Markov model to characterize activity patterns and subsequently derive estimators of the treatment effect in terms of changes in activity levels or frequency of activity type. We demonstrate the application of our model, and its advance over traditional analysis methods, using data from a naturally occurring feline OA-associated pain model.
Similar content being viewed by others
Introduction
To anyone who suffers chronic and persistent musculoskeletal pain, the negative impact on their quality of life is constant. In the U.S., more than 100 million people (nearly one third of the population) suffer from persistent pain with an associated economic cost of $600 billion USD annually; this cost exceeds that of cardiovascular disease, cancer, and diabetes combined1. The most significant contribution to this cost comes from the impact of osteoarthritis (OA) and other musculoskeletal pain1. OA results in the deterioration of all components of joints2 and is often associated with pain. OA-associated pain results in significant morbidity and economic costs3; these costs, an aging population, and the growing knowledge of the health and psychological benefits of maintaining mobility and activity, have made the treatment of OA and the associated pain a public health priority4,5,6,7,8. However, recent review papers show that the current practice of translational biomedical research is not producing new therapeutics for pain control in humans9,10,11. While these reviews highlight the lack of translation of basic research into new approved therapeutics for treatment of persistent pain in humans, they also discuss how the processes involved could be optimized to improve the chances of successful translation, including discussion of improved models and more relevant outcome measures. As OA and associated pain affects all mammals, the study of OA in non-human animals is both important in its own right (to increase the function and quality of life in affected animals) and for its ability to generate new knowledge about the treatment of OA in humans. Recently, the use of naturally occurring OA in pet animals has been suggested as a means of helping to improve translational research for the development of analgesic therapeutics in humans12. A significant advantage of naturally occurring disease in pets as a model of human conditions is the variation and complexity of the model. Measurement of spontaneous activity as an outcome measure may be particularly relevant to translational work as spontaneous activity may relate to spontaneous pain—something that has been difficult to model in animals. Just as the variability in naturally occurring disease is an advantage, the variability in clinically meaningful outcome measures such as activity is a challenge. This could be the reason for the lack of use of activity as a functional outcome measure in pain studies in humans, despite the importance of mobility and activity to quality of life7. The work we are reporting is motivated by our involvement in studies of OA-associated pain and the treatment effects on mobility and activity in domestic cats13,14. We posit that the methodology we describe presents a significant advance in how such activity data are analyzed when used as an outcome measure and has relevance to both large animal models and clinical evaluation in humans.
The current study uses data from a randomized cross-over study designed to evaluate the effectiveness of meloxicam, a nonsteroidal anti-inflammatory drug (NSAID), in owned, pet adult domestic cats with OA-associated pain15. In this study, cat activity patterns were measured at one-minute intervals using an omnidirectional accelerometer16. Accelerometer readings are integers quantifying the intensity of change in acceleration over the preceding, pre-defined epoch. Thus, for each subject, the accelerometer produces high-frequency, integer-valued, longitudinal data. The main goal of the present study is to re-evaluate the use of and analysis of activity data to determine whether meloxicam is effective in reducing OA-associated pain. We aimed to use the objectively measured physical activity recorded via accelerometers and analyze these data using a novel zero-inflated hidden Poisson semi-Markov model. The primary hypothesis of the present study is that meloxicam reduces OA-associated pain and this manifests through increased activity during more intense activity during the treatment period relative to the placebo period. One common approach for the analyses of such data is to aggregate the observed accelerometer data within each subject and treatment condition and subsequently to use an ANOVA to compare treatment with control15,16,17. Alternatively, one could aggregate the data over a shorter time interval, e.g., a day, and model the aggregated process using methods for smooth longitudinal data18. However, such aggregation can obscure changes in behavior patterns which do not produce a large difference in mean activity. For example, an effective treatment that reduces pain may lead to higher quality rest (more zero activity readings), but also more high-intensity movement (more high activity readings), potentially producing no change in mean activity. High volume activity data has the potential to be a more sensitive outcome measure, but thus far, analysis of such complex, high-volume multidirectional effects on activity, that takes into account the wide variation in individual activity patterns, has not been developed.
We propose modeling the minute-by-minute accelerometer data using a hidden semi-Markov model. Such a model is scientifically appealing in that the hidden states can be viewed as corresponding to latent (unobserved) activities, e.g., running, walking, resting, etc., and state duration corresponds to the length of time a subject is engaged in a given activity. Latent state-space models are common in the analysis of mobility data measured using wearable computing19,20,21,22,23. In the context of treatment evaluation, hidden Markov models have also been used to model latent health states and subsequently conduct inference for activity patterns in terms of transitions among these states and activity patterns within each state24,25; however, previous applications have considered relatively coarse time scales, e.g., daily or weekly measurements, and subsequently low data volume. In the current application, subject activity is measured every minute and as a result each subject has more than one-hundred thousand measurements during the observation period. Such high volume allows for flexible modeling and estimation of distinct parameters for each subject; this is important as activity patterns can vary widely across subjects.
In the current example of the spontaneous cat OA-pain model, more than 70% of the observations are zero and therefore we propose a zero-inflated hidden Poisson semi-Markov model with patient-specific intensities in each latent state. The proposed model is an extension of a zero-inflated Poisson hidden Markov model26,27 to the hidden semi-Markov model framework that allows for distinct state duration densities28,29,30. Furthermore, to facilitate computationally efficient estimation with high-frequency data, we propose a two stage estimation procedure31,32 that can be used with millions of observations without specialized computing resources.
The data for this work come from a study of the treatment effects of meloxicam (a non-steroidal anti-inflammatory drug) in a feline model of spontaneous OA-associated pain15,33. This study was a randomized, double-blind, placebo-controlled, crossover study to evaluate the effectiveness of meloxicam treatment to improve mobility and function in owned, pet cats with OA-associated pain. This study was approved by the Animal Care and Use Committee (Protocol # 11-102-O) at North Carolina State University College of Veterinary Medicine (NCSU-CVM), and written owner consent was granted in each case following verbal discussion of the study.
Results
Figure 1 displays the daily activity profile for a typical subject. There are multiple prolonged intervals in which there is no movement. In total, more than 70% of the minute-by-minute activity counts are zero; furthermore, the non-zero counts are heavily right-skewed owing to infrequent but high-intensity activities. Thus, such cat accelerometer data are heavy-tailed with an excessive proportion of zeros.
Figure 2 displays mean activity counts for each subject across each period of the study. In this figure, each gray line shows the mean activity count for a single subject in the study, and the black line shows the overall mean activity count for all subjects in their associated treatment group. The left panel shows data for the subjects that were randomized to meloxicam in the first treatment period, whereas the right panel shows data for the subjects that were randomized to placebo in the first treatment period. In both panels, one can see that the overall mean activity count is higher under meloxicam than placebo, but only marginally. A total of 41 out of 58 cats have a numerically larger mean activity count in the meloxicam period than in the placebo period. However, there is large between-subject variability. The standard errors for the group mean activity count at each treatment period range from 16.7 to 18.1.
We applied the proposed zero-inflated hidden semi-Markov model (detailed in the Methods section) to the longitudinal accelerometer data from this study of the treatment effect of meloxicam. We chose the number of latent activity states to be six using minimum Bayesian information criterion (BIC) as our data-driven selection method. The covariates included in the model were: treatment status, weekend (yes/no), night (yes/no), sex, Body Condition Score (BCS), age, total OA score, and treatment sequence.
Table 1 displays the estimated treatment effects in terms of mean activity counts within each latent state. There are significant effects (p < 0.05) in latent states 4–6 suggesting that cats treated with meloxicam may have increased mean activity while engaged in more intensive activities (e.g., running, jumping, etc.). The effect size of the treatment on those more intensive activities ranges from 7.28 to 9.14%. Table 2 shows the mean proportion of time spent in each latent state estimated using the Viterbi algorithm34. It can be seen that the proportion of time spent in each activity is similar across the treatment and placebo periods. The proportion of time spent is each latent activity states is similar before and after the meloxicam treatment (Table 2); what significantly changes is the mean activity counts in more intense states (4–6), i.e., cats become more active in those intense activity states (Table 1). The mean activity count for each subject in the placebo and meloxicam treatment period for latent state 4–6 is shown in Fig. 3.
Figure 4 shows a QQ-plot of the estimated quantiles using the fitted model and the observed quantiles of the activity counts. It can be seen that they follow each other closely and that there are no indications of serious lack-of-fit.
Discussion
We proposed a zero-inflated Poisson hidden semi-Markov model for activity data captured by accelerometer readings in a naturally occurring model of OA-associated pain. The proposed model can be used to characterize the impact of treatment on activity including changes in activity duration, intensity, and frequency. In the context of OA-associated pain, understanding such impacts is critical to fully understanding the effects of treatment on patient mobility and therefore quality of life.
Our method presents a significant advance in the analysis of therapeutic efficacy using activity counts as compared to the original analysis in Gruen et al15 as well as subsequent work in Gruen et al18. In Gruen et al15, the activity counts were averaged within each treatment period for each cat, and an ANOVA model was used to determine the effectiveness of meloxicam. Their analysis showed a non-significant increase (p = 0.31) in the activity counts among the cats that received meloxicam in the first treatment period. This may be due to the effect of data aggregation method, which can smooth away important effects and reduce statistical power. Gruen et al18 aggregated the counts across days and then used functional data analysis methods to formally assess treatment effect. This method failed to identify a significant difference between normal and arthritic cats (p = 0.541) due to the same limitations discussed above.
In contrast, our proposed subject-specific method not only preserves the original data ‘as is’ but also accounts for the large between-subject variability. More importantly, our approach can analyze the conditional treatment effect in each activity state. Indeed, our model found that meloxicam significantly improves the mobility of cats in more intensive activities, while the activity patterns did not change in less intensive activities. Although an easy assumption is that the increase in more intense activities is beneficial, the clinical relevance of these findings is, at present, unknown.
Our approach is relevant to humans. Cats, like people, show patterns of activity, with large periods of relative or absolute inactivity. Cats are generally crepuscular—that is, they show phasic patterns of activity, as do humans. Such zero-inflated patterned activity is thus common to cats and humans, and so our approach is relevant to human activity data. As the approach does not need to know what activity is being performed, it negates the need for algorithms to detect specific activities.
Future work could support our approach by evaluating concurrent video capture of moving subjects to verify that as more intense activities are being performed, the relevant latent states change. Additionally, research is needed to define the clinical relevance of the changes we have detected using the zero-inflated hidden Poisson semi-Markov model.
Our refinement of the analysis of high-volume activity data in this naturally occurring model of pain provides a clinically relevant outcome measure that can be used in naturally occurring models osteoarthritis in companion animals to provide highly relevant data on the potential efficacy of putative drugs prior to Phase II and III efficacy studies in humans, thus improving the translational paradigm by optimizing the critical go/no-go decision prior to Phase II. Such sophisticated approaches to high-volume activity data have not been applied to humans and our approach also has relevance to the analysis of activity data in humans to better understand the impact of therapeutics in a more refined manner.
Methods
The original study was approved by the Animal Care and Use Committee (Protocol # 11–102-O) at North Carolina State University College of Veterinary Medicine (NCSU-CVM), and written owner consent was granted in each case following verbal discussion of the study. The study was performed in accordance with the relevant guidelines and regulations. Full details of the study have been previously published15 and pertinent methodological information is provided in Supplementary File 1.
A total of 66 subjects were enrolled in the original study of which 58 had available accelerometer measurements from both arms of the cross-over design. Subjects were given placebo during the open label baseline period (Weeks 1 and 2). In the first blinded treatment period (Weeks 3, 4, and 5) half of the subjects were randomized to receive meloxicam and the remainder were randomized to receive placebo. At the end of the first blinded phase, subjects entered a three-week blinded washout (Weeks 6, 7, and 8) before switching treatments for the second blinded phase. All subjects wore an accelerometer (Actical) on their collar throughout the entire study, with the epoch set at 1 min, providing an activity count for every minute of the 11-week study for each subject.
Zero-inflated Poisson hidden semi-Markov model
We assume that the observed data are of the form {(Wi, Zi1, Xi1, Yi1, …, ZiT, XiT, YiT)} and comprise n independent replicates of the trajectory (W, Z1, X1, Y1, …, ZT, XT, YT), where: \(W \in {\mathbb{R}}^{d}\) denotes the baseline (pre-treatment) subject characteristics; T is a fixed time horizon; \(Z^{t} \in {\mathbb{R}}^{p}\) denotes environmental factors at t = 1, …, T; \(X^{t} \in {\mathbb{R}}^{q}\) encodes treatment administered at time t = 1, … ,T; and \(Y^{t} \in {\mathbb{N}}\) denotes the integer accelerometer activity reading at time t = 1, … ,T. In our application, \(Z^{t} \in \{ 0, 1\}^{2}\) comprises indicators coding period of day and weekend versus weekday, and \(X^{t} \in \{ 0, 1\}^{1}\) encodes treatment indicators (active versus placebo). While we develop our models allowing for rather general environmental factors and treatment processes, having binary factors in our application makes it simpler to enforce a stochastic ordering on the distributions indexed by latent states which is important for interpretability and coherence of the fitted models, e.g., if one wants the latent states ordered by mean activity intensity within each subject and to have these latent states align across subjects. We model the evolution of the accelerometer data using a zero-inflated Poisson hidden semi-Markov model which develop in the remainder of this subsection.
Let \(S_{i}^{t} \in \left\{ {1, \ldots , M} \right\}\) denote the latent state, i.e., the unobserved activity type, of subject i = 1, …, n at time t = 1, … ,T. At every single minute, each subject is assumed to be in one of the M latent activity states, with the convention that “1” represents a state where a low level of physical activity is typical; “2” represents a state where higher level activity is typical and so forth. The duration in each state is characterized by state-specific parameters and estimated from the data.
We assume the latent states evolve according to a semi-Markov model indexed by: (i) an initial state distribution, \(\delta_{m,i} = P\left( {S_{i}^{1} = m} \right)\) for m = 1, …, M; (ii) state duration distributions \(r_{m,i} \left( {v;x^{t} ,z^{t} } \right) = P(S_{i}^{t + 1} = m, \ldots ,S_{i}^{t + v} \ne m|S_{i}^{t} = m,X_{i}^{t} = x^{t} ,Z_{i}^{t} = z^{t} )\) for v > 0, m = 1, … , M; and (iii) the transition probabilities \(Q_{i} \left( {m,l;x^{t} ,z^{t} } \right) = P{(}S_{i}^{t + 1} = l {|} S_{i}^{t} = m, S_{i}^{t + 1} \ne m, X_{i}^{t} = x^{t} ,Z_{i}^{t} = z^{t} )\) for \(m, l = 1, \ldots ,M\). We model the initial state distribution nonparametrically using sample frequencies of the estimated latent states. We assume a multinomial logistic regression model for the transition probabilities so that
where \(d_{m,k,i}\) and \({\varrho }_{m,k,i}\) are unknown coefficient vectors for m, k = 1, …, M. We assume that the state duration distribution follows a latent accelerated failure time model so that.
where \(f_{m,i}\) is a base-density corresponding to a null treatment condition (coded as \(x^{t} \equiv 0\)) and \(c_{m,i} , \eta_{m,i}\) are unknown coefficient vectors for m = 1, … ,M. We further assume that \(Y_{i}^{t} \bot \left( {Z_{i}^{1} ,X_{i}^{1} ,S_{i}^{1} , \ldots , Z_{i}^{t - 1} ,X_{i}^{t - 1} ,S_{i}^{t - 1} } \right) | \left( {Z_{i}^{t} , X_{i}^{t} , S_{i}^{t} } \right)\) and that the accelerometer activity readings are distributed as.
when the subject is in the latent state 1, and
when the subject is in latent state m = 2, 3, …, M. Thus, the accelerometer readings are modeled as a zero-inflated Poisson with intensity function \(\lambda_{i}^{t} :S \times {\mathbb{R}}^{q} \times {\mathbb{R}}^{p} \to {\mathbb{R}}_{ + }\) and weight functions \(p_{i}^{t} :S \times {\mathbb{R}}^{q} \times {\mathbb{R}}^{p} \to \left[ {0, 1} \right]\) for i = 1, …, n, t = 1, …, T. Furthermore, for each subject i, time t, and latent state m, we assume that these functions are of the form
where \(b_{0,k,i} ,b_{1,k,i} ,\gamma_{k,i} ,\) k = 0, 1, …, M are unknown coefficient vectors. In our application, in which \(x^{t}\) and \(z^{t}\) are represented as binary vectors, we impose the constraints \(b_{0,m + 1,i} > b_{0,m,i} , b_{1,m + 1,i} \ge b_{1,m,i} ,\gamma_{m + 1,i} \ge \gamma_{m,i}\) for m = 1, …, M—1. These constraints ensure that the intensity functions of the latent states are monotone increasing, i.e., \(\lambda_{i}^{t} \left( {m + 1, z,x} \right) > \lambda_{i}^{t} \left( {m,z,x} \right)\) for all z, x, and m.
The preceding model describes individual level accelerometer data as a function of evolving covariate and treatment information. The idea of using the framework of hidden Markov model to analyze physical activity data can also be seen in the recent work of Huang et al.35 and van Kuppevelt et al.36. However, the cat accelerometer data we consider are much sparser than the human accelerometer data in their work, which motivates a zero-inflation component in our model. Moreover, we can include exogenous, time-varying variables like treatment period, night indicator, and weekend indicator in our model so as to explicitly address their effects on the activity patterns of cats. Our individual level model is a nontrivial extension of the zero-inflated Poisson hidden Markov model in the literature26,27, which not only allows for distinct state duration densities but also incorporates covariates and treatment in both transition probabilities and latent state durations.
To draw more general, i.e., population-level, inferences we model individual-level treatment effect parameters as functions of baseline subject information as follows. We assume that \({\rm E}(b_{1,m,i } |W_{i} = w) = \varOmega_{m,0} + \varOmega_{m,1} w\), where \(\varOmega_{m,0} \in {\mathbb{R}}^{q}\) and \(\varOmega_{m,1} \in {\mathbb{R}}^{q \times d}\) are matrices of unknown coefficients encoding treatment effect on the outcome for m = 0, 1, …, M. Similarly, we assume that \({\rm{{E}(}}c_{m,i } {|}W_{i} = w) = \varGamma_{m,0} + \varGamma_{m,1} w\) and \({{{\rm E}(}}d_{m,l,i } {|}W_{i} = w) = \varLambda_{m,l,0} + \varLambda_{m,l,1} w\), where \(\varGamma_{m,0} , \varLambda_{m,l,0} \in {\mathbb{R}}^{q}\) and where \(\varGamma_{m,1} , \varLambda_{m,l,1} \in {\mathbb{R}}^{q \times d}\) encode the effect of treatment on state duration and transition probabilities for \(m, l = 1, \ldots , M\). The idea of aggregating individual-level models to conduct population-level inference has been applied in other contexts including longitudinal and growth curve analyses31,37,38.
Estimation
We estimate the subject-specific parameters using maximum-likelihood estimation via the forward–backward recursive representation of the likelihood39; the number of latent states, M, is selected using BIC38. Subsequently, the population-level parameters are estimated by regressing the subject-specific maximum likelihood estimators on baseline covariates using least squares. This two-stage approach, which will be detailed shortly, is computationally efficient for high-frequency data like those collected in the meloxicam study. Let
be the collection of subject-specific parameters for i = 1, …, n. We construct estimators \(\hat{\theta }_{i,T}\) of \(\theta_{i}\) by maximizing the log-likelihood (see supplementary file 2 for details).
For m = 0, 1, …, M, let \(\hat{b}_{{1,{\rm{m}},{\rm{i}},{\rm{T}}}}\) denote the maximum likelihood estimator of \(b_{1,m,i}\). Subsequently, define \(\hat{\varOmega }_{m,0,n}\), \(\hat{\varOmega }_{m,1,n} = argmin_{{\varOmega_{m,0} ,\varOmega_{m,1} }} \mathop \sum \limits_{i = 1}^{n} ||\hat{b}_{1,m,i,T} - \varOmega_{m,0} - \varOmega_{m,1} W_{i} ||^{2}\) to be the two-stage estimator of \(\varOmega_{m,0} , \varOmega_{m,1}\). The estimators \(\hat{\varGamma }_{m,0,n} ,\hat{\varGamma }_{m,1,n} ,\hat{\varLambda }_{m,l,0,n} ,\hat{\varLambda }_{m,l,1,n}\) of \(\varGamma_{m,0} ,\varGamma_{m,1} ,\varLambda_{m,l,0} ,\varLambda_{m,l,1}\) are defined analogously.
The statistical analysis in the manuscript is implemented using the software R version 2.0.6. We developed an R package “ziphsmm” to analyze such high-volume, zero-inflated accelerometer data, which is publicly available online (https://cran.r-project.org/web/packages/ziphsmm/index.html).
Theoretical properties
Throughout we use a star superscript, e.g.,\(\theta_{i}^{*}\) to denote the population analogue of the maximum likelihood estimator of a parameter indexing the proposed model; if the model is correctly specified, then under the conditions stated below, these parameters will correspond to the true parameters indexing the generative model. In order to characterize the limiting behavior of the proposed estimators, we make the following assumptions for all \(i = 1, \ldots ,n\) and \(m, l = 1, \ldots ,M.\)
-
(A0)
The dimension of the subject-specific parameters is j, where j is known; \(\theta_{i}^{*}\) is an interior point of \(\Theta\), which is a compact subset of \({\mathbb{R}}^{j}\).
-
(A1)
The state duration distributions \(r_{m,i} \left( . \right)\) has finite support.
-
(A2)
There exists \(0 < \sigma_{ - } \le \sigma_{ + } < 1\) such that \({\upsigma }_{ - } \le P_{{\theta_{i }^{*} }} (S_{i}^{t + 1} = l|S_{i}^{t} = m) \le \sigma_{ + }\), for \({\rm{t}} = 1,{ } \ldots ,{\rm{T}}\); and \(\mathop \sum \limits_{{{\rm{m}} = 1}}^{{\rm{M}}} P_{{\theta_{i}^{*} }} \left( {Y_{i}^{t} = y{|}S_{i}^{t} = m} \right) > 0\) for all \({\rm{y}} \in {\rm{supp Y}}_{{\rm{i}}}^{{\rm{t}}}\) and \({\rm{t}} = 1,{ } \ldots ,{\rm{T}}\).
-
(A3)
For each \({\uptheta } \in {\Theta }\) the transition kernel indexed by \({\uptheta }\) is Harris recurrent and aperiodic40,41,42.
-
(A4)
The transition kernel is continuous in \({\uptheta }\) in an open neighborhood of \({\uptheta }_{{\rm{i}}}^{*}\).
-
(A5)
The hidden semi-Markov model is identifiable up to label-switching43,44.
These assumptions are standard in latent-state models45,46,47,48,49. Assumption (A0) avoids non-regularity occurring at boundary points; the assumption of a fixed dimension could be relaxed, for example, to the assumption that one has a strongly consistent estimator of j. Assumption (A1) is common in semi-markov models and simplifies asymptotic arguments; if the finite support condition does not hold, one can use a nested sequence of finite approximations at the expense of more delicate asymptotic arguments50,51. Assumption (A4) is a regularity condition that avoids non-standard asymptotic behavior associated with non-smooth functionals52,53. Assumptions (A2), (A3), (A5) ensure that the model is well-defined; we show that (A5) holds for the zero-inflated Poisson semi-Markov model in the supplementary material. A proof of the following lemma and theorem is also provided in the supplementary material.
Lemma 1. Under (A0)–(A5), the MLE for the subject-specific hidden semi-Markov model in the first stage is strongly consistent, \(\hat{\theta }_{i,T} \to \theta_{i}^{*}\) almost surely as \({\rm{T}} \to \infty\) for all \({\rm{i}} = 1, \ldots ,{\rm{n}}.\)
Define \({\Omega }_{{\rm{m}}}^{*} = \left[ {{\Omega }_{{{\rm{m}},0}}^{*} , \varOmega_{m,1}^{*} } \right]\) and the corresponding estimator \(\hat{\varOmega }_{{{\rm{m}},{\rm{n}}}} = \left[ {\hat{\varOmega }_{{{\rm{m}},0,{\rm{n}}}} { },\hat{\varOmega }_{m,1,n} } \right].{ }\) Let \({\Gamma }_{{\rm{m}}}^{*} ,\hat{\varGamma }_{m,n} , \varLambda_{m,l} , \hat{\varLambda }_{m,l,n}\) be defined analogously. Further, define the augmented design matrix \(\tilde{W} \in {\mathbb{R}}^{{n \times \left( {d + 1} \right)}}\) whose ith row is \(\widetilde{{W_{i} }} = \left[ {1, W_{i}^{^{\prime}} } \right] \in {\mathbb{R}}^{d + 1}\). Then the second stage regression models can be equivalently written as \({\rm{vec}}\left( {\hat{b}_{1,m,T} } \right) = \left( {I \otimes \tilde{W}} \right){\rm{vec}}\left( {\varOmega_{m} } \right) + \nu_{m} ,\) \({\rm{vec}}\left( {\hat{c}_{1,m,T} } \right) = \left( {I \otimes \tilde{W}} \right){\rm{vec}}\left( {\varGamma_{m} } \right) + \eta_{m}\), \({\rm{vec}}\left( {\hat{d}_{m,l,T} } \right) = \left( {I \otimes \tilde{W}} \right){\rm{vec}}\left( {\varLambda_{m,l} } \right) + \xi_{m,l}\), where vec is the vectorization operator, and \({\upnu }_{{\rm{m}}} , \eta_{m} , \xi_{m,l}\) are independent mean zero residuals. Define \(\tilde{B} = {\rm{I}} \otimes \tilde{W} \in {\mathbb{R}}^{{{\rm{qn}} \times {\rm{q}}\left( {{\rm{d}} + 1} \right)}} { }\) whose ith row is \(\tilde{B}_{i}\).
Theorem 3.2. Under (A0)–(A5), and further assume that \({\rm{H}} = {\rm{n}}^{ - 1} \mathop {\lim }\limits_{{{\rm{n}} \to \infty }} \tilde{W}^{{\prime }} W\) is a positive definite matrix. Then provided \({\rm{T}},{\rm{n}} \to \infty\) with \(\frac{{\rm{n}}}{{\rm{T}}} \to 0\), each of the following converges in distribution to a Gaussian distribution with mean zero and identity covariance:
for all \({\rm{m}},{\rm{l}} = 1, \ldots ,{\rm{M}}.\)
In the preceding theorem, the requirement that \(\frac{{\rm{n}}}{{\rm{T}}} \to 0\) is natural in applications using high-frequency accelerometer data where the number of observations per subject can be several orders of magnitude larger than the number of subjects. In the supplementary file 2, we have included extensive simulation experiments to evaluate the performance of the proposed treatment effect estimator.
References
National Research Council. Relieving pain in America: A blueprint for transforming prevention, care, education, and research. Washington, DC (2011).
Buckwalter, J. A., Saltzman, C. & Brown, T. The impact of osteoarthritis: Implications for research. Clin. Orthop. Relat. Res. 427, 6–15 (2004).
Chen, A., Gupte, C., Akhtar, K., Smith, P. & Cobb, J. The global economic cost of osteoarthritis: how the UK compares. Arthritis (2012).
Hadler, N. Osteoarthritis as a public health problem. Clin. Rheum Dis. 11, 175–185 (1985).
Elders, M. J. The increasing impact of arthritis on public health. J. Rheumatol. Suppl. 60, 6–8 (2000).
Hootman, J. M., Helmick, C. G. & Bradley, J. T. A public health approach to addressing arthritis in older adults: The most common cause of disability. Am. J. Public Health 102, 426–433 (2012).
McClintock, M. K., Dale, W., Laumann, E. O. & Waite, L. Empirical redefinition of comprehensive health and well-being in the older adults of the United States. Proc. Nat. Acad. Sci. 113, 3071–3080 (2016).
Park, J., Mendy, A. & Vieira, R. Various types of arthritis in the united states: Prevalence and age-related trends from 1999 to 2014. Am. J. Public Health 108, 256–258 (2018).
Vierck, C. J., Hannson, P. T. & Yezierski, R. P. Clinical and pre-clinical pain assessment: Are we measuring the same thing?. Pain 135, 7–10 (2008).
Mogil, J. S. Animal models of pain: Progress and challenges. Nat. Rev. Neurosci. 10, 283–294 (2009).
Hayes, A. G., Arendt-Nielsen, L. & Tate, S. Multiple mechanism have been tested in pain: How can we improve the chances of success?. Curr. Opin. Pharmacol. 14, 11–17 (2014).
Lascelles, B. D. X., Brown, D. C., Maixner, W. & Mogil, J. S. Spontaneous painful disease in companion animals can facilitate the development of chronic pain therapies for humans. Osteoarthr. Cartil. 26, 175–183 (2017).
Lascelles, B. D. X. et al. Cross-sectional study of the prevalence of radiographic degenerative joint disease in domesticated cats. Vet. Surg. 39, 535–544 (2010).
Slingerland, L., Hazewinkel, H., Meij, B. P., Picavet, P. & Voorhout, G. Cross-sectional study of the prevalanece of clinical feature of osteoarthritis in 100 cats. Vet. J. 187, 304–309 (2011).
Gruen, M. E., Griffith, E. H., Thomson, A. E., Simpson, W. & Lascelles, B. D. X. Criterion validation testing of clinical metrology instruments for measuring degenerative joint disease associated mobility impairment in cats. PLoS ONE 10(7), e0131839 (2015).
Lascelles, B. D. X. et al. Evaluation of a digitally integrated accelerometer-based activity monitor for the measurement of activity in cats. Vet. Anaesth. Analg. 35, 173–183 (2008).
Gruen, M. E. et al. A feline-specific anti-nerve growth factor antibody improves mobility in cats with degenerative joint disease-associated pain: A pilot proof of concept study. J. Vet. Int. Med. 30, 1138–1148 (2016).
Gruen, M. E. et al. The use of functional data analysis to evaluate activity in a spontaneous model of degenerative joint disease associated pain in cats. PLoS ONE 12(1), e0169576 (2017).
He, J., Li, H. & Tan, J. Real-time daily activity classification with wireless sensor networks using hidden markov model. in Engineering in Medicine and Biology Society, 29th Annual International Conference of the IEEE, 3192–3195 (2007).
Mannini, A. & Sabatini, A. M. Machine learning method for classifying human physical activity from on-body accelerometers. Sensors 10, 1154–1175 (2010).
Nickel C., Busch, C., Rangarajan, S. & Mobius, M. Using hidden markov models for accelerometer-based biometric gait recognition. in Signal Processing and its Applications, IEEE 7th International Colloquium, 58–63 (2011).
Lee, Y. & Cho, S. Activity recognition using hierarchical hidden markov models on a smartphone with 3d accelerometer. Hybrid Artif. Intell. Syst. 6678, 460–467 (2011).
Ronao, C. A. & Cho, S. Human activity recognition using smartphone sensors with two-stage continuous hidden markov models. in International Conference on Natural Computation, 10th International Conference of the IEEE, 681–686 (2014).
Scott, S. L., James, G. M. & Sugar, C. A. Hidden markov models for longitudinal comparisons. J. Am. Stat. Assoc. 100, 359–369 (2005).
Shirley, K. E., Small, D. S., Lynch, K. G., Matiso, S. A. & Oslin, D. W. Hidden markov models for alcoholism treatment trial data. Ann. Appl. Stat. 4, 366–395 (2010).
DeSantis, S. M. & Bandyopadhyay, D. Hidden markov models for zero-inflated poisson counts with an application to substance use. Stat. Med. 30, 1678–1694 (2011).
Olteanu, M. & Ridgway, J. Hidden markov models for time series of counts with excess zeros. in European Symposium on Artificial Neural Networks, 2012 Proceedings, 133–138 (2012).
Ferguson, J. D. Variable duration models for speech. in Proceedings of the Symposium on the Application of HMMs to Text and Speech, 143–179 (1980).
Russel, M. & Moore, R. Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition. in Acoustics, Speech, and Signal Processing, IEEE International Conference on ICCASSP. Volume 10, 5–8 (1985).
Levinson, S. E. Continuously variable duration hidden markov models for automatic speech recognition. Comput. Speech Lang. 1, 29–45 (1986).
Stukel, T. A. & Demidenko, E. Two-stage method of estimation for general linear growth curve models. Biometrics 53, 720–728 (1997).
Hanushek, E. A. Efficient estimators for regressing regression coefficients. Am. Stat. 28, 66–67 (1974).
Gruen, M. E., Thomson, A., Simpson, W. & Lascelles, B. D. X. Detection of clinically relevant pain relief in cats with degenerative joint disease associated pain. J. Vet. Int. Med. 28, 346–350 (2014).
Viterbi, A. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Trans. Inf. Theory 13, 260–269 (1967).
van Kuppevelt, D. et al. Segmenting accelerometer data from daily life with unsupervised machine learning. PLoS ONE 14(1), e0208692 (2019).
Huang, Q. et al. Hidden markov models for monitoring circadian rhythmicity in telemetric activity data. J. R. Soc. Interface 15, 20170885 (2018).
Diggle, P. Analysis of Longitudinal Data (Oxford University Press, Oxford, 2002).
Verbeke, G. & Molenberghs, G. Linear Mixed Models for Longitudinal Data (Springer, Berlin, 2009).
Rabiner, L. R. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77, 257–286 (1989).
Schwarz, G. et al. Estimating the dimension of a model. Ann Stat 6, 461–464 (1978).
Athreya, K. B. & Lahiri, S. N. Measure Theory and Probability Theory (Springer, 2006).
Meyn, S. P. & Tweedie, R. L. Markov chains and stochastic stability (Springer, 2012).
Allman, E. S. et al. Identifiability of parameters in latent structure models with many observed variables. Ann. Stat. 37(6A), 3099–3132 (2009).
Gassiat, E., Cleynen, A. & Robin, S. Finite state space non parametric hidden Markov models are in general identifiable. arXiv: 1306.4657 (2013).
Leroux, B. G. Maximum-likelihood estimation for hidden Markov models. Stoch. Process Their Appl. 40, 127–143 (1992).
Bickel, P. J. et al. Asymptotic normality of the maximum likelihood estimator for general hidden Markov models. Ann. Stat. 26, 1614–1635 (1998).
Le Gland, F. & Mevel, L. Exponential forgetting and geometric ergodicity in hidden markov models. Math. Control Signals Syst. 13, 63–93 (2000).
Douc, R. et al. Asymptotics of the maximum likelihood estimator for general hidden Markov models. Bernoulli 7, 381–420 (2001).
Douc, R. et al. Asymptotic properties of the maximum likelihood estimator in autoregressive models with Markov regime. Ann. Stat. 32, 2254–2304 (2004).
Barbu, V. S. & Limnios, N. Semi-Markov chains and hidden semi-Markov models toward applications: their use in reliability and DNA analysis (Springer, 2009).
Trevezas, S. & Limnios, N. Exact mle and asymptotic properties for nonparametric semi-Markov models. J. Nonparametr. Stat. 23, 719–739 (2011).
Van der Vaart, A. On differentiable functionals. Ann. Stat. 19, 178–204 (1991).
Potscher, B. M. & Prucha, I. R. Dynamic nonlinear econometric models: Asymptotic theory (Springer, 2013).
Acknowledgements
Laber acknowledges support from the National Institutes of Health (P01 CA142538) and the National Science Foundation (DMS-1555141, DMS-1513579). Staicu acknowledges support from the National Science Foundation (DMS-1454942). Funding for the original study was provided by a grant (45079) from Boehringer-Ingelheim Vetmedica, Inc.
Author information
Authors and Affiliations
Contributions
B.D.X.L. designed the original study; B.D.X.L. performed the original study; A.-M.L., E.L., Z.X. and B.D.X.L. designed the current study. Z.X., A.M.-L. and E.L. performed the analyses and prepared the figures; Z.X. and B.D.X.L. drafted the manuscript; all authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
BDXL has participated in sponsored CE for Boehringer-Ingelheim (manufacturers of Metacam used in the original study). A-MS, EL and ZX have no conflicts of interest to declare.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Xu, Z., Laber, E., Staicu, AM. et al. Novel approach to modeling high-frequency activity data to assess therapeutic effects of analgesics in chronic pain conditions. Sci Rep 11, 7737 (2021). https://doi.org/10.1038/s41598-021-87304-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-87304-w
- Springer Nature Limited