Fitting Kent models to compositional data with small concentration
- 409 Downloads
Compositional data can be transformed to directional data by the square root transformation and then modelled by using the Kent distribution. The current approach for estimating the parameters in the Kent model for compositional data relies on a large concentration assumption which assumes that the majority of the transformed data is not distributed too close to the boundaries of the positive orthant. When the data is distributed close to the boundaries with large variance significant folding may result. To treat this case we propose new estimators of the parameters derived based on the actual folded Kent distribution which are obtained via the EM algorithm. We show that these new estimators significantly reduce the bias in the current estimators when both the sample size and amount of folding is moderately large. We also propose using a saddlepoint density approximation for the Kent distribution normalising constant in order to more accurately estimate the shape parameters when the concentration is small or only moderately large.
KeywordsCompositional data Directional data EM algorithm Folding Kent distribution Maximum likelihood estimation Saddlepoint approximation Square root transformation
- Scealy, J.L.: Modelling techniques for compositional data using distributions defined on the hypersphere. PhD thesis, Australian National University (2010) Google Scholar