Classification of Poorly Time Sampled Light Curves of Periodic Variable Stars
Classification of periodic variable light curves is important for scientific knowledge discovery and efficient use of telescopic resources for source follow-up. In practice, labeled light curves from catalogs with hundreds of flux measurements (the training set) may be used to classify curves from ongoing surveys with tens of flux measurements (the test set). Statistical classifiers generally assume that the probability of class given light curve features is the same for training and test sets. This assumption is unlikely to hold when the number of flux measurements per light curve varies widely between the two sets. We employ two methods to correct the problem—noisification and denoisification. With noisification we alter the training set to mimic the distribution of the test set and then construct a classifier on these altered data. With denoisification we construct a classifier on the well-sampled curves in the training set and probabilistically infer what poorly sampled curves in the test set would look like if we continued obtaining flux measurements. On periodic variable sources from a simulated data set and the OGLE survey, both of these methods outperform making no adjustments for training–test set differences.
The authors acknowledge the generous support of a cyber-enabled discovery and innovation (CDI) grant (No. 0941742) from the National Science Foundation. This work was performed in the CDI-sponsored Center for Time Domain Informatics (http://cftd.info).
- 6.Eyer L, Cuypers J (2000) Predictions on the number of variable stars for the gaia space mission and for surveys such as the ground-based international liquid mirror telescope. In: IAU Colloq. 176: The impact of large-scale surveys on pulsating star research, vol 203, pp 71–72Google Scholar
- 7.Eyer L et al (2010) The variable universe through the eyes of GAIA. arXiv:1011.4527v1Google Scholar