The Robust Models of Retention for Thin Layer Chromatography

  • Miron B. Kursa
  • Łukasz Komsta
  • Witold R. Rudnicki
Part of the Advances in Intelligent and Soft Computing book series (AINSC, volume 103)


We present an application of the machine learning methods for modelling the retention constants in the thin layer chromatography. First a feature selection algorithm is applied to reduce the feature space and then the regression models are built with a help of the random forest algorithm. The models obtained in this way have better correlation with the experimental data than the reference models built with linear regression. They are also robust—the cross-validation tests shows that the accuracy on unseen data is on average identical to the cross-validated accuracy obtained on the training set.


random forest feature selection thin layer chromatography 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)zbMATHCrossRefGoogle Scholar
  2. 2.
    Héberger, K.: Quantitative structure—(chromatographic) retention relationships. Journal of Chromatography A 1158(1-2), 273–305 (2007)CrossRefGoogle Scholar
  3. 3.
    Kaliszan, R.: Quantitative relationships between molecular structure and chromatographic retention. Implications in physical, analytical, and medicinal chemistry. Critical Reviews in Analytical Chemistry 16, 323–383 (1986)Google Scholar
  4. 4.
    Kaliszan, R.: Quantitative structure retention relationships. Analytical Chemistry 64, 619–631 (1992)CrossRefGoogle Scholar
  5. 5.
    Komsta, L.: A functional-based approach to the retention in thin layer chromatographic screening systems. Analytica Chimica Acta 629(1-2), 66–72 (2008)CrossRefGoogle Scholar
  6. 6.
    Komsta, L.: Quick prediction of the retention of solutes in 13 thin layer chromatographic screening systems on silica gel by classification and regression trees. Journal of Separation Science 31(15), 2899–2909 (2008)CrossRefGoogle Scholar
  7. 7.
    Komsta, Ł.: Prediction of the retention in thin layer chromatography screening systems by atomic contributions. Analytica Chimica Acta 593(2), 224–237 (2007)CrossRefGoogle Scholar
  8. 8.
    Kursa, M.B., Jankowski, A., Rudnicki, W.R.: Boruta—A System for Feature Selection. Fundamenta Informaticae 101(4), 271–285 (2010)MathSciNetGoogle Scholar
  9. 9.
    Kursa, M.B., Rudnicki, W.R.: Feature Selection with the Boruta Package. Journal Of Statistical Software 36(11) (2010)Google Scholar
  10. 10.
    Moffat, A.: Clarke’s Analysis of Drugs and Poisons, 3rd edn. Pharmaceutical Press, London (2004)Google Scholar
  11. 11.
    Pyka, A.: The application of topological indexes in TLC. Journal of Planar Chromatography—Modern TLC 14, 152–159 (2001)Google Scholar
  12. 12.
    R Development Core Team: R: A Language and Environment for Statistical Computing (2010),
  13. 13.
    Tetko, I.V., Gasteiger, J., Todeschini, R., Mauri, A., Livingstone, D., Ertl, P., Palyulin, V.a., Radchenko, E.V., Zefirov, N.S., Makarenko, A.S., Tanchuk, V.Y., Prokopenko, V.V.: Virtual computational chemistry laboratory-design and description. Journal of Computer-Aided Molecular Design 19(6), 453–463 (2005)CrossRefGoogle Scholar
  14. 14.
    Wang, Q., Zhang, L.: Review of research on quantitative structure-retention relationships in thin-layer chromatography. Journal of Liquid Chromatography & Related Technologies 22(1), 1–14 (1999)zbMATHCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Miron B. Kursa
    • 1
  • Łukasz Komsta
    • 2
  • Witold R. Rudnicki
    • 1
  1. 1.Interdisciplinary Centre for Mathematical and Computational ModellingUniversity of WarsawWarsawPoland
  2. 2.Department of Medicinal ChemistrySkubiszewski Medical UniversityLublinPoland

Personalised recommendations