Abstract
There are two ways to evaluate the properties of unknown chemical compounds. One is by traditional approaches, which measure the desired data from the experiments and the other is by predicting them in the theoretical approaches using a kind of prediction model. The latter are considered to be more effective because they are less time consuming and cost efficient, and there is less risk in conducting the experiments. Besides, it is inconvenient to conduct experiments to obtain experimental data, especially for new materials or high molecular substances. Several methods using regression model and neural network for predicting the physical properties have been suggested so far. However, the existing methods have many problems in terms of accuracy and applicability. Therefore, an improved method for predicting the properties is needed. A new method for predicting the physical property was proposed to predict 15 physical properties for the chemicals which consist of C, H, N, O, S and Halogens. This method was based on the group contribution method that was oriented from the assumption that each fragment of a molecule contributes a certain amount to the value of its physical property. In order to improve the accuracy of the prediction of the physical properties and the applicability, we extended the database, significantly modifying the existing group contribution methods, and then established a new method for predicting the physical properties using support vector machine (SVM) which is a statistical theory that has never been used for predicting the physical properties. The SVM-based approach can develop nonlinear structure property correlations more accurately and easily in comparison with other conventional approaches. The results from the new estimation method are found to be more reliable, accurate and applicable. The newly proposed method can play a crucial role in the estimation of new compounds in terms of the expense and time.
Similar content being viewed by others
References
M. S. Mannan, W. J. Rogers and A. Aldeeb, A systematic approach to reactive chemicals analysis, Proc. HAZARDS XVI, Manchester, U.K. 41–58 (2001).
C. Bruneton, C. Hoff and P. Barton, Computers and Chemical Engineering, 22(6), 735 (1998).
K.G. Joback, Unified approach to physical property estimation using multivariate statistical techniques, S.M. Thesis, Massachusetts Institute of Technology, Cambridge (1984).
K.G. Joback, Designing molecules possessing desired physical property values Vol.1, Ph. D. Thesis, Massachusetts Institute of Technology, Cambridge (1989a).
K.G. Joback, Designing molecules possessing desired physical property values Vol. 2, Ph. D. Thesis, Massachusetts Institute of Technology, Cambridge (1989).
K. G. Joback, Fluid Phase Equilibria, 185(1), 45 (2001).
H. J. Liaw, C. C. Yur and Y. F. Lin, J. of Loss Prevention in the Process Industries, 13(6), 499 (2000).
L. Constantinou and R. Gani, AIChE Journal, 40(10), 1697 (1994).
K. H. Lee, J.Y. Jung and I. B. Lee, Korea J. Chem. Eng., 31, 744 (1993).
DIPPR (Design Institute for Physical Properties), http://dippr.byu.edu
V.N. Vapnik, The nature of statistical learning theory, Springer Verlag, New York, U.S.A. 53–67 (1995).
V. Cherkassky and F. Muler, Learning from data: Concepts, theory, and methods, John Wiley & Sons, New York, U.S.A. 353–387 (1998).
H. J. Liaw, C. J. Chen and C. C. Yur, J. of Loss Prevention in the Process Industries, 14(5), 371 (2001).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lee, C.J., Lee, G., So, W. et al. A new estimation algorithm of physical properties based on a group contribution and support vector machine. Korean J. Chem. Eng. 25, 568–574 (2008). https://doi.org/10.1007/s11814-008-0096-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11814-008-0096-0