Incorporating Molecule’s Stereisomerism within the Machine Learning Framework
An important field of chemoinformatics consists in the prediction of molecule’s properties, and within this field, graph kernels constitute a powerful framework thanks to their ability to combine a natural encoding of molecules by graphs, with classical statistical tools. Unfortunately some molecules encoded by a same graph and differing only by the three dimensional orientation of their atoms in space have different properties. Such molecules are called stereoisomers. These latter properties can not be predicted by usual graph methods which do not encode stereoisomerism. In this paper we propose to encode the stereoisomerism property of each atom of a molecule by a local subgraph. A kernel between bags of such subgraphs provides a similarity measure incorporating stereoisomerism properties. We then propose two extensions of this kernel incorporating in each sub graph information about its surroundings.
KeywordsAsymmetric Carbon Support Vector Regression Machine Graph Kernel Adjacency Relationship Dimensional Orientation
- 5.Drucker, H., Burges, C.J.C., Kaufman, L., Smola, A.J., Vapnik, V.: Support vector regression machines. In: NIPS, pp. 155–161 (1996)Google Scholar
- 7.Grenier, P.-A., Brun, L., Villemin, D.: Incorporating stereo information within the graph kernel framework. Technical report, CNRS UMR 6072 GREYC (2013), http://hal.archives-ouvertes.fr/hal-00809066/
- 9.Jacques, J., Collet, A., Wilen, S.: Enantiomers, racemates, and resolutions. Krieger Pub. Co. (1991)Google Scholar