Use of PLS Discriminant Analysis for Revealing the Absence of a Compound in an Electron Ionization Mass Spectral Database
A mathematical model is proposed for revealing the absence of a compound to be identified in an electron impact mass spectral library. The mathematical model (developed based on PLS Discriminant Analysis) can be represented as a “black box” which provides an answer whether a compound to be sought is absent or present in a database. The match factors of top ten candidates among the possible ones were used as input data. More than 5000 objects (mass spectra) were used at the steps of training, validation, and testing. The developed classification model provides correct prediction (of whether a compound is absent from the library) in 28.4% cases, while only 1.2% of compounds present in the database were incorrectly classified as the absent ones.
Keywordsidentification of organic compounds GC/MS mass spectral library mass spectral database classification PLS discriminant analysis
Unable to display preview. Download preview PDF.
- 8.Domokos, L., Henneberg, D., and Wiemann, B., Anal. Chim. Acta, 1984, vol. 5, p. 316.Google Scholar
- 13.Samokhin, A.S., Revel’skii, A.I., Chepelyanskii, D.A., Revel’skii, I.A., Mass-Spektrom., 2011, vol. 8, no. 1, p. 65.Google Scholar
- 14.Chemometrics course. Classification. http://rcs.chemometrics. ru/Tutorials/classification.htm. Cited August 24, 2016.Google Scholar
- 28.NIST MS Search User Guide, Gaithersburg: Natl. Inst. Standards Technol., 2008.Google Scholar
- 30.http://www.chemometrics.ru/materials/textbooks/projection. htm. Cited August 29, 2016.Google Scholar