Learning Manifolds in Forensic Data

  • Frédéric Ratle
  • Anne-Laure Terrettaz-Zufferey
  • Mikhail Kanevski
  • Pierre Esseiva
  • Olivier Ribaux
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4132)


Chemical data related to illicit cocaine seizures is analyzed using linear and nonlinear dimensionality reduction methods. The goal is to find relevant features that could guide the data analysis process in chemical drug profiling, a recent field in the crime mapping community. The data has been collected using gas chromatography analysis. Several methods are tested: PCA, kernel PCA, isomap, spatio-temporal isomap and locally linear embedding. ST-isomap is used to detect a potential time-dependent nonlinear manifold, the data being sequential. Results show that the presence of a simple nonlinear manifold in the data is very likely and that this manifold cannot be detected by a linear PCA. The presence of temporal regularities is also observed with ST-isomap. Kernel PCA and isomap perform better than the other methods, and kernel PCA is more robust than isomap when introducing random perturbations in the dataset.


Locally Linear Embedding Kernel Principal Component Analysis Learn Manifold Linear Embedding Nonlinear Dimensionality Reduction 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Ratle, F., Terrettaz, A.L., Kanevski, M., Esseiva, P., Ribaux, O.: Pattern analysis in illicit heroin seizures: a novel application of machine learning algorithms. In: Proc. of the 14th European Symposium on Artificial Neural Networks. d-side publi. (2006)Google Scholar
  2. 2.
    Guëniat, O., Esseiva, P.: Le Profilage de l’Hëroïne et de la Cocaïne, Presses polytechniques et universitaires romandes, Lausanne (2005)Google Scholar
  3. 3.
    Esseiva, P., Dujourdy, L., Anglada, F., Taroni, F., Margot, P.: A methodology for illicit drug intelligence perspective using large databases. Forensic Science International 132, 139–152 (2003)CrossRefGoogle Scholar
  4. 4.
    Esseiva, P., Anglada, F., Dujourdy, L., Taroni, F., Margot, P., Du Pasquier, E., Dawson, M., Roux, C., Doble, P.: Chemical profiling and classification of illicit heroin by principal component analysis, calculation of inter sample correlation and artificial neural networks. Talanta 67, 360–367 (2005)CrossRefGoogle Scholar
  5. 5.
    Madden, M.G., Ryder, A.G.: Machine Learning Methods for Quantitative Analysis of Raman Spectroscopy Data. In: Proceedings of the International Society for Optical Engineering (SPIE 2002), vol. 4876, pp. 1130–1139 (2002)Google Scholar
  6. 6.
    O’Connell, M.L., Howley, T., Ryder, A.G., Madden, M.G.: Classification of a target analyte in solid mixtures using principal component analysis, support vector machines, and Raman spectroscopy. In: Proceedings of the International Society for Optical Engineering (SPIE 2005), vol. 4876, pp. 340–350 (2005)Google Scholar
  7. 7.
    Schölkopf, B., Smola, A., Müller, K.R.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 10, 1299–1319 (1998)CrossRefGoogle Scholar
  8. 8.
    Roweis, S.T., Saul, L.K.: Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323–2326 (2000)CrossRefGoogle Scholar
  9. 9.
    Tenenbaum, J.B., de Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000)CrossRefGoogle Scholar
  10. 10.
    Jenkins, O.C., Matarić, M.J.: A spatio-temporal extension to isomap nonlinear dimension reduction. In: Proc. of the 21st International Conference on Machine Learning (2004)Google Scholar
  11. 11.
    Stork, D.G., Yom-Tov, E.: Computer Manual in MATLAB to accompany Pattern Classification. Wiley, Hoboken (2004)Google Scholar
  12. 12.
    Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley, NewYork (2001)MATHGoogle Scholar
  13. 13.
    Kruskal, J.B., Wish, M.: Multidimensional Scaling. SAGE Publications, Thousand Oaks (1978)Google Scholar
  14. 14.
    Bengio, Y., Paiement, J.F., Vincent, P., Delalleau, O., Le Roux, N., Ouimet, M.: Out-of-samples extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering. Advances in Neural Information Processing Systems 16 (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Frédéric Ratle
    • 1
  • Anne-Laure Terrettaz-Zufferey
    • 2
  • Mikhail Kanevski
    • 1
  • Pierre Esseiva
    • 2
  • Olivier Ribaux
    • 2
  1. 1.Institut de Géomatique et d’Analyse du Risque, Faculté des Géosciences et de l’EnvironnementUniversité de LausanneAmphipôleSwitzerland
  2. 2.Institut de Police Scientifique et de Criminologie, Ecole des Sciences CriminellesUniversité de LausanneBatochimeSwitzerland

Personalised recommendations