Abstract
The dating of documents is one of the pending issues to be resolved in many fields. Although there are a variety of analytical methodologies focused on the inks of documents to estimate the dating of documents, the analysis of documents itself attracted little attention. A non-destructive method based on Fourier transformed infrared (FTIR) was proposed in this research to estimate the dating of documents. First, FTIR spectra of journals dated from 1940 to 1980, naturally aged and conserved in the China Agriculture University library were collected. Second, the least squares support vector machines (LS-SVM) was applied to distinguish documents of different dating, and the accuracy was 99.26%. In order to find interested wavenumber that influence the dating process of documents, sparse partial least squares (sPLS) was applied to select informative variables. The average of selected variables was 483 after 100 runs, and the selected variables were focused on the absorption peaks of inorganic components and cellulose. Splicing sPLS with LS-SVM (sPLS–LS-SVM) built model to see the effective of selected variables. Average accuracy of sPLS–LS-SVM model was 99.34%, even the best result could reach 100.00% after 100 circle times. The present work indicates that the possibility of FTIR combined with chemometrics can estimate the dating of documents accurately. Additionally, the wavenumber which influence the dating of documents are mostly focused on cellulose and inorganic components.
Similar content being viewed by others
References
Abdel-Rahman EM, Mutanga O, Odindi J, Adam E, Odindo A, Ismail R (2014) A comparison of partial least squares (PLS) and sparse PLS regressions for predicting yield of Swiss chard grown under different irrigation water sources using hyperspectral data. Comput Electron Agric 106:11–19
Andrasko J (2001) HPLC analysis of ballpoint pen inks stored at different light conditions. J Forensic Sci 46:21–30
Brazeau L, Gaudreau M (2007) Ballpoint pen inks: the quantitative analysis of ink solvents on paper by solid-phase microextraction. J Forensic Sci 52:209–215
Calcerrada M, Garcia-Ruiz C (2015) Analysis of questioned documents: a review. Anal Chim Acta 853:143–166
Camps-Valls G (2011) Support vector machines in remote sensing: the tricks of the trade. In: SPIE remote sensing conference
Chung D, Keles S (2010) Sparse partial least squares classification for high dimensional data. Stat Appl Genet Mol Biol. https://doi.org/10.2202/1544-6115.1492
Dupont AL, Seemann A, Lavedrine B (2012) Capillary electrophoresis with electrospray ionisation-mass spectrometry for the characterisation of degradation products in aged papers. Talanta 89:301–309
Gomez-de Anda F, Gallardo-Velazquez T, Osorio-Revilla G, Dorantes-Alvarez L, Calderon-Dominguez G, Nogueda-Torres B, de-la-Rosa-Arana JL (2012) ‘Feasibility study for the detection of Trichinella spiralis in a murine model using mid-Fourier transform infrared spectroscopy (MID-FTIR) with attenuated total reflectance (ATR) and soft independent modelling of class analogies (SIMCA). Vet Parasitol 190:496–503
Liu YZ, Yu J, Xie MX, Liu Y, Han J, Jing TT (2006) Classification and dating of black gel pen ink by ion-pairing high-performance liquid chromatography. J Chromatogr A 1135:57–64
Liu YJ, Tran T, Postma G, Buydens LMC, Jansen J (2018) Estimating the number of components and detecting outliers using Angle Distribution of Loading Subspaces (ADLS) in PCA analysis. Anal Chim Acta 1020:17–29
Martínez JR, Nieto-Villena A, de la Cruz-Mendoza JA, Ortega-Zarzosa G, Guerrero AL (2017) Monitoring the natural aging degradation of paper by fluorescence. J. Cult Herit 26:22–27
Mees C, Souard F, Delporte C, Deconinck E, Stoffelen P, Stevigny C, Kauffmann JM, De Braekeleer K (2018) Identification of coffee leaves using FT-NIR spectroscopy and SIMCA. Talanta 177:4–11
Missori M, Mondelli C, De Spirito M, Castellano C, Bicchieri M, Schweins R, Arcovito G, Papi M, Castellano AC (2006) Modifications of the mesoscopic structure of cellulose in paper degradation. Phys Rev Lett 97:238001
Ortiz-Herrero L, Blanco ME, García-Ruiz C, Bartolomé L (2018) Direct and indirect approaches based on paper analysis by Py-GC/MS for estimating the age of documents. J Anal Appl Pyrolysis 131:9–16
Senior S, Hamed E, Masoud M, Shehata E (2012) Characterization and dating of blue ballpoint pen inks using principal component analysis of UV–Vis absorption spectra, IR spectroscopy, and HPTLC. J Forensic Sci 57:1087–1093
Silva CS, Pimentel MF, Amigo JM, Garcia-Ruiz C, Ortega-Ojeda F (2018) Chemometric approaches for document dating: handling paper variability. Anal Chim Acta 1031:28–37
Souguir Z, Dupont AL, De La Rie ER (2017) Formation of brown lines in paper: characterization of cellulose degradation at the wet-dry interface. Biomacromolecules 9:2546–2552
Suykens JAK, Van Gestel T, De Brabanter J, De Moor B, Vandewalle J (2002) Least squares support vector machines, Chapter 3. World Scientific: 5 TohTuck Link, Singapore, pp 71–75
Wang J, Luo G, Sun S, Wang Z, Wang Y (2001) Systematic analysis of bulk blue ballpoint pen ink by FTIR spectrometry. J Forensic Sci 46:1093–1097
Ward JL, Snow MS, Olson JE, Ball D, Adamic ML (2018) Carbon-14 content in tree and soil samples at the Idaho National Laboratory nuclear site. Nucl Instrum Methods B 437:103–109
Weyermann C, Kirsch D, Costa Vera C, Spengler B (2007) A GC/MS study of the drying of ballpoint pen ink on paper. Forensic Sci Int 168:119–127
Xu Y, Wang J, Yao L (2006) Dating the writing age of black roller and gel inks by gas chromatography and UV–Vis spectrophotometer. Forensic Sci Int 162:140–143
Yang B, Shao Q, Pan L, Li W (2018) A study on regularized weighted least square support vector classifier. Pattern Recog Lett 108:48–55
Zięba-Palus J, Wesełucha-Birczyńska A, Trzcińska B, Kowalski R, Moskal P (2017) Analysis of degraded papers by infrared and Raman spectroscopy for forensic purposes. J Mol Struct 1140:154–162
Acknowledgments
The authors would like to thank Yue Huang, Hong Yan and Qianqian Li for the critical review of the manuscript. The authors are also grateful to China Agricultural University Library for providing samples used in this work.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
Authors declare that they have no conflict of interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Xia, J., Huang, Y., Zhang, J. et al. Development of a chemometric methodology based on FTIR spectra for paper dating. Cellulose 27, 5323–5335 (2020). https://doi.org/10.1007/s10570-019-02892-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10570-019-02892-1