Structure Annotation in the Polish Corpus of Suicide Notes

  • Michał Marcińczuk
  • Monika Zaśko-Zielińska
  • Maciej Piasecki
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6836)


Polish Corpus of Suicide Notes (henceforth PCSN) is constructed to meet the needs of forensic linguistics. Suicide notes are messages created in borderline situation, shortly before death. Hence the annotation schema requires a complex description of a document structure, the textual content, as well as its linguistic properties. TEI was selected as the basis for the document encoding schema. TEI adaptation and extension with respect to such aspects of encoding as: a letter structure, various layers of changes and omissions, error correction, and extra-linguistic elements etc., are discussed with examples.


forensic linguistics suicide note structure annotation 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Blackwell, S.: Why Forensic linguistics Needs Corpus Linguistics. Comparative Legilinguistics 1, 5–19 (2009)Google Scholar
  2. 2.
    Shneidman, E.S., Farberow, N.L. (eds.): Clues to Suicide, New York-Toronto-London (1957)Google Scholar
  3. 3.
    Stone, P.J., Dunphy, D.C., Smith, M.S., Ogilvie (eds.): The General Inquirer: A Computer Approach to Content Analysis, pp. 527–535. MIT Press, Cambridge (1969)Google Scholar
  4. 4.
    Jones, N.J., Bennell, C.: The Development and Validation of Statistical Prediction Rules for Discriminating Between Genuine and Simulated Suicide Notes. In: IASR, vol. 11, p. 230 (2007)Google Scholar
  5. 5.
    Pestian, J., Nasrallah, H., Matykiewicz, P., Bennett, A., Leenaars, A.: Suicide Note Classification Using Natural Language Processing: A Content Analysis, pp. 19-28,
  6. 6.
    Eisenwort, B., Berzlanovich, A., Willlinger, U., Eisenwort, G., Lindorfer, S., Sonneck, G.: Abschiedsbriefe und ihre Bedeutung innerhalb der Suizidologie. Nervenarzt 77, 1359 (2006)CrossRefGoogle Scholar
  7. 7.
    Olsson, J.: Wordcrime. Solving Crime Through Forensic Linguistics, London - New York, p. 55 (2009)Google Scholar
  8. 8.
    Razak, Z., Zulkiflee, K., Idris, M.Y.I., Tamil, E.M., Noor, M.N.M., Salleh, R., Yaakoob, M., Yusof, Z.M., Yaacob, M.: Off-line Handwriting Text Line Segmentation: A Review. In: IJCNS, vol. 8(7), p. 12 (2008)Google Scholar
  9. 9.
    Bosma, W., Vossen, P., Soroa, A., Rigau, G., Tesconi, M., Marchetti, A., Monachini, M., Aliprandi, C.: KAF: a generic semantic annotation format. In: Proc. of the 5th Inter. Conf. on Generative Approaches to the Lexicon GL 2009, Pisa, Italy (2009)Google Scholar
  10. 10.
    Vanhoutte, E., Van den Branden, R.: Describing: Transcribing, Encoding, and Editing Modern Correspondence Material: A Textbase Approach. Lit Linguist Computing 24(1), 77–98 (2009)CrossRefGoogle Scholar
  11. 11.
    Coulthard, M.: On the Use of Corpora in the Analysis of Forensic Texts. Int. J. Speech Lang. La. 1, 27–43 (1994)CrossRefGoogle Scholar
  12. 12.
    Olsson, J.: Forensic Linguistics. In: An Introduction to Language, Crime and Law, London - New York, p. 52 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Michał Marcińczuk
    • 1
  • Monika Zaśko-Zielińska
    • 2
  • Maciej Piasecki
    • 1
  1. 1.Institute of InformaticsWrocław University of TechnologyWrocławPoland
  2. 2.Institute of Polish PhilologyUniversity of WroclawWrocławPoland

Personalised recommendations