Automatic Classification of Bloodstains with Deep Learning Methods

Bergman, Tommy; Klöden, Martin; Dreßler, Jan; Labudde, Dirk

doi:10.1007/s13218-022-00760-y

Automatic Classification of Bloodstains with Deep Learning Methods

Technical Contribution
Open access
Published: 20 May 2022

Volume 36, pages 135–141, (2022)
Cite this article

Download PDF

You have full access to this open access article

KI - Künstliche Intelligenz Aims and scope Submit manuscript

Automatic Classification of Bloodstains with Deep Learning Methods

Download PDF

Tommy Bergman ORCID: orcid.org/0000-0001-5357-4719¹,
Martin Klöden¹,
Jan Dreßler² &
…
Dirk Labudde¹

6177 Accesses
1 Citation
2 Altmetric
Explore all metrics

Abstract

The classification of detected bloodstains into predetermined categories is a crucial component of the so-called bloodstain pattern analysis. As in other forensic disciplines, deep learning methods may help to reduce human subjectivity within this process, may increase the classification accuracy, shorten the calculation time and thus, enable high-throughput analysis. In this work, an approach is presented in which a convolutional neural network (Inception v3) was trained from 965 drip stains (passive origin) and 1595 blood spatters (active origin). The trained CNN was evaluated with a test data set consisting of 366 images of drip stains and blood spatters. The success rate was 99.73% which suggests that neural networks could also be used to automatically classify other classes of bloodstain patterns to speed up the investigation process in the future.

Computer Aided Diagnosis: Approaches to Automate Hematological Tests

A Weakly Supervised Deep Learning Approach for Detecting Malaria and Sickle Cells in Blood Films

Deep-Learning Methods for the Classification of Normal and Pathological Blood Cells and Bone-Marrow Cells: A Comprehensive Review

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Classification of Bloodstains Forensic bloodstain pattern analysis involves analysing the shape and location of bloodstains at the crime scene to obtain case-relevant information. Since blood as a biological fluid is subject to physical laws, identical drop shapes indicate identical formation dynamics. Thus, there are classification schemes designed by forensic scientists who try to classify blood samples according to their shape. It is assumed that blood patterns of the same class have also been formed in the same way. Among the best known classification schemes are the system by Gardner et al. which is based on the velocity of blood drops [7] and the system by Bevel and Gardner which is based on the causative force [4]. This system fundamentally distinguishes between actively and passively induced bloodstains. Passive bloodstains are generated by the pure action of gravity, while active bloodstains are additionally accelerated by other external forces.

At the scene of a crime this classification makes it possible to distinguish whether a bloodstain is the direct result of an active force or whether it has been passively dripped from wounds or elevated objects. Further, the system by Bevel and Gardner [4] subdivided active bloodstains into transfer bloodstains and projected bloodstains, thus resulting in four supercategories: passive traces, transfer traces, projected traces and miscellaneous. Each of these upper categories contains a different number of blood patterns that can be distinguished from each other. An overview is shown in Fig. 1.

Deep learning via Inception v3 The open-source project of Google called Tensorflow is easy to use and can be operated on almost all systems due to its many interfaces [1]. We utilize this framework to train a CNN (Convolutional Neural Network) with the aim to be able to automatically distinguish blood spatters from blood drops.

CNN are artificial neural networks inspired by biological processes in the brain. This method of deep learning is often used for processing images or audio files. Basically, CNN consists of units of one or more convolutional layers followed by a pooling layer. While in the convolutional layer a small convolutional matrix is places onto the input to determine the activity of the individual neurons, in the pooling layer excess information is discarded. For this purpose the matrix is divided into areas of equal size and only the most active neuron in each area is passed on (“Max-Pooling”). This keeps the memory requirement small and increases the calculation speed. After one or more of such units of convolutional and pooling layers, the so-called fully connected layer usually follows. In this last layer, the number of neurons usually corresponds to the number of predefined classes that the network should distinguish. The associated Softmax function translates the activities of the fully connected layer into a probability distribution. Overall, this simplified structure is used to extract information from 2D or 3D data and calculate the probabilities, with which the data input matches the given classes. Among the most well-known applications of CNN are speech and image recognition.

While conventional CNN have piled up more and more layers in hope of increasing performance, a new approach was developed during the ImageNet Recognition Challenge: Inception or the current version Inception v3 from Googlenet increases the performance with some programming tricks [16]. Different and reduced filter sizes increase the performance and recognize prominent image areas more easily, even if they may be of different sizes (Fig. 2).

In 2015, the Google Brain Team released the TensorFlow framework [1], which was designed to optimize programming, especially in the area of machine learning, since it included many ready-made software solutions and was compatible with many major programming languages. In TensorFlow, mathematical operations are displayed in the form of a graph. The graph represents the sequential flow of all operations to be performed by TensorFlow. The CNN Inception v3 is also included with version 1.4.1 TensorFlow and offers scientists an excellent opportunity to apply pre-trained models to their own data or to train their own models. In this publication self-created two-dimensional image data from bloodstains are classified. Therefore, Inception v3 is the ideal platform within TensorFlow, because it promises high performance for this kind of data and is easy to copy.

Motivation and Aim Experienced bloodstain pattern analysts can classify blood stain pattern on a crime scene with a high success rate but it was shown earlier that contextual information of the crime scene is often incorporated into pattern classification decisions which leads to a 20 % higher proportion of misclassifications [17]. Based on this, the present study will evaluate whether it is possible to automate the process of bloodstain classification. This approach promises a purely objective view of the bloodstains. Deep learning approaches are known for properties that are good for image recognition, where the information of many thousands or millions of pixels has to be matched to a few allowed classes [18]. Many fields and disciplines of forensic science already benefit from computer-aided analysis, which shortens the computation time and enables high-throughput analysis [3, 6, 9, 10, 14]. In the field of forensic blood trace pattern analysis, new approaches to digital analysis have also emerged in recent years [2, 5, 8, 19]. There are even final software products that are supposed to support the interpretation of blood stains at the crime scene [11, 12]. Especially in this field, the possibility of an automatic classification would be enormously helpful.

In order to establish an origin for automatic bloodstain pattern classification, this publication focuses only on the distinction between passively originated drip stains and actively originated blood spatters. While, by definition, drip stains are created by blood drops falling down only due to gravity, blood spatters are created by blood drops that are thrown through the air due to a force against a source of liquid blood. [15]. The differentiation of drip stains and blood spatters is comparatively simple, since they can be easily distinguished visually. While the drip stains are almost perfectly round, blood spatters take different shapes depending on the applied force.

2 Data Acquisition and Method

Untreated and mixed porcine blood from a local butchery was used to create the bloodstain pattern. To prevent premature blood clotting, it was transported in airtight bags, stored in the refrigerator and used within 12 h.

For the passively originated drip stains, blood was drawn into a commercially available disposable pipette and dropped vertically onto paper strips from various heights between 25 and 35 cm. Care was taken to ensure that the drops would dissolve on their own and that they would not fall on already formed bloodstains. The finished blood-stained paper strips were placed on a DIN A3 sheet of paper. For taking the pictures, a camera was positioned orthogonally to the background with a tripod (height of 45 cm). The paper strips were pulled underneath and photographed.

For the actively originated blood spatters, a structure consisting of cardboard boxes, a polystyrene board, a cardboard tube, a metal plate and several A3 sheets of paper was designed. This structure was fixed with adhesive tape and made impermeable to liquids. Thereby it is possible to exert a constant force on a puddle of 15 ml blood placed in the middle of the box by dropping the cardboard tube onto it from a height of 50 cm. By removing the blood-spattered paper from the setup and cutting lengthwise into strips (Fig. 3) the resulting active blood spatters could be photographed by a vertically positioned camera at a height of 45 cm (Fig. 3).

The images were transferred via a storage medium to an average PC (specifications: Ubuntu 18.04.4 LTS; 4GB RAM; AMD Ryzen 7 3700x 8-core processor; 42 GB HDD; VMSVGA 30MB RAM) and then edited with the free available image editing software GIMP 2. The image processing involved white balance and cutting out the individual blood stains (\(400\times 400\) pixels). Each blood stain was then saved as an individual image file. This procedure resulted in a total of 2926 images. These images were divided into blood spatters (n=1595) and drip stains (n = 965) according to their origination and placed in two folders. These two folders represented the classes for the following training. 366 images were not subdivided and later used as the test data set.

So in total we have a training data set (1595 blood spatters and 965 drip stains) and a test data set (366 images without class reference).

By means of the programming language Python 2.7 the training programm for the CNN Inception v3 was created. Essential libraries provided by pip for the presented method were Tensorflow, Pandas, Tensorboard, Numpy, Argparse, Hashlib and Tarfile. A complete listing of all necessary libraries can be found in the supplements. At the end, 2000 training steps were used, so that the classification accuracy of the model is as high as possible. One step corresponds to one complete training epoch (batchsize = 100). For randomization the images were modified by cropping (reducing the content to max. 50%) and flipping (to the left or to the right). Already cropped images were also flipped.

The training of the CNN using Tensorflow was done in the given 2000 training steps with an internal evaluation during the training. For this, our training data set was split into three subfolders: 80% training data, 10% internal test data (this is not our own test data set) and 10% internal validation data. The training data and the internal test data now have no association, so it is unknown on which images drip stains or blood spatters are seen. In the internal validation data set, this information about the class affiliation remains. This way, an evaluation step can be inserted every 10 training steps, in which the current training status is queried with the internal evaluation data and, if necessary, a shift in the training focus can be initiated. At the end of each validation step, the internal test data is classified using the current training state of the CNN. This makes it possible to see in retrospect how good the adjustments were. This is to prevent overfitting during the training. After the training, our test data set (366 images) was given to the CNN for classification. These were not used in training. Their class affiliation was known to us and unknown to the CNN. Using this classification result, we evaluated the accuracy of the trained CNN.

3 Results

Training the network with 2560 images took 5:42 min. The resulting model had a size of 87.5 MB. In Fig. 4, it can be seen that the two predefined classes are mostly separated after only a few training steps. Few exceptions are gradually separated in the course of the 2000 training steps.

The classification of the 366 test images took 30 s. The classification result contains a statement about whether it is a drip or a blood splash as well as a so-called score, which indicates how certain the result is. Based on the features previously determined by the model, this is a percentage indication of how certain the statement is. 365 of the 366 test images were correctly classified in this way (99.73%).

4 Discussion

With the help of TensorFlow, a CNN was trained to distinguish between passively originated drip stains and actively originated blood spatters. From the 366 test images, 99.73% were correctly classified. This result proves that it is possible to classify bloodstains automatically and that the simple approach presented here is suited for this purpose.

Despite this encouraging result, there are limitations so far. In previous experiments (data not shown here), it was found that our model currently still has problems classifying wet bloodstains. The present dataset only includes dried bloodstains for training and testing, due to the fact that the surface of wet bloodstains reflects nearby light sources from above. Hence, reflections occurring at the drop surface apparently were misinterpreted by the model.

To avoid this problem in the future, more wet blood drops should be included in the training dataset or a separate class could be defined that includes only wet bloodstains. Furthermore, as with almost all classification difficulties of a CNN, the problem can be minimized with more data.

Therefore, we are already generating additional images of different classes of blood trace patterns. In the future, we will also have to think about integrating images from other databases or crime scenes into our dataset. This way, the approach would always remain up-to-date.

To find out which other factors can have a negative influence on the classification we tried to detect those features in the images that are mainly responsible for the classification. In Fig. 5 we can show with the help of a heat map that there are basically two factors that influence the classification: the droplet shape and the distribution of solid blood components. Within a few minutes, a bloodstain gradually dries out under the influence of atmospheric oxygen. This causes all the solid blood components to contract further and further, leaving behind an edge area of dried blood plasma residue. Depending on the substrate condition and the angle of impact, this area takes on different shapes and is wetted with blood to different degrees. In the case of actively formed blood spatters on walls, gravity during drying ensures that this edge area is formed on only one side, namely the upper one. Blood spatters on horizontal surfaces does not have this special feature; here, only the drop shape determines the classification result.

5 Outlook

Since the approach presented here worked out well we repeat it with a larger number of training images at the moment. Based on this publication, we now plan to separate other bloodstain patterns and take into account other factors such as texture or background color. It also lends itself to a comparison of the classification quality of our approach with that of experts in forensic blood trace pattern analysis.

We will continue to collect information about the areas which were responsible for decision-making when classifying additional blood pattern classes in the future, as it may also provide useful guidance for manual classification because the CNN reveals visual differences in an unbiased manner. In our vision the method presented here provides a simple and effective basis for a stepwise integration of computer-aided methods into forensic bloodstain pattern analysis. Figure 6 depicts an schematic overview of a fully automated BPA in 3D space.

We have already successfully implemented the partial steps shown in Fig. 6 and are currently optimizing them with additional bloodstain patterns. In the next publications, we then plan to present a stand-alone software solution that includes all components.

References

Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M, Ghemawat S, Goodfellow I, Harp A. Irving G, Isard M, Jia Y, Jozefowicz R, Kaiser L, Kudlur M, Levenberg J, Mané D, Monga R, Moore S, Murray D, Olah C, Schuster M, Shlens J, Steiner B, Sutskever I, Talwar K, Tucker P, Vanhoucke V, Vasudevan V, Viégas F, Vinyals O, Warden P, Wattenberg M, Wicke M, Yu Y, Zheng X (2015) TensorFlow: Large-scale machine learning on heterogeneous systems . https://www.tensorflow.org/. Software available from tensorflow.org
Arthur RM, Hoogenboom J, Green RD, Taylor MC, de Bruin KG (2017) An eye tracking study of bloodstain pattern analysts during pattern classification. Int J Legal Med 132(3):875–885. https://doi.org/10.1007/s00414-017-1711-6
Article Google Scholar
Bergmann T, Heinke F, Labudde D (2017) Towards substrate-independent age estimation of blood stains based on dimensionality reduction and k-nearest neighbor classification of absorbance spectroscopic data. Forensic Sci Int 278:1–8. https://doi.org/10.1016/j.forsciint.2017.05.023
Article Google Scholar
Bevel T (2002) Bloodstain pattern analysis: with an introduction to crime scene reconstruction. CRC Press, Boca Raton
Google Scholar
Buck U, Kneubuehl B, Näther S, Albertini N, Schmidt L, Thali M (2011) 3d bloodstain pattern analysis: Ballistic reconstruction of the trajectories of blood drops and determination of the centres of origin of the bloodstains. Forensic Sci Int 206(1–3):22–28. https://doi.org/10.1016/j.forsciint.2010.06.010
Article Google Scholar
Damas S, Cordón O, Ibáñez O, Santamaría J, Alemán I, Botella M, Navarro F (2011) Forensic identification by computer-aided craniofacial superimposition. ACM Comput Surv 43(4):1–27. https://doi.org/10.1145/1978802.1978806
Article Google Scholar
Erbisti PC, Gardner RM (2008) Bloodstain pattern analysis with an introduction to crime scene reconstruction. CRC Press. https://doi.org/10.1201/9781420052725
Book Google Scholar
Esaias O, Noonan GW, Everist S, Roberts M, Thompson C, Krosch MN (2019) Improved area of origin estimation for bloodstain pattern analysis using 3d scanning. J Forensic Sci 65(3):722–728. https://doi.org/10.1111/1556-4029.14250
Article Google Scholar
Franke K, Koeppen M (2001) A computer-based system to support forensic studies on handwritten documents. Int J Doc Anal Recogn 3(4):218–231. https://doi.org/10.1007/pl00013565
Article Google Scholar
Hansen C, Knoll S, Köppen V, Krempl G, Krull C, Schallehn E, Clausing E, Haun S, Kirst S, Meier A, Clausinga E (2014) Digitized locksmith forensics: design and implementation of a computer-aided forensic analysis. 3. Doktorandentagung - 8. Juli 2014 Magdeburger-Informatik-Tage
Illes M, Carter A, Laturnus P, Yamashita A (2005) Use of the backtrack™ computer program for bloodstain pattern analysis of stains from downward-moving drops. Can Soc Forensic Sci J 38(4):213–217. https://doi.org/10.1080/00085030.2005.10757593
Article Google Scholar
Joris P, Develter W, Jenar E, Suetens P, Vandermeulen D, de Voorde WV, Claes P (2015) HemoVision: an automated and virtual approach to bloodstain pattern analysis. Forensic Sci Int 251:116–123. https://doi.org/10.1016/j.forsciint.2015.03.018
Article Google Scholar
Milton-Barker A (2019) Inception v3 deep convolutional architecture for classifying acute myeloid/lymphoblastic leukemia. All Developer Program - Intel.com
Nagi R, Aravinda K, Rakesh N, Jain S, Kaur N, Mann A (2019) Digitization in forensic odontology: a paradigm shift in forensic investigations. J Forensic Dent Sci 11(1):5
Article Google Scholar
Peschel O (2015) Forensische Blutspurenmusteranalyse. Lehmanns, Berlin
Google Scholar
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. https://doi.org/10.1109/cvpr.2015.7298594
Taylor MC, Laber TL, Kish PE, Owens G, Osborne NKP (2016) The reliability of pattern classification in bloodstain pattern analysis, part 1: Bloodstain patterns on rigid non-absorbent surfaces. J Forensic Sci 61(4):922–927. https://doi.org/10.1111/1556-4029.13091
Article Google Scholar
Traore BB, Kamsu-Foguem B, Tangara F (2018) Deep convolution neural network for image recognition. Eco Inform 48:257–268. https://doi.org/10.1016/j.ecoinf.2018.10.002
Article Google Scholar
Vitiello A, Nunzio CD, Garofano L, Saliva M, Ricci P, Acampora G (2016) Bloodstain pattern analysis as optimisation problem. Forensic Sci Int 266:e79–e85. https://doi.org/10.1016/j.forsciint.2016.06.022
Article Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL. This projekt was founded by the,,Sächsische Aufbaubank“ (SAB) within the framework of the,,Europäischer Sozialfonds“ (ESF). The authors would like to thank Marie Heuschkel and Christina Lucas for constructive criticism of the manuscript.

Author information

Authors and Affiliations

Forensic Science Investigation Lab (FoSIL), University of Applied Sciences Mittweida, Technikumplatz 17, 09648, Mittweida, Germany
Tommy Bergman, Martin Klöden & Dirk Labudde
Institute of Forensic Medicine, University of Leipzig, Johannisallee 28, 04103, Leipzig, Germany
Jan Dreßler

Authors

Tommy Bergman
View author publications
You can also search for this author in PubMed Google Scholar
Martin Klöden
View author publications
You can also search for this author in PubMed Google Scholar
Jan Dreßler
View author publications
You can also search for this author in PubMed Google Scholar
Dirk Labudde
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tommy Bergman.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bergman, T., Klöden, M., Dreßler, J. et al. Automatic Classification of Bloodstains with Deep Learning Methods. Künstl Intell 36, 135–141 (2022). https://doi.org/10.1007/s13218-022-00760-y

Download citation

Received: 03 August 2021
Accepted: 26 April 2022
Published: 20 May 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s13218-022-00760-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Automatic Classification of Bloodstains with Deep Learning Methods

Abstract

Similar content being viewed by others

Computer Aided Diagnosis: Approaches to Automate Hematological Tests

A Weakly Supervised Deep Learning Approach for Detecting Malaria and Sickle Cells in Blood Films

Deep-Learning Methods for the Classification of Normal and Pathological Blood Cells and Bone-Marrow Cells: A Comprehensive Review

1 Introduction

2 Data Acquisition and Method

3 Results

4 Discussion

5 Outlook

References

Funding

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Automatic Classification of Bloodstains with Deep Learning Methods

Abstract

Similar content being viewed by others

Computer Aided Diagnosis: Approaches to Automate Hematological Tests

A Weakly Supervised Deep Learning Approach for Detecting Malaria and Sickle Cells in Blood Films

Deep-Learning Methods for the Classification of Normal and Pathological Blood Cells and Bone-Marrow Cells: A Comprehensive Review

1 Introduction

2 Data Acquisition and Method

3 Results

4 Discussion

5 Outlook

References

Funding

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation