Deep learning in forensic gunshot wound interpretation—a proof-of-concept study

Oura, Petteri; Junno, Alina; Junno, Juho-Antti

doi:10.1007/s00414-021-02566-3

Deep learning in forensic gunshot wound interpretation—a proof-of-concept study

Original Article
Open access
Published: 06 April 2021

Volume 135, pages 2101–2106, (2021)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Legal Medicine Aims and scope Submit manuscript

Deep learning in forensic gunshot wound interpretation—a proof-of-concept study

Download PDF

Petteri Oura¹,
Alina Junno^2,3 &
Juho-Antti Junno^2,3,4

4733 Accesses
10 Citations
8 Altmetric
Explore all metrics

Abstract

While the applications of deep learning are considered revolutionary within several medical specialties, forensic applications have been scarce despite the visual nature of the field. For example, a forensic pathologist may benefit from deep learning-based tools in gunshot wound interpretation. This proof-of-concept study aimed to test the hypothesis that trained neural network architectures have potential to predict shooting distance class on the basis of a simple photograph of the gunshot wound. A dataset of 204 gunshot wound images (60 negative controls, 50 contact shots, 49 close-range shots, and 45 distant shots) was constructed on the basis of nineteen piglet carcasses fired with a .22 Long Rifle pistol. The dataset was used to train, validate, and test the ability of neural net architectures to correctly classify images on the basis of shooting distance. Deep learning was performed using the AIDeveloper open-source software. Of the explored neural network architectures, a trained multilayer perceptron based model (MLP_24_16_24) reached the highest testing accuracy of 98%. Of the testing set, the trained model was able to correctly classify all negative controls, contact shots, and close-range shots, whereas one distant shot was misclassified. Our study clearly demonstrated that in the future, forensic pathologists may benefit from deep learning-based tools in gunshot wound interpretation. With these data, we seek to provide an initial impetus for larger-scale research on deep learning approaches in forensic wound interpretation.

Automated wound segmentation and classification of seven common injuries in forensic medicine

Article Open access 28 June 2023

Development and evaluation of deep learning algorithms for assessment of acute burns and the need for surgery

Article Open access 31 January 2023

Burns Depth Assessment Using Deep Learning Features

Article Open access 16 October 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Artificial intelligence uses trained algorithms to mimic human cognitive functions [1], especially in the context of interpreting complex data [2]. In contrast to conventional methods, artificial intelligence algorithms are allowed to approach problems freely without strict programming [3]. Deep learning is a subcategory under artificial intelligence, utilizing neural networks in a wide range of concepts such as image, text, and speech recognition [2, 4, 5]. While the applications of artificial intelligence and deep learning techniques are considered revolutionary within the healthcare sector and several medical specialties [2, 4, 6, 7], forensic applications have been relatively scarce [3, 8,9,10,11] and centered on subfields other than forensic pathology. This somewhat surprising, given the visual nature of forensic pathology at both microscopic and macroscopic levels.

The vast majority of missile wounds are caused by firearms [12]. In the USA, more than 30,000 people sustained lethal gunshot trauma in 2014, roughly two-thirds being suicides and one-third homicides [13]. As a thorough investigation of the course of events is in the public interest with potentially far-reaching legal consequences, the forensic pathologist’s conclusion of events should naturally be as accurate as possible. However, in case of lacking background information (in terms of, e.g., weapon type, bullet type, gunshot trajectory, and shooting distance [12, 14, 15]), these need to be inferred solely on the basis of the wounds and injuries detected by examining the victim’s body. To the best of our knowledge, there are no previous studies addressing the potential of deep learning in assisting the forensic pathologist in these cases.

In our proof-of-concept study, we aimed to test the hypothesis that deep learning algorithms have potential to predict shooting distance class out of four possibilities on the basis of a simple photograph of a gunshot wound. Due to the preliminary nature of our study, we utilized nineteen piglet carcasses as our material. A dataset of 204 images of gunshot wounds (60 negative controls, 50 contact shots, 49 close-range shots, and 45 distant shots) were used to train, validate, and test the ability of neural net architectures to correctly classify images on the basis of shooting distance. With these data, we sought to provide an initial impetus for larger-scale research on deep learning approaches to forensic wound interpretation.

Materials and methods

Material

This study tested the potential of deep learning methods to predict shooting distance class on the basis of a photograph of the gunshot wound. Nineteen carcasses of freshly died farmed piglets (weight range 2.05–4.76 kg) were used as the study material. The piglets had all sustained a natural death and were stored in a cold restricted room for ≤ 5 days until collected for this experiment. Exclusion criteria included external deformity and abnormal or blotchy skin pigmentation. Pig and piglet carcasses have been frequently and successfully used in studies on taphonomic processes and other forensic purposes [16]. While pigs do not perfectly act as a substitute for humans, they were chosen for this proof-of-concept study in order to avoid the obvious ethical concerns related to the use of human cadavers and minimize the risk of sensationalism. Importantly, pigs have rather similar skin attributes to humans [16, 17].

This study did not involve laboratory animals, living animals, or human cadavers. No piglets were harmed for this experiment, as all of them had died naturally in young age and were collected for this study afterwards. The carcasses were delivered for disposal according to local regulations immediately after the experiment.

Infliction of gunshot wounds

The .22 Long Rifle (5.6 × 15 mm) is a popular rimfire caliber in rifles and pistols. Contrary to general view as a low-powered and less dangerous caliber, .22 Long Rifle firearms are often used in shootings and homicides [18]. For example in Australia 43% [19] and in New Zealand 33% [20] of firearm homicides were committed using a .22 caliber rimfire. For this study, we selected the Ruger Standard Model semiautomatic pistol which is one of the most popular rimfire pistols globally. A 5 1/2 barrel length version of Mark II model was used in our experiment.

We selected to use the pistol version of .22 Long Rifle ammunition as we assumed that the powder type and amount are optimized with the shorter barrel length of pistols. We used Lapua Pistol King cartridge with 2.59 g (40 grain) round nose unjacketed lead bullet. Reported muzzle velocity with the barrel length of 120 mm is 270 m/s.

For this proof-of-concept study, we restricted shooting distances to three discrete classes, namely contact shot (0 cm), close-range shot (20 cm), and distant shot (100 cm). These were selected on the basis of previous literature regarding gunshot wound interpretation and classification among humans [12, 14, 15]. Fifty rounds of each distance were fired, with a grand total of 150 rounds in our experiment. Examples of typical wound appearances are presented in Fig. 1.

Photography of gunshot wounds

A professional 24.2-megapixel digital single-lens reflex Canon EOS 77D camera with Canon EF-S 17–55 mm f/2.8 IS USM lens (Canon, Tokyo, Japan) were used to obtain photographs of the entry wounds. Photographs were taken in a 90-degree angle towards the carcass, after ensuring an even lighting.

Experimental setting

As the experiment involved a firearm, safety measures were carefully planned beforehand. The experiment was performed in a restricted area, and only researchers whose presence was essential were present during the experiment. A researcher with a valid firearms license and long-term experience with firearms, including the one used in this study, performed the experiments (JAJ).

Due to the preliminary nature of the study, we aimed to keep all circumstantial factors fixed, except for shooting distance. First, a carcass was visually inspected and positioned on its left/right side onto a sandbed with a paperboard between the carcass and the sandbed. A set of clean photographs (negative controls) were obtained. Then, a round of 5–10 shots (depending on the size of the carcass) was fired vertically in a 90-degree angle at its upward-facing side, excluding face and distal limbs. Each carcass was randomized to sustain either contact shots, close-range shots, or distant shots using a random number generator (SPSS Statistics version 26, IBM, Armonk, NY, USA). An adjustable ruler was used to ensure the accuracy of shooting distance. Lastly, the carcass was photographed. The external circumstances such as weather and lighting conditions remained unchanged during the course of the experiment.

Image curation and post-processing

At the post-experiment phase, the full-sized photographs obtained during the experiment were uploaded to Photoshop Elements version 2021 (Adobe, San Jose, CA, USA) for visual inspection. Gunshot wounds (n = 144) were identified and extracted using a 420 × 420-pixel square cutter in .png format. Negative controls (n = 60) were extracted from photographs with no gunshot wounds following an identical method, with the exception that the extracted regions were chosen at random, without allowing two regions to overlap. The final dataset comprised a total of 204 images. Examples of extracted images are presented in Fig. 1.

Using a random number generator, the images of the final dataset were randomly divided into three sets (i.e., training, validation, and test sets) to be utilized in the deep learning procedure. In order to ensure sufficient set sizes, we used a 60%-20%-20% ratio in the division. Each of the three sets contained roughly equal percentages of negative controls, contact shots, close-range shots, and distant shots (Table 1).

Table 1 Division of data into training, validation, and test sets

Full size table

Deep learning procedure

Deep learning was performed using the open-source AIDeveloper software version 0.1.2 [5] which has a variety of neural net architectures readily available for image classification. The curated images were uploaded to AIDeveloper as 32 × 32-pixel grayscale images in order to streamline the deep learning process due to the relatively small sample size.

First, we used the training and validation image sets to train models (up to 3000 epochs) by means of all the available architectures. The specific parameters used in AIDeveloper are presented in Supplementary Table 1. Based on the validation accuracies obtained during the training process, we selected the best-performing model within each architecture. The best-performing models were then tested using the test set; performance metrics of all trained models are presented in Supplementary Table 2. In the results section, we present the model with the highest performance metrics from the test set round.

We assessed the models on the basis of the following metrics: testing accuracy (i.e., (true positive + true negative)/all), precision (i.e., true positive/(true positive + false positive)), recall (i.e., true positive/(true positive + false negative)), F1 value (i.e., 2 x (precision x recall)/(precision + recall)), and area under the receiver operating characteristics curve (i.e., a measure of the performance of a test relative to the ideal where there are no false negative or false positive results) [5, 21]. We did not aim to publish the algorithm of the best-performing model, as its applicability among other populations (e.g., external piglet carcasses and human cadavers) was likely to be poor. However, the algorithms are available from the corresponding author on request.

Results

Of the explored neural network architectures, a trained multilayer perceptron based model (MLP_24_16_24) reached the highest testing accuracy of 98%. The components of this model are specified in Table 2. A confusion matrix (i.e., a contingency table where predicted class is plotted against true class) and performance metrics for each class are presented in Tables 3 and 4, respectively. Of the testing set, the trained model was able to correctly classify all negative controls, contact shots, and close-range shots, whereas one distant shot was misclassified.

Table 2 Neural net architecture of the best-performing multilayer perceptron model (MLP_24_16_24)

Full size table

Table 3 Confusion matrix demonstrating true and predicted classes on test set data (bold signifies correct predictions)

Full size table

Table 4 Performance metrics of the trained neural network model against the test set

Full size table

Discussion

In this proof-of-concept study, we aimed to study the ability of deep learning methods to correctly estimate shooting distance on the basis of gunshot wound photographs. In our dataset of 204 images, a trained multilayer perceptron based neural net architecture proved most accurate, reaching the highest testing accuracy of 98%. As such, our results encourage larger-scale research on deep learning approaches to forensic wound interpretation.

Although artificial intelligence and deep learning techniques have clinical applications in several medical specialties [2, 4, 6, 7], forensic applications have been relatively scarce [3, 8,9,10,11] and centered on subfields other than forensic pathology. To the best of our knowledge, this is the first study to address deep learning in gunshot wound interpretation. The present dataset comprised images from four discrete classes, namely contact shot, close-range shot, and distant shot wounds, as well as negative controls with no wounds. Each of the wound types was considered to have distinct visual features, thus providing a basis for the deep learning approach. Of the independent testing set, the fully trained multilayer perceptron based model was able to correctly classify all negative controls (100%), contact shots (100%), and close-range shots (100%) and misclassified one distant shot as a negative control (88.9%). Even though the division into four classes was relatively rough, the present results suggest that forensic pathologists may benefit from deep learning algorithms in gunshot wound interpretation.

The main limitations of the study include a relatively small sample comprised of non-human material (204 photographs from 19 piglet carcasses) and the focus on only four gunshot wound classes. While only one weapon type and caliber were used in this preliminary study, further studies are needed to challenge the deep learning process with higher levels of variation in the dataset (e.g., weapon type, bullet type, shot trajectory, and circumstantial factors). Apart from the training, validation, and test sets needed for the training and testing of the algorithms, this study had no external validation set. Furthermore, we did not compare the performance of the algorithm to the gold standard (i.e., an experienced forensic pathologist). Therefore, the applicability of the present algorithm is highly limited in external settings. However, given the proof-of-concept nature and the high classification accuracy of the best-performing algorithm reached in this study, it seems clear that future research is needed to develop more robust and widely applicable algorithms for forensic use.

Future studies are encouraged to develop robust algorithms using large image sets of human material from diverse sources. In large datasets, shooting distance should be modelled continuously instead of discrete classes in order to increase the applicability of the estimates. Alongside shooting distance, algorithms should also be developed for other relevant indices such as weapon type, bullet type, and gunshot trajectory [12, 14, 15], or in a wider context also for other missile wounds, blunt trauma, or virtually any lesion with forensic significance. Ideally, prospective multicenter image libraries of substantial magnitude would offer outstanding material for the systematic development of accurate and scientifically validated algorithms.

From the legal perspective, the use of artificial intelligence to generate robust and comprehensible evidence remains problematic. Although artificial intelligence may be subject to human-like bias through the way it processes datasets and assesses events, it is difficult to examine similarly to human witnesses in the courtroom [22]. However, we hope that deep learning algorithms will, in the future, provide forensic pathologists with tools to utilize when constructing a view of the course of events. For example, the tools may assist in screening complex datasets for a specific detail or highlight areas of potential interest to the forensic pathologist. Importantly, the final interpretation would always remain with the forensic pathologist.

In this proof-of-concept study, a trained deep learning algorithm was able to predict shooting distance class (negative control, contact shot, close-range shot, or distant shot) on the basis of a simple photograph of a gunshot wound with an overall accuracy of 98%. The results of this study encourage larger-scale research on deep learning approaches to forensic wound interpretation.

Data availability

The datasets and algorithms generated and analyzed during the study are available from the corresponding author on request.

References

Russell S, Norvig P (2021) Artificial intelligence: a modern approach (4th Edition). Pearson, Hoboken
Google Scholar
Hosny A, Parmar C, Quackenbush J, Schwartz L, Aerts H (2018) Artificial intelligence in radiology. Nat Rev Cancer 18(8):500–510
Article CAS Google Scholar
Margagliotti G, Bollé T (2019) Machine learning & forensic science. Forensic Sci Int 298:138–139
Article Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Article CAS Google Scholar
Kräter M, Abuhattum S, Soteriou D, Jacobi A, Krüger T, Guck J et al (2020) AIDeveloper: deep learning image classification in life science and beyond. bioRxiv 2020.03.03.975250
Yu K, Beam A, Kohane I (2018) Artificial intelligence in healthcare. Nat Biomed Eng 2:719–731
Article Google Scholar
Rajkomar A, Dean J, Kohane I (2019) Machine learning in medicine. New Engl J Med 380(14):1347–1358
Article Google Scholar
Nikita E, Nikitas P (2020) On the use of machine learning algorithms in forensic anthropology. Leg Med (Tokyo) 47:101771
Article Google Scholar
Fan F, Ke W, Wu W, Tian X, Lyu T, Liu Y et al (2020) Automatic human identification from panoramic dental radiographs using the convolutional neural network. Forensic Sci Int 314:110416
Article Google Scholar
Peña-Solórzano C, Albrecht D, Bassed R, Burke M, Dimmock M (2020) Findings from machine learning in clinical medical imaging applications - Lessons for translation to the forensic setting. Forensic Sci Int 316:110538
Article Google Scholar
Porto L, Lima L, Franco A, Pianto D, Machado C, Vidal F (2020) Estimating sex and age from a face: a forensic approach using machine learning based on photo-anthropometric indexes of the Brazilian population. Int J Legal Med 134(6):2239–2259
Article Google Scholar
Saukko P, Knight B (2004) Knight’s forensic pathology, 3rd edn. Hodder Arnold, London
Book Google Scholar
Wolfson J, Teret S, Frattaroli S, Miller M, Azrael D (2016) The US public’s preference for safer guns. Am J Public Health 106(3):411–413
Article Google Scholar
Dolinak D, Matshes E, Lew E (2005) Forensic Pathology: principles and practice. Elsevier Academic Press, Burlington
Google Scholar
Denton J, Segovia A, Filkins J (2006) Practical pathology of gunshot wounds. Arch Pathol Lab Med 130(9):1283–1289
Article Google Scholar
Matuszewski S, Hall M, Moreau G, Schoenly K, Tarone A, Villet M (2020) Pigs vs people: the use of pigs as analogues for humans in forensic entomology and taphonomy research. Int J Legal Med 134(2):793–810
Article Google Scholar
Avci P, Sadasivam M, Gupta A, De Melo W, Huang Y, Yin R et al (2013) Animal models of skin disease for drug discovery. Expert Opin Drug Discov 8(3):331–355
Article CAS Google Scholar
Alpers P (1998) “Harmless” .22 calibre rabbit rifles kill more people than any other type of gun: March 1998. 22R Fact Sheet. Available at: https://www.gunpolicy.org/documents/5561-22-calibre-rabbit-rifles-kill-more-people-than-any-other/file, Nov 19, 2020
Data on firearms and violent death (1996) Brief prepared for the Commonwealth Police Ministers’ meeting of 10 May 1996. Australian Institute of Criminology, Canberra, Australia
Google Scholar
Alpers P, Morgan B (1995) Firearm Homicide in New Zealand: victims, perpetrators and their weapons 1992–94. A survey of NZ Police files presented to the National Conference of the Public Health Association of New Zealand. Knox College, Dunedin
Machin D, Campbell M, Walters S (2007) Medical Statistics, Fourth Edition - A Textbook for the Health Sciences. Wiley, Hoboken
Google Scholar
Gless S (2020) AI in the Courtroom: a comparative analysis of machine evidence in criminal trials. Georget J Int Law 51(2):195–253
Google Scholar

Download references

Funding

Open access funding provided by University of Oulu including Oulu University Hospital.

Author information

Authors and Affiliations

Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
Petteri Oura
Cancer and Translational Medicine Research Unit, University of Oulu, Oulu, Finland
Alina Junno & Juho-Antti Junno
Department of Archaeology, Faculty of Humanities, University of Oulu, Oulu, Finland
Alina Junno & Juho-Antti Junno
Faculty of Arts, University of Helsinki, Helsinki, Finland
Juho-Antti Junno

Authors

Petteri Oura
View author publications
You can also search for this author in PubMed Google Scholar
Alina Junno
View author publications
You can also search for this author in PubMed Google Scholar
Juho-Antti Junno
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Petteri Oura.

Ethics declarations

Ethical approval

Not applicable.

Human and animal rights and informed consent

This study did not involve laboratory animals, living animals, or human cadavers. No piglets were harmed for this experiment, as all of them had died naturally in young age and were collected for this study afterwards. The carcasses were delivered for disposal according to local regulations immediately after the experiment.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Conflict of interest

The authors declare no competing interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Key points

• Deep learning applications have been scarce in forensic pathology despite the visual nature of the field.

• This proof-of-concept study tested the potential of trained neural network architectures to predict shooting distance on the basis of a gunshot wound photograph.

• A multilayer perceptron based model reached the highest testing accuracy of 98%.

• Forensic pathologists may benefit from deep learning-based tools in gunshot wound interpretation.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 20 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oura, P., Junno, A. & Junno, JA. Deep learning in forensic gunshot wound interpretation—a proof-of-concept study . Int J Legal Med 135, 2101–2106 (2021). https://doi.org/10.1007/s00414-021-02566-3

Download citation

Received: 22 November 2020
Accepted: 08 March 2021
Published: 06 April 2021
Issue Date: September 2021
DOI: https://doi.org/10.1007/s00414-021-02566-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Deep learning in forensic gunshot wound interpretation—a proof-of-concept study

Abstract

Similar content being viewed by others

Automated wound segmentation and classification of seven common injuries in forensic medicine

Development and evaluation of deep learning algorithms for assessment of acute burns and the need for surgery

Burns Depth Assessment Using Deep Learning Features

Introduction

Materials and methods

Material

Infliction of gunshot wounds

Photography of gunshot wounds

Experimental setting

Image curation and post-processing

Deep learning procedure

Results

Discussion

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical approval

Human and animal rights and informed consent

Consent to participate

Consent for publication

Conflict of interest

Additional information

Publisher's note

Key points

Supplementary Information

Supplementary file1 (DOCX 20 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Deep learning in forensic gunshot wound interpretation—a proof-of-concept study

Abstract

Similar content being viewed by others

Automated wound segmentation and classification of seven common injuries in forensic medicine

Development and evaluation of deep learning algorithms for assessment of acute burns and the need for surgery

Burns Depth Assessment Using Deep Learning Features

Introduction

Materials and methods

Material

Infliction of gunshot wounds

Photography of gunshot wounds

Experimental setting

Image curation and post-processing

Deep learning procedure

Results

Discussion

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical approval

Human and animal rights and informed consent

Consent to participate

Consent for publication

Conflict of interest

Additional information

Publisher's note

Key points

Supplementary Information

Supplementary file1 (DOCX 20 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation