Morph Creation and Vulnerability of Face Recognition Systems to Morphing

Ferrara, Matteo; Franco, Annalisa

doi:10.1007/978-3-030-87664-7_6

Matteo Ferrara¹⁶ &
Annalisa Franco¹⁶

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

12k Accesses
5 Citations

Abstract

Face recognition in controlled environments is nowadays considered rather reliable, and very good accuracy levels can be achieved by state-of-the-art systems in controlled scenarios. However, even under these desirable conditions, digital image alterations can severely affect the recognition performance. In particular, several studies show that automatic face recognition systems are very sensitive to the so-called face morphing attack, where face images of two individuals are mixed to produce a new face image containing facial features of both subjects. Face morphing represents nowadays a big security threat particularly in the context of electronic identity documents because it can be successfully exploited for criminal intents, for instance to fool Automated Border Control (ABC) systems thus overcoming security controls at the borders. This chapter will describe the face morphing process, in an overview ranging from the traditional techniques based on geometry warping and texture blending to the most recent and innovative approaches based on deep neural networks. Moreover, the sensitivity of state-of-the-art face recognition algorithms to the face morphing attack will be assessed using morphed images of different quality generated using various morphing methods to identify possible factors influencing the probability of success of the attack.

You have full access to this open access chapter, Download chapter PDF

Detection of Face Morphing Attacks by Deep Learning

A Novel Framework for Detection of Morphed Images Using Deep Learning Techniques

Humans Versus Deep Learning: Detection of Face Morphing as a Peril

1 Introduction

Face morphing is generally described as a seamless transition transforming a facial image into another. Morphing was initially proposed as an image generation technique for computer graphics applications [1] or psychological studies [2, 3]. However, only in recent years it has emerged as a potential and severe security thread for Face Recognition Systems (FRS). The main risk deriving from face morphing is especially related to the adoption of automatic face-based identity verification in various applications like civilian identity management, Machine Readable Travel Documents (eMRTD), or visa management. A possible attack in relation to the use of MRTD in Automated Border Control (ABC) gates has been firstly identified in [4] and later confirmed by several research works. Identity verification at an ABC relies on the comparison of a live captured probe face image with a digital face image stored in an eMRTD such as an e-passport. If a morphed image, which is similar enough to the face of the two parent subjects, can be included in an eMRTD, then two persons can share the document. In this scenario, a criminal could exploit the passport of an accomplice with no criminal records to overcome the security controls. In more details, the subject with no criminal records (i.e., the accomplice) could apply for an eMRTD by presenting the morphed face photo; if the image is not noticeably different from his/her face, the police officer accepts the photo and releases the document (see Fig. 6.1).

The attack will be successful if the morphed image contemporarily meets two conditions.

It is able to fool the human expert, i.e., the morphed face must be very similar to the accomplice who applies for the document and no elements (e.g., morphing artifacts) of the image should raise suspicions;
the image fools at the same time the FRS used for automatic identity verification, meaning that the morphed face can be successfully matched with both subjects (criminal and accomplice).

Some studies confirm that morphed faces can be very realistic and able to fool human experts [5,6,7]. It is well known, in fact, that unfamiliar face recognition is a hard task for humans and it becomes even harder when it has to be accomplished based on a small-size id photo such as the one used by the citizens to apply for an identity or travel document. This photo is generally obtained by printing a high-quality digital image on photographic paper (typical size is 3.5 cm × 4.5 cm) and is then scanned to be included into the document. This printing and scanning process (P&S) hides many small details of the image (e.g., artifacts introduced by the morphing process) thus making it more difficult for human examiners to spot the attack attempt.

Figure 6.2 shows two examples of morphing. In the first case (top row), the morphed image (b) is obtained with an almost equal contribution of the two subjects (a) and (c); the result is quite similar to subject (a) but a human expert could notice some differences. In the second example, the morphed image (e) has been generated from (d) and (f), but with a stronger contribution of subject (d). Visually the morphed image is almost indistinguishable from the accomplice (d) and is very unlikely that it would raise some suspicion by the officer. Both these morphed images, (b) and (d), contain enough information of the “criminal” subject to fool commercial FRSs.

It is worth noting that in case of successful attack, the document issued is perfectly regular; the attack does not consist of altering the document content but in deceiving the officer while issuing the document. The document released will thus pass all the integrity checks (optical and electronic) performed at the gates.

This attack is made possible in practice by the procedure adopted in several countries where there is no live enrolment for facial images and citizens apply for the document by providing an ID photo printed on photographic paper. The trust chain is thus broken since citizens could intentionally alter the image content by different possible digital image manipulations [5], even with criminal intents. Switching to live enrolment would certainly be the most effective solution, but its adoption by all the involved countries is very unlikely; moreover, we have to consider the huge number of documents already issued since the introduction of eMRTDs, which still represent a potential risk. In fact, governmental agencies already reported a few real morphing attack attempts and recent news confirm that the criticalities related to the morphing attack have reached a wide public audience [8,9,10]. Estimating the real extent of this phenomenon is hard, due to the practical impossibility of spotting the cases of successful attack. Unfortunately, the analysis of the vulnerability of FRSs to morphing attack, discussed later in this chapter, is not encouraging and confirm once again that designing effective countermeasures is quite urgent.

This chapter is organized as follows. Section 6.2 describes the face morphing generation algorithms, presenting both traditional landmark-based approaches, as well as innovative solutions based on deep learning. Section 6.3 analyzes and discusses the vulnerability of commercial FRSs to morphing attack; finally, Sect. 6.4 draws some concluding remarks.

2 Face Morphing Generation

Nowadays, the generation of a morphed image has become quite an easy and inexpensive task. Open-source solutions are publicly available, such as for instance general image processing software with specific plugins (e.g., the GAP plugin for GIMP [11]). Moreover, a number of free or commercial tools (e.g., FaceMorpher [12] or FantaMorph [13]), as well as applications for mobile devices or online services are available. Interested readers can refer to [14] for a comprehensive review of publicly available morphing tools. It is however worth noting that the images obtained with these fully automated systems are usually affected by the presence (more or less accentuate) of clearly visible artifacts that would probably cause a rejection of the image by the human officer during the document issuing process. As discussed later in this chapter, the creation of a high quality and credible morphed image usually requires an accurate manual intervention aimed at removing the most relevant defects and make the image undistinguishable from a bona fide one.

2.1 Landmark Based Morphing

Landmark-based approaches for face morphing allow synthesizing a fluid and gradual transformation from one image to another by exploiting facial landmark points in the involved images. Reference points usually correspond to prominent facial components such as mouth, nose, eyes or eyebrows, and approximately outline their shape. Such reference points can be either manually annotated or automatically determined using facial landmark detection algorithms such as Dlib [15], which is the most widely used for this purpose. Of course, the effort needed in the two cases is different, and manual annotation is a boring and time-consuming task; on the other hand, if properly executed, manual landmark labeling usually provides more precise landmark locations and achieves a better image coverage. Automatic landmark detection algorithms, in fact, usually adopt standard facial models that consider the central part of the face and the chin but ignore for instance the forehead region. As we will discuss later, the accuracy of landmark detection has a direct impact on the quality and effectiveness of the generated morphed images.

Starting from the facial landmarks, the morphing process can be generally described as follows. Let ${I}_{0}$ and ${I}_{1}$ be the two parent images to morph and let ${P}_{0}$ and ${P}_{1}$ be the two sets of correspondence points in ${I}_{0}$ and ${I}_{1}$, respectively. For most of the landmark-based approaches, the transformation between the two images is ruled by the so-called morphing factor, a parameter $\alpha $ representing a weighting factor for the two images. The morphing process is therefore generating a set of intermediate frames ${\mathbb{M}}=\left\{{I}_{\alpha },\alpha \in {\mathbb{R}},0<\alpha <1\right\}$ representing the transformation of the first image (${I}_{0}$) into the second one (${I}_{1}$) as shown in Fig. 6.3. Note that, to obtain realistic results, the two images need to be aligned in advance (e.g., by overlaying the eye centers).

In general, each frame is a weighted linear combination of ${I}_{0}$ and ${I}_{1}$ (based on $\alpha $ value), obtained by combining (i) geometric warping [16] of the two images based on correspondence points and (ii) texture blending.

Formally:

$${I}_{\alpha}\left(\mathbf{p}\right)=\left(1-\alpha \right)\cdot {I}_{0}\left({w}_{{P}_{\alpha}\to {P}_{0}}\left(\mathbf{p}\right)\right)+\alpha \cdot {I}_{1}\left({w}_{{P}_{\alpha}\to {P}_{1}}\left(\mathbf{p}\right)\right),$$

(6.1)

where

$\mathbf{p}$ is a generic pixel position;
$\alpha $ is the weight factor, representing the contribution of image ${I}_{1}$ to the morphing ($\alpha =0.3$ indicates that the morphed image will be obtained for the 30% from ${I}_{1}$ and 70% from ${I}_{0}$);
${P}_{\alpha}$ is the set of correspondence points aligned according to the weight factor $\alpha $;
${w}_{{P}_{B}\to {P}_{A}}\left(\mathbf{p}\right)$ is a warping function.

Several warping techniques have been proposed in the literature [17]. A common approach consists in representing the two sets of points (P_A and P_B) by means of topologically equivalent (i.e., no folding or discontinuities are permitted) triangular meshes (see Fig. 6.3) and computing local spatial transformations that map each warped triangle to the corresponding original one [18]. Note that the meshes are constrained to cover the whole images and not to cause self-intersection (i.e., each pixel position is contained in exactly one mesh). A triangular mesh can be derived from a set of points via Delaunay triangulation [19]. Given a generic pixel position $\mathbf{p}$ in the warped image, the transformation used to map $\mathbf{p}$ onto the original image $I$ is the local transformation corresponding to the warped triangle that contains $\mathbf{p}$ (see Fig. 6.4).

The set of aligned correspondence points ${P}_{\alpha}$ in Eq. (6.1) is computed as follows (see Fig. 6.5):

$${P}_{\alpha}=\left\{{\mathbf{r}}_{i}|{\mathbf{r}}_{i}=\left(1-\alpha \right)\cdot {\mathbf{u}}_{i}+\alpha \cdot {\mathbf{v}}_{i}, {\mathbf{u}}_{i}\in {P}_{0},{\mathbf{v}}_{i}\in {P}_{1}\right\}.$$

(6.2)

A more general formulation of the morphing process has been proposed in [20]; here geometric warping and image blending are ruled by two different factors. Equation (6.1) can be generalized as follows:

$${I}_{{\alpha}_{B},{\alpha}_{W}}\left(\mathbf{p}\right)=\left(1-{\alpha}_{B}\right)\cdot {I}_{0}\left({w}_{{P}_{{\alpha}_{W}}\to {P}_{0}}\left(\mathbf{p}\right)\right)+{\alpha}_{B}\cdot {I}_{1}\left({w}_{{P}_{{\alpha}_{W}}\to {P}_{1}}\left(\mathbf{p}\right)\right),$$

(6.3)

where ${\alpha}_{B}$ and ${\alpha}_{W}$ are the blending and warping factors, respectively.

The effects of blending and warping are shown in Fig. 6.7 where two very different subjects have been selected (see Fig. 6.6) to highlight the influence of ${\alpha}_{B}$ and ${\alpha}_{W}$. From a visual point of view, the result from different combinations is overall quite similar, but the effects produced on the probability of success of the attack by the possibility of acting separately on geometry warping and image blending have to be carefully considered. Several studies in fact show that, in the context of face recognition, humans are more sensitive to texture than to geometry [21]; the study [20] reveals that the same holds for FRSs, as confirmed by the experimental results reported in Sect. 3.2. Assigning different weighting factors to texture blending and geometry warping during the face morphing process significantly increases the chances of success, especially in the presence of look-alike subjects.

The automatic generation of morphed images can produce some visible artifacts that might be easily spotted by a human observer, thus drastically reducing the probability of success of the face morphing attack. The adoption of automatically detected facial landmarks, further increase the probability of artifacts in case of inaccurate point identification. The following visible artifacts are generally detectable:

Macroscopic ghost artifacts in the face surrounding area (see Fig. 6.8a). Facial landmarks are usually exclusively located in the facial region, and no reference points are considered for hairs, ears, and ecc. No accurate warping is therefore carried out for these regions, and the blending process produces therefore visible artifacts due to different characteristics (e.g., hair style or background) of the two contributing images.
Fig. 6.8
Morphed image obtained from the two subjects in Fig. 6.3 with macroscopic artifacts in the region around face; b morphed image in (a) after automatic background substitution
Full size image
Minor artifacts close to the facial reference points (eyes, eye brows, mouth, nose, chin, and nostrils) mainly due to insufficient or inaccurate landmarks. Typical patterns are double edges or double reflections on irises (see Fig. 6.9a).
Fig. 6.9
a Small artifacts in the eye region, with double edge effect and multiple light reflections in the iris; b eye region after manual post-processing for artifact removal
Full size image

A widely used solution to remove the macroscopic artifacts in the face surrounding area is background substitution; the background region is typically replaced by the corresponding region of one of the parent images (the one with the highest blending factor), after a proper alignment (see Fig. 6.8b). An additional step is recommended in this case, aimed at equalizing the skin color before background substitution. In fact, due to different illumination conditions or skin color between the two face images, the retouching result could be unsatisfactory, in particular when the facial landmarks do not include the forehead region, thus causing a strong edge with the central face region. To overcome this issue, the histogram matching method described in [22] could be applied.

The second category of artifacts is more difficult to address, and no effective automatic solutions have been identified so far. At present, only a very careful manual post-processing is able to remove them, with a combination of low-level image processing operations such small region cloning from the contributing images, direct painting or edge smoothing (see Fig. 6.9b). Of course, this manual intervention is not trivial and requires some practice to achieve a good result. However, manual post-processing is a key element for the success of the morphing attack, in particular to fool human experts, which could quite easily spot morphing artifacts if not carefully removed.

2.2 Deep Learning-Based Face Morph Generation

The face morphing approaches presented in the previous section provide a precise control on the morphing process in relation for instance to the contribution of the two subjects in the resulting image. On the other hand, since the process relies on facial landmarks, an inaccurate detection of such reference points, as well as the lack of reference points in specific face regions, determine in most cases the presence of some ghost artifacts in the morphed image, which a human expert observing the image could spot quite easily. As mentioned above, the realization of an “ideal” morphed image requires a difficult and time-consuming manual post-processing aimed at removing all visible artifacts. To overcome this limitation, some innovative solutions for face morphing generation have been recently proposed, with the aim of fully automating the generation process. In particular, a few recent works in the literature exploit the potential of Generative Adversarial Networks (GAN) to synthesize morphed images by sampling the two contributing facial images in the latent space, without requiring preliminary landmark extraction and alignment.

GANs are based on the combined action of two different agents, a generator and discriminator. The first one, the generator $G$, produces samples from a distribution which should be ideally indistinguishable from the training distribution. The discriminator $D$ is trained to determine if the incoming samples are drawn from the real set of training images or are fake samples generated by $G$. The training process gradually improves the samples produced by the generator $G$, which learns the most effective way to fool the discriminator.

The first approach for GAN-based face morphing generation, called MorGAN, was proposed in [23]. The network architecture is inspired by the work [24] where the Bidirectional Generative Adversarial Network (BiGAN) is introduced. In addition to the generator $G$ from the standard GAN framework BiGAN includes an encoder $E$ which maps data $\mathbf{x}$ to latent representations $\mathbf{z}$. The BiGAN discriminator $D$ discriminates not only in data space ($\mathbf{x}$ versus $G(\mathbf{z})$), but jointly in data and latent space (tuples ($\mathbf{x}$, $E(\mathbf{x})$) versus ($G(\mathbf{z}), \mathbf{z}$), where the latent component is either an encoder output $E(\mathbf{x})$ or a generator input $\mathbf{z}$. The idea is that the BiGAN encoder $E$ should learn to invert the generator $G$, even if the two modules cannot directly “communicate”. This architecture is adapted by the authors of [23] to the problem of face morph generation. The generator is split into two components, complementary inverse to each other, and the discriminator is trained to distinguish between joint pairs (samples from the encoder and samples from the decoder). The main limitation of the MorGAN approach is the limited size of the generated morphed images, 64 × 64 pixels, which is quite far from the resolution needed to fulfill the ISO/ICAO quality standards (minimum inter-eye distance of 90 pixels) and to successfully fool commercial FRSs. This last aspect is confirmed in [25] where the authors evaluate the vulnerability of state-of-the-art face recognition systems to MorGAN morphed images.

The same work [25] focuses on the generation of high-quality morphed images, with the aim of overcoming the key limitation of the MorGAN approach. In particular, the authors propose the adoption of StyleGAN [26] for morphing generation. Given the latent code ${L}_{1}$ of the face, StyleGAN maps the inputs to an intermediate latent space through the mapping network. The mapping layer consists of 8 fully connected layers serially connected. The approach synthesizes a data-subject-specific morphed face by forcing a strategy to embed the face image into the latent space. The subject-specific embedded latent space passes through the synthesis network consisting of 18 layers, thus obtaining a representation in 18 latent spaces (dimension 512) which is further concatenated. The loss function driving the embedding measures the similarity between the input image and the reconstructed image. The images of the two contributing subjects are both processed according to the procedure described above and a weighted average (to recall the idea of morphing factor) of the corresponding latent codes is computed to obtain the morphed image latent code, which is finally passed through the synthesis network to generate the high-resolution morphed image (1024 × 1024).

The morphing approach based on StyleGAN has been successively improved by the same authors in [27] where the MIPGAN (Morphing through Identity Prior driven GAN) approach is presented. The introduction of a loss function aimed at preserving the identity of the generated morphed image, through enforced identity priors represents the main element of novelty. Given the images of the two contributing subjects, the corresponding latent vectors are first computed using a latent prediction network. The morphed image latent vector is again obtained by a weighted average of the two input vectors and is finally passed through the synthesis network to obtain a morphed image of size 1024 × 1024. The last step consists of a final optimization stage based on the identity preserving loss function. The authors propose two different versions of MIPGAN, obtained using two versions of StyleGAN, [26] and [28], respectively. The MIPGAN approach achieves interesting results in terms of efficacy of the attack, as shown by the results reported in the next section.

Besides image resolution, another important aspect to consider is the similarity of the morphed image to the two contributing subjects. From this point of view, the landmark-based approaches certainly allow to better preserve the identity of the two contributing subjects and to control quite easily (via the morphing factor) the similarity of the resulting morphed images to one of the two individuals. GAN-based approaches seem to have less control on this aspect, even when an identity preserving loss function is adopted. Even if the morphed images generated using GAN-based approaches can fool automatic FRSs, we believe that further work is needed to make the generated images able to fool the human expert.

3 Vulnerability of Face Recognition Systems to Face Morphing

In this section, we describe the experiments carried out using three commercial face recognition SDKs (referred to as $SD{K}_{1}$, $SD{K}_{2},$ and $SD{K}_{3}$) which provided top performance in the “Face Recognition Vendor Test (FRVT)—1:1 Verification” [29, 30]; the names of the SDKs cannot be disclosed, and the results will be therefore presented in anonymous form.

In order to simulate a realistic attack to an ABC system, the operational threshold of the face recognition software have been fixed according to the Frontex guidelines [31]. In particular, for ABC systems operating in verification mode, the face recognition algorithm has to ensure a False Acceptance Rate (FAR) equal to 0.1% and a False Rejection Rate (FRR) lower than 5%. During the experimentation, for each SDK, the security threshold indicated in the corresponding documentation to achieve FAR = 0.1% has been used. Since we focus on morphing attacks, the performance is evaluated in terms of Mated Morph Presentation Match Rate (MMPMR) [32] with the aim to quantify the percentage of morphing attacks able to fool the SDKs. To this purpose the MMPMR for all SDKs have been measured by comparing morphed face images against probe images of both subjects involved in the generation of the morphed image.

3.1 Data Sets

The SDKs have been evaluated on five data sets:

BIOLAB-1.0 [5]: it contains 80 morphed images generated using the GIMP software [11, 33] after a manual labeling of the facial reference points and a first manual alignment based on eyes superimposition; a final manual retouch was carried out to remove visible artifacts. For each morphed image, it contains two probe images, one for each parent subject.
MorphDB [34]: the aim of this dataset is to reproduce the typical scenario where the ID photo is provided by the citizens printed on photographic paper and then scanned by the officer during the issuing process. It contains 100 morphed images generated using the Sqirlz Morph 2.1 software [35] with facial landmarks automatically detected and a morphing factor in the range [0.3;0.4]. After the generation, the morphed images have been manually retouched to remove visible artifacts introduced by the morphing procedure. The P&S images have been created by printing the digital version on high quality photographic paper by a professional photographer and scanned at 300 DPI. For each morphed image, it includes a variable number of probe images of the two parent subjects.
SOTAMD [36]: it contains 5748 high quality images for benchmarking under realistic conditions. The dataset consists of facial images from subjects of various ethnicities, age-groups, and both genders. After a careful subject pre-selection, the morphed images have been created using seven different morphing algorithms and applying manual post-processing to remove visible artifacts. Moreover, the images have been also printed and scanned. For each morphed image, it includes 10 probe images, for each contributing subject, captured under a simulated ABC gate operational scenario presenting more variations with respect to other datasets.
AMSL [37]: a dataset containing images from the Face Research Lab London Set [38]. 2175 morphed face images were generated using the morphing approach described in [39]. All images were modified in the way to comply with the requirements of the ICAO portrait quality standard for eMRTD [40] and to fit on a chip of an eMRTD including cropping, down-scaling, and JPEG compression. For each morphed image, it contains two probe images, one for each subject.
B&W [20]: a dataset containing morphed images automatically generated by separately varying the blending and the warping factors ${\alpha}_{B}$ and ${\alpha}_{W}$ to evaluate their importance in fooling face recognition systems. It contains 560 morphed images for each combination of ${\alpha}_{B}$ and ${\alpha}_{W}$ and for each of them, a probe image for each contributing subject.

BIOLAB-1.0, MorphDB, and SOTAMD datasets are available for testing on the Bologna Online Evaluation Platform (BOEP) [41] hosted in the FVC-onGoing framework [42, 43].

3.2 Results

Table 6.1 reports the single MMPMR of the three SDKs and their average on all datasets (except for B&W data set whose results are reported below).

Table 6.1 MMPMR of the three SDKs on different data sets

Full size table

For all SDKs, the most difficult datasets seem to be both BIOLAB-1.0 and AMSL with an average MMPMR of 95.0 and 92.7%, respectively. This is probably due to a combination of different elements:

morphing factor—both BIOLAB-1.0 and AMSL datasets contain symmetric morphed images (i.e., morphing factor equal to 0.5) while MorphDB dataset contains asymmetric morphed images generated with a morphing factor in the range [0.3;0.4] and SOTAMD dataset contains morphed images generated with two different morphing factors (0.3 and 0.5);
facial landmarks manually labeled—to generate BIOLAB-1.0 morphed images, the facial landmarks have been manually selected, while automatically detected facial landmarks have been used to generate MorphDB and SOTAMD morphed images;
forehead landmarks—BIOLAB-1.0 morphed images have been generated using also landmarks manually labeled on the hairline (see Fig. 11 in [5]) which have not been used to generate the other databases;
facial outer region substitution—as shown in Fig. 6.3, the intermediate morphed frames could present double exposure effects outside the facial region (e.g., background, hair, shoulders, and body). To make morphed images more realistic and therefore more difficult to be detected, usually a retouching is applied. MorphDB and SOTAMD morphed images have been automatically retouched by replacing the pixels outside the face region with those of the accomplice image, while BIOLAB-1.0 morphed images have been manually retouched.
probe images—to simulate an ABC gate operational scenario, in the SOTAMD database, the morphed images are compared against face images acquired using ABC gates. Such images present different lighting conditions, and some of them have been acquired as grayscale images. Such differences could decrease the chance to fool the SDKs.

As the SOTAMD dataset [36] presents meta-data regarding the characteristics of the parent subjects used for morphing (e.g., gender) and of the morphing generation pipeline (e.g., morphing approach), the MMPMR of the three SDKs and their average on different subsets are reported in Tables 6.2 and 6.3 (digital and P&S versions, respectively).

Table 6.2 MMPMR of the three SDKs on digital version of SOTAMD subsets

Full size table

Table 6.3 MMPMR of the three SDKs on P&S version of SOTAMD subsets

Full size table

Table 6.4 MMPMR of ${SDK}_{1}$ on B&W data set for each combination of ${\alpha}_{B}$ and ${\alpha}_{W}$. Different values are represented by different blue levels (the darker, the greater)

Full size table

Some interesting results can be observed, in relation to the main attributes characterizing the database images:

gender—the chance of fooling SDKs for female subjects looks on average higher than for male subjects (about 10% better on both digital and P&S versions).
post-processing—as expected manual retouching increases the probability of fooling the SDKs with respect to automatic post-processing, even if the difference is not so evident (about 5% better on both digital and P&S versions).
morphing algorithm—SDKs exhibit different behaviors as the morphing algorithm changes; algorithms C02 and C01 present a higher change to fool SDKs with respect to algorithms C06, C07, and C03. Please refer to [36] for a detailed description of the different morphing algorithms.
morphing factor—as expected symmetric morphing (morphing factor equals to 0.5) fools the SDKs more easily (more than 40% better on both digital and P&S versions) than asymmetric morphing (morphing factor equals to 0.3).
morph quality—as expected high quality morphs are more difficult to detect than low quality morphs (about 45% better on both digital and P&S versions).
image compression—the uncompressed images present a higher probability to fool SDKs with respect to the compressed version (about 15% better).

Tables 6.4, 6.5, 6.6, and 6.7 report the MMPMR of the three SDKs and their average on B&W data set. For all SDKs blending and warping present a very different impact on the probability of success of the attack, while geometric modifications obtained increasing the warping factor ${\alpha}_{W}$ do not heavily affect recognition accuracy (see ranges ${\alpha}_{B}\in \left[0;0.1\right]$, ${\alpha}_{W}\in \left[0.4;0.5\right]$), an opposite behavior is observed for the blending factor ${\alpha}_{B}$ (${\alpha}_{B}\in \left[0.4;0.5\right]$, ${\alpha}_{W}\in \left[0;0.1\right]$). Hence, for a criminal it would be much more convenient to create a morphed image with ${\alpha}_{B}=0.5$ and ${\alpha}_{W}\in \left[0;0.2\right]$ instead of using a balanced morphing factor in the range [0.2; 0.3] as stated in [34, 44]. This choice would increase the chances of successful attack at the border (from about 16–44 to 78–88%, on the average) keeping unaltered the chances of fooling the human officer during the document issuing process. In fact, a visual inspection of several generated morphs reveals that the difference between the two images is imperceptible, in particular when look-alike subjects are involved (see the example of Fig. 6.10). Moreover, we should always consider that human recognition capabilities are surprisingly error-prone in front of unfamiliar faces [45] and small appearance variations would probably be neglected. Finally, it is important to note that the MMPMR values could be even higher because, in a real scenario, a criminal would try to produce high quality morphed images, discarding the morphs with a low probability of success and applying manual retouching to remove unrealistic artifacts.

Table 6.5 MMPMR of ${SDK}_{2}$ on B&W data set for each combination of ${\alpha}_{B}$ and ${\alpha}_{W}$. Different values are represented by different blue levels (the darker, the greater)

Full size table

Table 6.6 MMPMR of ${SDK}_{3}$ on B&W data set for each combination of ${\alpha}_{B}$ and ${\alpha}_{W}$. Different values are represented by different blue levels (the darker, the greater)

Full size table

Table 6.7 Average MMPMR of the three SDKs on B&W data set for each combination of ${\alpha}_{B}$ and ${\alpha}_{W}$. Different values are represented by different blue levels (the darker, the greater). The green region represents the most promising combinations of blending and warping factors to successfully perpetrate the attack

Full size table

3.3 Deep Learning-Based Morphing Results

Currently no databases of morphed images generated by GANs are publicly available; therefore, the vulnerability assessment we did only focus on images generated by landmark-based approaches. However, as a reference, we think it is worth reporting the preliminary results reported by the authors of the GAN-based approaches in their paper [27].

Table 6.8 compares the MMPMR of a state-of-the-art FRS on morphed images generated by (i) GANs and (ii) a landmark-based morphing method [46]. While StyleGAN generates morphed images with a low chance to fool the FRS (about 60%), the MIPGAN approach achieves interesting results in terms of efficacy of the attack (about 90%) even if lower than the facial landmark method (about 98%).

Table 6.8 MMPMR of a face recognition system on morphed images generated by GANs as reported in [27]

Full size table

On the other hand, even if MIPGAN seems able to fool a FRS, some further efforts are necessary to improve the similarity with the contributing subjects thus increasing the effectiveness of the attack against human experts.

4 Conclusions

The general trust on automatic face recognition systems has recently been undermined by several possible kinds of attack, among which the face morphing is one of the most insidious and difficult to address. Dealing with face morphing is particularly complex in the context of ePassports; FRS are requested to work at fixed operational thresholds that guarantee a good trade-off between security and convenience in the use of ABC gates. Unfortunately, at these thresholds, it is very hard for FRSs to reject morphed images, thus making them quite vulnerable to the face morphing attack. This is particularly true when the morphed facial image is accurately prepared, with a manual intervention for facial landmark selection and artifact removal. Studies in the literature show that humans are easily fooled by accurate morphed images. Moreover, the high success rate measured in this chapter for landmark-based morphing techniques and the preliminary results reported in research papers for the GAN-based approaches confirm that face morphing is a real security threat. Recently, several research groups working on face recognition devoted significant efforts in designing face morphing attack detection techniques but, as discussed in a later chapter, further improvements are still needed to achieve good generalization capabilities.

References

Beier T (1992) Feature-based image metamorphosis. Comput Graph 26:35–42
Article Google Scholar
Steyvers M (1999) Morphing techniques for manipulating face images. Behav Res Meth Instrum Comput 359–369
Google Scholar
Jäger T, Seiler KH, Mecklinger A (2005) Picture database of morphed faces (MoFa): technical report. Experimental neuropsychology unit. Department of Psychology, Saarland University, Saarbrücken, Germany
Google Scholar
Ferrara M, Franco A, Maltoni D (2014) The magic passport. In: International joint conference on biometrics, clearwater (FL), pp 1–7
Google Scholar
Ferrara M, Franco A, Maltoni D (2016) On the effects of image alterations on face recognition accuracy. In: Face recognition across the imaging spectrum, pp 195–222
Google Scholar
Robertson DJ et al. (2018) Detecting morphed passport photos: a training and individual differences approach. Cogn Res Princ Implic 3(27)
Google Scholar
Robertson DJ (2020) Morphed passport photo detection by human observers. In: International conference on biometrics for borders. Warsaw
Google Scholar
Spiegel (2021) Aktivisten schmuggeln Fotomontage in Reisepass. https://www.spiegel.de/netzwelt/netzpolitik/biometrie-im-reisepass-peng-kollektiv-schmuggelt-fotomontage-in-ausweis-a-1229418.html
Monroy M (2021) Laws against morphing. https://digit.site36.net/2020/01/10/laws-against-morphing/
The Peng! Collective (2021) Mask.ID Part II—We send our passports to Libya. https://pen.gg/campaign/mask-id-2/
GIMP (2021) GIMP animation package. https://www.gimp.org/news/2009/06/05/gimp-animation-package-260-released/
Luxand (2021) FaceMorpher. http://www.facemorpher.com/
Abrosoft (2021) FantaMorph. https://www.fantamorph.com/
Scherhag U, Rathgeb C, Merkle J, Breithaupt R, Busch C (2019) Face recognition systems under morphing attacks: a survey. IEEE Access, pp 23012–23026
Google Scholar
(2021) Dlib C++ Library. http://dlib.net/
Wikipedia (2021) Image warping. http://en.wikipedia.org/wiki/Image_warping
Wolberg G (1994) Digital image warping, 1st edn. IEEE Computer Society Press, Los Alamitos, CA, USA
Google Scholar
Rogers DF, Adams JA (1989) Mathematical elements for computer graphics, 2nd ed. McGraw-Hill Higher Education
Google Scholar
Delaunay BN (1934) Sur la sphère vide. Bulletin de l’Académie des sciences de l’URSS, Classe des sciences mathématiques et naturelles 6:793–800
Google Scholar
Ferrara M, Franco A, Maltoni D (2019) Decoupling texture blending and shape warping in face morphing. In: International conference of the biometrics special interest group (BIOSIG), Darmstadt, pp 1–5
Google Scholar
Lai M, Oruç I, Barton JS (2013) The role of skin texture and facial shape in representations of age and identity. Cortex, pp 252–265
Google Scholar
Gonzalez RC, Woods RE (2017) Digital image processing, 4th ed. Pearson
Google Scholar
Damer N, Saladi AM, Braun A, Kuijper A (2018) Morgan: recognition vulnerability and attack detectability of face morphing attacks created by generative adversarial network. In: Internationa conference on biometrics theory, applications and systems, pp 1–10
Google Scholar
Donahue J, Krähenbühl P, Darrell T (2017) Adversarial feature learning. https://arxiv.org/abs/1605.09782
Venkatesh S et al. (2020) Can gan generated morphs threaten face recognition systems equally as landmark based morphs?—vulnerability and detection. In: 8th International workshop on biometrics and forensics (IWBF). Porto Portugal, pp 1–6
Google Scholar
Karras T, Laine S, Aila T (2019) A style-based generator architecture for generative adversarial networks. In: IEEE conference on computer vision and pattern recognition, pp 4401–4410
Google Scholar
Zhang H et al. (2020) MIPGAN—generating robust and high quality morph attacks using GAN. https://arxiv.org/abs/2009.01729
Karras T et al. (2020) Analysing and improving the image quality of StileGAN. In: IEEE/CVF conference on computer vision and pattern recognition, pp 8110–8119
Google Scholar
NIST (2021) Face recognition vendor test (FRVT) 1:1 verification. https://pages.nist.gov/frvt/html/frvt11.html
Grother P, Ngan M, Hanaoka K (2021) Ongoing face recognition vendor test (FRVT)—Part 1: verification. NIST, Gaithersburg, MD
Google Scholar
FRONTEX—R&D Unit (2015) Best practice technical guidelines for automated border control (ABC) systems. FRONTEX, Warsaw, Poland, ISBN: 978–92–95205–50–5. https://doi.org/10.2819/39041
Scherhag U et al. (2017) Biometric systems under morphing attacks: assessment of morphing techniques and vulnerability reporting. In: International conference of the biometrics special interest group (BIOSIG). Darmstadt, Germany
Google Scholar
GIMP (2021) GNU image manipulation program web site. http://www.gimp.org/
Ferrara M, Franco A, Maltoni D (2018) Face demorphing. IEEE Trans Inf Forensics Secur 13(4):1008–1017
Article Google Scholar
Xiberpix (2021) Sqirlz Morph 2.1 web site. http://www.xiberpix.net/SqirlzMorph.html
Raja K et al. (2020) Morphing attack detection - database, evaluation platform and benchmarking. In: IEEE transactions on information forensics and security (TIFS)
Google Scholar
(2021) AMSL face morph image data set. https://omen.cs.uni-magdeburg.de/disclaimer/index.php
DeBruine L, Jones B (2021) Face research lab London set. https://doi.org/10.6084/m9.figshare.5047666.v3
Neubert T, Makrushin A, Hildebrandt M, Kraetzer C, Dittmann J (2018) Extended stirtrace benchmarking of biometric and forensic qualities of morphed face images. IET Biometrics 7(4):325–332
Article Google Scholar
Wolf A (2016) ICAO: portrait quality (reference facial images for MRTD), version 0.7
Google Scholar
BioLab (2021) Bologna online evaluation platform web site. https://biolab.csr.unibo.it/fvcongoing/UI/Form/BOEP.aspx
Dorizzi B et al. (2009) Fingerprint and online signature verification competitions at ICB 2009. In: Proceedings 3rd IAPR/IEEE international conference on biometrics (ICB09). Alghero
Google Scholar
BioLab (2021) FVC-ongoing web site. http://biolab.csr.unibo.it/fvcongoing
Robertson DJ, Kramer RSS, Burton AM (2017) Fraudulent ID using face morphs: experiments on human and automatic recognition. PLoS ONE 12(3)
Google Scholar
Young AW, Burton AM (2017) Recognizing faces. Curr Dir Psychol Sci 26(3):212–217
Article Google Scholar
Raghavendra R, Raja KB, Venkatesh S, Busch C (2017) Face morphing versus face averaging: vulnerability and detection. In: IEEE international joint conference on biometrics (IJCB). Denver, CO, USA
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, University of Bologna, via dell’Università, 50, Cesena, Italy
Matteo Ferrara & Annalisa Franco

Authors

Matteo Ferrara
View author publications
You can also search for this author in PubMed Google Scholar
Annalisa Franco
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Matteo Ferrara .

Editor information

Editors and Affiliations

Department of Computer Science, Hochschule Darmstadt, Darmstadt, Germany
Christian Rathgeb
School of Engineering, Universidad Autonoma de Madrid, Madrid, Spain
Ruben Tolosana
School of Engineering, Universidad Autonoma de Madrid, Madrid, Spain
Ruben Vera-Rodriguez
Department of Information Security and Communication Technology, Norwegian University of Science and Technology, Gjøvik, Norway
Christoph Busch

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ferrara, M., Franco, A. (2022). Morph Creation and Vulnerability of Face Recognition Systems to Morphing. In: Rathgeb, C., Tolosana, R., Vera-Rodriguez, R., Busch, C. (eds) Handbook of Digital Face Manipulation and Detection. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-030-87664-7_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-87664-7_6
Published: 31 January 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87663-0
Online ISBN: 978-3-030-87664-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Morph Creation and Vulnerability of Face Recognition Systems to Morphing

Abstract

Similar content being viewed by others

Detection of Face Morphing Attacks by Deep Learning

A Novel Framework for Detection of Morphed Images Using Deep Learning Techniques

Humans Versus Deep Learning: Detection of Face Morphing as a Peril

1 Introduction