Keywords

1 Introduction

Face recognition was the process of determining the size, location, position and orientation of human face from the input image, the research of which can be traced back to the 1960s. And it has become increasingly mature after decades of tortuous development [1, 2]. Face recognition was an important biometric technique for personal identification, has been widely used in many departments as government, public security, banking, securities, taxation and army. Most previous research focused more on technology in accuracy and speed [3,4,5,6], such as Galton proposed the use of the key points and their distances of the human face to represent the feature vector of the human face.

However, many researchers revealed that objective stimuli and subjective experience tended to be always inconsistent [7, 8]. Less studies assessed the user subjective evaluation in the process of face recognition in order to achieve not only the goal of a fast and accurate identification but only a good user experience. On the basis of this problem, our research paid more attention on the following questions:

  1. 1.

    What was the objective performance of our and competing product in terms of pass rate and recognition length after contrasting test?

  2. 2.

    What was the user’s subjective evaluation of our and competing product? And was the subjective evaluation consistent with objective performance? If there is inconsistency, what was it and we needed to find out the reason in order to optimize UE/UI design.

  3. 3.

    In the face recognition, what was the most appropriate subjective duration from the user perspective?

The current purpose might provide a direction of the further optimization direction on the users’ subjective experience of face recognition through a comparison test and a rigorous experiment.

2 Research Method

2.1 Procedure

The whole study was divided into two parts. The first part was similar to usability test to obtain the objective performance of our and competing product. The second part was related to tolerance experiment, which was widely used in psychological experiment and aimed to obtain the intolerable length of time [9, 10].

Specifically, in the usability test we required our participants to conduct face recognition in our and competing products in sequential order. To balance order effect, half of the participants did our product first and another half did competing product first. Researchers recorded the pass rate and recognition length. And we also collected feedback of participants on the subjective experience during use.

In the tolerance experiment, participants were required to give an assessment on a five point Likert scale, namely 1-very unacceptable, 2-a little unacceptable, 3-moderate, 4-a little acceptable, 5-very acceptable, to a given duration of face recognition and then we calculated the duration interval of extremely well (4 points or more), moderately well (3–4 points), just passable (2–3 points) and completely intolerable (2 points or less). In order to facilitate targeted optimization, we divided the whole face recognition into three stages. The first stage was face detection, from the action of clicking the start button to the appearance on the page saying “face detected”. The second stage was blink detection, from the appearance saying “please blink” to the appearance saying “blink detected”. The third stage was taking a photo, from the appearance saying “please be ready to take photos” to the appearance saying “successfully completed” (see Fig. 1).

Fig. 1.
figure 1

The three stages of face recognition

We invited programmers to develop different demos in terms of duration for three stages respectively. As the first stage for example, it had 24 demos from 0.5 s to 15 s. The participant was given a demo at a time, the duration of which was programmed in advance, unrelated with actual behavior of the participant without his knowing. And he was required to evaluate his subjective acceptability for this given demo in a Likert scale. Every participant should evaluate all demos in an ascending or descending order. Similarly, half of them followed the former and the other half adhered to the latter to avoid sequential effect.

2.2 Material

Due to no need of develop demos for the first usability test, we adopted online version of our and competitive product to obtain comparable result in a real-world environment. However, the second part of tolerance experiment adopted demos which were scheduled time in advance. Before this, it was unknown what the appropriate quantity of demos and duration gap between the two adjacent demos were, which was determined in a pre-experiment. It was found out that 80% of participants cannot be tolerated if the duration exceeded 11 s, 9 s and 5 s for the three stages, namely face detection, blink detection and picture taken. As a precautionary measure, the duration for the three stages increased to 15 s, 11 s, 7 s. Duration gap between the two adjacent demos must fit in with the principle of neither so long to miss key turning point nor so short to unable to perceive discrepancy. Combining the previous studies which found out difference threshold distributed from 0.65 s to 1 s, we choose 0.5 s for the duration gap between the two adjacent demos.

Finally, the number of demos for first stage, face detection, from 0.5 s to 15 s, was 24. The number of demos for second stage, blink detection, from 0.5 s to 11 s, was 22. The number of demos for third stage, picture taken, from 0.5 s to 7 s, was 14. All demos in the same stage were equal excepted for duration (see Table 1).

Table 1. All demos for three stages.

2.3 Participants

We invited 10 participants for the pre-experiments and 28 for the formal experiment and usability test, whose occupations involved student, self-employed person and employee. Their age distributed from 18 to 25, with the mean value of 21. To avoid an impact of mobile phone models, we selected 16 IOS users and 12 android users.

2.4 Results

Objective Performance.

The accurate recognition rate of both our and competing products achieved to 90%. Our product had the advantage over the other in the whole objectively time consuming (see Table 2).

Table 2. The whole objective time consuming of our and competing products

However, the result that the subjective evaluation from users was worse than competing product was entirely unexpected. Further analysis revealed the most two important reasons were unsatisfactory interactive details and subjective perceived duration.

The former consisted of the following four details (see Table 3).

  • ① The prompt frame of face detection included too many outlines of face and shoulder for user to manipulate in the given frame.

  • ② Lack of clear and vivid hint, users cannot go forward according to our design.

  • ③ When confronted the failure of face detection, users were at a loss what to do due to the inappropriate time and content of the prompt.

  • ④ Our product was in short of interest, exquisite feeling and specialty.

Table 3. Unsatisfactory interactive details of our product

The latter illustrated the inconsistence between subjective comfortable duration and objective duration. So what was the most subjective comfortable time of face cognition?

2.5 The Standard of Comfortable Duration

Every participant evaluated all demos of different duration and so we can calculate the mean value of all participants at every duration. Furthermore, the percentage of the participants who felt extremely well, moderately well, just passable at every duration can be calculated respectively.

In consideration of the two results, we can come to an agreement of a standard duration.

The result revealed that users felt extremely well from 1 s to 2.5 s (see Fig. 2), in the interval of which 70% of participants perceived comfortable in the face detection stage (see Fig. 3).

Fig. 2.
figure 2

The interval of users felt extremely well in the face detection stage 1-very unacceptable, 2-a little unacceptable, 3-moderate, 4-a little acceptable, 5-very acceptable

Fig. 3.
figure 3

The percentage of users who felt comfortable, acceptable and just passable in the face detection stage

Similarly, users felt extremely well from 0.8 s to 1 s (see Fig. 4), in the interval of which 80% of participants perceived comfortable in the blink detection stage (see Fig. 5).

Fig. 4.
figure 4

The interval of users felt extremely well in the blink detection stage

Fig. 5.
figure 5

The percentage of users who felt comfortable, acceptable and just passable in the blink detection stage

And in the picture-taking stage the result was within 0.7 s (see Fig. 6). and 80% (see Fig. 7).

Fig. 6.
figure 6

The interval of users felt extremely well in the picture-taking stage

Fig. 7.
figure 7

The percentage of users who felt comfortable, acceptable and just passable in the picture-taking stage

3 Discussion

In the interaction design of user interface, designers needed to guide users to go forward in line with your expectation to achieve product goal through the way of making the focal key points stand out. However, there existed some problems in the initial design.

We expected that users interacted in the sequence of hits-frame-other, however, people focused all their attention on the frame (see Fig. 8).

Fig. 8.
figure 8

The expected sequence vs. the real sequence

So how to make the focal key points stand out was an important question. As our usual practice went, contents, color, size and distance might be in use (see Fig. 9). However, face recognition, as a novel Interactive mode, was difficult for users to understand. So we adopted more visual motion and sound in a subsequent improvement (see Fig. 10).

Fig. 9.
figure 9

The product of before vs. after (Color figure online)

Fig. 10.
figure 10

Visual motion and sound in the face recognition

And we found out that in the face and blink stage, the subjective experience of users and the objective duration were inconsistent. Our data (see Fig. 10) revealed that current status of our product cannot meet users’ demand in terms of degree of comfort. So we adjusted the time length of visual dynamic effect to increase the duration of detection by 1 s to push forward more participants’ subjective experience into the comfortable interval (see Figs. 11 and 12).

Fig. 11.
figure 11

In the face detection and blink detection before optimization

Fig. 12.
figure 12

Adjusting the time length of visual dynamic effect to increase the duration of detection by 1 s

In the third stage, the longer the duration was, the better users perceived. So the performance of the recognition should be promoted in respect of technology.

As a whole, the present study tried to build a standard duration of face recognition in terms of users’ subjective experience instead of objective technology, using usability test and tolerance experiment. We hope our result can give a reference to future optimization of related product.

However, we conducted the experiment in a normal environment, without considering circumstance differences, such as did the credit and financing circumstance do some different effect on the perceived duration? So we will conceive a more well-designed experiment to explore the subjective duration in more situations and more group division.