1 Introduction

Public address system can provide useful information for the audience, which are especially important at the railway station or the subway station. Effective public address can offer help to evacuate people at these places.

At present, most speech recognition researches are to predict and assess effects of hearing devices, to do relevant medical legal identification and evaluate the injury degree with low hearing function. In China, large public places basically broadcast through functional address system. It is very important for the public barrier free design to try to take the old people into account.

2 Experimental Method

2.1 Experiment Design

The experiment is conducted in the anechoic room of ergonomics laboratory. The audio files are the length of each sentence is about 4 s, the fade in and out time of each sentence is about 0.15 s and the interval of word front and back is 0.2 s and 0.3 s respectively.

44 subjects were divided into the old group and the young group. The young group includes 21 subjects with the average age of 25.4, and the old group includes 22 subjects with the average of 62.3.

2.2 Experimental Method

The speech recognition experiment adopts the method of subjective assessment of the public sound system [1]. The experiment is divided into two tests. The first one is the hearing level, and after that is the speech recognition test.

The hearing level test is to measure the hearing threshold of subjects to guarantee that their hearing belongs to the normal range of their age (Table 1).

Table 1. Definition of difficulty degree

3 Experiment Results and Discussion

3.1 Hearing Level

The hearing level test showed that the hearing attenuation of Chinese young group and the old group have no significant frequency differences with the data according to the international standards of ISO 7029 [2], except in the frequency of 1000 Hz and 4000 Hz for the old group.

3.2 Speech Recognition Threshold

Under the same 50 dB environmental noise, both for the young group and for the old group, a single factor analysis was conducted and there was a significant difference between the two kinds of voices for the two age groups. It is showed in Figs. 1 and 2 that under the same noisy environment, the threshold value of male voice is lower than that of female voice in the both young and old groups. It can be seen that threshold value for the old group for the male voice is lower than to the female voice.

Fig. 1.
figure 1figure 1

The speech recognition threshold of the young group under 50 dB environmental noise

Fig. 2.
figure 2figure 2

The speech recognition threshold of the old group under 50 dB environmental noise

Under the same 50 dB environmental noise, the average speech recognition threshold value for the young group to the male voice is 43 dB and the signal noise ratio is −7 dB while the average value to the female voice is 45 dB and the signal noise ratio is −5 dB. The average speech recognition threshold value for the old group to the male voice is 48 dB and the signal noise ratio is −2 dB while the average value to the female voice is 52 dB and the signal noise ratio is 2 dB. So under the same noisy environment, the signal noise ratio should be at least 2 dB to make sure that more than 50 % people can identify the language correctly.

3.3 The Relationship Between Speech Recognition Score and the Signal Noise Ratio

The speech recognition score refers to the percentage of the speech signals heard and understood by subjects. The hearing loss caused by the age influences the speech recognition score, which is a function of the signal noise ration.

From the Fig. 3, it could be found that when the signal noise ratio reaches 4 dB, the accuracy of the young group to words recognition exceeds 90 % and has no obvious change. When the signal noise ratio is under −4 dB, the difficulty of target words is over 80 % When the signal noise ratio reaches 8 dB and F(2, 35) = 2.111, there is no significant difference in statistics for the young group.

Fig. 3.
figure 3figure 3

The speech recognition accuracy and difficulty of the young group

From the Fig. 4, it could be found that when the signal noise ratio of the old group reaches over 8 dB, their accuracy to words recognition is over 90 % and has no significant change. For the old group, when the signal noise ratio reaches 12 dB and F(2, 37) = 2.408, there is no significant difference in statistics.

Fig. 4.
figure 4figure 4

The speech recognition accuracy and difficulty of the old group

When the environmental noise is 50 dB, in order to meet the requirements of speech recognition of the young people and to let the old people gain better hearing effects, in the sound pressure level design of audio equipment the signal noise ratio should reach 12 dB to make sure that most people can hear harmonious and beautiful sound level.