How Many Labeled License Plates Are Needed?

Wu, Changhao; Xu, Shugong; Song, Guocong; Zhang, Shunqing

doi:10.1007/978-3-030-03341-5_28

How Many Labeled License Plates Are Needed?

Changhao Wu²⁰,
Shugong Xu²⁰,
Guocong Song²¹ &
…
Shunqing Zhang²⁰

Conference paper
First Online: 02 November 2018

2317 Accesses
21 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11259))

Abstract

Training a good deep learning model often requires a lot of annotated data. As a large amount of labeled data is typically difficult to collect and even more difficult to annotate, data augmentation and data generation are widely used in the process of training deep neural networks. However, there is no clear common understanding on how much labeled data is needed to get satisfactory performance. In this paper, we try to address such a question using vehicle license plate character recognition as an example application. We apply computer graphic scripts and Generative Adversarial Networks to generate and augment a large number of annotated, synthesized license plate images with realistic colors, fonts, and character composition from a small number of real, manually labeled license plate images. Generated and augmented data are mixed and used as training data for the license plate recognition network modified from DenseNet. The experimental results show that the model trained from the generated mixed training data has good generalization ability, and the proposed approach achieves a new state-of-the-art accuracy on Dataset-1 and AOLP, even with a very limited number of original real license plates. In addition, the accuracy improvement caused by data generation becomes more significant when the number of labeled images is reduced. Data augmentation also plays a more significant role when the number of labeled images is increased.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

License plate recognition is one of the most important components of modern intelligent transportation systems. It has attracted the attention of many researchers. However, most existing algorithms [3, 13, 14, 18] can only work normally under certain conditions. For example, some recognition systems require sophisticated hardware to shoot high-quality images, while other systems require the vehicle to slowly pass through a fixed access opening or even stop. Accurately detecting license plates and recognizing characters in an open environment is a challenging task. The main difficulties are different license plate fonts and colors, character distortion caused by the image capture process and non-uniform illumination, and low-quality images caused by occlusion or motion blur.

In this paper, we propose a license plate recognition system, in which we cope with challenge such as, low light, low resolution, motion blur, and other harsh conditions. Figure 1 shows the license plates which can be correctly recognized by our proposed method. From top to bottom are the license plate images affected by the shooting angle, uneven illumination, low resolution, detection error and motion blur.

In general, supervised learning requires a large amount of labeled data in order to achieve good results. However, real data is not easy to obtain, the acquisition process is slow, and the data needs to be processed and annotated before it can be used for training. To achieve a higher accuracy of the annotation, manual inspection is also required.

However, the acquisition of a large amount of real data and manual annotations is very expensive. Therefore, data generation is very important for the training of license plate recognition network. We believe that the information contained in a small number of real license plates is sufficient to recognize most of the existing license plate images. However, there is no clear common understanding on how much labeled data is needed to get satisfactory performance. In this paper, we try to address such a question in vehicle license plate character recognition.

The main contributions of this paper can be summarized as the following three points:

1.
We propose various methods of data generation and data augmentation. As long as we have a few labeled license plate images, a large amount of generated data can be created. We can achieve and even exceed the recognition accuracy and results of systems trained only on real images.
2.
We compare the performance of various data generation and data augmentation methods to find that both data generation methods and data augmentation methods can significantly improve license plate recognition accuracy. Data augmentation plays a larger role in accuracy improvement when there are many labeled license plates but when the number of labelled license plates is small, data generation more significantly increases accuracy.
3.
We apply a network that is modified from DenseNet to license plate recognition to reduce network parameters and inference time and improve accuracy.

The rest of paper is arranged as follows. In Sect. 2 we review the related works briefly. In Sect. 3 we describe the details of networks used in our approach. Experimental results are provided in Sect. 4, and conclusions are drawn in Sect. 5.

2 Related Work

The section introduces previous work on license plate recognition and GANs.

2.1 License Plate Recognition

Existing license plate recognition systems are either text segmentation-based [3, 6], or non-segmentation-based [13]. Methods that depend on segmentation first preprocess the license plate image and then segment individual characters through image processing. After this, each character is classified by a convolutional neural network. This method is very dependent on the accuracy of text segmentation, and the recognition speed is slower. A recognition method that does not require segmentation is proposed by Li et al. [13]. It is composed of a deep convolutional network and a Long Short-Term Memory (LSTM), where the deep CNN is directly applied for feature extraction, and a bidirectional LSTM network is applied for sequence labeling. DenseNet [9] is a highly efficient convolutional neural network. Because of its low parameter number and fast inference time, DenseNet is widely used. Our method is also a segmentation-free approach based on the framework proposed by [9], where DenseNet is applied for feature extraction.

Data generation is used in license plate recognition to improve the accuracy of recognition. The labeled license plates generated by CycleGAN as a pre-training data set for the recognition network are used in [18], and the model is fine-tuned with the real license plate data set. This data generation method can significantly improve the recognition accuracy. License plate detection and recognition is combined in [14], and it finally improves the recognition speed and recognition accuracy of the system.

2.2 Generative Adversarial Networks

Generative adversarial networks(GANs) [2, 15] train a generator and discriminator alternatively. The output of the discriminator acts as a generator’s loss function. Zhu et al. [21] propose Cycle-Consistent Adversarial Networks(CycleGAN), which learns the mapping relationship from one domain to another and is mainly used for the style conversion of pictures. Wasserstein GANs(WGAN) [1] are proposed to improve the stability of GAN training. Applying Wasserstein loss to CycleGAN, creating CycleWGAN, also improves its training stability in [18]. Gradient penalties in WGAN(WGAN-GP) [5] are proposed to solve the WGAN generator weight distribution problem.

2.3 Data Generation for Training

A large number of real labeled images are often difficult to obtain, so the role of data generation is very significant [17]. The synthesized images are used to train scene text detection networks [7] and recognition networks [11]. The generated data is shown to improve the performance of person detection [20], font recognition [19], and semantic segmentation [16]. However, when the difference between the generated data and the real data is very large, the performance is poor when applied to a real scene. Therefore, [18] applies CycleGAN to convert the style of license plate generated by the script into a real license plate, which can greatly reduce the gap between the generated image and the real image. We apply data generation and data augmentation methods at the same time, and use the data generated by different methods directly as the training set for recognition network. Therefore we need very little real data.

3 License Plate Recognition Based on Data Generation and Augmentation

In this section, the pipeline of the proposed method is described. We train the GAN model using synthetic images and real images simultaneously. We then use the generated images to train a model modified from DenseNet.

3.1 CycleGAN

CycleGAN [21] learns to translate an image from a source domain X to a target domain Y in the absence of paired examples. Our goal is to train a mapping relationship G between the script license plate domain X and the real license plate domain Y. CycleGAN contains two mapping functions $G:X\rightarrow Y$ and $Y\rightarrow X$, and associated adversarial discriminators D$_Y$, D$_X$.

The techniques proposed in WGAN [1] are applied in CycleGAN, and CycleWGAN is proposed in [18]. WGAN points out why the traditional GAN is difficult to converge and improve during training, which greatly reduces the training difficulty and speeds up the convergence. There are two main improvements: the first one is to remove the log from the loss function, and the second is to perform weight clipping after each iteration to update the weight, and limit the weight to a range (e.g., the limit range is [−0.1, +0.1]. Outside weights are trimmed to −0.1 or +0.1). CycleWGAN solves the problem of training instability and collapse mode, which makes the result more diverse.

We apply the techniques in WGAN-GP [5] to CycleWGAN and propose the CycleWGAN-GP. WGAN-GP also proposes an improvement plan based on WGAN. WGAN reduces the training difficulty of GAN, but it is still difficult to converge in some conditions, and the generated pictures are worse than DCGAN. WGAN-GP applies gradient penalty, and solves the above problem along with the problems of vanishing gradient and exploding gradient during training. It also converges faster than CycleWGAN and produces higher quality pictures.

We apply a CycleGAN equipped with WGAN and WGAN-GP techniques to train the mapping relationship between the fake license plate and the real license plate. First of all, we apply OpenCV scripts to generate synthetic license plates as a source domain X, and then choose real license plates without labels as a target domain Y. Before the training of CycleWGAN-GP, these license plates are randomly cropped and randomly flipped horizontally or vertically.

3.2 Recognition Network Design

DenseNet is a densely connected convolutional neural network. In this network, there is a direct connection between any two layers. The input of each layer of the network is the union of the output of all previous layers, and the feature map learned by this layer is also directly transmitted to all subsequent layers. DenseNet allows the input of l$^{th}$ Layer to directly affect all subsequent layers. Its output is:

$$\begin{aligned} x_l=H_l([x_0,x_1,...,x_{l-1}]) \end{aligned}$$

(1)

where $H_l(\cdot )$ refers to a composite function of three consecutive operations: batch normalization (BN) [10], followed by a rectified linear unit (ReLU), and a 3 $\times $ 3 convolution (Conv). Additionally, since each layer contains the output information of all previous layers, it only needs a few feature maps, so the number of parameter of DenseNet is greatly reduced compared to other models.

Table 1. Construction of recognition network. The output size represents w $\times $ h $\times $ c. Note that each “conv” layer shown in the table corresponds the sequence BN-ReLU-Conv

Full size table

Our network structure is shown in Table 1, which is different from the network structure of [9], because the input license plate image is smaller and is a gray scale image of 136 $\times $ 36, so the network only has 3 dense blocks. The transition layers used in our network consist of a batch normalization layer and an 1 $\times $ 1 convolutional layer followed by a 2 $\times $ 2 average pooling layer. A 1 $\times $ 1 convolution can be introduced as bottleneck layer before each 3 $\times $ 3 convolution to reduce the number of input feature-maps. To improve model compactness, we reduce the number of feature-maps from 192 to 128 at transition layers 2.

The last DenseNet layer is followed by a fully-connected layer with 68 neurons for the 68 classes of label, including 31 Chinese characters, 26 letters, 10 digits and “blank”. We train the networks with stochastic gradient descent (SGD). The labelling loss is derived using Connectionist Temporal Classification (CTC) [4]. The optimization algorithm Adam [12] is then applied, as it converges quickly and does not require a complicated learning rate schedule. Another advantage of using the modified DenseNet network is that it does not require the Long Short-Term Memory(LSTM) networks. The use of LSTM complicates the solution and increases computational cost.

4 Experiment

In this section, we conduct experiments to verify the effectiveness of the proposed methods. Our network is implemented capitalizing keras. The experiments are trained on a NVIDIA Tesla P40 with 24 GB memory and are tested on a NVIDIA GTX745 GPU with 4 GB memory.

4.1 Dataset

The image in the Dataset-1 [18] are captured from a wide variety of real traffic monitoring scenes under various viewpoints, blurring and illumination. Dataset-1 contains a training set of 203,774 plates and a test set of 9,986 plates. The first character of Chinese license plates is a Chinese character which represents the province. While there are 31 abbreviations for all of the provinces, Dataset-1 contains 30 classes of them.

The second data set is the application-oriented license plate (AOLP) [8] benchmark database, which has 2049 images of Taiwan license plates. This database is categorized into three subsets: access control (AC) with 681 samples, traffic law enforcement (LE) with 757 samples, and road patrol (RP) with 611 samples.

4.2 Implementation Details

Network. The recognition network is shown in Table 1. We implement it with Keras. The images are resized to 136 $\times $ 36 and converted to gray scale and then fed to the recognition network. We change the last layer of fully connected layers to 68 neurons according to the 68 classes of characters-33 Chinese characters, 24 letters, 10 digits and “blank”. We train the networks with SGD and learning rate of 0.0001. The labelling loss is derived using CTC. We set the training batch size as 256 and predicting size as 1.

Evaluation Criterion. In this work, we evaluate the model’s performance in terms of recognition accuracy and character recognition accuracy, which is similar to Wang et al. [18]. Recognition accuracy is defined as :

$$\begin{aligned} RA = \frac{Number\ of\ correctly\ recognized\ license\ plates}{Number\ of\ all\ license\ plates} \end{aligned}$$

(2)

Character recognition accuracy is defined as:

$$\begin{aligned} CRA = \frac{Number\ of\ correctly\ recognized\ characters}{Number\ of\ all\ characters} \end{aligned}$$

(3)

GAN Training and Testing. Three data generation methods are shown in Fig. 2. To train CycleWGAN, first we use the OpenCV scripts to generate 1000 blue fake license plates as a source domain X, and then select 1000 real blue license plates from Dataset-1 as a target domain Y. We train the CycleWGAN model with these fake license plates and real license plates. The training real plates do not require character labels. All the images are resized to 143 $\times $ 143, cropped to 128 $\times $ 128 and randomly flipped for data augmentation. We use Adam with $L_{1}=0.9$, $L_{2}=0.999$ and learning rate of 0.0001. We stop training after 300,000 steps and save the model. When testing, first we use the OpenCV scripts to generate 40,000 blue fake license plates, and then we apply the last checkpoint to generate 40,000 license plates. The same goes for CycleWGAN-GP. Finally we get 80,000 blue license plates generated by CycleWGAN and CycleWGAN-GP.

Data Augmentation. The six data augmentation methods are proposed in order to increase the training data of the recognition network. The data was augmented through affine transformation, motion blurring, uneven lighting, stretching, erosion and dilation, downsampling and the application of gaussian noise. Examples of these transformations are shown in Fig. 3. A real license plate image randomly passes through the six data augmentation methods, allowing for the creation of much more training data. First, we select a small number of labeled real license plates from Dataset-1, such as 300. And then using data augmentation methods in Fig. 3, we generate 80,000 augmented license plates with these selected real license plates.

Mixed Training Data. Our mixed training data consists of four parts, including 40,000 license plates generated by OpenCV scripts, 40,000 license plates generated by CycleWGAN, 40,000 license plates generated by CycleWGAN-GP, and 80,000 license plates augmented from a small number of labeled real license plates. All 200,000 training images are generated with license plate character labels. The license plates that need manual labeling are only selected from Dataset-1. After converting the training data to gray scale, 400,000 more training images are obtained by flipping pixels in order to simulate gray images of yellow and green license plates. Then, these images are fed to the recognition network modified from DenseNet.

4.3 Performance Evaluation on Dataset-1

With the above methods, our mixed training data is generated from 300, 700, 3,333, 4,750 and 6,000 real license plates selected from Dataset-1 training set respectively. Our baseline is the [18] using the license plate images generated by the CycleWGAN pre-training recognition network, and then using 9,000, 50,000 and 200,000 real labeled license plate images in a fine-tuning model. From the results in Table 2, it is concluded that when data generation, data augmentation and DenseNet are used, we only need 300 real labeled license plates to achieve the effect of 200,000 real license plates. In the same way, when the number of real license plates reaches 4,750, the final recognition accuracy has reached 99.0%, an increase of 1.4%. When the number of real license plate images exceeds 4,750, license plate recognition accuracy and character recognition accuracy are not improving. We conjecture that 4,750 real images contain enough information to recognize most of the license plates. Thus, by increasing the number of real license plates, the total amount of information after data augmentation will not change, and the recognition accuracy will not increase any further.

Table 2. The accuracy of other methods compared with the proposed method on Dataset-1. Recognition accuracy (%) and character recognition accuracy (%) are listed.

Full size table

4.4 Performance Evaluation on Data Generation

In order to evaluate the effect of the data generated by different methods, we train the models using synthetic data generated by script, CycleGAN, CycleWGAN, and CycleWGAN-GP respectively. The results are shown in Table 3. When we only use the data set generated by script for training, the recognition accuracy on the test set of Dataset-1 is 42.2%. As shown in Fig. 2(a), our synthetic license plates generated by script also contain noise such as low light, low resolution, motion blur. The CycleGAN images achieve a recognition accuracy of 51.2%. Accuracy is not much improved because of the instability and lack of diversity in CycleGAN training. As shown in Fig. 2(b) and (c), the CycleWGAN and CycleWGAN-GP images display more various styles and colors, and part of them can not really distinguish from real images. The CycleWGAN and CycleWGAN-GP images achieve a recognition accuracy of 62.5% and 64.5% respectively. We also compare the impact of data generation and data augmentation on accuracy. When the number of real license plates is 3333, the recognition accuracy of the augmented data on Dataset-1 is 97.9%, far exceeding the recognition accuracy of the generated data.

Table 3. Single data augmentation recognition results with 3333 real images compare with [18]. Recognition accuracy (%), character recognition accuracy (%) are shown. “CRA-C” is the recognition accuracy (%) of the Chinese characters of the first character, and “CRA-NC” is the recognition accuracy (%) of the letters and numbers of the last six characters.

Full size table

Table 4. The effect of the number of plates and data generation on the results. Recognition accuracy (%), character recognition accuracy (%) are shown. RA(A) indicates the recognition accuracy of the data augmentation. CRA(A) indicates the character recognition accuracy of the data augmentation. RA(A+G) indicates the recognition accuracy of the training data composed of data augmentation and data generation. CRA(A+G) indicates the character recognition accuracy of the mixed training data composed of data augmentation and data generation.

Full size table

In order to understand how much number of real license plates improves recognition accuracy, we compare data augmentation results from 60 to 6000 real license plates. The result in Table 4 shows that the greater the number of real license plates, the higher the recognition accuracy obtained. Up to 4750, the highest recognition accuracy of the Dataset-1 is 99.0%. Even if the number of real license plates is increased from 4750, the result is no longer improved.

In order to understand the impact of GAN on recognition accuracy, we did some additional comparative experiments. It can also be seen in the Table 4 that training data composed of data augmentation and data generation can get better results than training data composed of only data augmentation. The conclusion is that the recognition accuracy of augmented data can be improved with data generation. In addition, the fewer real license plates, the more recognition accuracy increases contributed from generated data.

4.5 Performance Evaluation on AOLP

For the application-oriented license plate(AOLP) dataset, the experiments are carried out by using license plates from different sub-datasets for training and test. This data set is divided into three sub-datasets: access control (AC), traffic law enforcement (LE), and road patrol (RP). For example, in Table 5, we use the license plates from the LE and RP sub-datasets to train the DenseNet, and test its performance on the AC sub-dataset. Similarly, AC and RP are used for training and LE for test, and so on. Since there is no AOLP license plate font, only the data augmentation methods are used, without script and GAN generated license plates. In Table 5, through data augmentation and DenseNet, our method achieves the highest recognition accuracy on the AOLP dataset.

Table 5. The accuracy of other methods compared with the proposed method on dataset AOLP. Recognition accuracy (%), character recognition accuracy (%) are shown. AOLP is categorized into three subsets: access control (AC) with 681 samples, traffic law enforcement (LE) with 757 samples, and road patrol (RP) with 611 samples

Full size table

5 Conclusion

In this paper, we have investigated how many real labeled license plates are needed to train the license plate recognition system. We have proposed three data generation methods and six data augmentation methods in order to fully obtain all the information in a small number of images. The experimental results show that the proposed method only requires 300 real labeled license plates to achieve the effect achieved by 200,000 real license plates. The result shows that the greater the number of real license plates, the higher the recognition accuracy obtained. Up to 4750, the highest recognition accuracy of the Dataset-1 is 99.0%. Even if the number of real license plates is increased furthermore, the result is no longer improved. Additionally, training data composed of both augmented and generated data can achieve better results than training data composed of only augmented data. Furthermore, the fewer real license plates, the more recognition accuracy increases contributed from generated data.

References

Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein gan. arXiv preprint arXiv:1701.07875 (2017)
Goodfellow, I.J.: Generative adversarial nets. In: International Conference on Neural Information Processing Systems, pp. 2672–2680 (2014)
Google Scholar
Gou, C., Wang, K., Yao, Y., Li, Z.: Vehicle license plate recognition based on extremal regions and restricted Boltzmann machines. IEEE Trans. Intell. Transp. Syst. 17(4), 1096–1107 (2016)
Article Google Scholar
Graves, A., Fernández, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 369–376. ACM (2006)
Google Scholar
Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A.C.: Improved training of wasserstein gans. In: Advances in Neural Information Processing Systems, pp. 5767–5777 (2017)
Google Scholar
Guo, J.M., Liu, Y.F.: License plate localization and character segmentation with feedback self-learning and hybrid binarization techniques. IEEE Trans. Veh. Technol. 57(3), 1417–1424 (2008)
Article Google Scholar
Gupta, A., Vedaldi, A., Zisserman, A.: Synthetic data for text localisation in natural images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2315–2324 (2016)
Google Scholar
Hsu, G.S., Chen, J.C., Chung, Y.Z.: Application-oriented license plate recognition. IEEE Trans. Veh. Technol. 62(2), 552–561 (2013)
Article Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: CVPR, vol. 1, p. 3 (2017)
Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift, pp. 448–456 (2015)
Google Scholar
Jaderberg, M., Simonyan, K., Vedaldi, A., Zisserman, A.: Synthetic data and artificial neural networks for natural scene text recognition. arXiv preprint arXiv:1406.2227 (2014)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. Computer Science (2014)
Google Scholar
Li, H., Shen, C.: Reading car license plates using deep convolutional neural networks and LSTMs. arXiv preprint arXiv:1601.05610 (2016)
Li, H., Wang, P., Shen, C.: Toward end-to-end car license plate detection and recognition with deep neural networks. IEEE Trans. Intell. Transp. Syst. 99, 1–11 (2018)
Google Scholar
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015)
Ros, G., Sellart, L., Materzynska, J., Vazquez, D., Lopez, A.M.: The synthia dataset: a large collection of synthetic images for semantic segmentation of urban scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3234–3243 (2016)
Google Scholar
Shrivastava, A., Pfister, T., Tuzel, O., Susskind, J., Wang, W., Webb, R.: Learning from simulated and unsupervised images through adversarial training. In: CVPR. vol. 2, p. 5 (2017)
Google Scholar
Wang, X., Man, Z., You, M., Shen, C.: Adversarial generation of training examples: applications to moving vehicle license plate recognition. arXiv preprint arXiv:1707.03124 (2017)
Wang, Z., et al.: Deepfont: identify your font from an image. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 451–459. ACM (2015)
Google Scholar
Yu, J., Farin, D., Krüger, C., Schiele, B.: Improving person detection using synthetic training data. In: 2010 17th IEEE International Conference on Image Processing (ICIP), pp. 3477–3480. IEEE (2010)
Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. arXiv preprint (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, 200444, China
Changhao Wu, Shugong Xu & Shunqing Zhang
Playground Global, Palo Alto, USA
Guocong Song

Authors

Changhao Wu
View author publications
You can also search for this author in PubMed Google Scholar
Shugong Xu
View author publications
You can also search for this author in PubMed Google Scholar
Guocong Song
View author publications
You can also search for this author in PubMed Google Scholar
Shunqing Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shugong Xu .

Editor information

Editors and Affiliations

Sun Yat-sen University, Guangzhou, China
Jian-Huang Lai
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Cheng-Lin Liu
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Xilin Chen
Tsinghua University, Beijing, China
Jie Zhou
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
Xi’an Jiaotong University, Xi’an, China
Nanning Zheng
Peking University, Beijing, China
Hongbin Zha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, C., Xu, S., Song, G., Zhang, S. (2018). How Many Labeled License Plates Are Needed?. In: Lai, JH., et al. Pattern Recognition and Computer Vision. PRCV 2018. Lecture Notes in Computer Science(), vol 11259. Springer, Cham. https://doi.org/10.1007/978-3-030-03341-5_28

Download citation

DOI: https://doi.org/10.1007/978-3-030-03341-5_28
Published: 02 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03340-8
Online ISBN: 978-3-030-03341-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Abstract

1 Introduction

2 Related Work

2.1 License Plate Recognition

2.2 Generative Adversarial Networks

2.3 Data Generation for Training

3 License Plate Recognition Based on Data Generation and Augmentation

3.1 CycleGAN

3.2 Recognition Network Design

4 Experiment

4.1 Dataset

4.2 Implementation Details

4.3 Performance Evaluation on Dataset-1

4.4 Performance Evaluation on Data Generation

4.5 Performance Evaluation on AOLP

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation