Effective Barcode Hunter via Semantic Segmentation in the Wild

Ni, Feng; Cao, Xixin

doi:10.1007/978-3-030-37731-1_35

Feng Ni¹⁶ &
Xixin Cao¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11961))

Included in the following conference series:

International Conference on Multimedia Modeling

2748 Accesses

Abstract

Barcodes are popularly used for product identification in many scenarios. However, locating them on product images is challenging. Half-occlusion, distortion, darkness or targets being too small to recognize can often add to the difficulties using conventional methods. In this paper, we introduce a large-scale diverse barcode dataset and adopt a deep learning-based semantic segmentation approach to address these problems. Specifically, we use an efficient method to synthesize 30000 well-annotated images containing diverse barcode labels, and get Barcode-30 k, a large-scale dataset with accurate pixel-level annotated barcode in the wild. Moreover, to locate barcode more precisely, we further propose an Effective Barcode Hunter - BarcodeNet. It is a semantic segmentation model based on CNN (Convolutional Neural Network) and is mainly formed with two novel modules, Prior Pyramid Pooling Module (P3M) and Pyramid Refine Module (PRM). Additional ablation studies further demonstrate the effectiveness of BarcodeNet, and it yields a high mIoU result of 95.36% on the proposed synthetic Barcode-30 k validation-set. To prove the practical value of the whole system, we test the BarcodeNet trained on train-set of Barcode-30 k on a manually annotated testing set that only collected from cameras, it achieves mIoU of 90.3%, which is a very accurate result for practical applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for scene segmentation. TPAMI 39(12), 2481–2495 (2017)
Article Google Scholar
Caesar, H., Uijlings, J., Ferrari, V.: Region-based semantic segmentation with end-to-end training. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 381–397. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_23
Chapter Google Scholar
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. TPAMI 40(4), 834–848 (2018)
Article Google Scholar
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. In: ICLR (2015)
Google Scholar
Chen, L., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. CoRR. arXiv:1706.05587 (2017)
Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: CVPR, pp. 3213–3223 (2016)
Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Dai, J., He, K., Sun, J.: Convolutional feature masking for joint object and stuff segmentation, pp. 3992–4000 (2014)
Google Scholar
Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., Garcia-Rodriguez, J.: A review on deep learning techniques applied to semantic segmentation (2017)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. TPAMI 37(9), 1904–16 (2015)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016)
Google Scholar
Johnson, M., Shotton, J., Cipolla, R.: Semantic texton forests for image categorization and segmentation 5(7), 1–8 (2008)
Google Scholar
Lafferty, J.D., Mccallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001)
Google Scholar
Li, J.H., Wang, W.H., Rao, T.T., Zhu, W.B., Liu, C.J.: Morphological segmentation of 2-D barcode gray scale image. In: ICISAI, pp. 62–68 (2017)
Google Scholar
Li, J., Wang, Y.W., Chen, Y., Wang, G.: Adaptive segmentation method for 2-D barcode image base on mathematic morphological. Res. J. Appl. Sci. Eng. Technol. 6(18), 3335–3342 (2013)
Article Google Scholar
Li, J., Zhao, Q., Tan, X., Luo, Z., Tang, Z.: Using deep convnet for robust 1D barcode detection. In: ICIISA, pp. 261–267 (2017)
Google Scholar
Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation (2016)
Google Scholar
Liu, W., Rabinovich, A., Berg, A.C.: ParseNet: looking wider to see better. Computer Science (2015)
Google Scholar
Liu, Z., He, L., Liang, R.: Research on segmentation algorithm of 2D color barcode based on mobile phone. In: ICCEE (2012)
Google Scholar
Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation, pp. 1520–1528 (2015)
Google Scholar
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. TPAMI 39(4), 640–651 (2017)
Article Google Scholar
Thoma, M.: A survey of semantic segmentation (2016)
Google Scholar
Wachenfeld, S., Terlunen, S., Jiang, X.: Robust 1-D barcode recognition on camera phones and mobile product information display (2010)
Chapter Google Scholar
Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks, pp. 5987–5995 (2016)
Google Scholar
Xu, W., Mccloskey, S.: 2D barcode localization and motion deblurring using a flutter shutter camera. In: Applications of Computer Vision, pp. 159–165 (2011)
Google Scholar
Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions (2015)
Google Scholar
Zamberletti, A., Gallo, I., Carullo, M., Binaghi, E.: Neural image restoration for decoding 1-D barcodes using common camera phones. In: VISAPP 2010 - ICCV, pp. 5–11 (2010)
Google Scholar
Zamberletti, A., Gallo, I., Albertini, S.: Robust angle invariant 1D barcode detection. In: ACPR, pp. 160–164 (2013)
Google Scholar
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_53
Chapter Google Scholar
Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: CVPR, pp. 6230–6239 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Software and Microelectronics, Peking University, Beijing, China
Feng Ni & Xixin Cao

Authors

Feng Ni
View author publications
You can also search for this author in PubMed Google Scholar
Xixin Cao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Feng Ni or Xixin Cao .

Editor information

Editors and Affiliations

Korea Advanced Institute of Science and, Daejeon, Korea (Republic of)
Yong Man Ro
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Junmo Kim
National Cheng Kung University, Tainan City, Taiwan
Wei-Ta Chu
Tsinghua University, Beijing, China
Peng Cui
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Jung-Woo Choi
National Tsing Hua University, Hsinchu, Taiwan
Min-Chun Hu
Ghent University, Ghent, Belgium
Wesley De Neve

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ni, F., Cao, X. (2020). Effective Barcode Hunter via Semantic Segmentation in the Wild. In: Ro, Y., et al. MultiMedia Modeling. MMM 2020. Lecture Notes in Computer Science(), vol 11961. Springer, Cham. https://doi.org/10.1007/978-3-030-37731-1_35

Download citation

DOI: https://doi.org/10.1007/978-3-030-37731-1_35
Published: 24 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37730-4
Online ISBN: 978-3-030-37731-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics