Abstract
Barcodes are popularly used for product identification in many scenarios. However, locating them on product images is challenging. Half-occlusion, distortion, darkness or targets being too small to recognize can often add to the difficulties using conventional methods. In this paper, we introduce a large-scale diverse barcode dataset and adopt a deep learning-based semantic segmentation approach to address these problems. Specifically, we use an efficient method to synthesize 30000 well-annotated images containing diverse barcode labels, and get Barcode-30 k, a large-scale dataset with accurate pixel-level annotated barcode in the wild. Moreover, to locate barcode more precisely, we further propose an Effective Barcode Hunter - BarcodeNet. It is a semantic segmentation model based on CNN (Convolutional Neural Network) and is mainly formed with two novel modules, Prior Pyramid Pooling Module (P3M) and Pyramid Refine Module (PRM). Additional ablation studies further demonstrate the effectiveness of BarcodeNet, and it yields a high mIoU result of 95.36% on the proposed synthetic Barcode-30 k validation-set. To prove the practical value of the whole system, we test the BarcodeNet trained on train-set of Barcode-30 k on a manually annotated testing set that only collected from cameras, it achieves mIoU of 90.3%, which is a very accurate result for practical applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for scene segmentation. TPAMI 39(12), 2481–2495 (2017)
Caesar, H., Uijlings, J., Ferrari, V.: Region-based semantic segmentation with end-to-end training. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 381–397. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_23
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. TPAMI 40(4), 834–848 (2018)
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. In: ICLR (2015)
Chen, L., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. CoRR. arXiv:1706.05587 (2017)
Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: CVPR, pp. 3213–3223 (2016)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
Dai, J., He, K., Sun, J.: Convolutional feature masking for joint object and stuff segmentation, pp. 3992–4000 (2014)
Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., Garcia-Rodriguez, J.: A review on deep learning techniques applied to semantic segmentation (2017)
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. TPAMI 37(9), 1904–16 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016)
Johnson, M., Shotton, J., Cipolla, R.: Semantic texton forests for image categorization and segmentation 5(7), 1–8 (2008)
Lafferty, J.D., Mccallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001)
Li, J.H., Wang, W.H., Rao, T.T., Zhu, W.B., Liu, C.J.: Morphological segmentation of 2-D barcode gray scale image. In: ICISAI, pp. 62–68 (2017)
Li, J., Wang, Y.W., Chen, Y., Wang, G.: Adaptive segmentation method for 2-D barcode image base on mathematic morphological. Res. J. Appl. Sci. Eng. Technol. 6(18), 3335–3342 (2013)
Li, J., Zhao, Q., Tan, X., Luo, Z., Tang, Z.: Using deep convnet for robust 1D barcode detection. In: ICIISA, pp. 261–267 (2017)
Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation (2016)
Liu, W., Rabinovich, A., Berg, A.C.: ParseNet: looking wider to see better. Computer Science (2015)
Liu, Z., He, L., Liang, R.: Research on segmentation algorithm of 2D color barcode based on mobile phone. In: ICCEE (2012)
Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation, pp. 1520–1528 (2015)
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. TPAMI 39(4), 640–651 (2017)
Thoma, M.: A survey of semantic segmentation (2016)
Wachenfeld, S., Terlunen, S., Jiang, X.: Robust 1-D barcode recognition on camera phones and mobile product information display (2010)
Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks, pp. 5987–5995 (2016)
Xu, W., Mccloskey, S.: 2D barcode localization and motion deblurring using a flutter shutter camera. In: Applications of Computer Vision, pp. 159–165 (2011)
Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions (2015)
Zamberletti, A., Gallo, I., Carullo, M., Binaghi, E.: Neural image restoration for decoding 1-D barcodes using common camera phones. In: VISAPP 2010 - ICCV, pp. 5–11 (2010)
Zamberletti, A., Gallo, I., Albertini, S.: Robust angle invariant 1D barcode detection. In: ACPR, pp. 160–164 (2013)
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_53
Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: CVPR, pp. 6230–6239 (2017)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Ni, F., Cao, X. (2020). Effective Barcode Hunter via Semantic Segmentation in the Wild. In: Ro, Y., et al. MultiMedia Modeling. MMM 2020. Lecture Notes in Computer Science(), vol 11961. Springer, Cham. https://doi.org/10.1007/978-3-030-37731-1_35
Download citation
DOI: https://doi.org/10.1007/978-3-030-37731-1_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37730-4
Online ISBN: 978-3-030-37731-1
eBook Packages: Computer ScienceComputer Science (R0)