Penalty Non-maximum Suppression in Object Detection

Zhao, Wenqing; Yan, Hai

doi:10.1007/978-3-030-03341-5_8

Penalty Non-maximum Suppression in Object Detection

Wenqing Zhao²⁰ &
Hai Yan²⁰

Conference paper
First Online: 02 November 2018

2217 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11259))

Abstract

As a post-processing step, Non-Maximum Suppression (NMS) is always used to obtain final detection boxes. It suppresses all detection boxes which have a higher intersection-over-union (IoU) overlap than threshold T with pre-selected detection box p with the maximum score in each iteration. However, it removes the positive object, if the positive object is adjacent to p with a higher IoU. To overcome these shortages, we propose Penalty-NMS method which according to the different overlap to assign penalty coefficient to decay detections scores. In this process, we will not eliminate any detection boxes. And we keep all detection boxes temporarily until the detections with lower score will be eliminated after many rounds of iteration. Our method obtains significant improvements on standard datasets like PASCAL VOC (1.9% for Faster RCNN) and MS COCO (1.6% for R-FCN and 1.8% for Faster RCNN) without any additional computational and parameters.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Object detection has always been a popular research topic in the field of computer vision. In the past decade, traditional methods [26,27,28] have reached the bottleneck and the accuracy is also not satisfying. However, owing to the image recognition which is based on Convolutional Neural Network, has fulfilled remarkable achievements, thus in the basis of the development of network, [3] has successfully made remarkable progress by introducing Convolutional Neural Network into object detection. Subsequently, many methods [1,2,3, 7, 32,33,34] based on the idea of [3] have significantly improved in the part of accuracy and speed. At the meantime, many current object detection pipelines due to the deep learning can be divided into three stages as follows: (1) extracts region proposals, (2) classifies and refines each region proposal, and (3) removes extra detection boxes that might belong to the same object. NMS is frequently used in Stage (3) as an essential part of object detection and obtains impressive effect in [1,2,3, 7, 8]. However, traditional NMS algorithm suppresses any detection box which exceeds the threshold T. That is, it removes detection boxes which have a IoU of pre-selected detection p exceed the given threshold T. Average precision would drop as a result of the missing positives, thus, traditional NMS could also be named as GreedyNMS. The current state-of-the-art object detection Faster RCNN based on ImageNet model [10,11,12] extracts feature maps and generates many region proposals by Region Proposal Network(RPN) [2]. Then it uses GreedyNMS to get final detections.

For any IoU threshold, GreedyNMS always tradeoffs between recall and precision. This paper is devoted to solving the problem which leads to a miss due to the setting of the threshold T. As a post-processing step, GreedyNMS aims to remove redundant detections. When we solve its defects, we should not add any additional computation or parameters. [9] proposed Tyrolean network (Tnet) to rescore all detections, then the network can choose the best detection. Although this method improves detection performance, it also adds extra training.

Static images which are addressed by stage (1) and (2) will generate multiple detection boxes (denoted as set P), each detection box attaches a corresponding detection score (denoted as set C). GreedyNMS selects the detection box p with the highest score in the set P, removes p from set P and appends it to set K (final detection boxes set). Then it calculates the IoU of p with the rest detection boxes, suppressing all detections in which the IoU is higher than threshold T. At last, repetitively executing the remaining detection boxes with the same procedures, then T plays an essential role in object detection. When we selected a lower threshold T, the object which has a IoU higher than T would be missed. Instead, when we select a larger threshold T, it would generate more false positives which will lead to a drop in average precision. Hence, GreedyNMS always compromises between recall and precision. For this purpose, we propose Penalty Non-maximum suppression (Penalty-NMS) which penalizes all detection boxes except detection box which has the highest score in each iteration. Owing to the different overlap values, assigning penalty coefficient to reduce the detection boxes score without using threshold T. In comparison with the box in which the GreedyNMS suppression is greater than the threshold T, we’d like to use the quadratic function to calculate the penalty coefficient to reduce detection score which has different overlap values. Penalty-NMS obtains significant improvements in average precision for Faster RCNN and R-FCN on standard datasets like PASCAL VOC and MS COCO.

2 Related Work

Although the NMS algorithm is a core part of object detection, Numerous studies have focused on feature design, classifier design, and object proposals in the past. Surprisingly, Few studies on the NMS algorithms exist.

NMS was first employed in edge detection which is performed edge thinning to remove spurious responses [35]. Subsequently, it has been applied to face detection [29] and object detection [16]. [16] demonstrated that a greedy NMS algorithm, where a detection box with the maximum score is selected and its neighboring boxes are suppressed using a threshold improves performance over the approach used for face detection [29]. Since then, greedy NMS has been the de-facto algorithm used in object detection [1,2,3, 7, 24, 32].

For clustering detections, principled clustering formulation has been proposed in [15, 16] which obtain good performance for object class detection. Several other clustering algorithms have been explored for the task of NMS: mean-shift clustering [16], agglomerative clustering [17] and affinity propagation clustering [18]. However, they have yet to surpass the performance of GreedyNMS. To link convnets and NMS [26], directly generates a sparse set of detections by training an LSTM that NMS is unnecessary. [21] designs a convnet that combines decisions of GreedyNMS with different overlap thresholds, allowing the network to choose the GreedyNMS operating point locally. [20] proposed a proposal subset optimization algorithm for detecting salient objects as an alternative to NMS. [22] has proposed a true end-to-end learning algorithm which makes the classifier aware of the NMS procedure at test time by including GreedyNMS at training time. [23] operates on graphs, but requires a pre-processing that defines a node ordering. [4, 30] tend to produce fewer spread-out double detections and improve overall detection quality. [19, 24] propose to detect pairs of objects instead of each individual objects in order to handle strong occlusion. But there is a problem that single and double detections need to be handled.

In summary, most of the proposed algorithms can replace GreedyNMS. However, we find that GreedyNMS still obtain the greatest performance for generic object detection. [21] has obtain better results which is capable of performing NMS without being given a set of suppression alternatives to choose from and without having another final suppression step. But an additional deep network with vast parameters and Computation is required. [25] proposed Soft-NMS that Our algorithm is similar to it but does not add any parameters.

Although the traditional NMS algorithm can obtain better performance in several generations of detector [1,2,3, 13, 14], it is still a greedy algorithm with obvious defects. This paper aims to improve the NMS algorithm without any additional parameters or computation.

3 Penalty-NMS

This section is divided into two parts. First of all, we review the details of the traditional NMS (i.e. GreedyNMS) and analyze the shortcomings of GreedyNMS as a post-processing in object detection. Then we introduce Penalty-NMS which is proposed in this paper in detail.

3.1 GreedyNMS

When a test image passes the detection system without post-processing, multiple detections are generated around each object. However, each object only needs one detection box. Therefore, GreedyNMS is used as a post-processing to eliminate redundant detection boxes. GreedyNMS process is as follows:

1.
Detection boxes set $ {\text{p}}\{ {\text{p}}_{1} ,{\text{p}}_{2} \cdots {\text{p}}_{\text{n}} \} $ and the corresponding scores set $ {\text{C}}\left\{ {{\text{c}}_{1} ,{\text{c}}_{2} \ldots {\text{c}}_{\text{n}} } \right\} $.
2.
Choose the detection box $ {\text{p}}_{ \hbox{max} } $ which has a maximum score and merge it into set K (the final detection boxes set). Then remove $ {\text{p}}_{ \hbox{max} } $ from set P and calculate the IOU overlap of the remaining detections with $ {\text{p}}_{ \hbox{max} } $.
3.
Set the threshold T. Remove all detections which have a IoU overlap that is greater than T from set P.
4.
Repeat steps 2 and 3 until set P is empty.

Threshold T in step 3 has a decisive influence on the final result of the entire detection system. However, it’s hard for us to find a suitable value.

3.2 Penalty-NMS

The neighbor detection boxes have a higher likelihood of detecting the same object. However, GreedyNMS always falls into local optimal, since it removes all the detection boxes which have a IoU overlap that is greater than threshold T. Unlike GreedyNMS, we penalize all detection boxes that should be suppressed in GreedyNMS in Piecewise Penalty-NMS or penalize all detection boxes regardless of whether they have an overlap with detection box p in Continuous Penalty-NMS. The detection boxes with lower scores are removed after multiple penalties.

(Piecewise Penalty-NMS).

As it is shown in Eq. (1), Piecewise Penalty-NMS penalizes any detection which has a higher IoU than T and the detection boxes with a IoU less than T keep its original score. The penalty coefficient is $ \upbeta\left( {1 - {\text{overlap}}^{2} } \right) $.

$$ \upmu_{\text{i}} = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {overlap < T} \hfill \\ {\upbeta\left( {1 - {\text{overlap}}^{2} } \right), } \hfill & {overlap \ge T} \hfill \\ \end{array} } \right. $$

(1)

Here $ \upmu_{\text{i}} $ is the penalty coefficient for detection box i, overlap (0 ≤ overlap ≤ 1) is the IoU overlap of the detection box $ {\text{p}}_{\text{i}} $ with the pre-selected detection box, $ \upbeta\,(\upbeta > 0) $ is the regulatory factor.

In Eq. (1) we change the score of the detection boxes which have a IoU overlap is greater than threshold T in GreedyNMS from 0 to $ \upbeta\left( {1 - {\text{overlap}}^{2} } \right) \cdot {\text{confidence}} $(detection boxes’ score). Therefore, we can regard GreedyNMS as a special form of Piecewise Penalty-NMS. As we have analyzed in Sect. 3.1, there are always some positive objects which are missed in GreedyNMS. So penalizing detections to decay the scores seems to be a better approach. Eq. (1) decreases the scores of the detection boxes whose IoU overlap is greater than threshold T. However, the Piecewise Penalty-NMS still needs to set threshold T manually. The performance of the algorithm is still limited by the threshold T.

(Continuous Penalty-NMS).

Continuous Penalty-NMS algorithm is shown in Fig. 3. The penalty coefficient is shown in Eqs. (2) and (3). In Continuous Penalty-NMS, we no longer use threshold T, but directly penalize all detection boxes. In Eqs. (2) and (3), the growth of penalty has the opposite change. In the following sections, Continuous Penalty-NMS₁ and Continuous Penalty-NMS₂ correspond to Eqs. (2) and (3) respectively.

$$ \mu_{i} = \beta \left( {1 - overlap^{2} } \right) $$

(2)

$$ \mu_{i} = \beta \left( {overlap - 1} \right)^{2} $$

(3)

In Sect. 3.1, we have analyzed the performance of the traditional NMS which is strictly limited by the threshold T. Piecewise Penalty-NMS is a better way to improve the performance of GreedyNMS algorithm, but it’s still unable to get rid of the threshold T. Therefore, we further propose Continuous Penalty-NMS which is different from [25] that does not penalize detection boxes with no overlap. Instead, Continuous Penalty-NMS penalizes all detections except detection box with maximum score in each iteration.

Quadratic functions are used as the penalty coefficients in this paper. The first derivative of Eqs. (1) and (2) is $ - 2\upbeta \cdot {\text{overlap}} $ and the first derivative of Eq. (3) is $ 2\upbeta\left( {{\text{overlap}} - 1} \right) $. All of them are less than 0. Therefore, the penalty coefficient becomes smaller as the value of IoU becomes higher to decay the score of detection box. The second derivative of Eqs. (1) and (2) are both less than 0 that are convex function. Equation (3) is opposite to it but is the same as Gaussian function which is presented in [25]. The growth proportion of Penalty in Eq. (2) increases as the IoU overlap increases. However, the growth proportion of Penalty in Eq. (3) decreases as the IoU overlap increases.

Penalty-NMS algorithm is used in each iteration and removes detections whose scores are less than $ \upsigma $. Compared with the influence of the threshold T in GreedyNMS, the parameter $ \upbeta $ and $ \upsigma $ in Penalty-NMS is less sensitive to the performance of the algorithm. We can also set $ \upbeta $ to 1.0 which means that our algorithm does not add any parameters. Besides, the computational complexity for Penalty-NMS is O($ {\text{n}}^{2} $) which is the same as GreedyNMS, the n is the number of detection boxes. Although Penalty-NMS algorithm does not get the global optimal solution, compared with GreedyNMS algorithm, a better sub-optimal global solution is obtained without any additional computation and can be easily embedded in any detection system (Fig. 1).

4 Experiment

4.1 Experiment on PASCAL VOC

Our experiment is based on Faster RCNN on PASCAL VOC [4] that has 20 object categories and the basic network is VGG16 [11]. We train the models on the union set of VOC 2007 trainval and evaluate on VOC 2007 test set. Object detection accuracy is measured by mean Average Precision (mAP). In this section, we analyze and compare the performance of Piecewise Penalty-NMS, Continuous Penalty-NMS₁, Continuous Penalty-NMS₂ and traditional NMS algorithm. We also analyze the sensitivity of parameters $ \upbeta $ and $ \upsigma $.

(Penalty-NMS Performance Analysis).

In Table 1, we analyze the changes of mAP values at different threshold T by GreedyNMS and Piecewise Penalty-NMS. We also compare the performance of Piecewise Penalty-NMS at different values of $ \upbeta $. Table 1 shows that when the threshold T is 0.3 or 0.4, compared with GreedyNMS, Piecewise Penalty-NMS has a significant improvement up to 1.9%(higher than [25]) in mAP. Although Piecewise Penalty-NMS adds a parameter $ \upbeta $, analyzing the red box in Table 1, we can see that the different values of $ \upbeta $ has no significant distinction in mAP. The fluctuation range is only 0–0.7%. The blue box in Table 1 shows that Piecewise Penalty-NMS does not obtain any performance improvement when the threshold T ≥ 0.5. We analyze the main reason for this situation is that the positive detections which should be suppressed is directly kept. The effect is similar to our algorithm when we set higher threshold. However, this growth of positive is much less than the growth of false positive, which leads a drop in average precision.

Table 1. Results on PASCAL VOC 2007 test set, PPenalty-NMS denotes Piecewise Penalty-NMS. 0.5, 0.6, 0.7, 0.8 denotes the value of $ \upbeta $

Full size table

Table 2 shows that Continuous Penalty-NMS₁ and Continuous Penalty-NMS₂ algorithm still obtain significant improvement without using threshold T. The highest growth reached 1.9% and 1.8% respectively which are Equal to Piecewise Penalty-NMS. Empirically, the detection boxes with higher overlap should be penalized more. The growth proportion of Penalty in Eq. (2) increases as the IoU increases. However, the growth proportion of Penalty in Eq. (3) decreases as the IoU increases. As parameter $ \upbeta $ changes, although Penalty-NMS₁ and Continuous Penalty-NMS₂ are two different modes of penal growth, both of them obtain similar improvement. Therefore, the growth mode of penalty is not the key factor which can affect the performance of the algorithm. In Table 2, row 4 and 5 show that the value of mAP decreases continuously as the threshold T increases. Compared with Continuous Penalty-NMS1 and Continuous Penalty-NMS2 whose ranges of variation are from 0.3% to 0.8%, the performance of the traditional NMS is totally depended on the setting of threshold T.

Table 2. Results on PASCAL VOC 2007 test set, CPenalty-NMS₁ denotes Continuous Penalty-NMS₁, CPenalty-NMS₂ denotes Continuous Penalty-NMS₂.

Full size table

(Sensitivity Analysis).

The function of the parameter $ \upsigma $ is the same as the threshold T in GreedyNMS. But its effect on the performance of the algorithm is far less than T. As shown in Fig. 4, Piecewise Penalty-NMS₁ and Continuous Penalty-NMS₂ obtain better performance from a range between 0.001 to 0.004. The mAP for Piecewise Penalty-NMS $ (\upbeta = 0.6,{\text{T}} = 0.3) $ are maintained at 71.4%–71.9%. Likewise, Continuous Penalty-NMS₁ $ (\upbeta = 0.6) $ also stays at 71.6%–71.9%. The fluctuation range is kept within 0.5% and less than the GreedyNMS (Fig. 2).

Column 9 of Table 2 shows that Continuous Penalty-NMS₁ and Continuous Penalty-NMS₂ reach 71.1% and 71.8% respectively which also obtain better performance than GreedyNMS. So we have a further experiment that sets $ \upbeta $ to 1.0, which means that we also no longer use parameter $ \upbeta $. In Table 3, we still obtain a better improvement by 1.0%–1.8%, when the range is from 0.001 to 0.004. Although it has brought some loss in average precision, compared with the traditional NMS algorithm, Penalty-NMS not only has significant improvements, but also is freer to select parameter $ \upsigma $, which has a slight influence on accuracy. Table 4 show the detailed numbers.

Table 3. Sensitivity Analysis when $ \upbeta $ is equal to 1.0. Continuous Penalty-NMS₁ denotes CPenalty-NMS₁ and Continuous Penalty-NMS₂ denotes CPenalty-NMS₂.

Full size table

Table 4. Results on PASCAL VOC 2007 test set with Faster R-CNN detectors and VGG-16

Full size table

4.2 Experiments on MS COCO

We evaluate Faster RCNN and R-FCN on the MS COCO dataset [5] that has 80 object categories. Our experiments involve the 80 k train set and 40 k val set for test. We evaluate the mAP average for IoU ∈ [0.5:0.05:0.95] (COCO’s standard metric, simply denoted as mAP@[0.5, 0.95]) and mAP@0.5 (PASCAL VOC’s metric).

The results are in Table 5. R-FCN [24] with single-scale trained baseline has a val result of 48.9%/27.6% and Faster RCNN has a baseline 48.4%/27.2%. R-FCN obtained 1.1% and 1.0% improvement respectively based on MS COCO’s standard metric and PASCAL VOC’s metric when we adopt Piecewise Penalty-NMS algorithm. We also obtain improvements of 1.4% and 1.3% for Faster RCNN. For MS COCO’s standard metric we obtain an improvement of 1.6% and 1.8% respectively for R-FCN and Faster RCNN which are higher than Soft-NMS [25] ’s improvements 1.1% for Faster RCNN and 1.3% for R-FCN.

Table 5. Results on MS COCO dataset with ResNet-101. G denotes GreedyNMS, P denotes Piecewise Penalty-NMS and C denotes Continuous Penalty-NMS₁.

Full size table

5 Conclusion

As the greedy algorithm, NMS usually serves as post-processing. In this paper, we analyze in detail the limitations of GreedyNMS, which bring about a miss due to the setting of threshold T. We propose Penalty-NMS algorithm which penalizes detection boxes rather than the direct suppress one. What’s more, our method solves the limitations without using any additional computations or parameters. The experimental results indicate that compared to the traditional NMS algorithm, Penalty-NMS achieves salient progress. Notes should be observed that we do not obtain the overall optimal result and Penalty-NMS is still a greedy algorithm. But the penalty idea gets a good effect on applying NMS algorithm. Moreover, other functions are proper to be penalty coefficient. The future work will focus on the way of learning to get penalty coefficient instead of adopting the fixed function.

References

Girshick, R.: Fast R-CNN. In: ICCV (2015)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In CVPR (2014)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results (2007)
Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Uijlings, J., van de Sande, K., Gevers, T., Smeulders, A.: Selective search for object recognition. IJCV 104, 154–171 (2013)
Article Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. ArXiv preprint arXiv:1506.02640 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8691, pp. 346–361. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10578-9_23
Chapter Google Scholar
Hosang, J., Benenson, R., Schiele, B.: A convent for non-maximum suppression. In: GCPR (2016)
Google Scholar
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_53
Chapter Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (ICLR) (2015)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. arXiv:1512.03385 (2015)
Viola, P., Jones, M.: Robust real-time face detection. IJCV 57, 137–154 (2004)
Article Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. In: PAMI (2010)
Google Scholar
Rothe, R., Guillaumin, M., Van Gool, L.: Non-maximum suppression for object detection by passing messages between windows. In: Cremers, D., Reid, I., Saito, H., Yang, M.-H. (eds.) ACCV 2014. LNCS, vol. 9003, pp. 290–306. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16865-4_19
Chapter Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
Google Scholar
Bourdev, L., Maji, S., Brox, T., Malik, J.: Detecting people using mutually consistent poselet activations. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6316, pp. 168–181. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15567-3_13
Chapter Google Scholar
Mrowca, D., Rohrbach, M., Hoffman, J., Hu, R., Saenko, K., Darrell, T.: Spatial semantic regularisation for large scale object detection. In: ICCV (2015)
Google Scholar
Stewart. R., Andriluka, M.: End-to-end people detection in crowded scenes. In: CVPR (2016)
Google Scholar
Zhang, J., Sclaroff, S., Lin, Z., Shen, X., Price, B., Mech, R.: Unconstrained salient object detection via proposal subset optimization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5733–5742 (2016)
Google Scholar
Hosang, J., Benenson, R., Schiele, B.: Learning non-maximum suppression (2017)
Google Scholar
Henderson, P., Ferrari, V.: End-to-end training of object class detectors for mean average precision. In: Lai, S.-H., Lepetit, V., Nishino, K., Sato, Y. (eds.) ACCV 2016. LNCS, vol. 10115, pp. 198–213. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-54193-8_13
Chapter Google Scholar
Blaschko, M.B.: Branch and bound strategies for non-maximal suppression in object detection. In: Boykov, Y., Kahl, F., Lempitsky, V., Schmidt, Frank R. (eds.) EMMCVPR 2011. LNCS, vol. 6819, pp. 385–398. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23094-3_28
Chapter Google Scholar
Dai, J., Li, Y., He, K., et al.: R-FCN: object detection via region-based fully convolutional networks (2016)
Google Scholar
Bodla, N., Singh, B., Chellappa, R., et al.: Improving object detection with one line of code (2017)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. In: TPAMI (2010)
Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, pp. I. IEEE (2001)
Google Scholar
Felzenszwalb, P., Mcallester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008. IEEE (2008)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: International Conference on Neural Information Processing Systems, pp. 1097–1105. Curran Associates Inc (2012)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. Comput. Sci. (2014)
Google Scholar
Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. In: Computer Vision and Pattern Recognition. IEEE (2015)
Google Scholar
Redmonand, J., Farhadi, A.: Yolo9000: better, faster, stronger. In: CVPR (2017)
Google Scholar
Liu, W., et al.: SSD: Single Shot MultiBox Detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Shen, Z., Liu, Z., Li, J., et al.: DSOD: Learning deeply supervised object detectors from scratch (2017)
Google Scholar
Rosenfeld, A., Thurston, M.: Edge and curve detection for visual scene analysis. IEEE Trans. Comput. 100(5), 562–569 (1971)
Article Google Scholar

Download references

Author information

Authors and Affiliations

North China Electric Power University, Baoding, 071003, China
Wenqing Zhao & Hai Yan

Authors

Wenqing Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Hai Yan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wenqing Zhao .

Editor information

Editors and Affiliations

Sun Yat-sen University, Guangzhou, China
Jian-Huang Lai
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Cheng-Lin Liu
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Xilin Chen
Tsinghua University, Beijing, China
Jie Zhou
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
Xi’an Jiaotong University, Xi’an, China
Nanning Zheng
Peking University, Beijing, China
Hongbin Zha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, W., Yan, H. (2018). Penalty Non-maximum Suppression in Object Detection. In: Lai, JH., et al. Pattern Recognition and Computer Vision. PRCV 2018. Lecture Notes in Computer Science(), vol 11259. Springer, Cham. https://doi.org/10.1007/978-3-030-03341-5_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-03341-5_8
Published: 02 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03340-8
Online ISBN: 978-3-030-03341-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Abstract

1 Introduction

2 Related Work

3 Penalty-NMS

3.1 GreedyNMS

3.2 Penalty-NMS

(Piecewise Penalty-NMS).

(Continuous Penalty-NMS).

4 Experiment

4.1 Experiment on PASCAL VOC

(Penalty-NMS Performance Analysis).

(Sensitivity Analysis).

4.2 Experiments on MS COCO

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation