Learning When to Say “I Don’t Know"

Kashani Motlagh, Nicholas; Davis, Jim; Anderson, Tim; Gwinnup, Jeremy

doi:10.1007/978-3-031-20713-6_15

Nicholas Kashani Motlagh ORCID: orcid.org/0000-0001-6229-6212¹⁶,
Jim Davis¹⁶,
Tim Anderson¹⁷ &
…
Jeremy Gwinnup¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13598))

Included in the following conference series:

International Symposium on Visual Computing

602 Accesses
2 Citations

Abstract

We propose a new Reject Option Classification technique to identify and remove regions of uncertainty in the decision space for a given neural classifier and dataset. Such existing formulations employ a learned rejection (remove)/selection (keep) function and require either a known cost for rejecting examples or strong constraints on the accuracy or coverage of the selected examples. We consider an alternative formulation by instead analyzing the complementary reject region and employing a validation set to learn per-class softmax thresholds. The goal is to maximize the accuracy of the selected examples subject to a natural randomness allowance on the rejected examples (rejecting more incorrect than correct predictions). We provide results showing the benefits of the proposed method over naïvely thresholding calibrated/uncalibrated softmax scores with 2-D points, imagery, and text classification datasets using state-of-the-art pretrained models. Source code is available at https://github.com/osu-cvl/learning-idk.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Ensemble diversified learning for image classification with noisy labels

Article 09 March 2021

Classification with Rejection: Concepts and Evaluations

Three–Way Classification: Ambiguity and Abstention in Machine Learning

References

Agresti, A., Coull, B.A.: Approximate is better than “exact" for interval estimation of binomial proportions. Am. Stat. 52(2) (1998)
Google Scholar
Bao, H., Dong, L., Piao, S., Wei, F.: BEiT: BERT pre-training of image transformers. In: ICLR (2022)
Google Scholar
Barbieri, F., Camacho-Collados, J., Neves, L., Espinosa-Anke, L.: TweetEval: unified benchmark and comparative evaluation for tweet classification. In: EMNLP (2020)
Google Scholar
Brown, L.D., Cai, T.T., DasGupta, A.: Interval estimation for a binomial proportion. Stat. Sci. 16(2) (2001)
Google Scholar
Chow, C.K.: On optimum recognition error and reject tradeoff. IEEE Trans. Inf. Theory 16(1), 41–46 (1970)
Article MATH Google Scholar
Clopper, C.J., Pearson, E.S.: The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika 26(4) (1934)
Google Scholar
Darlow, L.N., Crowley, E.J., Antoniou, A., Storkey, A.J.: CINIC-10 is not imagenet or CIFAR-10. arXiv preprint arxiv:1810.03505 (2018)
Davis, J., Frank, L.: Revisiting Batch Normalization. In: ECCV (2022)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR (2009)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Human Language Technologies (2019)
Google Scholar
El-Yaniv, R., Wiener, Y.: On the foundations of noise-free selective classification. J. Mach. Learn. Res. 11 (2010)
Google Scholar
Franc, V., Prusa, D., Voracek, V.: Optimal Strategies for Reject Option Classifiers. arXiv preprint arxiv:2101.12523 (2021)
Geifman, Y., El-Yaniv, R.: Selective classification for deep neural networks. In: NIPS (2017)
Google Scholar
Geifman, Y., El-Yaniv, R.: SelectiveNet: a deep neural network with an integrated reject option. In: ICML (2019)
Google Scholar
Guo, C., Pleiss, G., Sun, Y., Weinberger, K.Q.: On calibration of modern neural networks. In: ICML (2017)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR (2016)
Google Scholar
Krizhevsky, A., Hinton, G., et al.: Learning multiple layers of features from tiny images (2009)
Google Scholar
Lu, Z., Sreekumar, G., Goodman, E., Banzhaf, W., Deb, K., Boddeti, V.: Neural Architecture Transfer. IEEE Trans. Pattern Anal. Mach. Intell. 43(09) (2021)
Google Scholar
Maas, A.L., Daly, R.E., Pham, P.T., Huang, D., Ng, A.Y., Potts, C.: Learning word vectors for sentiment analysis. In: Human Language Technologies (2011)
Google Scholar
Maji, S., Chicago, T., Rahtu, E., Kannala, J., Blaschkó, M., Vedaldi, A.: Fine-Grained Visual Classification of Aircraft. arXiv preprint arxiv:1306.5151 (2013)
Pietraszek, T.: Optimizing abstaining classifiers using ROC analysis. In: ICML (2005)
Google Scholar
Tortorella, F.: An optimal reject rule for binary classifiers. In: Advances in Pattern Recognition (2000)
Google Scholar
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: NIPS (2015)
Google Scholar
Zhao, Y., Chen, J., Oymak, S.: On the Role of Dataset Quality and Heterogeneity in Model Confidence. arXiv preprint arxiv:2002.09831 (2020)

Download references

Acknowledgements

This research was supported by the U.S. Air Force Research Laboratory under Contract #GRT00054740 (Release #AFRL-2022-3339).

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Ohio State University, Columbus, USA
Nicholas Kashani Motlagh & Jim Davis
Air Force Research Laboratory, Wright-Patterson AFB, Dayton, USA
Tim Anderson & Jeremy Gwinnup

Authors

Nicholas Kashani Motlagh
View author publications
You can also search for this author in PubMed Google Scholar
Jim Davis
View author publications
You can also search for this author in PubMed Google Scholar
Tim Anderson
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy Gwinnup
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nicholas Kashani Motlagh .

Editor information

Editors and Affiliations

University of Nevada, Reno, NV, USA
George Bebis
University of Illinois Urbana-Champaign, Urbana, IL, USA
Bo Li
National University of Singapore, Singapore, Singapore
Angela Yao
Microsoft Research Asia, Beijing, China
Yang Liu
University of Missouri, Columbia, MO, USA
Ye Duan
City University of Hong Kong, Kowloon, Hong Kong
Manfred Lau
Idaho National Laboratory, Idaho Falls, ID, USA
Rajiv Khadka
Salesforce, Seattle, WA, USA
Ana Crisan
Tufts University, Medford, MA, USA
Remco Chang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kashani Motlagh, N., Davis, J., Anderson, T., Gwinnup, J. (2022). Learning When to Say “I Don’t Know". In: Bebis, G., et al. Advances in Visual Computing. ISVC 2022. Lecture Notes in Computer Science, vol 13598. Springer, Cham. https://doi.org/10.1007/978-3-031-20713-6_15

Download citation

DOI: https://doi.org/10.1007/978-3-031-20713-6_15
Published: 11 December 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20712-9
Online ISBN: 978-3-031-20713-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning When to Say “I Don’t Know"

Abstract

Access this chapter

Similar content being viewed by others

Ensemble diversified learning for image classification with noisy labels

Classification with Rejection: Concepts and Evaluations

Three–Way Classification: Ambiguity and Abstention in Machine Learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning When to Say “I Don’t Know"

Abstract

Access this chapter

Similar content being viewed by others

Ensemble diversified learning for image classification with noisy labels

Classification with Rejection: Concepts and Evaluations

Three–Way Classification: Ambiguity and Abstention in Machine Learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation