DiffRNN: Differential Verification of Recurrent Neural Networks

Mohammadinejad, Sara; Paulsen, Brandon; Deshmukh, Jyotirmoy V.; Wang, Chao

doi:10.1007/978-3-030-85037-1_8

Sara Mohammadinejad¹⁰,
Brandon Paulsen¹⁰,
Jyotirmoy V. Deshmukh¹⁰ &
…
Chao Wang¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12860))

Included in the following conference series:

International Conference on Formal Modeling and Analysis of Timed Systems

563 Accesses
7 Citations

Abstract

Recurrent neural networks (RNNs) such as Long Short Term Memory (LSTM) networks have become popular in a variety of applications such as image processing, data classification, speech recognition, and as controllers in autonomous systems. In practical settings, there is often a need to deploy such RNNs on resource-constrained platforms such as mobile phones or embedded devices. As the memory footprint and energy consumption of such components become a bottleneck, there is interest in compressing and optimizing such networks using a range of heuristic techniques. However, these techniques do not guarantee the safety of the optimized network, e.g., against adversarial inputs, or equivalence of the optimized and original networks. To address this problem, we propose DiffRNN, the first differential verification method for RNNs to certify the equivalence of two structurally similar neural networks. Existing work on differential verification for ReLU-based feed-forward neural networks does not apply to RNNs where nonlinear activation functions such as Sigmoid and Tanh cannot be avoided. RNNs also pose unique challenges such as handling sequential inputs, complex feedback structures, and interactions between the gates and states. In DiffRNN, we overcome these challenges by bounding nonlinear activation functions with linear constraints and then solving constrained optimization problems to compute tight bounding boxes on non-linear surfaces in a high-dimensional space. The soundness of these bounding boxes is then proved using the dReal SMT solver. We demonstrate the practical efficacy of our technique on a variety of benchmarks and show that DiffRNN outperforms state-of-the-art RNN verification tools such as Popqorn.

J. V. Deshmukh and C. Wang—Equal contribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
dReal is implemented based on delta-complete decision procedures; it returns either unsat or delta-sat on the given input formulas, where delta is a user-defined error bound [11].
2.
Gated recurrent units (GRUs) are structurally very similar to LSTMs, and differential verification hurdles for GRUs are the same as LSTMs; thus, we omit explaining GRUs in this paper for brevity.
3.
For a scalar input u, \( \sigma _\mathcal {S} (u) = \frac{e^{u}}{1+e^{u}}\), and \(\tanh (u) = \frac{e^{u}-e^{-u}}{e^{u}+e^{-u}}\).
4.
A many-to-one vanilla RNN differs from the Vanilla RNN model shown above in one small way. For an input sequence of length T, the output is computed only at time T, i.e., the final output of the network is defined as \(\mathbf {y}(T)\) (see Fig. 6 in the Appendix of the arXiv version).
5.
For non-monotonic activation functions we can compute the maximum and minimum using off-the-shelf global optimization tools and then validate the computed bounds using SMT solvers.
6.
One of our evaluation objectives is to extend the results of ReluDiff to general activation functions instead of ReLUs. This also allows us to validate our methodology in the relatively simpler world of feedforward networks before tackling RNNs.
7.
We took the same MNIST benchmarks used by previous work (Popqorn) for verification of single RNNs, to have a fair comparison. Based on the previous work, it seems to be a challenging benchmark set, due to the high input-dimensionality.
8.
As Popqorn evaluates bounds using numerical tools based on gradient descent, while the approach is sound in theory, it is susceptible to numerical precision issues. Hence, we added an extra validation step using dReal to ensure numerical precision of the bounds computed by Popqorn.

References

Anguita, D., Ghio, A., Oneto, L., Parra, X., Reyes-Ortiz, J.L.: A public domain dataset for human activity recognition using smartphones. In: ESANN (2013)
Google Scholar
Bastani, O., Ioannou, Y., Lampropoulos, L., Vytiniotis, D., Nori, A.V., Criminisi, A.: Measuring neural net robustness with constraints. In: Annual Conference on Neural Information Processing Systems, pp. 2613–2621 (2016)
Google Scholar
Bojarski, M., et al.: End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316 (2016)
Carlini, N., Wagner, D.A.: Towards evaluating the robustness of neural networks. In: IEEE Symposium on Security and Privacy, pp. 39–57 (2017)
Google Scholar
Cheng, Y., Wang, D., Zhou, P., Zhang, T.: A survey of model compression and acceleration for deep neural networks. arXiv preprint arXiv:1710.09282 (2017)
Cousot, P., Cousot, R.: Abstract interpretation: a unified lattice model for static analysis of programs by construction or approximation of fixpoints. In: ACM SIGACT-SIGPLAN Symposium on Principles of Programming Languages, pp. 238–252 (1977)
Google Scholar
Cousot, P., Halbwachs, N.: Automatic discovery of linear restraints among variables of a program. In: ACM SIGACT-SIGPLAN Symposium on Principles of Programming Languages, pp. 84–96 (1978)
Google Scholar
Dvijotham, K., Stanforth, R., Gowal, S., Mann, T.A., Kohli, P.: A dual approach to scalable verification of deep networks. In: International Conference on Uncertainty in Artificial Intelligence, pp. 550–559 (2018)
Google Scholar
Ehlers, R.: Formal verification of piece-wise linear feed-forward neural networks. In: Automated Technology for Verification and Analysis - 15th International Symposium, ATVA 2017, Pune, India, 3–6 October 2017, Proceedings, pp. 269–286 (2017)
Google Scholar
Fischer, M., Balunovic, M., Drachsler-Cohen, D., Gehr, T., Zhang, C., Vechev, M.T.: DL2: training and querying neural networks with logic. In: International Conference on Machine Learning, pp. 1931–1941 (2019)
Google Scholar
Gao, S., Kong, S., Clarke, E.M.: dreal: An smt solver for nonlinear theories over the reals. In: International conference on automated deduction. pp. 208–214. Springer (2013)
Chapter Google Scholar
Gehr, T., Mirman, M., Drachsler-Cohen, D., Tsankov, P., Chaudhuri, S., Vechev, M.T.: AI2: safety and robustness certification of neural networks with abstract interpretation. In: IEEE Symposium on Security and Privacy, pp. 3–18 (2018)
Google Scholar
Ghorbal, K., Goubault, E., Putot, S.: The zonotope abstract domain taylor1+. In: International Conference on Computer Aided Verification. pp. 627–633. Springer (2009)
Chapter Google Scholar
Goodfellow, I.J., Shlens, J., Szegedy, C.: Explaining and harnessing adversarial examples. In: International Conference on Learning Representations (2015)
Google Scholar
Gopinath, D., Katz, G., Pasareanu, C.S., Barrett, C.W.: DeepSafe: a data-driven approach for assessing robustness of neural networks. In: Automated Technology for Verification and Analysis - 16th International Symposium, ATVA 2018, Los Angeles, CA, USA, 7–10 October 2018, Proceedings, pp. 3–19 (2018)
Google Scholar
Han, S., Mao, H., Dally, W.J.: Deep compression: compressing deep neural network with pruning, trained quantization and Huffman coding. In: International Conference on Learning Representations (2016)
Google Scholar
Huang, X., Kwiatkowska, M., Wang, S., Wu, M.: Safety verification of deep neural networks. In: International Conference on Computer Aided Verification, pp. 3–29 (2017)
Google Scholar
Jia, R., Raghunathan, A., Göksel, K., Liang, P.: Certified robustness to adversarial word substitutions. arXiv preprint arXiv:1909.00986 (2019)
Julian, K.D., Kochenderfer, M.J., Owen, M.P.: Deep neural network compression for aircraft collision avoidance systems. Journal of Guidance, Control, and Dynamics 42(3), 598–608 (2019)
Article Google Scholar
Katz, G., Barrett, C.W., Dill, D.L., Julian, K., Kochenderfer, M.J.: Reluplex: an efficient SMT solver for verifying deep neural networks. In: International Conference on Computer Aided Verification, pp. 97–117 (2017)
Google Scholar
Katz, G., et al.: The Marabou framework for verification and analysis of deep neural networks. In: International Conference on Computer Aided Verification, pp. 443–452 (2019)
Google Scholar
Ko, C.Y., Lyu, Z., Weng, T.W., Daniel, L., Wong, N., Lin, D.: Popqorn: quantifying robustness of recurrent neural networks. arXiv preprint arXiv:1905.07387 (2019)
Kurakin, A., Goodfellow, I.J., Bengio, S.: Adversarial examples in the physical world. In: International Conference on Learning Representations (2017)
Google Scholar
LeCun, Y., Cortes, C.: MNIST handwritten digit database (2010). http://yann.lecun.com/exdb/mnist/
Lyu, Z., Ko, C.Y., Kong, Z., Wong, N., Lin, D., Daniel, L.: Fastened crown: tightened neural network robustness certificates. arXiv preprint arXiv:1912.00574 (2019)
Ma, L., et al.: Deepgauge: multi-granularity testing criteria for deep learning systems. In: IEEE/ACM International Conference On Automated Software Engineering, pp. 120–131. ACM (2018)
Google Scholar
Ma, S., Liu, Y., Lee, W., Zhang, X., Grama, A.: MODE: automated neural network model debugging via state differential analysis and input selection. In: Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2018, Lake Buena Vista, FL, USA, 04–09 November 2018, pp. 175–186 (2018)
Google Scholar
Madry, A., Makelov, A., Schmidt, L., Tsipras, D., Vladu, A.: Towards deep learning models resistant to adversarial attacks. In: International Conference on Learning Representations (2018)
Google Scholar
Mirman, M., Gehr, T., Vechev, M.T.: Differentiable abstract interpretation for provably robust neural networks. In: International Conference on Machine Learning, pp. 3575–3583 (2018)
Google Scholar
Moore, R.E., Kearfott, R.B., Cloud, M.J.: Introduction to Interval Analysis, vol. 110. SIAM (2009)
Google Scholar
Moosavi-Dezfooli, S., Fawzi, A., Frossard, P.: DeepFool: a simple and accurate method to fool deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2574–2582 (2016)
Google Scholar
Nguyen, A.M., Yosinski, J., Clune, J.: Deep neural networks are easily fooled: high confidence predictions for unrecognizable images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 427–436 (2015)
Google Scholar
Odena, A., Goodfellow, I.: Tensorfuzz: debugging neural networks with coverage-guided fuzzing. arXiv preprint arXiv:1807.10875 (2018)
Paulsen, B., Wang, J., Wang, C.: Reludiff: differential verification of deep neural networks. arXiv preprint arXiv:2001.03662 (2020)
Paulsen, B., Wang, J., Wang, J., Wang, C.: Neurodiff: scalable differential verification of neural networks using fine-grained approximation. arXiv preprint arXiv:2009.09943 (2020)
Pei, K., Cao, Y., Yang, J., Jana, S.: Deepxplore: automated whitebox testing of deep learning systems. In: ACM Symposium on Operating Systems Principles, pp. 1–18 (2017)
Google Scholar
Price, Kenneth V.., Storn, Rainer M.., Lampinen, Jouni A..: Differential Evolution: A Practical Approach to Global Optimization. LNCS, Springer, Heidelberg (2005). https://doi.org/10.1007/3-540-31306-0
Book MATH Google Scholar
Raghunathan, A., Steinhardt, J., Liang, P.: Certified defenses against adversarial examples. In: International Conference on Learning Representations (2018)
Google Scholar
Ruan, W., Huang, X., Kwiatkowska, M.: Reachability analysis of deep neural networks with provable guarantees. In: International Joint Conference on Artificial Intelligence, pp. 2651–2659 (2018)
Google Scholar
Shi, Z., Zhang, H., Chang, K.W., Huang, M., Hsieh, C.J.: Robustness verification for transformers. arXiv preprint arXiv:2002.06622 (2020)
Singh, G., Gehr, T., Püschel, M., Vechev, M.T.: An abstract domain for certifying neural networks. In: ACM SIGACT-SIGPLAN Symposium on Principles of Programming Languages, pp. 41:1–41:30 (2019)
Google Scholar
Singh, G., Gehr, T., Püschel, M., Vechev, M.T.: Boosting robustness certification of neural networks. In: International Conference on Learning Representations (2019)
Google Scholar
Stérin, T., Farrugia, N., Gripon, V.: An intrinsic difference between vanilla rnns and gru models. COGNTIVE 2017, 84 (2017)
Google Scholar
Sun, Y., Wu, M., Ruan, W., Huang, X., Kwiatkowska, M., Kroening, D.: Concolic testing for deep neural networks. In: Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, ASE 2018, Montpellier, France, 3–7 September 2018, pp. 109–119 (2018)
Google Scholar
Szegedy, C., et al.: Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199 (2013)
Tian, Y., Pei, K., Jana, S., Ray, B.: Deeptest: automated testing of deep-neural-network-driven autonomous cars. In: International Conference on Software Engineering, pp. 303–314 (2018)
Google Scholar
Wang, S., Pei, K., Whitehouse, J., Yang, J., Jana, S.: Efficient formal safety analysis of neural networks. In: Annual Conference on Neural Information Processing Systems, pp. 6369–6379 (2018)
Google Scholar
Wang, S., Pei, K., Whitehouse, J., Yang, J., Jana, S.: Formal security analysis of neural networks using symbolic intervals. In: USENIX Security Symposium, pp. 1599–1614 (2018)
Google Scholar
Weng, T., et al.: Towards fast computation of certified robustness for relu networks. In: International Conference on Machine Learning, pp. 5273–5282 (2018)
Google Scholar
Wicker, M., Huang, X., Kwiatkowska, M.: Feature-guided black-box safety testing of deep neural networks. In: International Conference on Tools and Algorithms for Construction and Analysis of Systems, pp. 408–426 (2018)
Google Scholar
Wong, E., Kolter, J.Z.: Provable defenses against adversarial examples via the convex outer adversarial polytope. In: International Conference on Machine Learning, pp. 5283–5292 (2018)
Google Scholar
Xie, X., et al.: Deephunter: a coverage-guided fuzz testing framework for deep neural networks. In: Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis, pp. 146–157 (2019)
Google Scholar
Xie, X., Ma, L., Wang, H., Li, Y., Liu, Y., Li, X.: Diffchaser: detecting disagreements for deep neural networks. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 5772–5778. AAAI Press (2019)
Google Scholar
Xu, W., Qi, Y., Evans, D.: Automatically evading classifiers: a case study on PDF malware classifiers. In: Network and Distributed System Security Symposium (2016)
Google Scholar
Zhang, H., Weng, T.W., Chen, P.Y., Hsieh, C.J., Daniel, L.: Efficient neural network robustness certification with general activation functions. In: Annual Conference on Neural Information Processing Systems, pp. 4939–4948 (2018)
Google Scholar

Download references

Acknowledgments

We thank the anonymous reviewers for their comments. The authors also gratefully acknowledge the support by the National Science Foundation (NSF) under the Career Award SHF-2048094, the NSF FMitF award CCF-1837131, the NSF grant CNS-1813117, and a grant from Toyota R&D North America.

Author information

Authors and Affiliations

University of Southern California, Los Angeles, CA, USA
Sara Mohammadinejad, Brandon Paulsen, Jyotirmoy V. Deshmukh & Chao Wang

Authors

Sara Mohammadinejad
View author publications
You can also search for this author in PubMed Google Scholar
Brandon Paulsen
View author publications
You can also search for this author in PubMed Google Scholar
Jyotirmoy V. Deshmukh
View author publications
You can also search for this author in PubMed Google Scholar
Chao Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sara Mohammadinejad .

Editor information

Editors and Affiliations

Université Paris-Est Créteil, Créteil, France
Catalin Dima
CNRS, Université de Paris, Paris, France
Mahsa Shirmohammadi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mohammadinejad, S., Paulsen, B., Deshmukh, J.V., Wang, C. (2021). DiffRNN: Differential Verification of Recurrent Neural Networks. In: Dima, C., Shirmohammadi, M. (eds) Formal Modeling and Analysis of Timed Systems. FORMATS 2021. Lecture Notes in Computer Science(), vol 12860. Springer, Cham. https://doi.org/10.1007/978-3-030-85037-1_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-85037-1_8
Published: 16 August 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-85036-4
Online ISBN: 978-3-030-85037-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics