Search for Falsifications in Copies of Business Documents

Slavin, Oleg; Andreeva, Elena; Arlazarov, Vladimir V.

doi:10.1007/978-3-030-67892-0_16

Part of the book series: Studies in Systems, Decision and Control ((SSDC,volume 350))

533 Accesses

Abstract

The present article is concerned with methods of comparison of scanned copies of business documents. Such a problem arises when comparing two copies of business documents signed by two parties to detect possible changes made by one of the parties. This problem is relevant, for example, in the banking sector when concluding contracts in paper form. It considers the partial matching method for the flexible form that allows modifying text attributes and inadvertent modifications of common words. It proposes the method of comparison of two scanned images based on recognition and analyses of N-grams words sequences. The proposed method has been tested on its private data set. The proposed method has demonstrated high quality and reliability of searching for differences in two copies of the same Agreement document.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automatic Verification of Properly Signed Multi-page Document Images

Algorithms for detection of plain copy-move regions in digital images

Article 01 July 2015

On the correctness of electronic documents: studying, finding, and localizing inconsistency bugs in PDF readers and files

Article Open access 09 March 2018

References

Saha, R., Mondal, A., Jawahar, C.: Graphical Object Detection in Document Images, pp. 51–58. https://doi.org/10.1109/icdar.2019.00018 (2019)
Ray, A., Sharma, M., Upadhyay, A., Makwana, M., Chaudhury, S., Trivedi, A., Singh, A., Saini, A.: An End-to-End Trainable Framework for Joint Optimization of Document Enhancement and Recognition, pp. 59–64. https://doi.org/10.1109/icdar.2019.00019 (2019)
Jain, R., Wigington, C.: Multimodal Document Image Classification, pp. 71–77. https://doi.org/10.1109/icdar.2019.00021 (2019)
Qasim, S.R., Mahmood, H., Shafait, F.: Rethinking Table Recognition using Graph Neural Networks, pp. 142–147. https://doi.org/10.1109/icdar.2019.00031 (2019)
Moysset, B., Kermorvant, C., Wolf, C.: Learning to detect, localize and recognize many text objects in document images from few examples. IJDAR 21, 161–175 (2018). https://doi.org/10.1007/s10032-018-0305-2
Article Google Scholar
Nagy, G.: Document analysis systems that improve with use. IJDAR 23, 13–29 (2020). https://doi.org/10.1007/s10032-019-00344-x
Article Google Scholar
Sidère, N., Cruz, F., Coustaty M., Ogier, J.-M.: A dataset for forgery detection and spotting in document images. In: Proceeding of Seventh International Conference on Emerging Security Technologies (EST). https://doi.org/10.1109/est.2017.8090394, https://sci-hub.tw/10.1109/EST.2017.8090394 (2017)
Bertrand, R., Terrades, O., Gomez-Kramer, R., Franco, P., Ogier, J.: A conditional random field model for font forgery detection. In: 13th International Conference on Document Analysis and Recognition, Nancy, France. [Online] Available: https://doi.org/10.1109/icdar.2013.29 (2015)
Beusekom, J., Shafait, F., Breuel, T.M.: Automated OCR ground truth generation. In: Proceeding of the 8th IAPR Workshop on Document Analysis Systems, pp. 111–117. Nara, Japan, September. https://sci-hub.tw/10.1109/DAS.2008.59 (2008)
Beusekom, J., Shafait, F., Breuel, T.M.: Document signature using intrinsic features for counterfeit detection. In: Proceedings of the 2nd international workshop on Computational Forensics, ser. IWCF ’08, pp. 47–57. Springer-Verlag, Berlin, Heidelberg. https://link.springer.com/content/pdf/10.1007%2F978–3-540-85303-9_5.pdf (2008)
Ahmed, A.G.H., Shafait, F.: Forgery detection based on intrinsic document contents. 11th IAPR International Workshop on Document Analysis Systems. https://doi.org/10.1109/das.2014.26 (2014)
Andreeva, E., Arlazarov, V.V., Manzhikov, T., Slavin, O.: Comparison of the scanned pages of the contractual documents. In: The 10th International Conference on Machine Austria (ICMV 2017), November 13–15. Vienna, Austria. https://doi.org/10.1117/12.2309458 (2017)
Slavin, O.A.: Using special text points in the recognition of documents. In: Studies in Systems, Decision and Control, vol. 259, pp. 43–53. Springer Nature Switzerland AG. http://doi.org/10.1007/978-3-030-32579-4_4 (2020)
Rodehorst, V., Koschan, A.: Comparison and evaluation of feature point detectors (2006)
Google Scholar
Lukoyanov, A., Nikolaev, D., Konovalenko, I.: Modification of YAPE keypoint detection algorithm for wide local contrast range images. In: Tenth International Conference on Machine Vision (ICMV 2017), vol. 10696. International Society for Optics and Photonics, vol. 10696. https://doi.org/10.1117/12.2310243 (2018)
Badino, H., Kanade, T.A.: Head-Wearable “Short-Baseline Stereo System for the Simultaneous Estimation of Structure and Motion”. In: Proceedings of MVA, pp. 185–189 (2011)
Google Scholar
Skoryukina, N., Farajev, I., Bulatov, K., Arlazarov, V.V.: Impact of geometrical restrictions in RANSAC sampling on the ID document classification. In: Osten, W., Nikolaev, D., Zhou, J. (ed.) ICMV 2019, 11433 ed., vol. 11433, pp. 1–7. ISSN 0277-786X, ISBN 978-15-10636-43-9. https://doi.org/10.1117/12.2559306 (2020)
Bezmaternykh, P.V., Nikolaev, D.P.: A document skew detection method using fast Hough transform. In: Proceedings Volume 11433, Twelfth International Conference on Machine Vision (ICMV 2019); 114330 J. [Online] Available: https://doi.org/10.1117/12.2559069 (2020)
Smart IDReader: Document Recognition in Video Stream. In: Bulatov, K., Arlazarov, V., Chernov, T., Slavin, O., Nikolaev, D. 14th IAPR International Conference on Document Analysis and Recognition, vol. 6, pp. 39–44. IEEE. https://doi.org/10.1109/icdar.2017.347 (2017)
Chernyshova, Y.S., Sheshkus, A.V., Arlazarov, V.V.: Two-step CNN framework for text line recognition in camera-captured images. IEEE Access 8, 32587–32600 (2020). https://doi.org/10.1109/ACCESS.2020.2974051
Article Google Scholar
Tesseract OCR. Documentation. [Online] Available: https://tesseract-ocr.github.io. Accessed 26 Oct 2020

Download references

Acknowledgements

The research is carried out with partial financial support of The Russian Foundation for Basic Research (projects: 17-29-03170, 18-07-01384).

Author information

Authors and Affiliations

Federal Research Center “Computer Sciences and Control” Russian Academy of Sciences, 9 Prosp. 60-Letiya Oktyabrya, Moscow, 117312, Russia
Oleg Slavin
Moscow Institute of Physics and Technology (State University)—MIPT, Institutskiy Per 9, Dolgoprudny, Moscow Region, 141701, Russia
Oleg Slavin
LLC “Smart Engines Service”, 9 Prosp. 60-Letiya Oktyabrya, Moscow, 117312, Russia
Elena Andreeva & Vladimir V. Arlazarov
Federal Publicly Funded Institution of Science, Institute for Information Transmission Problems n.a. A.A. Kharkevich of Russian Academy of Science, 19 Bolshoy Karetny Per, Moscow, 127051, Russia
Vladimir V. Arlazarov

Authors

Oleg Slavin
View author publications
You can also search for this author in PubMed Google Scholar
Elena Andreeva
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir V. Arlazarov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Oleg Slavin .

Editor information

Editors and Affiliations

Volgograd State Technical University, Volgograd, Russia
Alla G. Kravets
Peter the Great St. Petersburg Polytechn University, St. Petersburg, Russia
Alexander A. Bolshakov
Volgograd State Technical University, Volgograd, Russia
Maxim V. Shcherbakov

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Slavin, O., Andreeva, E., Arlazarov, V.V. (2021). Search for Falsifications in Copies of Business Documents. In: Kravets, A.G., Bolshakov, A.A., Shcherbakov, M.V. (eds) Cyber-Physical Systems. Studies in Systems, Decision and Control, vol 350. Springer, Cham. https://doi.org/10.1007/978-3-030-67892-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-67892-0_16
Published: 14 April 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67891-3
Online ISBN: 978-3-030-67892-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Search for Falsifications in Copies of Business Documents

Abstract

Access this chapter

Similar content being viewed by others

Automatic Verification of Properly Signed Multi-page Document Images

Algorithms for detection of plain copy-move regions in digital images

On the correctness of electronic documents: studying, finding, and localizing inconsistency bugs in PDF readers and files

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Search for Falsifications in Copies of Business Documents

Abstract

Access this chapter

Similar content being viewed by others

Automatic Verification of Properly Signed Multi-page Document Images

Algorithms for detection of plain copy-move regions in digital images

On the correctness of electronic documents: studying, finding, and localizing inconsistency bugs in PDF readers and files

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation