Skip to main content

Enhancing Enterprise Business Processes Through AI Based Approach for Entity Extraction – An Overview of an Application

  • Conference paper
  • First Online:
Recent Trends in Image Processing and Pattern Recognition (RTIP2R 2020)

Abstract

While Industries are growing strong with their digital transformation, advanced analytics are making them stronger through data driven decisions. At the same time traditional automation is getting matured and emerging as cognitive automation. In the era of Industry 4.0, handshake of business process automation, advance analytics and cognitive services have laid down a strong platform for ‘Cognitive Bots’. Enterprises can leverage the advent of powerful technologies and approaches, anticipating ultimate goal of the business through more adaptive, self-learning, and contextual applications. This paper explains one of such cognitive bot for finance department where invoices are of utmost importance for the function. The mentioned bot is intended for amount detection and verification; additionally, it can also extract various entities like organization name, location and date which contributes to perform analytics to a great extent. The application reward business in reducing turnaround time and human errors. The accuracy of specially customized trained neural model has achieved state of the art results on the current set of learning data. The proposed framework makes use of Optical Character Recognition and PDFMiner for text extraction from scanned invoices. A Quality Classifier that will reject hand written invoices for any further processing. A spaCy’s Name-Entity-Recognition predicts the amount, date, organization name and location from extracted unstructured text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ming, D., Liu, J., Tian, J.: Research on Chinese financial invoice recognition technology. Pattern Recogn. Lett. 24(1), 489–497 (2003)

    Article  Google Scholar 

  2. Emambakhsh, M., He, Y., Nabney, I.: Handwritten and machine-printed text discrimination using a template matching approach. In: Proceedings of the 12th IAPR International Workshop on Document Analysis Systems DAS, vol. 2016, no. 101779, pp. 399–404 (2016)

    Google Scholar 

  3. Emambakhsh, M., He, Y., Nabney, I.: Handwritten and Machine-Printed Text Discrimination Using a Template Matching Approach (2016). https://doi.org/10.1109/DAS.2016.22.

  4. Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004). https://doi.org/10.1023/B:VISI.0000029664.99615.94

    Article  Google Scholar 

  5. Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2008)

    Article  Google Scholar 

  6. Albawi, S., Mohammed, T.A., Al-Zawi, S.: Understanding of a convolutional neural network. In: 2017 International Conference on Engineering and Technology (ICET), Antalya, pp. 1–6 (2017). https://doi.org/10.1109/ICEngTechnol.2017.8308186

  7. Vincent, L.: Announcing Tesseract OCR (2006). https://googlecode.blogspot.com/2006/08/announcing-tesseract-ocr.html. Accessed 30 Aug 2006

  8. Alginahi, Y.: Preprocessing Techniques in Character Recognition (2010). https://doi.org/10.5772/9776

  9. Abdu, A.: Enhanced radon transform skew estimation and correction algorithm for scanned multiple-choice forms, pp. 444–454 (2019). https://doi.org/10.15405/epsbs.2019.05.02.44.

  10. Honnibal, M.: Introducing spaCy (2015). https://explosion.ai/blog/introducing-spacy. Accessed 19 Feb 2015

  11. Hovy, E., Marcus, M., Palmer, M., Ramshaw, L., Weischedel, R.: OntoNotes: the 90% solution. In: Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, NAACL-Short 2006, pp. 57–60. Association for Computational Linguistics, Stroudsburg (2006)

    Google Scholar 

  12. Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: EMNLP, pp. 1532–1543 (2014)

    Google Scholar 

  13. Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., Liu, C.: A survey on deep transfer learning. arXiv:1808.01974 (2018)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ankit Dwivedi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dwivedi, A., Vijayan, P., Gupta, R., Ramdasi, P. (2021). Enhancing Enterprise Business Processes Through AI Based Approach for Entity Extraction – An Overview of an Application. In: Santosh, K.C., Gawali, B. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2020. Communications in Computer and Information Science, vol 1380. Springer, Singapore. https://doi.org/10.1007/978-981-16-0507-9_32

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-0507-9_32

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-0506-2

  • Online ISBN: 978-981-16-0507-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics