Automatic Extraction of Business Logic from Digital Documents
Executable papers have been attracting attention of scientific publishers, as they may improve validation of their content by reviewers. The paper reports on the ongoing research on making regular PDF file submissions executable without a need to build sophisticated Web systems and allowing authors to use their customary authoring tools. A unified approach has been developed to enable extraction of useful information from the digital PDF document, either scanned printouts or born-digital content and illustrated with experiments involving BPMN diagrams.
KeywordsImage Recognition Automatic Extraction Business Logic Hough Transform Authoring Tool
Unable to display preview. Download preview PDF.
- 1.Godlewska, M.K.: A model of an open architecture of distributed electronic documents supporting decision processes in collaborative organizations. PhD Thesis, Gdansk University of Technology, Gdansk, Poland (November 2013)Google Scholar
- 2.Siciarek, J.: Semantics driven table understanding in born-digital documents. In: Choras, R.S. (ed.) Image Processing and Communications Challenges 5. AISC, vol. 233, pp. 149–156. Springer, Heidelberg (2014)Google Scholar
- 3.Siciarek, J., Wiszniewski, B.: IODA - an interactive open document architecture. In: Proceedings of the International Conference on Computational Science, ICCS 2011. Procedia Computer Science, vol. 4, pp. 668–677 (2011)Google Scholar
- 4.Szwoch, W., Mucha, M.: Recognition of hand drawn flowcharts. In: Choraś, R.S. (ed.) Image Processing and Communications Challenges 4. AISC, vol. 184, pp. 63–70. Springer, Heidelberg (2013)Google Scholar