A Controlled Natural Language for Tax Fraud Detection

  • Aaron Calafato
  • Christian Colombo
  • Gordon J. Pace
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9767)


Addressing tax fraud has been taken increasingly seriously, but most attempts to uncover it involve the use of human fraud experts to identify and audit suspicious cases. To identify such cases, they come up with patterns which an IT team then implements to extract matching instances. The process, starting from the communication of the patterns to the developers, the debugging of the implemented code, and the refining of the rules, results in a lengthy and error-prone iterative methodology. In this paper, we present a framework where the fraud expert is empowered to independently design tax fraud patterns through a controlled natural language implemented in GF, enabling immediate feedback reported back to the fraud expert. This allows multiple refinements of the rules until optimised, all within a timely manner. The approach has been evaluated by a number of fraud experts working with the Maltese Inland Revenue Department.


  1. [AR10]
    Angelov, K., Ranta, A.: Implementing controlled languages in GF. In: Fuchs, N.E. (ed.) CNL 2009. LNCS, vol. 5972, pp. 82–101. Springer, Heidelberg (2010)Google Scholar
  2. [Cal16]
    Calafato, A.: A domain specific property language for fraud detection to support agile specification development. Master’s thesis, University of Malta (2016)Google Scholar
  3. [CGP15]
    Colombo, C., Grech, J.-P., Pace, G.J.: A controlled natural language for business intelligence monitoring. In: Biemann, C., Handschuh, S., Freitas, A., Meziane, F., Métais, E. (eds.) NLDB 2015. LNCS, vol. 9103, pp. 300–306. Springer, Heidelberg (2015)CrossRefGoogle Scholar
  4. [CP13]
    Colombo, C., Pace, G.J.: Fast-forward runtime monitoring — an industrial case study. In: Qadeer, S., Tasiran, S. (eds.) RV 2012. LNCS, vol. 7687, pp. 214–228. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  5. [CPA09]
    Colombo, C., Pace, G.J., Abela, P.: Offline runtime verification with real-time properties: a case study. In: Proceedings of WICT 2009 (2009)Google Scholar
  6. [CPS09]
    Colombo, C., Pace, G.J., Schneider, G.: Larva – safer monitoring of real-time java programs (tool paper). In: Seventh IEEE International Conference on Software Engineering and Formal Methods (SEFM), pp. 33–37. IEEE Computer Society, November 2009Google Scholar
  7. [DRE13]
    Dannélls, D., Ranta, A., Enache, R.: Multilingual grammar for museum object descriptions. In: Frontiers of Multilingual Grammar Development, p. 99 (2013)Google Scholar
  8. [JES00]
    Jones, S.P., Eber, J.M., Seward, J.: Composing contracts: an adventure in financial engineering (functional pearl). In: ICFP 2000: Proceedings of the Fifth ACM SIGPLAN International Conference on Functional programming, pp. 280–292. ACM, New York (2000)Google Scholar
  9. [Kuh14]
    Kuhn, T.: A survey and classification of controlled natural languages. Comput. Linguist. 40(1), 121–170 (2014)CrossRefGoogle Scholar
  10. [LS09]
    Leucker, M., Schallhart, C.: A brief account of runtime verification. J. Log. Algebr. Program. 78(5), 293–303 (2009)CrossRefMATHGoogle Scholar
  11. [MTVM93]
    Maes, S., Tuyls, K., Vanschoenwinkel, B., Manderick, B.: Credit card fraud detection using bayesian, neural networks. In: Maciunas, R.J. (ed.) Interactive Image-guided Neurosurgery, pp. 261–270. American Association Neurological Surgeons, Rolling Meadows (1993)Google Scholar
  12. [PSE98]
    Power, R., Scott, D., Evans, R.: What you see is what you meant: direct knowledge editing with natural language feedback. In: ECAI, pp. 677–681 (1998)Google Scholar
  13. [Ran11]
    Ranta, A.: Grammatical Framework: Programming with Multilingual Grammars. Center for the Study of Language and Information/SRI (2011)Google Scholar
  14. [RED10]
    Ranta, A., Enache, R., Détrez, G.: Controlled language for everyday use: the MOLTO phrasebook. In: Rosner, M., Fuchs, N.E. (eds.) CNL 2010. LNCS, vol. 7175, pp. 115–136. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  15. [TD99]
    Tax, D.M.J., Duin, R.P.W.: Data domain description using support vectors. In: Proceedings of the European Symposium on Artificial Neural Networks, pp. 251–256 (1999)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Aaron Calafato
    • 1
  • Christian Colombo
    • 1
  • Gordon J. Pace
    • 1
  1. 1.University of MaltaMsidaMalta

Personalised recommendations