Abstract
The Amount of legal information that is being produced on a daily basis in courts is increasing enormously. The processing of such data has been receiving considerate attention thanks to their availability in an electronic form and the progress made in Artificial Intelligence application. Indeed, deep learning has shown promising results when used in the field of natural language processing (NLP). Neural Networks such as convolutional neural networks and recurrent neural network have been used for different NLP tasks like information retrieval, sentiment analysis and document classification. In this work, we propose a Neural Network based model with a dynamic input length for French legal text classification. The proposed approach, tested over real legal cases, outperforms baseline methods.
ICIKS 2021.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Doan, T.M., Jacquenet, F., Largeron, C., Bernard, M.: A study of text summarization techniques for generating meeting minutes. In: Dalpiaz, F., Zdravkovic, J., Loucopoulos, P. (eds.) RCIS 2020. LNBIP, vol. 385, pp. 522–528. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-50316-1_33
Stead, C., Smith, S., Busch, P., Vatanasakdakul, S.: Towards an academic abstract sentence classification system. In: Dalpiaz, F., Zdravkovic, J., Loucopoulos, P. (eds.) RCIS 2020. LNBIP, vol. 385, pp. 562–568. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-50316-1_39
Li, Y., Hao, Z.B., Lei, H.: Survey of convolutional neural network. J. Comput. Appl. 36, 2508–2515 (2016)
Albawi, S., Mohammed, T.A., Al-Zawi, S.: Understanding of a convolutional neural network. In: International Conference on Engineering and Technology, ICET, pp. 1–6 (2017)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)
Joachims, T.: Text categorization with Support Vector Machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0026683
McCallum, A., Nigam, K.: A comparison of event models for Naive Bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, pp. 41–48 (1998)
Joulin, A., Grave, E., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pp. 427–431 (2017)
Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1746–1751 (2014)
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Proceedings of the 29th Conference on Neural Information Processing Systems, NIPS 2015. Advances in Neural Information Processing Systems, pp. 649–657 (2015)
Conneau, A., Schwenk, H., Barrault, L., Lecun, Y.: Very deep convolutional networks for text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pp. 1107–1116 (2017)
Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, ICML, pp. 160–167 (2008)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324 (1998)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: International Conference on Learning Representations (2013)
Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A convolutional neural network for modelling sentences. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 655–665 (2014)
Arora, M., Kansal, V.: Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis. Soc. Netw. Anal. Min. 9(1), 1–14 (2019). https://doi.org/10.1007/s13278-019-0557-y
Koomsubha, T., Vateekul, P.: A character-level convolutional neural network with dynamic input length for Thai text categorization. In: Proceedings of the 9th International Conference on Knowledge and Smart Technology, KST, pp. 101–105 (2017)
Kim, Y., Jernite, Y., Sontag, D., Rush, A.M.: Character-aware neural language models. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (2016)
Radford, A., Jozefowicz, R., Sutskever, I.: Learning to generate reviews and discovering sentiment. In: International Conference on Learning Representations, ICLR (2018)
Socher, R., et al.: Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, pp. 1631–1642 (2013)
Nallapati, R., Manning, C.D.: Legal docket classification: where machine learning stumbles. In: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pp. 438–446 (2008)
Sulea, O.M., Zampieri, M., Malmasi, S., Vela, M., Dinu, L.P., van Genabith, J.: Exploring the use of text classification in the legal domain. In: Proceedings of 2nd Workshop on Automated Semantic Analysis of Information in Legal Texts, ASAIL (2017)
Wei, F., Qin, H., Ye, S., Zhao, H.: Empirical study of deep learning for text classification in legal document review. In: International Conference on Big Data, pp. 3317–3320 (2018)
Undavia, S., Meyers, A., Ortega, J.E.: A comparative study of classifying legal documents with neural networks. In: Federated Conference on Computer Science and Information Systems, FedCSIS, pp. 515–522 (2018)
Da Silva, N.C., et al.: Document type classification for Brazil’s supreme court using a convolutional neural network. In: Proceedings of the Tenth International Conference on Forensic Computer Science and Cyber Law-ICoFCS, pp. 7–11 (2018)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (2015)
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from over fitting. J. Mach. Learn. Res. JMLR 1929–1958 (2014)
Yoo, J.-Y., Yang, D.: Classification scheme of unstructured text document using TF-IDF and Naive Bayes classifier. The Journal of Machine Learning Research, Proceedings of 3rd International Conference on Computer and Computing Science, COMCOMS (2015)
Pranckevičius, T., Marcinkevičius, V.: Comparison of Naive Bayes, random forest, decision tree, support vector machines, and logistic regression classifiers for text reviews classification. Baltic J. Mod. Comput. 5, 221 (2017)
Acknowledgments
This paper has been done under the contract PREMATTAJ 2017–2019 of the Occitanie region which is greatly acknowledged. The decisions used in this paper have been annotated by Professor Guillaume Zambrano of the University of Nîmes.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Hammami, E., Faiz, R., Akermi, I. (2021). A Dynamic Convolutional Neural Network Approach for Legal Text Classification. In: Saad, I., Rosenthal-Sabroux, C., Gargouri, F., Arduin, PE. (eds) Information and Knowledge Systems. Digital Technologies, Artificial Intelligence and Decision Making. ICIKS 2021. Lecture Notes in Business Information Processing, vol 425. Springer, Cham. https://doi.org/10.1007/978-3-030-85977-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-85977-0_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-85976-3
Online ISBN: 978-3-030-85977-0
eBook Packages: Computer ScienceComputer Science (R0)