Abstract
With the abundance of electronic lawsuits already implemented throughout Brazil, courts have a valuable source of information in text format that constitute attractive bases for the application of Artificial Intelligence (AI) and machine learning (ML). In this research, supervised learning approaches were explored for the automatic classification of types of documents in electronic court proceedings of the Court of Justice of Minas Gerais (TJMG). The methodology is composed of cross-validation within the specific corpus of the legal domain, comparing traditional classifiers and more recent methods based on neural networks and deep learning models, using Glove word vectors generated for the Portuguese Language and Convolutional Neural Network (CNN). This work achieved high precision in the results and if implemented in the courts it can provide significant savings in financial and human resources, allowing lawsuits classification activities, currently done manually by employees, to be performed in seconds by the machine. The result of this experiment shows that the hit rates for the CNN and SVM classifiers exceed 93% and is considered a high result. Based on the assumption that Glove brings extra semantic resources that can help in classifying texts from court proceedings, this work demonstrates Glove’s effectiveness by showing that a CNN with Glove surpasses SVM.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
The first instance is the first hierarchical jurisdiction, i.e., the first body of Justice to which the citizen must address a dispute resolution request.
- 2.
Legal term referring to the manifestations in processes such as: Initial Petition, Contestation (defense), Embargos, Sentence.
- 3.
Judicial circumscription, under the jurisdiction of one or more judges of law.
- 4.
The Natural Language Toolkit is a set of libraries and programs for symbolic and statistical processing of natural language written in the Python programming language.
- 5.
Open source machine learning library for the Python programming language.
- 6.
References
Evaluation: from Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation (2011)
Sulea, O.-M., Zampieri, M., Malmasi, S., Vela, M., Dinu, L.P., van Genabith, J.: Exploring the Use of Text Classification in the Legal Domain (2017)
Mastropaolo, A., Pallante, F., Radicioni, D.P.: Legal documents categorization by compression. In: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Law - ICAIL’13, p. 92. ACM Press, Rome (2013)
Francesconi, E., Passerini, A.: Automatic classification of provisions in legislative texts. Artif. Intell. Law 15, 1–17 (2007). https://doi.org/10.1007/s10506-007-9038-0
Lilleberg, J., Zhu, Y., Zhang, Y.: Support vector machines and Word2vec for text classification with semantic features. In: 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC), pp. 136–140. IEEE, Beijing (2015)
Wei, F., Qin, H., Ye, S., Zhao, H.: Empirical study of deep learning for text classification in legal document review. In: 2018 IEEE International Conference on Big Data (Big Data), pp. 3317–3320 (2018). https://doi.org/10.1109/BigData.2018.8622157
Howe, J.S.T., Khang, L.H., Chai, I.E.: Legal Area Classification: A Comparative Study of Text Classifiers on Singapore Supreme Court Judgments (2019). arXiv:1904.06470 [cs]
Chalkidis, I., Fergadiotis, M., Malakasiotis, P., Aletras, N., Androutsopoulos, I.: Extreme Multi-Label Legal Text Classification: A case study in EU Legislation (2019). arXiv:1905.10892 [cs]
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Kim, Y.: Convolutional neural networks for sentence classification (2014). arXiv preprint arXiv:1408.5882
Agarwal, A., Negahban, S., Wainwright, M.: A simple way to prevent neural networks from overfitting. Ann. Stat. 40, 1171–1197 (2012)
Hartmann, N., Fonseca, E., Shulby, C., Treviso, M., Rodrigues, J., Aluisio, S.: Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks (2017). arXiv:1708.06025 [cs]
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012)
Abadi, M., et al.: Tensorflow: A system for large-scale machine learning. In: Symposium on Operating Systems Design and Implementation, pp. 265–283 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Silva, A.C., Maia, L.C.G. (2020). The Use of Machine Learning in the Classification of Electronic Lawsuits: An Application in the Court of Justice of Minas Gerais. In: Cerri, R., Prati, R.C. (eds) Intelligent Systems. BRACIS 2020. Lecture Notes in Computer Science(), vol 12319. Springer, Cham. https://doi.org/10.1007/978-3-030-61377-8_43
Download citation
DOI: https://doi.org/10.1007/978-3-030-61377-8_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61376-1
Online ISBN: 978-3-030-61377-8
eBook Packages: Computer ScienceComputer Science (R0)