Appellate Court Modifications Extraction for Portuguese

Fernandes, William Paulo Ducca; Silva, Luiz José Schirmer; Frajhof, Isabella Zalcberg; de Almeida, Guilherme da Franca Couto Fernandes; Konder, Carlos Nelson; Nasser, Rafael Barbosa; de Carvalho, Gustavo Robichez; Barbosa, Simone Diniz Junqueira; Lopes, Hélio Côrtes Vieira

doi:10.1007/s10506-019-09256-x

Appellate Court Modifications Extraction for Portuguese

Original Research
Published: 12 July 2019

Volume 28, pages 327–360, (2020)
Cite this article

Artificial Intelligence and Law Aims and scope Submit manuscript

William Paulo Ducca Fernandes ORCID: orcid.org/0000-0001-7692-9666¹,
Luiz José Schirmer Silva¹,
Isabella Zalcberg Frajhof¹,
Guilherme da Franca Couto Fernandes de Almeida¹,
Carlos Nelson Konder¹,
Rafael Barbosa Nasser¹,
Gustavo Robichez de Carvalho¹,
Simone Diniz Junqueira Barbosa¹ &
…
Hélio Côrtes Vieira Lopes¹

802 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

Appellate Court Modifications Extraction consists of, given an Appellate Court decision, identifying the proposed modifications by the upper Court of the lower Court judge’s decision. In this work, we propose a system to extract Appellate Court Modifications for Portuguese. Information extraction for legal texts has been previously addressed using different techniques and for several languages. Our proposal differs from previous work in two ways: (1) our corpus is composed of Brazilian Appellate Court decisions, in which we look for a set of modifications provided by the Court; and (2) to automatically extract the modifications, we use a traditional Machine Learning approach and a Deep Learning approach, both as alternative solutions and as a combined solution. We tackle the Appellate Court Modifications Extraction task, experimenting with a wide variety of methods. In order to train and evaluate the system, we have built the KauaneJunior corpus, using public data disclosed by the Appellate State Court of Rio de Janeiro jurisprudence database. Our best method, which is a Bidirectional Long Short-Term Memory network combined with Conditional Random Fields, obtained an \(F_{\beta = 1}\) score of 94.79%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Preserving the rule of law in the era of artificial intelligence (AI)

Article Open access 17 July 2021

Stanley Greenstein

Information extraction from electronic medical documents: state of the art and future research directions

Article 08 November 2022

Mohamed Yassine Landolsi, Lobna Hlaoua & Lotfi Ben Romdhane

Code is law: how COMPAS affects the way the judiciary handles the risk of recidivism

Article Open access 09 February 2024

Christoph Engel, Lorenz Linhardt & Marcel Schubert

Notes

There are recent works discussing Brazil’s Supreme Court’s (Supremo Tribunal Federal) prominent role in describing the contemporary Brazilian judiciary system. One of these papers offers a brief description of judicial review in Brazilian law, which the reader may find useful to verify some of the claims made here. See da Silva (2013).
On the importance of previous judicial decisions as a constraint on legal decision-making, see Marmor (2012).
We used an in-house BILSTM-based tagger trained for Portuguese.
https://gate.ac.uk/sale/tao/splitch8.html.
https://gate.ac.uk/.
http://legislation.di.uoa.gr.
https://www.rechtspraak.nl.
http://iate.europa.eu.
We used a BILSTM based tagger trained for Portuguese.

References

Angelidis I, Chalkidis I, Koubarakis M (2018) Named entity recognition, linking and generation for greek legislation. In: JURIX, pp 1–10
Bengio Y, Simard P, Frasconi P (1994) Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw 5(2):157–166
Article Google Scholar
Bevan R, Torrisi A, Atkinson K, Bollegala D, Coenen F (2018) Efficient and effective case reject-accept filtering: a study using machine learning. In: JURIX, pp 171–175
Boella G, Di Caro L, Ruggeri A, Robaldo L (2014) Learning from syntax generalizations for automatic semantic annotation. J Intell Inf Syst 43(2):231–246
Article Google Scholar
Brasil (2008) Código civil brasileiro e legislação correlata, 2nd edn. Senado Federal, Subsecretaria de Edições Técnicas, [Online]. Accessed 28 Sept 2018
Carletta J (1996) Assessing agreement on classification tasks: the kappa statistic. Comput. Linguist. 22(2):249–254
Google Scholar
Chalkidis I, Androutsopoulos I (2017) A deep learning approach to contract element extraction. In: JURIX, pp 155–164
Chalkidis I, Androutsopoulos I, Michos A (2017) Extracting contract elements. In: Proceedings of the 16th edition of the international conference on artificial intelligence and law. ACM, pp 19–28
Chiarcos C (2012) Powla: modeling linguistic corpora in OWL/DL. In: Extended semantic web conference. Springer, pp 225–239
Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:14061078
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:14123555
Cohn T, Blunsom P (2005) Semantic role labelling with tree conditional random fields. In: Proceedings of the ninth conference on computational natural language learning. Association for Computational Linguistics, pp 169–172
Curran JR, Clark S, Bos J (2007) Linguistically motivated large-scale NLP with c&c and boxer. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions. Association for Computational Linguistics, pp 33–36
da Silva VA (2013) Deciding without deliberating. Int J Const Law 11(3):557–584
Google Scholar
de Araujo DA, Rigo SJ, Barbosa JLV (2017) Ontology-based information extraction for juridical events with case studies in Brazilian legal realm. Artif Intell Law 25(4):379–396
Article Google Scholar
de Justiça CN (2017) Justiça em números 2017: ano-base 2016. http://www.cnj.jus.br/files/conteudo/arquivo/2017/12/b60a659e5d5cb79337945c1dd137496c.pdf/, [Online]. Accessed 14 Apr 2018
Dozat T, Manning CD (2017) Deep biaffine attention for neural dependency parsing. In: Proceedings of the 5th international conference on learning representations
Dragoni M, Villata S, Rizzi W, Governatori G (2016) Combining NLP approaches for rule extraction from legal documents. In: 1st workshop on mining and reasoning with legal texts (MIREL 2016)
Dyer C, Ballesteros M, Ling W, Matthews A, Smith NA (2015) Transition-based dependency parsing with stack long short-term memory. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (volume 1: long papers), no. 1, pp 334–343
Fillmore CJ (1977) Scenes-and-frames semantics. Linguist Struct Process 59:55–88
Google Scholar
García-Constantino M, Atkinson K, Bollegala D, Chapman K, Coenen F, Roberts C, Robson K (2017) CLIEL: context-based information extraction from commercial law documents. In: Proceedings of the 16th edition of the international conference on artificial intelligence and law. ACM, pp 79–87
Gianfelice D, Lesmo L, Palmirani M, Perlo D, Radicioni DP (2013) Modificatory provisions detection: a hybrid NLP approach. In: Proceedings of the fourteenth international conference on artificial intelligence and law. ACM, pp 43–52
Gimpel K, Schneider N, O’Connor B, Das D, Mills D, Eisenstein J, Heilman M, Yogatama D, Flanigan J, Smith NA (2011) Part-of-speech tagging for twitter: annotation, features, and experiments. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies: short papers, vol 2. Association for Computational Linguistics, pp 42–47
Gonçalves CR (2018) Direito civil brasileiro v. 6–Responsabilidade civil. Editora Saraiva
Graves A, Schmidhuber J (2005) Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural Netw. 18(5–6):602–610
Article Google Scholar
Graves A, Mohamed Ar, Hinton G (2013) Speech recognition with deep recurrent neural networks. In: 2013 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 6645–6649
Hermann KM, Kočiský T, Grefenstette E, Espeholt L, Kay W, Suleyman M, Blunsom P (2015) Teaching machines to read and comprehend. In: Advances in neural information processing systems, pp 1693–1701
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article Google Scholar
Huang Z, Xu W, Yu K (2015) Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:150801991
Klein D, Manning CD (2003) Accurate unlexicalized parsing. In: Proceedings of the 41st annual meeting of the association for computational linguistics
Lafferty J, McCallum A, Pereira FC (2001) Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the eighteenth international conference on machine learning, pp 282–289
Ma X, Hovy E (2016) End-to-end sequence labeling via bi-directional LSTM–CNNS–CRF. In: Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: long papers), no. 1, pp 1064–1074
Marmor A (2012) The Routledge companion to philosophy of law. Routledge, London
Book Google Scholar
McCallum A, Li W (2003) Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. In: Proceedings of the seventh conference on natural language learning at HLT-NAACL 2003, vol 4. Association for Computational Linguistics, pp 188–191
Mikolov T, Chen K, Corrado G, Dean J (2013a) Efficient estimation of word representations in vector space. arXiv preprint arXiv:13013781
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013b) Distributed representations of words and phrases and their compositionality. In: Burges CJC, Bottou L, Welling M, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems, vol 26, pp 3111–3119
Nanda R, John AK, Caro LD, Boella G, Robaldo L (2017a) Legal information retrieval using topic clustering and neural networks. In: COLIEE@ ICAIL, pp 68–78
Nanda R, Siragusa G, Di Caro L, Theobald M, Boella G, Robaldo L, Costamagna F (2017b) Concept recognition in European and national law. In: JURIX, pp 193–198
Nguyen TS, Nguyen LM, Tojo S, Satoh K, Shimazu A (2018) Recurrent neural network-based models for recognizing requisite and effectuation parts in legal texts. Artif Intell Law 26(2):169–199
Article Google Scholar
Peters ME, Ammar W, Bhagavatula C, Power R (2017) Semi-supervised sequence tagging with bidirectional language models. In: Proceedings of the 55th annual meeting of the association for computational linguistics (volume 1: long papers), no. 1, pp 1756–1765
Peters ME, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer LS (2018b) Deep contextualized word representations. In: NAACL HLT 2018: 16th annual conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1, pp 2227–2237
Peters ME, Neumann M, Zettlemoyer L, Yih WT (2018a) Dissecting contextual word embeddings: architecture and representation. arXiv preprint arXiv:1808.08949
Ramshaw L, Marcus M (1995) Text chunking using transformation-based learning. In: Proceedings of the 3rd ACL workshop on very large Corpora, pp 82–94
Sha F, Pereira F (2003) Shallow parsing with conditional random fields. In: Proceedings of the 2003 conference of the North American chapter of the association for computational linguistics on human language technology, vol 1. Association for Computational Linguistics, pp 134–141
Strubell E, Verga P, Belanger D, McCallum A (2017) Fast and accurate entity recognition with iterated dilated convolutions. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp 2670–2680
Trompper M, Winkels R (2016) Automatic assignment of section structure to texts of Dutch court judgments. In: JURIX, pp 167–172
Wen TH, Gasic M, Mrkšić N, Su PH, Vandyke D, Young S (2015) Semantically conditioned LSTM-based natural language generation for spoken dialogue systems. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 1711–1721. 10.18653/v1/D15-1199, http://www.aclweb.org/anthology/D15-1199
Yang Z, Yang D, Dyer C, He X, Smola A, Hovy E (2016) Hierarchical attention networks for document classification. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1480–1489

Download references

Acknowledgements

Funding was provided by “Conselho Nacional de Desenvolvimento Científico e Tecnológico” and “Coordenação de Aperfeiçoamento de Pessoal de Nível Superior”.

Author information

Authors and Affiliations

Pontifícia Universidade Católica do Rio de Janeiro (PUC-Rio), Rua Marquês de São Vicente, 225 - Gávea, Rio de Janeiro, RJ, 22451-045, Brazil
William Paulo Ducca Fernandes, Luiz José Schirmer Silva, Isabella Zalcberg Frajhof, Guilherme da Franca Couto Fernandes de Almeida, Carlos Nelson Konder, Rafael Barbosa Nasser, Gustavo Robichez de Carvalho, Simone Diniz Junqueira Barbosa & Hélio Côrtes Vieira Lopes

Authors

William Paulo Ducca Fernandes
View author publications
You can also search for this author in PubMed Google Scholar
Luiz José Schirmer Silva
View author publications
You can also search for this author in PubMed Google Scholar
Isabella Zalcberg Frajhof
View author publications
You can also search for this author in PubMed Google Scholar
Guilherme da Franca Couto Fernandes de Almeida
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Nelson Konder
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Barbosa Nasser
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo Robichez de Carvalho
View author publications
You can also search for this author in PubMed Google Scholar
Simone Diniz Junqueira Barbosa
View author publications
You can also search for this author in PubMed Google Scholar
Hélio Côrtes Vieira Lopes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to William Paulo Ducca Fernandes.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fernandes, W.P.D., Silva, L.J.S., Frajhof, I.Z. et al. Appellate Court Modifications Extraction for Portuguese. Artif Intell Law 28, 327–360 (2020). https://doi.org/10.1007/s10506-019-09256-x

Download citation

Published: 12 July 2019
Issue Date: September 2020
DOI: https://doi.org/10.1007/s10506-019-09256-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Appellate Court Modifications Extraction for Portuguese

Abstract

Access this article

Similar content being viewed by others

Preserving the rule of law in the era of artificial intelligence (AI)

Information extraction from electronic medical documents: state of the art and future research directions

Code is law: how COMPAS affects the way the judiciary handles the risk of recidivism

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Appellate Court Modifications Extraction for Portuguese

Abstract

Access this article

Similar content being viewed by others

Preserving the rule of law in the era of artificial intelligence (AI)

Information extraction from electronic medical documents: state of the art and future research directions

Code is law: how COMPAS affects the way the judiciary handles the risk of recidivism

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation