Abstract
Extracting related or exactly the same data from such a great amount of available data is considered a tedious task. After which relating it to the query and the processing it based on the input query that to varying with the user sounds superficial. But information retrieval systems (IRS), chatbots, and question-answering (QA) systems have developed extensively through the decade. Natural language understanding (NLU) and natural language processing (NLP) are the techniques which made it easy for computers to interpret and process the high-level human query language. Through this paper we suggest a model through comparative analysis of bagOfwords (BOW), Word2Vec, TF-IDF, and BERT for selecting an appropriate IPC section based on the input text provided by the user. The proposed approach is based on syntactical analysis followed by semantic analysis. After performing semantic analysis, we perform feature extraction followed by text classification and categorization based on the resemblance with the dataset of the IPC sections. We worked on different models to test the accuracy and efficiency, for bagOfwords (BOW), Word2Vec, TF-IDF, and BERT.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
M.K. Dhami, I. Belton, Statistical analyses of court decisions: the example of multilevel models of sentencing. Law Method 10, 247–266 (2016)
M. Dymitruk, Ethical artificial intelligence in judiciary. Research Gate (2019)
O. Koshorek, A. Cohen, N. Mor, M. Rotman, J. Berant, Text segmentation supervised learning task, in Proceedings of NAACL-HLT, 2018
G. Veena, D. Gupta, A. Anil, S. Akhil, An ontology driven question answering system for legal documents, in 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, 2019
T. Tengvall, A method for automatic question answering in Swedish based on BERT
G. Veena, D. Gupta, A. Anil, S. Akhil, An ontology driven question answering system for legal documents, in 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICICICT 2019; 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICSSIT 2020
M.-F.M. Raquel Mochales, Study on the structure of argumentation in case law. Research Gate (2008)
M.M. Masha Medvedeva, Using machine learning to predict decisions of the European court of human rights (Springer, 2020)
D.M. Katz, M.J. Bommarito, J. Blackman, A general approach for predicting the behavior of the supreme court of the United States. PloS ONE 12(4), e0174698 (2017)
S. Qaiser, R. Ali, Text mining: use of TF-IDF to examine the relevance of words to documents
C. Liu, Y. Sheng, Research of text classification based on improved TF-IDF algorithm (2018)
C. Mingtsung, L. Shuling, Research on the application of artificial intelligence technology in the field of justice, in ICAACE 2020 (2020)
S. Kayalvizhi, D. Thenmozhi, C. Aravindan, Legal assistance using word embeddings (2019)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Harode, B., Prajapat, S., Bhurre, S. (2022). Text Processor for IPC Prediction. In: Singh, P.K., Singh, Y., Kolekar, M.H., Kar, A.K., Gonçalves, P.J.S. (eds) Recent Innovations in Computing. Lecture Notes in Electrical Engineering, vol 832. Springer, Singapore. https://doi.org/10.1007/978-981-16-8248-3_9
Download citation
DOI: https://doi.org/10.1007/978-981-16-8248-3_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8247-6
Online ISBN: 978-981-16-8248-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)