Abstract
This paper investigates automatic query generation from legal decisions, along with contributing a test collection for the evaluation of case law retrieval. For a sentence or paragraph within a legal decision that cites another decision, queries were automatically generated from a proportion of the terms in that sentence or paragraph. Manually generated queries were also created as a ground to empirically compare automatic methods. Automatically generated queries were found to be more effective than the average Boolean queries from experts. However, the best keyword and Boolean queries from experts significantly outperformed automatic queries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The doctrine of precedent requires, broadly speaking, that like circumstances are considered in a like fashion; a case that considers a certain set of factual circumstances therefore must be followed for any future circumstances that are analogous.
- 2.
The obligation of parties to litigation to disclose all documents relevant to issues between them.
- 3.
A keynumber system of categorised areas and subareas of law. Areas of law can be searched or browsed by number.
- 4.
Decisions were downloaded from http://courtlistener.com.
- 5.
A statement by the Court as to whether it would grant review of a lower court’s decision.
- 6.
- 7.
\(\lambda \) is responsible for smoothing between the background language model (the legal collection), and the foreground language model (the sentence or paragraph).
References
Bailey, P., Moffat, A., Scholer, F., Thomas, P.: User variability and ir system evaluation. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 625–634. ACM (2015)
Baron, J.R., Lewis, D.D., Oard, D.W.: Trec 2006 legal track overview. In: The Fifteenth Text REtrieval Conference (TREC 2006) Proceedings (2006)
Bendersky, M., Croft, W.B.: Discovering key concepts in verbose queries. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 491–498. ACM (2008)
Galgani, F., Compton, P., Hoffmann, A.: Combining different summarization techniques for legal text. In: Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, pp. 115–123. Association for Computational Linguistics (2012)
Grabmair, M., Ashley, K.D., Chen, R., Sureshkumar, P., Wang, C., Nyberg, E., Walker, V.R.: Introducing luima: an experiment in legal conceptual retrieval of vaccine injury decisions using a uima type system and tools. In: Proceedings of the 15th International Conference on Artificial Intelligence and Law, ICAIL 2015, pp. 69–78 (2015)
Hiemstra, D., Robertson, S., Zaragoza, H.: Parsimonious language models for information retrieval. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 178–185. ACM (2004)
Kim, M.-Y., Xu, Y., Lu, Y., Goebel, R.: Legal question answering using paraphrasing and entailment analysis. In: Tenth International Workshop on Juris-Informatics (JURISIN) (2016)
Koniaris, M., Anagnostopoulos, I., Vassiliou, Y.: Multi-dimension diversification in legal information retrieval. In: Cellary, W., Mokbel, M.F., Wang, J., Wang, H., Zhou, R., Zhang, Y. (eds.) WISE 2016. LNCS, vol. 10041, pp. 174–189. Springer, Cham (2016). doi:10.1007/978-3-319-48740-3_12
Koniaris, M., Anagnostopoulos, I., Vassiliou, Y.: Evaluation of diversification techniques for legal information retrieval. Algorithms 10(1), 22 (2017)
Koopman, B., Cripwell, L., Zuccon, G.: Generating clinical queries from patient narratives. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (to appear, 2017)
Kumaran, G., Carvalho, V.R.: Reducing long queries using query quality predictors. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 564–571. ACM (2009)
Lastres, S.A.: Rebooting legal research in a digital age. Technical report, LexisNexis (2013)
Peñas, A., et al.: Overview of ResPubliQA 2009: question answering evaluation over european legislation. In: Peters, C., Di Nunzio, G.M., Kurimo, M., Mandl, T., Mostefa, D., Peñas, A., Roda, G. (eds.) CLEF 2009. LNCS, vol. 6241, pp. 174–196. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15754-7_21
Poje, J.: Legal research. American Bar Association Techreport 2014 (2014)
Salton, G.: Automatic Information Organization and Retrieval. McGraw Hill Text, New York (1968)
Schweighofer, E., Geist, A.: Legal query expansion using ontologies and relevance feedback. In: CEUR Workshop Proceedings, vol. 321, pp. 149–160 (2007)
Tomokiyo, T., Hurst, M.: A language model approach to keyphrase extraction. In: Proceedings of the ACL 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, vol. 18, pp. 33–40. Association for Computational Linguistics (2003)
Turtle, H.: Natural language vs. boolean query evaluation: a comparison of retrieval performance. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 212–220 (1994)
Turtle, H.: Text retrieval in the legal world. Artif. Intell. Law 3(1), 5–54 (1995)
van Opijnen, M.: Citation analysis and beyond: in search of indicators measuring case law importance. In: JURIX, vol. 250, pp. 95–104 (2012)
Verberne, S., Sappelli, M., Hiemstra, D., Kraaij, W.: Evaluation and analysis of term scoring methods for term extraction. Inform. Retrieval J. 19(5), 510–545 (2016)
Zuccon, G., Palotti, J., Hanbury, A.: Query variations and their effect on comparing information retrieval systems. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, pp. 691–700. ACM (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Locke, D., Zuccon, G., Scells, H. (2017). Automatic Query Generation from Legal Texts for Case Law Retrieval. In: Sung, WK., et al. Information Retrieval Technology. AIRS 2017. Lecture Notes in Computer Science(), vol 10648. Springer, Cham. https://doi.org/10.1007/978-3-319-70145-5_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-70145-5_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70144-8
Online ISBN: 978-3-319-70145-5
eBook Packages: Computer ScienceComputer Science (R0)