Skip to main content

Building Legal Knowledge Map Repository with NLP Toolkits

  • Conference paper
  • First Online:
The 12th Conference on Information Technology and Its Applications (CITA 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 734))

Included in the following conference series:


Today, the legal document system is increasingly strict with different levels of influence and affects activities in many different fields. The increasing number of legal documents interwoven with each other also leads to difficulties in searching and applying in practice. The construction of knowledge maps that involve one or a group of legal documents is an effective approach to represent actual knowledge domains. A legal knowledge graph constructed from laws and legal documents can enable a number of applications, such as question answering, document similarity, and search. In this paper, we describe the process of building a system of knowledge maps for the Vietnamese legal system from the source of about 325,000 legal documents that span all fields of social life. This study also proposes an integrated ontology to represent the legal knowledge from legal documents. This model integrates the ontology of relational knowledge and the graph of key phrases and entities in the form of a concept graph. It can express the semantics of the content of a given legal document. In addition, this study also describes the process of building and exploiting natural language processing tools to build a VLegalKMaps system, which is a repository of Vietnamese legal knowledge maps. We also highlight open challenges in the realization of knowledge graphs in a technical legal system that enables this approach at scale.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions


  1. 1.

  2. 2.

  3. 3.

  4. 4.

  5. 5.

  6. 6.

  7. 7.

  8. 8.

  9. 9.

  10. 10.

  11. 11.

  12. 12.

    Law Code 23/2008/QH12: Vietnam Law of Road Traffic 2008.


  1. Altinok, D.: Mastering SpaCy: An End-to-End Practical Guide to Implementing NLP Applications Using the Python Ecosystem. Packt Publishing Ltd. (2021)

    Google Scholar 

  2. Assembly, V.N.: Law on Road Traffic, Law No. 23/2008/QH12 (2008)

    Google Scholar 

  3. Assembly, V.N.: Law on Promulgation of Legislative Documents, No. 80/2015/QH13 (2015)

    Google Scholar 

  4. Constantin, A., Peroni, S., Pettifer, S., Shotton, D., Vitali, F.: The document components ontology (DoCo). Semant. Web 7(2), 167–181 (2016)

    Article  Google Scholar 

  5. Crotti Junior, A., et al.: Knowledge graph-based legal search over German court cases. In: Harth, A., et al. (eds.) ESWC 2020. LNCS, vol. 12124, pp. 293–297. Springer, Cham (2020).

    Chapter  Google Scholar 

  6. Dang, D., Nguyen, H., Ngo, H.E.A.: Information retrieval from legal documents with ontology and graph embeddings approach. In: 36th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE 2023), Shanghai, China (2023, in press)

    Google Scholar 

  7. Dhani, J.S., Bhatt, R., Ganesan, B., Sirohi, P., Bhatnagar, V.: Similar cases recommendation using legal knowledge graphs. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2021)

    Google Scholar 

  8. Filtz, E., Kirrane, S., Polleres, A.: Interlinking legal data. In: SEMANTiCS Posters & Demos (2018)

    Google Scholar 

  9. Vietnam Government: Decree on Administrative penalties for road traffic and rail transport offences, No. 100/2019/ND-CP (2019)

    Google Scholar 

  10. Kaltenboeck, M., Boil, P., Verhoeven, P., Sageder, C., Montiel-Ponsoda, E., Calleja-Ibáñez, P.: Using a legal knowledge graph for multilingual compliance services in labor law, contract management, and geothermal energy. In: Curry, E., Auer, S., Berre, A.J., Metzger, A., Perez, M.S., Zillner, S. (eds.) Technologies and Applications for Big Data Value, pp. 253–271. Springer, Cham (2022).

  11. Lettieri, N.: Knowledge machineries. Introducing the instrument-enabled future of legal research and practice. In: Knowledge of the Law in the Big Data Age, pp. 10–23. IOS Press (2019)

    Google Scholar 

  12. Montiel-Ponsoda, E., Rodríguez-Doncel, V.: Lynx: building the legal knowledge graph for smart compliance services in multilingual Europe. In: Proceedings of the 1st Workshop on LREC (Language Resources and Technologies for the Legal Knowledge Graph) Workshop, pp. 19–22 (2018)

    Google Scholar 

  13. Ngo, Q.H., Dien, D., Winiwarter, W.: Building English-Vietnamese named entity corpus with aligned bilingual news articles. In: Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing, pp. 85–93 (2014)

    Google Scholar 

  14. Ngo, Q.H., Kechadi, T., Le-Khac, N.-A.: OAK: ontology-based knowledge map model for digital agriculture. In: Dang, T.K., Küng, J., Takizawa, M., Chung, T.M. (eds.) FDSE 2020. LNCS, vol. 12466, pp. 245–259. Springer, Cham (2020).

    Chapter  Google Scholar 

  15. Ngo, Q.H., Kechadi, T., Le-Khac, N.A.: Knowledge representation in digital agriculture: a step towards standardised model. Comput. Electron. Agric. 199, 107127 (2022)

    Article  Google Scholar 

  16. Nguyen, H.D., Pham, V.T., Le, T.T., Tran, D.H.: A mathematical approach for representing knowledge about relations and its application. In: Proceedings of 7th International Conference on Knowledge and Systems Engineering (KSE 2015), pp. 324–327. IEEE (2015)

    Google Scholar 

  17. Nguyen, P.K.: How to conduct research in Vietnamese law: overview of the legal system of the socialist republic of Vietnam. Int. J. Leg. Inf. 27(3), 307–331 (1999)

    Article  Google Scholar 

  18. Nguyen, P.T., Vu, X.L., Nguyen, T.M.H., Le, H.P., et al.: Building a large syntactically-annotated corpus of Vietnamese. In: The Third Linguistic Annotation Workshop-The LAW III, p. 6p (2009)

    Google Scholar 

  19. Nguyen, T.H., Nguyen, H.D., Pham, V.T., Tran, D.A., Selamat, A.: Legal-Onto: an ontology-based model for representing the knowledge of a legal document. In: Proceedings of 17th Evaluation of Novel Approaches to Software Engineering (ENASE 2022), Online Streaming, pp. 426–434 (2022)

    Google Scholar 

  20. Pham, V.T., Nguyen, H.D., Le, T., et al.: Ontology-based solution for building an intelligent searching system on traffic law documents. In: Proceedings of 15th International Conference on Agents and Artificial Intelligence (ICAART 2023), Lisbon, Portugal, pp. 217–224 (2023)

    Google Scholar 

  21. Sovrano, F., Palmirani, M., Vitali, F.: Legal knowledge extraction for knowledge graph based question-answering. In: Legal Knowledge and Information Systems, pp. 143–153. IOS Press (2020)

    Google Scholar 

  22. Thelen, S.: The ontology-based approach of the publications office of the EU for document accessibility and open data services. In: Kő, A., Francesconi, E. (eds.) Electronic Government and the Information Systems Perspective: 4th International Conference, EGOVIS 2015, Valencia, Spain, 1–3 September 2015, Proceedings, vol. 9265, p. 29. Springer, Cham (2015).

  23. Wang, Z., et al.: IFlyLegal: a Chinese legal system for consultation, law searching, and document analysis. In: Proceedings of EMNLP-IJCNLP, pp. 97–102 (2019)

    Google Scholar 

  24. Yu, H., Li, H., et al.: A knowledge graph construction approach for legal domain. Tehnički vjesnik 28(2), 357–362 (2021)

    MathSciNet  Google Scholar 

Download references


This research is funded by Vietnam National University Ho Chi Minh City (VNU-HCM) under grant number DS2023-26-04.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Hien D. Nguyen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ngo, H.Q., Nguyen, H.D., Le-Khac, NA. (2023). Building Legal Knowledge Map Repository with NLP Toolkits. In: Nguyen, N.T., Le-Minh, H., Huynh, CP., Nguyen, QV. (eds) The 12th Conference on Information Technology and Its Applications. CITA 2023. Lecture Notes in Networks and Systems, vol 734. Springer, Cham.

Download citation

Publish with us

Policies and ethics