Skip to main content

Advertisement

Log in

Ontology-driven relational data mapping for constructing a knowledge graph of porphyry copper deposits

  • Research
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

Geoscience knowledge graph has become a popular topic in recent years. A series of studies have been reported to introduce the construction and application of geoscience knowledge graphs from different views. The relational geoscience dataset with high knowledge density and data quality is an important digital heritage of geoscience. The relational dataset has not been taken seriously in the geoscience knowledge graph research. In this study, we proposed a quick method of building a geoscience knowledge graph using relational data mapping to triples. First, the use-case-driven method was applied to design the ontology of porphyry copper deposits. Second, the mapping rules were built based on the porphyry copper ontology. Third, the knowledge graph of the porphyry copper deposit was constructed based on relational data mapping and knowledge fusion. Based on the resulting knowledge graph, several exploratory cases were conducted to make knowledge reasoning and discovery. It is indicated that the solution proposed in this study is a fast batch-processing geoscience knowledge graph construction method. The experiences from this study can benefit the construction of knowledge graphs in other geoscience disciplines and promote knowledge discovery.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

The dataset of porphyry copper deposits can be obtained from US Geological Survey (https://mrdata.usgs.gov/porcu/). The file of mapping rules is available on GitHub at https://github.com/wangcug/DataMapping.

Code availability

The mapping rules file is available on GitHub at https://github.com/wangcug/DataMapping.

Notes

  1. http://d2rq.org/d2rq-language#propertybridge.

  2. https://mrdata.usgs.gov/porcu/.

References

  • Bergen KJ, Johnson PA, de Hoop MV, Beroza GC (2019) Machine learning for data-driven discovery in solid Earth geoscience. Science 363(6433):eaau0323. https://doi.org/10.1126/science.aau0323

    Article  CAS  Google Scholar 

  • Cerans K, Būmans G (2015) RDB2OWL: A Language and Tool for Database to Ontology Mapping. (Paper presented at the CAISE 2015 Forum)

  • Chen Q, Yao H, Li S, Li X, Kang X, Lai W, Kuang J (2023) Fact-condition statements and super relation extraction for geothermic knowledge graphs construction. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101412

  • Chhaya P, Lee K-H, Shin K-s, Choi C-H, Cho W-S, Lee Y-S (2016) ‘Using D2RQ and Ontop to publish relational database as Linked Data’ 2016 Eighth International Conference on Ubiquitous and Future Networks (ICUFN). IEEE, pp. 694–698

  • Cox SJ, Richard S (2015) A geologic timescale ontology and service. Earth Sci Inf 8:5–19

    Article  Google Scholar 

  • Devi R, Singh R, Singh VP (2018a) Comparative study of RDB to RDF Mapping using D2RQ and R2RML mapping languages. Int J Inform Sci Application 10(1):23–36

    Google Scholar 

  • Devi R, Singh R, Singh VP (2018b) Comparative study of RDB to RDF Mapping using D2RQ and R2RML mapping languages. Int J Inform Sci Application 10(1):23–26

    Google Scholar 

  • Enkhsaikhan M (2021) Geological knowledge graph construction from Mineral Exploration text. The University of Western Australia

  • Fan R, Wang L, Yan J, Song W, Zhu Y, Chen X (2019) Deep learning-based named Entity Recognition and Knowledge Graph Construction for Geological hazards. ISPRS Int J Geo-Information 9(1). https://doi.org/10.3390/ijgi9010015

  • Fensel D, Şimşek U, Angele K, Huaman E, Kärle E, Panasiuk O et al (2020) Introduction: what is a knowledge graph? Knowl Graphs 1–10. https://doi.org/10.1007/978-3-030-37439-6_1

  • Gil Y, Pierce SA, Babaie H, Banerjee A, Borne K, Bust G et al (2018) Intelligent systems for geosciences. Commun ACM 62(1):76–84. https://doi.org/10.1145/3192335

    Article  Google Scholar 

  • Hu X, Ma X, Ma C et al (2023a) The geoscience knowledge system, ontology and knowledge graph for data-driven discovery: Preface. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2023.101592

  • Hu X, Xu Y, Ma X, Yunqiang Z, Chao M, Chao L et al (2023b) Knowledge System, Ontology, and knowledge graph of the Deep-Time Digital Earth (DDE): Progress and Perspective. J Earth Sci 34(5):1323–1327. https://doi.org/10.1007/s12583-023-1930-1

    Article  Google Scholar 

  • Husson J, Peters S, Ross I, Czaplewski J (2016) (2016) Macrostrat and GeoDeepDive: A Platform for Geological Data Integration and Deep-Time Research, AGU Fall Meeting Abstracts. pp. IN23F-04

  • Jaccard P (1912) The distribution of the flora in the alpine zone. New Phytol 11(2):37–50

    Article  Google Scholar 

  • Koskela R, Ramamurthy M, Pearlman J, Lehnert K, Ahern T, Fredericks J et al (2017) Earthcube: A community-driven cyberinfrastructure for the geosciences, EGU General Assembly Conference Abstracts. p. 5884

  • Kumar Gond A, Dey S, Zong K, Liu Y, Anand R, Mitra A, Mitra A (2023) A better understanding of Archean crustal evolution: exploring the sedimentary archive of the Singhbhum Craton, eastern India. J Asian Earth Sci 251. https://doi.org/10.1016/j.jseaes.2023.105630

  • Li S, Chen J, Liu C, Wang Y (2021) Mineral Prospectivity Prediction via Convolutional neural networks based on geological Big Data. J Earth Sci 32(2):327–347. https://doi.org/10.1007/s12583-020-1365-z

    Article  Google Scholar 

  • Lv X, Xie Z, Xu D, Jin X, Ma K, Tao L et al (2022) Chinese Named Entity Recognition in the Geoscience Domain based on BERT. Earth Space Sci 9(3). https://doi.org/10.1029/2021ea002166

  • Ma X (2022) Knowledge graph construction and application in geosciences: a review. Comput Geosci 161:105082. https://doi.org/10.1016/j.cageo.2022.105082

    Article  Google Scholar 

  • Ma X, Ma C, Wang C (2020) A new structure for representing and tracking version information in a deep time knowledge graph. Comput Geosci 145:104620

    Article  Google Scholar 

  • Ma C, Morrison SM, Muscente AD, Wang C, Ma X (2022) Incorporate temporal topology in a deep-time knowledge base to facilitate data‐driven discovery in geoscience. Geosci Data J. https://doi.org/10.1002/gdj3.171

    Article  Google Scholar 

  • Ma C, Kale AS, Zhang J, Ma X (2023) A knowledge graph and service for regional geologic time standards. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101453

  • Michel F, Montagnat J, Zucker CF (2013) ‘A survey of RDB to RDF translation approaches and tools’. https://hal.archives-ouvertes.fr/hal-00903568v1

  • Normile D (2019) Earth scientists plan a ‘geological Google’. Science 363(6430):917. https://doi.org/10.1126/science.363.6430.917

    Article  CAS  Google Scholar 

  • Parsons MA, Duerr R, Godøy Ø (2023) The evolution of a geoscience standard: an instructive tale of science keyword development and adoption. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101400

  • Peters SE, Husson JM, Wilcots J (2017) The rise and fall of stromatolites in shallow marine environments. Geology 45(6):487–490. https://doi.org/10.1130/g38931.1

    Article  Google Scholar 

  • Qiu Q, Xie Z, Wu L, Tao L (2019a) GNER: a generative model for geological named entity recognition without labeled data using deep learning. Earth Space Sci 6(6):931–946. https://doi.org/10.1029/2019ea000610

    Article  Google Scholar 

  • Qiu Q, Xie Z, Wu L, Tao L, Li W (2019b) BiLSTM-CRF for geological named entity recognition from the geoscience literature. Earth Sci Inf 12(4):565–579. https://doi.org/10.1007/s12145-019-00390-3

    Article  Google Scholar 

  • Qiu Q, Ma K, Lv H, Tao L, Xie Z (2023a) Construction and application of a knowledge graph for iron deposits using text mining analytics and a deep learning algorithm. Math Geosci 55(3):423–456

    Article  Google Scholar 

  • Qiu Q, Wang B, Ma K, Lü H, Tao L, Xie Z (2023b) A practical Approach to constructing a geological knowledge graph: a case study of Mineral Exploration Data. J Earth Sci 34(5):1374–1389. https://doi.org/10.1007/s12583-023-1809-3

    Article  Google Scholar 

  • Reichstein M, Camps-Valls G, Stevens B, Jung M, Denzler J, Carvalhais N, Prabhat (2019) Deep learning and process understanding for data-driven Earth system science. Nature 566(7743):195–204. https://doi.org/10.1038/s41586-019-0912-1

    Article  CAS  Google Scholar 

  • Tang X, Feng Z, Xiao Y, Wang M, Ye T, Zhou Y et al (2023) Construction and application of an ontology-based domain-specific knowledge graph for petroleum exploration and development. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101426

  • Wang C, Ma X, Chen J (2018a) Ontology-driven data integration and visualization for exploring regional geologic time and paleontological information. Comput Geosci 115:12–19. https://doi.org/10.1016/j.cageo.2018.03.004

    Article  Google Scholar 

  • Wang C, Ma X, Chen J, Chen J (2018b) Information extraction and knowledge graph construction from geoscience literature. Comput Geosci 112:112–120

    Article  Google Scholar 

  • Wang C, Hazen RM, Cheng Q, Stephenson MH, Zhou C, Fox P et al (2021) The deep-time Digital Earth program: data-driven discovery in geosciences. Natl Sci Rev 8(9):nwab027

    Article  CAS  Google Scholar 

  • Wang C, Li Y, Chen J (2023a) Text mining and knowledge graph construction from geoscience literature legacy: A review. In X. Ma, M. Mookerjee, L. Hsu, & D. Hills (Eds.), Recent Advancement in Geoinformatics and Data Science (pp. 11–28). Geological Society of America. https://doi.org/10.1130/2022.2558(02)

  • Wang C, Li Y, Chen j, Ma X (2023b) Named entity annotation schema for geological literature mining in the domain of porphyry copper deposits. Ore Geol Rev 152:105243. https://doi.org/10.1016/j.oregeorev.2022.105243

    Article  Google Scholar 

  • Wang S, Zhu Y, Qi Y, Hou Z, Sun K, Li W et al (2023c) A unified framework of temporal information expression in geosciences knowledge system. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101465

  • Xu H, Zhao Y, Huang H, Dong S, Shi Y, Huang C et al (2023) A comprehensive construction of the domain ontology for stratigraphy. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101461

  • Yu C, Zhang L, Hou M, Yang J, Zhong H, Wang C (2023) Climate paleogeography knowledge graph and deep time paleoclimate classifications. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101450

  • Zhang C (2015) DeepDive: a data management system for automatic knowledge base construction. The University of Wisconsin-Madison

  • Zhou X-G, Gong R-B, Shi F-G, Wang Z-F (2020) PetroKG: construction and application of knowledge graph in Upstream Area of PetroChina. J Comput Sci Technol 35(2):368–378. https://doi.org/10.1007/s11390-020-9966-7

    Article  Google Scholar 

  • Zhou C, Wang H, Wang C, Hou Z, Zheng Z, Shen S et al (2021) Geoscience knowledge graph in the big data era. Sci China Earth Sci 64(7):1105–1114. https://doi.org/10.1007/s11430-020-9750-4

    Article  Google Scholar 

  • Zhu Y, Zhou W, Xu Y, Liu J, Tan Y (2017) Intelligent Learning for Knowledge Graph towards Geological Data. Sci Program 2017:1–13. https://doi.org/10.1155/2017/5072427

    Article  Google Scholar 

Download references

Funding

This study was funded by the National Key R&D Program of China (2022YFF0801202, 2022YFF0801200), National Natural Science Foundation of China (41902305), Knowledge Innovation Program of Wuhan-Shuguang (2023010201020332).

Author information

Authors and Affiliations

Authors

Contributions

ChengbinWang: writing-original draft, conceptualization, data validation, writing-review and editing, project administration, funding acquisition. Liangquan Tan: building the mapping rules. Yuanjun Li: knowledge graph visualization and application. Mingguo Wang: writing-review and editing. Xiaogang Ma: writing-review and editing, data validation. Jianguo Chen: funding acquisition, data validation, writing-review, and editing.

Corresponding author

Correspondence to Chengbin Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Communicated by H. Babaie.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, C., Tan, L., Li, Y. et al. Ontology-driven relational data mapping for constructing a knowledge graph of porphyry copper deposits. Earth Sci Inform (2024). https://doi.org/10.1007/s12145-024-01307-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12145-024-01307-5

Keywords

Navigation