Skip to main content
Log in

Geoscience knowledge graph in the big data era

  • Research Paper
  • Published:
Science China Earth Sciences Aims and scope Submit manuscript

Abstract

Since the beginning of the 21st century, the geoscience research has been entering a significant transitional period with the establishment of a new knowledge system as the core and with the drive of big data as the means. It is a revolutionary leap in the research of geoscience knowledge discovery from the traditional encyclopedic discipline knowledge system to the computer-understandable and operable knowledge graph. Based on adopting the graph pattern of general knowledge representation, the geoscience knowledge graph expands the unique spatiotemporal features to the Geoscience knowledge, and integrates geoscience knowledge elements, such as map, text, and number, to establish an all-domain geoscience knowledge representation model. A federated, crowd intelligence-based collaborative method of constructing the geoscience knowledge graph is developed here, which realizes the construction of high-quality professional knowledge graph in collaboration with global geo-scientists. We also develop a method for constructing a dynamic knowledge graph of multi-modal geoscience data based on in-depth text analysis, which extracts geoscience knowledge from massive geoscience literature to construct the latest and most complete dynamic geoscience knowledge graph. A comprehensive and systematic geoscience knowledge graph can not only deepen the existing geoscience big data analysis, but also advance the construction of the high-precision geological time scale driven by big data, the compilation of intelligent maps driven by rules and data, and the geoscience knowledge evolution and reasoning analysis, among others. It will further expand the new directions of geoscience research driven by both data and knowledge, break new ground where geoscience, information science, and data science converge, realize the original innovation of the geoscience research and achieve major theoretical breakthroughs in the spatiotemporal big data research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ansari G A, Saha A, Kumar V, Bhambhani M, Sankaranarayanan K, Chakrabarti S. 2019. Neural program induction for KBQA without gold programs or query annotations. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence. Macao: AAAI Press. 4890–4896

    Google Scholar 

  • Aubry M P, Berggren W A, Van Couvering J A, Steininger F. 1999. Problems in chronostratigraphy: Stages, series, unit and boundary stratotypes, global stratotype section and point and tarnished golden spikes. Earth-Sci Rev, 46: 99–148

    Article  Google Scholar 

  • Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z. 2007. DBpedia: A nucleus for a web of open data. In: Aberer K, Choi K-S, Noy N, Allemang D, Lee K-I, Nixon L, Golbeck J, Mika P, Maynard D, Mizoguchi R, Schreiber G, Cudré-Mauroux P, eds. The Semantic Web. ISWC 2007, ASWC 2007. Lecture Notes in Computer Science, vol 4825. Berlin, Heidelberg: Springer. 722–735

    Google Scholar 

  • Ballatore A, Bertolotto M, Wilson D. 2015. A structural-lexical measure of semantic similarity for geo-knowledge graphs. ISPRS Int J Geo-Inform, 4: 471–492

    Article  Google Scholar 

  • Boyack K W, Klavans R, Börner K. 2005. Mapping the backbone of science. Scientometrics, 64: 351–374

    Article  Google Scholar 

  • Carlson A, Betteridge J, Kisiel B, Settles B, Hruschka Jr. E R, Mitchell T M. 2010. Toward an architecture for never-ending language learning. In: Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2010. Atlanta, Georgia, USA, July 11–15, 2010. 1306–1313

  • Chah N. 2017. Freebase-triples: A methodology for processing the freebase data dumps

  • Chen J, Chen J. 2018. GlobeLand30: Operational global land cover mapping and big-data analysis. Sci China Earth Sci, 61: 1533–1534

    Article  Google Scholar 

  • Chen Y, Liu Z, Chen J, Hou J. 2008a. History and theory of mapping knowledge domains (in Chinese). Stud Sci Sci, 26: 449–460

    Google Scholar 

  • Chen Y, Zhang S, Peng X, Zhao W. 2008b. A collaborative ontology construction tool with conflicts detection, In: Fourth International Conference on Semantics, Knowledge and Grid. Los Alamitos: IEEE Computer Society. 12–19

    Chapter  Google Scholar 

  • Davydov V I. 2020. Shift in the paradigm for GSSP boundary definition. Gondwana Res, 86: 266–286

    Article  Google Scholar 

  • Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W. 2014. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 601–610

  • Gangemi A, Presutti V, Catenacci C, Lehmann J, Nissim M. 2007. C-ODO: An OWL meta-model for collaborative ontology design. In: Proceedings of the Workshop on Social and Collaborative Construction of Structured Knowledge (CKC 2007) at the 16th International World Wide Web Conference (WWW2007). Banff

  • Guo H D. 2017a. Big data drives the development of Earth science. Big Earth Data, 1: 1–3

    Article  Google Scholar 

  • Guo H D. 2017b. Big Earth data: A new frontier in Earth and information sciences. Big Earth Data, 1: 4–20

    Article  Google Scholar 

  • Guo H D, Wang L, Chen F, Liang D. 2014. Scientific big data and Digital Earth. Chin Sci Bull, 59: 5066–5073

    Article  Google Scholar 

  • Guo R, Ying S. 2017. The rejuvennation of cartograph in ICT era (in Chinese). Acta Geodaet Cartograph Sin, 46: 1274–1283

    Google Scholar 

  • Hoffart J, Suchanek F M, Berberich K, Weikum G. 2013. YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intell, 194: 28–61

    Article  Google Scholar 

  • Lenat D B. 1995. CYC: A large-scale investment in knowledge infrastructure. Commun ACM, 38: 33–38

    Article  Google Scholar 

  • Lu F, Yu L, Qiu P. 2017. On geographic knowledge graph (in Chinese). J Geo-inform Sci, 19: 723–734

    Google Scholar 

  • Lucas S G. 2018. The GSSP method of chronostratigraphy: A critical review. Front Earth Sci, 6: 191

    Article  Google Scholar 

  • Ma X G, Ma C, Wang C B. 2020. A new structure for representing and tracking version information in a deep time knowledge graph. Comput Geosci, 145: 104620

    Article  Google Scholar 

  • Mitraka E, Waagmeester A, Su A, Good B. 2015. Wikidata: A central hub for linked open life science data. In: the Biocuration 2015 Conference. Beijing

  • Nakashole N, Theobald M, Weikum G. 2011. Scalable knowledge harvesting with high precision and high recall. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining. 227–236

  • NSTC (National Science and Technology Council). 2018. Open Knowledge Network. 8

  • OECD (Organization for Economic Cooperation and Development). 1996. The Knowledge-based Economy. Paris: OECD

    Google Scholar 

  • Oramas S, Ostuni V C, Noia T D, Serra X, Sciascio E D. 2017. Sound and music recommendation with knowledge graphs. ACM Trans Intell Syst Technol, 8: 1–21

    Article  Google Scholar 

  • Singhal A. 2012. Introducing the Knowledge Graph: Things, not strings. Google Blog. https://www.blog.google/products/search/introducing-knowledge-graph-things-not/

  • Sun H. 2017. Encyclopedia of Geoscience (in Chinese). Beijing: Science Press

    Google Scholar 

  • Tang J. 2020. On the next decade of artificial intelligence (in Chinese). CAAI Trans Intell Syst, 15: 193–198

    Google Scholar 

  • Tansley S, Tolle K. 2009. The Fourth Paradigm: Data-Intensive Scientific Discovery. Redmond, WA: Microsoft Research

    Google Scholar 

  • Tay Y, Luu A T, Hui S C. 2017. Non-Parametric estimation of multiple embeddings for link prediction on dynamic knowledge graphs. In: Proceedings of the Thirty First Conference on Artificial Intelligence (AAAI). Menlo Park: AAAI. 1243–1249

    Google Scholar 

  • Walsh S, Gradstein F, Ogg J. 2004. History, philosophy, and application of the Global Stratotype Section and Point (GSSP). Lethaia, 37: 201–218

    Article  Google Scholar 

  • Wang P, Jian Z. 2019. Exploring the deep South China Sea: Retrospects and prospects. Sci China Earth Sci, 62: 1473–1488

    Article  Google Scholar 

  • Wang S, Zhang X, Ye P, Du M, Lu Y, Xue H. 2019. Geographic Knowledge Graph (GeoKG): A formalized geographic knowledge representation. ISPRS Int J Geo-Inform, 8: 184

    Article  Google Scholar 

  • Wang Z, Li J. 2016. Text-Enhanced representation learning for knowledge graph. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. Menlo Park: AAAI. 1293–1299

    Google Scholar 

  • Xiong W. 2019. Influence of artificial intelligence on the development of some fields of surveying and mapping tech (in Chinese). Geomat Inform Sci Wuhan Univ, 44: 101–105

    Google Scholar 

  • Xu J, Pei T, Yao Y. 2010. Conceptual framework and representation of geographic knowledge map. Geo-Inf Sci, 12: 496–502

    Google Scholar 

  • Zhai G, Yang S, Chen N. 2018. Big Data epoch: Challenges and opportunities for geology (in Chinese). Bull Chin Acad Sci, 8: 825–831

    Google Scholar 

  • Zhang X, Zhang C, Wu M, Lv G. 2020. Spatiotemporal features based geographical knowledge graph construction. Sci Sin-Inf, 50: 1019–1032

    Article  Google Scholar 

Download references

Acknowledgements

This paper benefitted from the guidances and supports from many domestic and overseas colleagues. The theme of this paper comes from our collective thinking. So this paper should be a piece of collective work. The authors want to specially thank the experts and scholars who participated in the Shuang Qing Forum themed “Data-driven Geoscience: from Tradition to the Data Age” in 2019 and Deep Time Digital Earth Series Symposiums. The authors appreciate Prof. Yupeng Yao and Chaolin Zhao from the Geoscience Department of NSFC for their guidance on the overall design and research directions in the original exploration project of NSFC. Many thanks are also given to the anonymous peer reviewers for their valuable comments and suggestions. This work was supported by the National Natural Science Foundation of China (Grant Nos. 41421001, 42050101, and 42050105).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Chenghu Zhou or Hua Wang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, C., Wang, H., Wang, C. et al. Geoscience knowledge graph in the big data era. Sci. China Earth Sci. 64, 1105–1114 (2021). https://doi.org/10.1007/s11430-020-9750-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11430-020-9750-4

Keywords

Navigation