Abstract
Since the beginning of the 21st century, the geoscience research has been entering a significant transitional period with the establishment of a new knowledge system as the core and with the drive of big data as the means. It is a revolutionary leap in the research of geoscience knowledge discovery from the traditional encyclopedic discipline knowledge system to the computer-understandable and operable knowledge graph. Based on adopting the graph pattern of general knowledge representation, the geoscience knowledge graph expands the unique spatiotemporal features to the Geoscience knowledge, and integrates geoscience knowledge elements, such as map, text, and number, to establish an all-domain geoscience knowledge representation model. A federated, crowd intelligence-based collaborative method of constructing the geoscience knowledge graph is developed here, which realizes the construction of high-quality professional knowledge graph in collaboration with global geo-scientists. We also develop a method for constructing a dynamic knowledge graph of multi-modal geoscience data based on in-depth text analysis, which extracts geoscience knowledge from massive geoscience literature to construct the latest and most complete dynamic geoscience knowledge graph. A comprehensive and systematic geoscience knowledge graph can not only deepen the existing geoscience big data analysis, but also advance the construction of the high-precision geological time scale driven by big data, the compilation of intelligent maps driven by rules and data, and the geoscience knowledge evolution and reasoning analysis, among others. It will further expand the new directions of geoscience research driven by both data and knowledge, break new ground where geoscience, information science, and data science converge, realize the original innovation of the geoscience research and achieve major theoretical breakthroughs in the spatiotemporal big data research.
Similar content being viewed by others
References
Ansari G A, Saha A, Kumar V, Bhambhani M, Sankaranarayanan K, Chakrabarti S. 2019. Neural program induction for KBQA without gold programs or query annotations. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence. Macao: AAAI Press. 4890–4896
Aubry M P, Berggren W A, Van Couvering J A, Steininger F. 1999. Problems in chronostratigraphy: Stages, series, unit and boundary stratotypes, global stratotype section and point and tarnished golden spikes. Earth-Sci Rev, 46: 99–148
Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z. 2007. DBpedia: A nucleus for a web of open data. In: Aberer K, Choi K-S, Noy N, Allemang D, Lee K-I, Nixon L, Golbeck J, Mika P, Maynard D, Mizoguchi R, Schreiber G, Cudré-Mauroux P, eds. The Semantic Web. ISWC 2007, ASWC 2007. Lecture Notes in Computer Science, vol 4825. Berlin, Heidelberg: Springer. 722–735
Ballatore A, Bertolotto M, Wilson D. 2015. A structural-lexical measure of semantic similarity for geo-knowledge graphs. ISPRS Int J Geo-Inform, 4: 471–492
Boyack K W, Klavans R, Börner K. 2005. Mapping the backbone of science. Scientometrics, 64: 351–374
Carlson A, Betteridge J, Kisiel B, Settles B, Hruschka Jr. E R, Mitchell T M. 2010. Toward an architecture for never-ending language learning. In: Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2010. Atlanta, Georgia, USA, July 11–15, 2010. 1306–1313
Chah N. 2017. Freebase-triples: A methodology for processing the freebase data dumps
Chen J, Chen J. 2018. GlobeLand30: Operational global land cover mapping and big-data analysis. Sci China Earth Sci, 61: 1533–1534
Chen Y, Liu Z, Chen J, Hou J. 2008a. History and theory of mapping knowledge domains (in Chinese). Stud Sci Sci, 26: 449–460
Chen Y, Zhang S, Peng X, Zhao W. 2008b. A collaborative ontology construction tool with conflicts detection, In: Fourth International Conference on Semantics, Knowledge and Grid. Los Alamitos: IEEE Computer Society. 12–19
Davydov V I. 2020. Shift in the paradigm for GSSP boundary definition. Gondwana Res, 86: 266–286
Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W. 2014. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 601–610
Gangemi A, Presutti V, Catenacci C, Lehmann J, Nissim M. 2007. C-ODO: An OWL meta-model for collaborative ontology design. In: Proceedings of the Workshop on Social and Collaborative Construction of Structured Knowledge (CKC 2007) at the 16th International World Wide Web Conference (WWW2007). Banff
Guo H D. 2017a. Big data drives the development of Earth science. Big Earth Data, 1: 1–3
Guo H D. 2017b. Big Earth data: A new frontier in Earth and information sciences. Big Earth Data, 1: 4–20
Guo H D, Wang L, Chen F, Liang D. 2014. Scientific big data and Digital Earth. Chin Sci Bull, 59: 5066–5073
Guo R, Ying S. 2017. The rejuvennation of cartograph in ICT era (in Chinese). Acta Geodaet Cartograph Sin, 46: 1274–1283
Hoffart J, Suchanek F M, Berberich K, Weikum G. 2013. YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intell, 194: 28–61
Lenat D B. 1995. CYC: A large-scale investment in knowledge infrastructure. Commun ACM, 38: 33–38
Lu F, Yu L, Qiu P. 2017. On geographic knowledge graph (in Chinese). J Geo-inform Sci, 19: 723–734
Lucas S G. 2018. The GSSP method of chronostratigraphy: A critical review. Front Earth Sci, 6: 191
Ma X G, Ma C, Wang C B. 2020. A new structure for representing and tracking version information in a deep time knowledge graph. Comput Geosci, 145: 104620
Mitraka E, Waagmeester A, Su A, Good B. 2015. Wikidata: A central hub for linked open life science data. In: the Biocuration 2015 Conference. Beijing
Nakashole N, Theobald M, Weikum G. 2011. Scalable knowledge harvesting with high precision and high recall. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining. 227–236
NSTC (National Science and Technology Council). 2018. Open Knowledge Network. 8
OECD (Organization for Economic Cooperation and Development). 1996. The Knowledge-based Economy. Paris: OECD
Oramas S, Ostuni V C, Noia T D, Serra X, Sciascio E D. 2017. Sound and music recommendation with knowledge graphs. ACM Trans Intell Syst Technol, 8: 1–21
Singhal A. 2012. Introducing the Knowledge Graph: Things, not strings. Google Blog. https://www.blog.google/products/search/introducing-knowledge-graph-things-not/
Sun H. 2017. Encyclopedia of Geoscience (in Chinese). Beijing: Science Press
Tang J. 2020. On the next decade of artificial intelligence (in Chinese). CAAI Trans Intell Syst, 15: 193–198
Tansley S, Tolle K. 2009. The Fourth Paradigm: Data-Intensive Scientific Discovery. Redmond, WA: Microsoft Research
Tay Y, Luu A T, Hui S C. 2017. Non-Parametric estimation of multiple embeddings for link prediction on dynamic knowledge graphs. In: Proceedings of the Thirty First Conference on Artificial Intelligence (AAAI). Menlo Park: AAAI. 1243–1249
Walsh S, Gradstein F, Ogg J. 2004. History, philosophy, and application of the Global Stratotype Section and Point (GSSP). Lethaia, 37: 201–218
Wang P, Jian Z. 2019. Exploring the deep South China Sea: Retrospects and prospects. Sci China Earth Sci, 62: 1473–1488
Wang S, Zhang X, Ye P, Du M, Lu Y, Xue H. 2019. Geographic Knowledge Graph (GeoKG): A formalized geographic knowledge representation. ISPRS Int J Geo-Inform, 8: 184
Wang Z, Li J. 2016. Text-Enhanced representation learning for knowledge graph. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. Menlo Park: AAAI. 1293–1299
Xiong W. 2019. Influence of artificial intelligence on the development of some fields of surveying and mapping tech (in Chinese). Geomat Inform Sci Wuhan Univ, 44: 101–105
Xu J, Pei T, Yao Y. 2010. Conceptual framework and representation of geographic knowledge map. Geo-Inf Sci, 12: 496–502
Zhai G, Yang S, Chen N. 2018. Big Data epoch: Challenges and opportunities for geology (in Chinese). Bull Chin Acad Sci, 8: 825–831
Zhang X, Zhang C, Wu M, Lv G. 2020. Spatiotemporal features based geographical knowledge graph construction. Sci Sin-Inf, 50: 1019–1032
Acknowledgements
This paper benefitted from the guidances and supports from many domestic and overseas colleagues. The theme of this paper comes from our collective thinking. So this paper should be a piece of collective work. The authors want to specially thank the experts and scholars who participated in the Shuang Qing Forum themed “Data-driven Geoscience: from Tradition to the Data Age” in 2019 and Deep Time Digital Earth Series Symposiums. The authors appreciate Prof. Yupeng Yao and Chaolin Zhao from the Geoscience Department of NSFC for their guidance on the overall design and research directions in the original exploration project of NSFC. Many thanks are also given to the anonymous peer reviewers for their valuable comments and suggestions. This work was supported by the National Natural Science Foundation of China (Grant Nos. 41421001, 42050101, and 42050105).
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Zhou, C., Wang, H., Wang, C. et al. Geoscience knowledge graph in the big data era. Sci. China Earth Sci. 64, 1105–1114 (2021). https://doi.org/10.1007/s11430-020-9750-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11430-020-9750-4