Abstract
The existing Scientific Data Management Systems (SDMSs) usually focus on a single domain and the interaction pattern of each subsystem is complex. What’s more, the heterogeneity and multi-source of Scientific Big Data (SBD), resulting in a wide variety of databases, scientific devices and functional areas, make the incompatibility and conflict between system modules inevitable. In this context, the paper focuses on the design and technology requirements of a multi-domain and sub-role oriented software architecture. Through integrating multiple databases, third-party systems and related tools, this architecture realizes both the storage and the sharing of multi-domain and multi-type SBD. Particularly, this architecture is divided into four independent functional areas and corresponding roles are designed, which enhances the decoupling and extensibility of the architecture. In addition, this paper has a formal description of the partition design from the perspective of role. On this basis, this paper also shows the typical application scenarios under different roles. The rationality and comprehensiveness of the proposed architecture are proved by describing the architectures design and technology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Andreeva, J., Campana, S., Fanzago, F., Herrala, J.: High-energy physics on the grid: the atlas and CMS experience. J. Grid Comput. 6(1), 3–13 (2008)
Bengtssonpalme, J., et al.: Strategies to improve usability and preserve accuracy in biological sequence databases. Proteomics 16(18), 2454–2460 (2016)
Cook, C.E., Bergman, M.T., Cochrane, G., Apweiler, R., Birney, E.: The European bioinformatics institute in 2017: data coordination and integration. Nucleic Acids Res. 46(D1), D21 (2018)
Dewitt, D.J., Kabra, N., Luo, J., Patel, J.M., Yu, J.B.: Client–server paradise. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 558–569 (1994)
Dozier, J., Stonebraker, M., Frew, J.: Sequoia 2000: a next-generation information system for the study of global change. In: Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Towards Distributed Storage and Data Management Systems, pp. 47–53 (1994)
Gao, W., et al.: Data motif-based proxy benchmarks for big data and AI workloads. In: IISWC 2018 (2018)
Gao, W., et al.: Data motifs: a lens towards fully understanding big data and AI workloads. In: 27th International Conference on Parallel Architectures and Compilation Techniques, PACT 2018 (2018)
Ivanova, M., Nes, N., Goncalves, R., Kersten, M.: MonetDB/SQL meets skyserver: the challenges of a scientific database. In: International Conference on Scientific and Statistical Database Management, p. 13 (2007)
Ivezic, Z., et al.: LSST: from science drivers to reference design and anticipated data products. Am. Astron. Soc. 41, 366 (2008)
Jia, Z., et al.: Understanding big data analytics workloads on modern processors. IEEE Trans. Parallel Distrib. Syst. 28(6), 1797–1810 (2017)
Jun, C., Wen, W., Zi-yang, L., An, L.: Landsat 5 satellite overview. Remote Sens. Inf. 43(3), 85–89 (2007)
Stonebraker, M.: Scientific data bases at scale and SciDB. Anal. Proc. 4, 199–206 (2013)
Suchanek, F.M., Weikum, G.: Knowledge bases in the age of big data analytics. VLDB Endowment (2014)
Szalay, A.S., Gray, J., Fekete, G., Kunszt, P.Z., Kukol, P., Thakar, A.: Indexing the sphere with the hierarchical triangular mesh. Microsoft Research (2007)
Team, C.T.P.: Paradise: a database system for gis applications. In: ACM SIGMOD International Conference on Management of Data, p. 485 (1995)
Wang, L., et al.: Bigdatabench: a big data benchmark suite from internet services. In: IEEE International Symposium on High Performance Computer Architecture, HPCA 2014 (2014)
Acknowledgement
This work is supported by the National Key Research and Development Plan of China (Grant No. 2016YFB1000600 and 2016YFB1000601).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sun, Q., Liu, Y., Tian, W., Guo, Y., Lu, J. (2019). Multi-domain and Sub-role Oriented Software Architecture for Managing Scientific Big Data. In: Ren, R., Zheng, C., Zhan, J. (eds) Big Scientific Data Benchmarks, Architecture, and Systems. SDBA 2018. Communications in Computer and Information Science, vol 911. Springer, Singapore. https://doi.org/10.1007/978-981-13-5910-1_10
Download citation
DOI: https://doi.org/10.1007/978-981-13-5910-1_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-5909-5
Online ISBN: 978-981-13-5910-1
eBook Packages: Computer ScienceComputer Science (R0)