Abstract
Since the utilization of data analytics in oil field industry, data mining has become increasingly important. Various decision-making algorithms derived from data are closely related to the quality of data, which makes data quality assessment an indispensable part of the intelligent construction of oilfield. General data quality assessment models are not suitable for centralized oilfield scenarios because the quality of datasets depends on their usage rather than a simple stacking of individual data units. For example, datasets containing data units with good quality yet serious homogeneity cannot meet the data requirements in deep learning. This paper is based on the theoretical model of process measurement and adopts the second-level fuzzy comprehensive evaluation model. We calculate the member-ship degree of each factor set based on the business demand by the AHP. The oriented oilfield structured data quality assessment model is then established. This model provides theoretical basis and technical support for oilfield data preprocessing, decision-making and staged evaluation of data governance.
Copyright 2023, IFEDC Organizing Committee
This paper was prepared for presentation at the 2023 International Field Exploration and Development Conference in Wuhan, China, 20-22 September 2023.
This paper was selected for presentation by the IFEDC Committee following review of information contained in an abstract submitted by the author(s). Contents of the paper, as presented, have not been reviewed by the IFEDC Technical Team and are subject to correction by the author(s). The material does not necessarily reflect any position of the IFEDC Technical Committee its members. Papers presented at the Conference are subject to publication review by Professional Team of IFEDC Technical Committee. Electronic reproduction, distribution, or storage of any part of this paper for commercial purposes without the written consent of IFEDC Organizing Committee is prohibited. Permission to reproduce in print is restricted to an abstract of not more than 300 words; illustrations may not be copied. The abstract must contain conspicuous acknowledgment of IFEDC. Contact email: paper@ifedc.org.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Liang, W.F.: Constructed achievements and prospects of the intelligent application system for the oilfield development. Daqing Petrol. Geol. Dev. 38(5), 283–289 (2019)
Liu, S.F., Gao, W.X., Xu, W.Y.: Research and improvement of a software process measurement model. Microcomput. Inform. 26(33), 17–19 (2010)
La, Z.: A Fuzzy-set-theoretic approach to the compositionality of meaning: propositions, dispositions and canonical forms. J. Seman. 2(3–4) (1983)
Saaty, R.W.: The analytic hierarchy process—what it is and how it is used. Mathem. Model. 9(3), 161–176 (1987)
Chen, S.H.: Research on application of image enhancement and recognition technology based on deep learning in oilfield work area. University of Electronic Science and Technology of China (2021)
Wang, Q., Li, A.R., Ren, J.W., et al.: Exploration of deep learning technology in remaining oil prediction of Y oilfield. Surv. Mapping Geol. 4(3), 15–18 (2022)
Wang, H.L., Mu, L.X., Shi, F.G., et al.: Production prediction at ultra-high water cut stage via recurrent neural network. Petrol. Explor. Dev. 2020107 (2020)
Zhong, Y.H., Wang, S.N., Luo, L., et al.: Knowledge mining for oilfield development index prediction model using deep learning. J. Southwest Petrol. Univ. 42(6), 63 (2020)
Wang, R.Y.: A product perspective on total data quality management. Commun. ACM 41(2), 58–65 (1998)
Laws, R., Gillespie, S., Puro, J., et al.: The community health applied research network (CHARN) data warehouse: a resource for patient-centered outcomes research and quality improvement in underserved, safety net populations. EGEMS 2(3) (2014)
Fang, Y.L., Yang, D.Q., Tang, S.W., et al.: Data quality managements in data warehouse. Comput. Eng. Appl. 39(13), 1–4 (2003)
Fang, Y.L., Yang, D.Q., Tang, S.W., et al.: Data schedule serialization in data transformations. Comput. Eng. Appl. 39(17), 4–6 (2003)
Liu, Z.H., Zhang, Q.L.: Research overview of big data technology. J. Zhejiang Univ. 48(6), 957–972 (2014)
Constantinescu, R., Iacob, I.M.: Capability maturity model integration. J. Appl. Quantit. Methods 2(1), 31–37 (2007)
Batini, C., Cabitza, F., Cappiello, C., et al.: A comprehensive data quality methodology for web and structured data. In: Proceedings of the 2006 1st International Conference on Digital Information Management. IEEE (2006)
Boretti, E.: AIB-ISTAT statistics: the first time for Italian public libraries. Perform. Measur. Metrics 6(1), 32–38 (2005)
Joseph, K., Mmath, J.F.: Validation of perinatal data in the Discharge Abstract Database of the Canadian Institute for Health Information. Chronic Diseases Injur. Canada 29(3) (2009)
Lee, Y.W., Strong, D.M., Kahn, B.K., et al.: AIMQ: a methodology for information quality assessment. Inform. Manag. Commun. ACM 40(2), 133–146 (2002)
Marchetti, C., Mecella, M., Scannapieco, M., et al.: Data Quality in Cooperative Information Systems. Encyclopedia of Data Warehousing and Mining, pp. 297–301. IGI Global (2005)Â
Pipino, L.L., Lee, Y.W., Wang, R.Y.: Data quality assessment. Commun. ACM. 45(4), 211–218 (2002)
Sitawati, H.D., Ruldeviyani, Y., Hidayanto, A.N., et al.: Data quality improvement: case study financial regulatory authority reporting. In: proceedings of the 2021 International Seminar on Machine Learning, Optimization, and Data Science (ISMODE). IEEE (2022)
Su, Z., Jin, Z.: A methodology for information quality assessment in the designing and manufacturing processes of mechanical products. Information Quality Management: Theory and Applications, pp. 190–220. IGI Global (2007)
Woodall, P., Parlikad, A.K., Lebrun, L.: Approaches to information quality management: State of the practice of uk asset-intensive organisations. Asset Condition Informat. Syst. Decision Models, 1–18 (2012)
Chen, C., Chen, H., Zhang, Y., et al.: TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13(8), 1194–1202 (2020)
Liu, H.: The research on key issues of data quality management, assessment and detection in big data environment. Jilin University (2019)
Acknowledgments
The research is supported by Sinopec Shengli Oilfield Intelligent Engine for Oilfield Safety and Environmental Protection project under grant number YKB2302.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Su, Xs., Mei, W., Song, Hf., Liu, J., Huang, S. (2024). Oriented Oilfield Structured Data Quality Assessment Model. In: Lin, J. (eds) Proceedings of the International Field Exploration and Development Conference 2023. IFEDC 2023. Springer Series in Geomechanics and Geoengineering. Springer, Singapore. https://doi.org/10.1007/978-981-97-0272-5_28
Download citation
DOI: https://doi.org/10.1007/978-981-97-0272-5_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0271-8
Online ISBN: 978-981-97-0272-5
eBook Packages: EngineeringEngineering (R0)