Abstract
We present a data-driven and physics-informed algorithm for drilling accident forecasting. The core machine-learning algorithm uses the data from the drilling telemetry representing the time-series. We have developed a Bag-of-features representation of the time series that enables the algorithm to predict the probabilities of six types of drilling accidents in real-time. The machine-learning model is trained on the 125 past drilling accidents from 100 different Russian oil and gas wells. Validation shows that the model can forecast 70% of drilling accidents with a false positive rate equals to 40%. The model addresses partial prevention of the drilling accidents at the well construction.
Similar content being viewed by others
Data Availability
Data is not available by the non-disclosure agreements with companies.
Code Availability
Code is not available by the non-disclosure agreements with companies.
References
Abanda A, Mori U, Lozano JA (2019) A review on distance based time series classification. Data Min Knowl Disc 33(2):378–412
Abu-Abed F (2010) Application of neural network modeling tools for analysis of pre-emergency situations on drilling sites. Softw Products Syst 3(3):136–139
Abu-Abed F (2015) Automated system for detection of pre-emergency situations at oil and gas industry facilities. Logger 5(251):48–61
Ali D, Frimpong S (2020) Artificial intelligence, machine learning and process automation: existing knowledge frontier and way forward for mining sector. Artif Intell Rev 53(8):6025–6042
Aljubran M, Ramasamy J, Bassam M, Magana-Mora A (2021) Deep learning and time-series analysis for the early detection of lost circulation incidents during drilling operations. IEEE Access
Antipova K, Klyuchnikov N, Zaytsev A, Gurina E, Romanenkova E, Koroteev D, et al. (2019) Data-driven model for the drilling accidents prediction. In: SPE Annual technical conference and exhibition, society of petroleum engineers
Bangert P (2021) Machine learning and data science in the oil and gas industry: Best practices, tools, and case studies. Gulf Professional Publishing
Begum N, Keogh E (2014) Rare time series motif discovery from unbounded streams. Proc VLDB Endow 8(2):149–160
Bergstra J, Yamins D, Cox D (2013) Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In: International conference on machine learning, pp 115–123
Bostrom A, Bagnall A (2017) Binary shapelet transform for multiclass time series classification. In: Transactions on large-scale data-and knowledge-centered systems XXXII, Springer, pp 24–46
Burnaev E, Erofeev P, Papanov A (2015) Influence of resampling on accuracy of imbalanced classification. In: Eighth international conference on machine vision (ICMV 2015), international society for optics and photonics, vol 9875, p 987521
Chandola V, Banerjee A, Kumar V (2009) Anomaly detection: A survey. ACM Comput Surv (CSUR) 41(3):15
Claesen M, De Moor B (2015) Hyperparameter search in machine learning. arXiv:150202127
Fawaz HI, Forestier G, Weber J, Idoumghar L, Muller PA (2019) Deep learning for time series classification: a review. Data Min Knowl Disc 33(4):917–963
Ferreira APL, Carvalho DJ, Rodrigues RM, Schnell DM, Thomson IJ, Baptista RC, Alves SB, et al. (2015) Automated decision support and expert collaboration avoid stuck pipe and improve drilling operations in offshore Brazil subsalt well. In: Offshore technology conference, offshore technology conference
Friedman J, Hastie T, Höfling H, Tibshirani R, et al. (2007) Pathwise coordinate optimization. Ann Appl Stat 1(2):302–332
Tc F (2011) A review on time series data mining. Eng Appl Artif Intel 24(1):164–181
Grabocka J, Schilling N, Wistuba M, Schmidt-Thieme L (2014) Learning time-series shapelets. In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 392–401
Grace RD (2017) Blowout and well control handbook. Gulf Professional Publishing
Gurina E, Klyuchnikov N, Zaytsev A, Romanenkova E, Antipova K, Simon I, Makarov V, Koroteev D (2020) Application of machine learning to accidents detection at directional drilling. J Pet Sci Eng 184:106519
Hajizadeh Y (2019) Machine learning in oil and gas; a swot analysis approach. J Pet Sci Eng 176:661–663
Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (roc) curve. Radiology 143(1):29–36
Hatami N, Gavet Y, Debayle J (2019) Bag of recurrence patterns representation for time-series classification. Pattern Anal Applic 22(3):877–887
Karim F, Majumdar S, Darabi H, Harford S (2019) Multivariate lstm-fcns for time series classification. Neural Netw 116:237–245
Kozlovskaia N, Zaytsev A (2017) Deep ensembles for imbalanced classification. In: 2017 16th IEEE international conference on machine learning and applications (ICMLA). IEEE, pp 908–913
Kumar N (2021) Recent issues with machine vision applications for deep network architectures. In: Cognitive computing systems. Apple Academic Press, pp 267–284
Li H, Liu J, Yang Z, Liu RW, Wu K, Wan Y (2020) Adaptively constrained dynamic time warping for time series classification and clustering. Inform Sci 534:97–116
Malhotra P, TV V, Vig L, Agarwal P, Shroff G (2017) Timenet: Pre-trained deep recurrent neural network for time series classification. arXiv:170608838
Milon MH (2017) Comparison on fourier and wavelet transformation for an ecg signal. Am J Eng Res(AJER) 6(8):1–7
Mohri M, Rostamizadeh A, Talwalkar A (2012) Foundations of machine learning. The MIT Press
Qodirov S, Shestakov A (2020) Development of artificial neural network for predicting drill pipe sticking in real-time well drilling process. In: 2020 Global smart industry conference (GloSIC). IEEE, pp 139–144
Rakthanmanon T, Keogh E (2013) Fast shapelets: A scalable algorithm for discovering time series shapelets. In: proceedings of the 2013 SIAM international conference on data mining, SIAM, pp 668–676
Sadlier A, Says I, Hanson R (2013) Automated decision support to enhance while-drilling decision making: Where does it fit within drilling automation?. In: SPE/IADC Drilling conference, OnePetro
Sagheer A, Kotb M (2019) Unsupervised pre-training of a deep lstm-based stacked autoencoder for multivariate time series forecasting problems. Sci Rep 9(1):1–16
Samhitha BK, Priya MS, Sanjana C, Mana SC, Jose J (2020) Improving the accuracy in prediction of heart disease using machine learning algorithms. In: 2020 International conference on communication and signal processing (ICCSP). IEEE, pp 1326–1330
Schlumberger (2020) Oilfield glossary. https://www.glossary.oilfield.slb.com/en
Serra J, Arcos JL (2014) An empirical evaluation of similarity measures for time series classification. Knowl-Based Syst 67:305–314
Sifuzzaman M, Islam M, Ali M (2009) Application of wavelet transform and its advantages compared to fourier transform. J Phys Sci
Tzanetakis G, Essl G, Cook P (2001) Audio analysis using the discrete wavelet transform. In: Proc conf. in acoustics and music theory applications, Citeseer, vol 66
Wang J, Liu P, She MF, Nahavandi S, Kouzani A (2013) Bag-of-words representation for biomedical time series classification. Biomed Signal Process Control 8(6):634–644
Wang X, Lin J, Senin P, Oates T, Gandhi S, Boedihardjo AP, Chen C, Frankenstein S (2016) Rpm: Representative pattern mining for efficient time series classification. In: EDBT
Wang Z, Yan W, Oates T (2017) Time series classification from scratch with deep neural networks: A strong baseline. In: 2017 International joint conference on neural networks (IJCNN). IEEE, pp 1578–1585
Yoon J, Jarrett D, van der Schaar M (2019) Time-series generative adversarial networks. Adv Neural Inf Process Syst 32(NeurIPS 2019)
Zheng Y, Liu Q, Chen E, Ge Y, Zhao JL (2014) Time series classification using multi-channels deep convolutional neural networks. In: International conference on web-age information management. Springe1r, pp 298–310
Funding
The authors did not receive support from any organisation for the submitted work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A
In Fig. 13 one may see the example of mud telemetry data used during the drilling accident forecasting problem. After the feature generation procedure, the presented part of the time-series is transformed into a histogram, presented in Fig. 14.
Appendix B
Table 6 shows the results of the first stage of an optimization procedure for the top 40 parameter sets.
Rights and permissions
About this article
Cite this article
Gurina, E., Klyuchnikov, N., Antipova, K. et al. Forecasting the abnormal events at well drilling with machine learning. Appl Intell 52, 9980–9995 (2022). https://doi.org/10.1007/s10489-021-03013-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-03013-x