Abstract
Rock classification provides vital information to geosciences and geological engineering practices. Reaping the benefits of the advances of computer vision-based deep learning artificial intelligence (AI) technology, this study aims to develop a next-generation convolutional neural network (CNN) to perform automatic rock classification. Two major challenging issues have been particularly addressed. First, most of the previous rock classifications are simply transfer learning of CNNs that are trained by life-like scenarios. Second, classifying rock types with similar textures leads to severe overfitting of CNNs. In this study, a novel CNN called HKUDES_Net is proposed and implemented to classify seven common Hong Kong rock types, namely fine-grained granite, medium-grained granite, coarse-grained granite, coarse ash tuff, fine ash tuff, feldsparphyric rhyolite, and granodiorite. With the aid of dynamic expansion, squeeze and excitation, and other strategies, HKUDES_Net can classify rock types with similar texture patterns/colors but different grain sizes. As compared with the other ten landmark CNNs and seven feature-based algorithms, HKUDES_Net has the best performance in precision (90.9%), recall (90.4%), and f1-score (90.5%). By implementing the alerting level, which restricts the training loss hovering above a small constant and prevents the validation loss from rising, the overfitting has been efficiently eliminated. The proposal and implementation of HKUDES_Net highlight the value of interdisciplinary research and will continuously pave the way for better coupling AI and geosciences.
Highlights
-
A next-generation convolutional neural network (CNN) called HKUDES_Net is developed to perform automatic rock classification.
-
HKUDES_Net can classify rock types with similar texture patterns/colors but different grain sizes.
-
By implementing the alerting level in HKUDES_Net, the overfitting has been efficiently eliminated.
Plain Language Summary
Rock classification provides vital information to geosciences and geological engineering practices. However, the traditional way of rock classification is to a certain degree experience-based. Computer vision-based artificial intelligence (AI), especially convolutional neural network (CNN), a sub-branch of AI, may help people automatically classify rock types with the hope to train AI models (CNNs), which can instantly and precisely perceive rock images similar to how human beings observe rocks. In other words, by putting a rock image into a CNN, the CNN will read the information from the image and tell people what rock type in the image is. This study develops and implements a next-generation CNN called HKUDES_Net to classify seven different rock types. Adopting different computational strategies, HKUDES_Net can classify rock types of different grain sizes with similar texture patterns/colors, which is challenging for other CNNs. As compared with the other landmark CNNs and machine learning algorithms, HKUDES_Net has much a better prediction accuracy performance. The proposal and implementation of HKUDES_Net exemplify an interdisciplinary research study, and we are confident that this paper will be instrumental to pave the way for further coupling AI and geosciences.
Similar content being viewed by others
Data availability
The data used to support the findings of this study are available from the corresponding author upon reasonable request.
References
Alférez GH, Vázquez EL, Ardila AMM, Clausen BL (2021) Automatic classification of plutonic rocks with deep learning. Appl Comput Geosci 10:100061
Baraboshkin E, Ismailova L, Orlov D, Zhukovskaya E, Kalmykov G, Khotylev O, Baraboshkin EY, Koroteev D (2020) Deep convolutions for in-depth automated rock typing. Comput Geosci 135:104330
Barton N, Lien R, Lunde J (1974) Engineering classification of rock masses for the design of tunnel support. Rock Mech Rock Eng 6:189–236
Benaouda D, Wadge G, Whitmarsh RB, Rothwell RG, Macleod C (1999) Inferring the lithology of borehole rocks by applying neural networks classifiers to downhole logs: an example from the Ocean Drilling Program. Geophys J Int 136(2):477–491
Bergen KJ, Johnson PA, de Hoop MV, Beroza GC (2019) Machine learning for data-driven discovery in solid earth geoscience. Science 363:1299
Bianconi F, Fernández A (2007) Evaluation of the effects of Gabor filter parameters on texture classification. Pattern Recogn 40(12):3325–3335
Bianconi F, Fernández A (2014) An appendix to “Texture databases—a comprehensive survey.” Pattern Recogn Lett 45:33–38
Bianconi F, González E, Fernández A, Saetta SA (2012) Automatic classification of granite tiles through colour and texture features. Expert Syst Appl 39(12):11212–11218
Bianconi F, González E, Fernández A (2015) Dominant local binary patterns for texture classification: labelled or unlabelled? Pattern Recogn Lett 65:8–14
Bianconi F, Fernández A, Smeraldi F, Pascoletti G (2021) Colour and texture descriptors for visual recognition: a historical overview. J Imaging 7(11):245
Cai J, Zhao J, Hudson J (1998) Computerization of rock engineering systems using neural networks with an expert system. Rock Mech Rock Eng 31:135–152
Chatterjee S (2013) Vision-based rock-type classification of limestone using multi-class support vector machine. Appl Intell 39:14–27
Chen L, Zhang HW, Xiao J, Nie L, Shao J, Liu W, Chua T (2017) SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2017, pp 5659–5667
Chollet F (2017) Xception: deep learning with depthwise separable convulutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2017, pp 1251–1258
Cui X, Wong LNY (2021) A 3D thermo-hydro-mechanical coupling model for enhanced geothermal systems. Int J Rock Mech Min Sci 143:104744
Deng J, Dong W, Socher R et al (2009) Imagenet: a large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition, Miami, FL, pp 248–255
Donskoi E, Suthers SP, Fradd SB, Young JM, Campbell JJ, Raynlyn TD, Clout JMF (2007) Utilization of optical image analysis and automatic texture classification for iron ore particle characterization. Miner Eng 20(5):461–471
Dunlop H (2006) Automatic rock detection and classification in natural scenes, PhD thesis. Carnegie Mellon University
Elsken T, Metzen JH, Hutter F (2019) Neural architecture search: a survey. J Mach Learn Res 20(55):1–21
Erguler ZA, Ulusay R (2009) Assessment of physical disintegration characteristics of clay-bearing rocks: disintegration index test and a new durability classification chart. Eng Geol 105(1–2):11–19
Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vision 88:303–338
Everingham M, Eslami SMA, Van Gool L, Williams CKI, Winn J, Zisserman A (2015) The Pascal visual object classes challenge: a retrospective. Int J Comput Vision 111:98–136
Fan G, Chen F, Chen D, Dong Y (2020) Recognizing multiple types of rocks quickly and accurately based on lightweight CNNs model. IEEE Access 8:55269–55278
Fernández A, Ghita O, González E, Bianconi F, Whelan PF (2011) Evaluation of robustness against rotation of LBP, CCR and ILBP features in granite texture classification. Mach vis Appl 22(6):913–926
Fernández A, Álvarez MX, Bianconi F (2013) Texture description through histograms of equivalent patterns. J Math Imaging vis 45(1):76–102
Ferreira A, Giraldi G (2017) Convolutional neural network approaches to granite tiles classification. Expert Syst Appl 84:1–11
Goodfellow I, Bengio Y, Hinton G (2016) Deep learning. MIT Press, Cambridge
Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. California Institute of Technology Technical Report
He KM, Zhang XY, Ren SQ (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2016, pp 770–778
Hossain S, Serikawa S (2013) Texture databases—a comprehensive survey. Pattern Recogn Lett 34(15):2007–2022
Howard A, Sandler M, Chu G, Chen L, Chen B, Tan M, Wang W, Zhu Y, Pang R, Vasudevan V, Le V, Adam QH (2019) Searching for MobileNetV3. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2019, pp 1314–1324
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2018, pp 7132–7141
Huang Y, Wanstedt S (1998) The introduction of neural network system and its application in rock engineering. Eng Geol 49(3–4):253–260
Hudson JA, Harrison JP (1997) Engineering rock mechanics: an introduction to the principles. Imperial College of Science, Technology and Medicine, University of London
Izadi H, Sadri J, Bayati M (2017) An intelligent system for mineral identification in thin sections based on a cascade approach. Comput Geosci 99:37–49
Jeong J, Park E, Han WS, Kim KY (2014) A novel data assimilation methodology for predicting lithology based on sequence labelling algorithms. J Geophys Res Solid Earth 119(10):7503–7520
Jeong J, Park E, Emelyanova I, Pervukhina M, Esteban L, Yun ST (2020) Interpreting the subsurface lithofacies at high lithological resolution by integrating information from well-log data and rock-core digital images. J Geophys Res Solid Earth 125(2):e2019JB018204
Karimpouli S, Tahmasebi P (2019) Image-based velocity estimation of rock using Convolutional Neural Networks. Neural Netw 111:89–97
Kim CY, Bae GJ, Hong SW, Park CH, Moon HK, Shin HS (2001) Neural network based prediction of ground surface settlements due to tunnelling. Comput Geotech 28(6–7):517–547
Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25 (NIPS 2012), pp 1106–1114
Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images. Technical report, University of Toronto, vol 21, p 561
Lampert CH, Nickisch H (2009) Learning to detect unseen object classes by between-class attribute transfer. In: 2009 IEEE conference on computer vision and pattern recognition, Miami, FL, pp 951–958
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06), New York, NY, pp 2169–2178
Lecun Y, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Lee GR, Gommers F, Wasilewski F, Wohlfahrt K, O’Leary A (2019) PyWavelets: a Python package for wavelet analysis. J Open Source Softw 4(36):1237
Lepisto L, Kunttu I, Visa A (2005) Rock image classification using color features in Gabor space. J Electron Imaging 14(4):040503
Lepisto L, Kunttu I, Autio J, Visa A (2003) Rock image classification using nonhomogenous textures and spectral imaging. In: The 11th international conference in central europe on computer graphics, visualization and computer vision, vol 3(7), pp 82–86
Li N, Hao H, Gu Q, Wang D, Hu X (2017) A transfer learning method for automatic identification of sandstone microscopic images. Comput Geosci 103:111–121
Li FF, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In: 2004 conference on computer vision and pattern recognition workshop, Washington, DC, USA, p 178
Liao S, Law MWK, Chung ACS (2009) Dominant local binary patterns for texture classification. IEEE Trans Image Process 18(5):1107–1118
Lin M, Chen Q, Yan SC (2014) Network in Network. In: 2014 ICLR. https://arxiv.org/abs/1312.4400
Maiti S, Tiwari RK (2010) Neural network modeling and an uncertainty analysis in Bayesian framework: a case study from the KTB borehole site. J Geophys Res 115:B10208. https://doi.org/10.1029/2010JB000864.
Meng FZ, Song J, Wong LNY, Wang ZQ, Zhang CQ (2021) Characterization of roughness and shear behavior of thermally treated granite fractures. Eng Geol 293:106287
Młynarczuk M, Górszczyk A, Ślipek B (2013) The application of pattern recognition in the automatic classification of microscopic rock images. Comput Geosci 60:126–133
Netzer Y, Wang T, Coates A, Bissacco A, Wu B, Ng AY (2011) Reading digits in natural images with unsupervised feature learning. In: NIPS workshop on deep learning and unsupervised feature learning
Ng AY, Jordan MI (2001) On discriminative vs. generative classifiers: a comparison of logistic regression and naïve Bayes. Adv Neural Inf Process Syst 14 (NIPS 2001) 14:605–610
Nilsback ME, Zisserman A (2008) Automated flower classification over a large number of classes. In: 2008 Sixth Indian conference on computer vision, graphics & image processing, Bhubaneswar, pp 722–729
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830
Perez CA, Estévez PA, Vera PA, Castillo LE, Aravena CM, Schulz DA, Medina LE (2011) Ore grade estimation by feature selection and voting using boundary detection in digital image analysis. Int J Miner Process 101(1–4):28–36
Raiche A (1991) A pattern recognition approach to geophysical inversion using neural nets. Geophys J Int 105(3):629–648
Ramachandranm P, Zoph B, Le VQ (2017) Searching for activation functions. https://arxiv.org/abs/1710.05941v2
Ran XJ, Xue LF, Zhang YY, Liu ZY, Sang XJ, He JX (2019) Rock classification from field image patches analyzed using a deep convolutional neural network. Mathematics 7(8):755
Ribeiro MT, Singh S, Guestrin C (2016) “Why should I trust you?”: explaining with the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144
Sandler M, Howard A, Zhu ML, Zhmoginov A, Chen LC (2018) MobileNetV2: inverted residual and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 4510–4520
Sharif H, Ralchenko M, Samson C, Ellery A (2015) Autonomous rock classification using bayesian image analysis for rover-based planetary exploration. Comput Geosci 83:153–167
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. https://arxiv.org/abs/1409.1556
Singh N, Singh TN, Tiwary A, Sarkar KM (2010) Textural identification of basaltic rock mass using image processing and neural network. Comput Geosci 14(2):301–310
Suykens JAK, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9:293–300
Szegedy C, Liu W, Jia YQ, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2015, pp 1–9
Tessier J, Duchesne C, Bartolacci G (2007) A machine vision approach to online estimation of run-of-mine ore composition on conveyor belts. Miner Eng 20(12):1129–1144
van der Walt S, Schönberger JL, Nunez-Iglesias J, Boulogne F, Warner JD, Yager N, Gouillart E, Yu T (2014) scikit-image: Image processing in Python. Technical Report PeerJ 2:e453. https://doi.org/10.7717/peerj.453
Wong LNY, Zhou YM (2021) Boulder falls in Hong Kong-insights from power law relationships and supervised machine learning. Landslides 18:3227–3253
Wong LNY, Guo TY, Lam WK, Ng JYH (2019) Experimental study of cracking characteristics of Kowloon granite based on three mode I fracture toughness methods. Rock Mech Rock Eng 52(11):4217–4235
Xiao J, Hays J, Ehinger KA, Oliva A, Torralba A (2010) SUN database: large-scale scene recognition from abbey to zoo. In: 2010 IEEE computer society conference on computer vision and pattern recognition, San Francisco, CA, pp 3485–3492
Yang Y, Zhang Q (1997) A hierarchical analysis for rock engineering using artificial neural networks. Rock Mech Rock Eng 30:207–222
Zhang YH, Wong LNY, Chan KK (2019) An extended grain-based model accounting for microstructures in rock deformation. J Geophys Res Solid Earth 124(1):125–148
Zhao Q, Glaser SD (2020) Relocating acoustic emission in rocks with unknown velocity structure with machine learning. Rock Mech Rock Eng 53:2053–2061
Zhou YM, Zhao C, Zhao CF, Ma CC, Xie JF (2018) Experimental study on the fracturing behaviors and mechanical properties of cracks under coupled hydro-mechanical effects in rock-like specimens. Water 10(10):1355
Zhu XH, Chen MQ, Liu WJ, Luo YX, Hu H (2022) The fragmentation mechanism of heterogeneous granite by high-voltage electrical pulses. Rock Mech Rock Eng 55:4351–4372
Acknowledgements
The first author acknowledges the Postgraduate Scholarship from the University of Hong Kong. Mr Hui Zong, PhD candidate at Tongji University in China and Mr Yuhan Ping, PhD candidate at Department of Computer Science, the University of Hong Kong, are also acknowledged for discussing the architecture of the proposed HKUDES_Net in great detail. The authors also acknowledge Mr. Frankie Lo and his colleagues from GEO for constructive discussions on data collection and CNN training plans involved in this study. The rock image database is not publicly available at present, and it will be made open source at a later stage. Other data related to the present study are available online (at https://doi.org/10.6084/m9.figshare.21768689.v1).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhou, Y., Wong, L.N.Y. & Tse, K.K.C. Novel Rock Image Classification: The Proposal and Implementation of HKUDES_Net. Rock Mech Rock Eng 56, 3825–3841 (2023). https://doi.org/10.1007/s00603-023-03235-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00603-023-03235-0