Skip to main content
Log in

Novel Rock Image Classification: The Proposal and Implementation of HKUDES_Net

  • Original Paper
  • Published:
Rock Mechanics and Rock Engineering Aims and scope Submit manuscript

Abstract

Rock classification provides vital information to geosciences and geological engineering practices. Reaping the benefits of the advances of computer vision-based deep learning artificial intelligence (AI) technology, this study aims to develop a next-generation convolutional neural network (CNN) to perform automatic rock classification. Two major challenging issues have been particularly addressed. First, most of the previous rock classifications are simply transfer learning of CNNs that are trained by life-like scenarios. Second, classifying rock types with similar textures leads to severe overfitting of CNNs. In this study, a novel CNN called HKUDES_Net is proposed and implemented to classify seven common Hong Kong rock types, namely fine-grained granite, medium-grained granite, coarse-grained granite, coarse ash tuff, fine ash tuff, feldsparphyric rhyolite, and granodiorite. With the aid of dynamic expansion, squeeze and excitation, and other strategies, HKUDES_Net can classify rock types with similar texture patterns/colors but different grain sizes. As compared with the other ten landmark CNNs and seven feature-based algorithms, HKUDES_Net has the best performance in precision (90.9%), recall (90.4%), and f1-score (90.5%). By implementing the alerting level, which restricts the training loss hovering above a small constant and prevents the validation loss from rising, the overfitting has been efficiently eliminated. The proposal and implementation of HKUDES_Net highlight the value of interdisciplinary research and will continuously pave the way for better coupling AI and geosciences.

Highlights

  • A next-generation convolutional neural network (CNN) called HKUDES_Net is developed to perform automatic rock classification.

  • HKUDES_Net can classify rock types with similar texture patterns/colors but different grain sizes.

  • By implementing the alerting level in HKUDES_Net, the overfitting has been efficiently eliminated.

Plain Language Summary

Rock classification provides vital information to geosciences and geological engineering practices. However, the traditional way of rock classification is to a certain degree experience-based. Computer vision-based artificial intelligence (AI), especially convolutional neural network (CNN), a sub-branch of AI, may help people automatically classify rock types with the hope to train AI models (CNNs), which can instantly and precisely perceive rock images similar to how human beings observe rocks. In other words, by putting a rock image into a CNN, the CNN will read the information from the image and tell people what rock type in the image is. This study develops and implements a next-generation CNN called HKUDES_Net to classify seven different rock types. Adopting different computational strategies, HKUDES_Net can classify rock types of different grain sizes with similar texture patterns/colors, which is challenging for other CNNs. As compared with the other landmark CNNs and machine learning algorithms, HKUDES_Net has much a better prediction accuracy performance. The proposal and implementation of HKUDES_Net exemplify an interdisciplinary research study, and we are confident that this paper will be instrumental to pave the way for further coupling AI and geosciences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data availability

The data used to support the findings of this study are available from the corresponding author upon reasonable request.

References

  • Alférez GH, Vázquez EL, Ardila AMM, Clausen BL (2021) Automatic classification of plutonic rocks with deep learning. Appl Comput Geosci 10:100061

    Google Scholar 

  • Baraboshkin E, Ismailova L, Orlov D, Zhukovskaya E, Kalmykov G, Khotylev O, Baraboshkin EY, Koroteev D (2020) Deep convolutions for in-depth automated rock typing. Comput Geosci 135:104330

    Google Scholar 

  • Barton N, Lien R, Lunde J (1974) Engineering classification of rock masses for the design of tunnel support. Rock Mech Rock Eng 6:189–236

    Google Scholar 

  • Benaouda D, Wadge G, Whitmarsh RB, Rothwell RG, Macleod C (1999) Inferring the lithology of borehole rocks by applying neural networks classifiers to downhole logs: an example from the Ocean Drilling Program. Geophys J Int 136(2):477–491

    Google Scholar 

  • Bergen KJ, Johnson PA, de Hoop MV, Beroza GC (2019) Machine learning for data-driven discovery in solid earth geoscience. Science 363:1299

    Google Scholar 

  • Bianconi F, Fernández A (2007) Evaluation of the effects of Gabor filter parameters on texture classification. Pattern Recogn 40(12):3325–3335

    Google Scholar 

  • Bianconi F, Fernández A (2014) An appendix to “Texture databases—a comprehensive survey.” Pattern Recogn Lett 45:33–38

    Google Scholar 

  • Bianconi F, González E, Fernández A, Saetta SA (2012) Automatic classification of granite tiles through colour and texture features. Expert Syst Appl 39(12):11212–11218

    Google Scholar 

  • Bianconi F, González E, Fernández A (2015) Dominant local binary patterns for texture classification: labelled or unlabelled? Pattern Recogn Lett 65:8–14

    Google Scholar 

  • Bianconi F, Fernández A, Smeraldi F, Pascoletti G (2021) Colour and texture descriptors for visual recognition: a historical overview. J Imaging 7(11):245

    Google Scholar 

  • Cai J, Zhao J, Hudson J (1998) Computerization of rock engineering systems using neural networks with an expert system. Rock Mech Rock Eng 31:135–152

    Google Scholar 

  • Chatterjee S (2013) Vision-based rock-type classification of limestone using multi-class support vector machine. Appl Intell 39:14–27

    Google Scholar 

  • Chen L, Zhang HW, Xiao J, Nie L, Shao J, Liu W, Chua T (2017) SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2017, pp 5659–5667

  • Chollet F (2017) Xception: deep learning with depthwise separable convulutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2017, pp 1251–1258

  • Cui X, Wong LNY (2021) A 3D thermo-hydro-mechanical coupling model for enhanced geothermal systems. Int J Rock Mech Min Sci 143:104744

    Google Scholar 

  • Deng J, Dong W, Socher R et al (2009) Imagenet: a large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition, Miami, FL, pp 248–255

  • Donskoi E, Suthers SP, Fradd SB, Young JM, Campbell JJ, Raynlyn TD, Clout JMF (2007) Utilization of optical image analysis and automatic texture classification for iron ore particle characterization. Miner Eng 20(5):461–471

    Google Scholar 

  • Dunlop H (2006) Automatic rock detection and classification in natural scenes, PhD thesis. Carnegie Mellon University

  • Elsken T, Metzen JH, Hutter F (2019) Neural architecture search: a survey. J Mach Learn Res 20(55):1–21

    Google Scholar 

  • Erguler ZA, Ulusay R (2009) Assessment of physical disintegration characteristics of clay-bearing rocks: disintegration index test and a new durability classification chart. Eng Geol 105(1–2):11–19

    Google Scholar 

  • Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vision 88:303–338

    Google Scholar 

  • Everingham M, Eslami SMA, Van Gool L, Williams CKI, Winn J, Zisserman A (2015) The Pascal visual object classes challenge: a retrospective. Int J Comput Vision 111:98–136

    Google Scholar 

  • Fan G, Chen F, Chen D, Dong Y (2020) Recognizing multiple types of rocks quickly and accurately based on lightweight CNNs model. IEEE Access 8:55269–55278

    Google Scholar 

  • Fernández A, Ghita O, González E, Bianconi F, Whelan PF (2011) Evaluation of robustness against rotation of LBP, CCR and ILBP features in granite texture classification. Mach vis Appl 22(6):913–926

    Google Scholar 

  • Fernández A, Álvarez MX, Bianconi F (2013) Texture description through histograms of equivalent patterns. J Math Imaging vis 45(1):76–102

    Google Scholar 

  • Ferreira A, Giraldi G (2017) Convolutional neural network approaches to granite tiles classification. Expert Syst Appl 84:1–11

    Google Scholar 

  • Goodfellow I, Bengio Y, Hinton G (2016) Deep learning. MIT Press, Cambridge

    Google Scholar 

  • Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. California Institute of Technology Technical Report

  • He KM, Zhang XY, Ren SQ (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2016, pp 770–778

  • Hossain S, Serikawa S (2013) Texture databases—a comprehensive survey. Pattern Recogn Lett 34(15):2007–2022

    Google Scholar 

  • Howard A, Sandler M, Chu G, Chen L, Chen B, Tan M, Wang W, Zhu Y, Pang R, Vasudevan V, Le V, Adam QH (2019) Searching for MobileNetV3. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2019, pp 1314–1324

  • Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2018, pp 7132–7141

  • Huang Y, Wanstedt S (1998) The introduction of neural network system and its application in rock engineering. Eng Geol 49(3–4):253–260

    Google Scholar 

  • Hudson JA, Harrison JP (1997) Engineering rock mechanics: an introduction to the principles. Imperial College of Science, Technology and Medicine, University of London

  • Izadi H, Sadri J, Bayati M (2017) An intelligent system for mineral identification in thin sections based on a cascade approach. Comput Geosci 99:37–49

    Google Scholar 

  • Jeong J, Park E, Han WS, Kim KY (2014) A novel data assimilation methodology for predicting lithology based on sequence labelling algorithms. J Geophys Res Solid Earth 119(10):7503–7520

    Google Scholar 

  • Jeong J, Park E, Emelyanova I, Pervukhina M, Esteban L, Yun ST (2020) Interpreting the subsurface lithofacies at high lithological resolution by integrating information from well-log data and rock-core digital images. J Geophys Res Solid Earth 125(2):e2019JB018204

    Google Scholar 

  • Karimpouli S, Tahmasebi P (2019) Image-based velocity estimation of rock using Convolutional Neural Networks. Neural Netw 111:89–97

    Google Scholar 

  • Kim CY, Bae GJ, Hong SW, Park CH, Moon HK, Shin HS (2001) Neural network based prediction of ground surface settlements due to tunnelling. Comput Geotech 28(6–7):517–547

    Google Scholar 

  • Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25 (NIPS 2012), pp 1106–1114

  • Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images. Technical report, University of Toronto, vol 21, p 561

  • Lampert CH, Nickisch H (2009) Learning to detect unseen object classes by between-class attribute transfer. In: 2009 IEEE conference on computer vision and pattern recognition, Miami, FL, pp 951–958

  • Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06), New York, NY, pp 2169–2178

  • Lecun Y, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324

    Google Scholar 

  • Lee GR, Gommers F, Wasilewski F, Wohlfahrt K, O’Leary A (2019) PyWavelets: a Python package for wavelet analysis. J Open Source Softw 4(36):1237

    Google Scholar 

  • Lepisto L, Kunttu I, Visa A (2005) Rock image classification using color features in Gabor space. J Electron Imaging 14(4):040503

    Google Scholar 

  • Lepisto L, Kunttu I, Autio J, Visa A (2003) Rock image classification using nonhomogenous textures and spectral imaging. In: The 11th international conference in central europe on computer graphics, visualization and computer vision, vol 3(7), pp 82–86

  • Li N, Hao H, Gu Q, Wang D, Hu X (2017) A transfer learning method for automatic identification of sandstone microscopic images. Comput Geosci 103:111–121

    Google Scholar 

  • Li FF, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In: 2004 conference on computer vision and pattern recognition workshop, Washington, DC, USA, p 178

  • Liao S, Law MWK, Chung ACS (2009) Dominant local binary patterns for texture classification. IEEE Trans Image Process 18(5):1107–1118

    Google Scholar 

  • Lin M, Chen Q, Yan SC (2014) Network in Network. In: 2014 ICLR. https://arxiv.org/abs/1312.4400

  • Maiti S, Tiwari RK (2010) Neural network modeling and an uncertainty analysis in Bayesian framework: a case study from the KTB borehole site. J Geophys Res 115:B10208. https://doi.org/10.1029/2010JB000864.

  • Meng FZ, Song J, Wong LNY, Wang ZQ, Zhang CQ (2021) Characterization of roughness and shear behavior of thermally treated granite fractures. Eng Geol 293:106287

    Google Scholar 

  • Młynarczuk M, Górszczyk A, Ślipek B (2013) The application of pattern recognition in the automatic classification of microscopic rock images. Comput Geosci 60:126–133

    Google Scholar 

  • Netzer Y, Wang T, Coates A, Bissacco A, Wu B, Ng AY (2011) Reading digits in natural images with unsupervised feature learning. In: NIPS workshop on deep learning and unsupervised feature learning

  • Ng AY, Jordan MI (2001) On discriminative vs. generative classifiers: a comparison of logistic regression and naïve Bayes. Adv Neural Inf Process Syst 14 (NIPS 2001) 14:605–610

    Google Scholar 

  • Nilsback ME, Zisserman A (2008) Automated flower classification over a large number of classes. In: 2008 Sixth Indian conference on computer vision, graphics & image processing, Bhubaneswar, pp 722–729

  • Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830

    Google Scholar 

  • Perez CA, Estévez PA, Vera PA, Castillo LE, Aravena CM, Schulz DA, Medina LE (2011) Ore grade estimation by feature selection and voting using boundary detection in digital image analysis. Int J Miner Process 101(1–4):28–36

    Google Scholar 

  • Raiche A (1991) A pattern recognition approach to geophysical inversion using neural nets. Geophys J Int 105(3):629–648

    Google Scholar 

  • Ramachandranm P, Zoph B, Le VQ (2017) Searching for activation functions. https://arxiv.org/abs/1710.05941v2

  • Ran XJ, Xue LF, Zhang YY, Liu ZY, Sang XJ, He JX (2019) Rock classification from field image patches analyzed using a deep convolutional neural network. Mathematics 7(8):755

    Google Scholar 

  • Ribeiro MT, Singh S, Guestrin C (2016) “Why should I trust you?”: explaining with the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144

  • Sandler M, Howard A, Zhu ML, Zhmoginov A, Chen LC (2018) MobileNetV2: inverted residual and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 4510–4520

  • Sharif H, Ralchenko M, Samson C, Ellery A (2015) Autonomous rock classification using bayesian image analysis for rover-based planetary exploration. Comput Geosci 83:153–167

    Google Scholar 

  • Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. https://arxiv.org/abs/1409.1556

  • Singh N, Singh TN, Tiwary A, Sarkar KM (2010) Textural identification of basaltic rock mass using image processing and neural network. Comput Geosci 14(2):301–310

    Google Scholar 

  • Suykens JAK, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9:293–300

    Google Scholar 

  • Szegedy C, Liu W, Jia YQ, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2015, pp 1–9

  • Tessier J, Duchesne C, Bartolacci G (2007) A machine vision approach to online estimation of run-of-mine ore composition on conveyor belts. Miner Eng 20(12):1129–1144

    Google Scholar 

  • van der Walt S, Schönberger JL, Nunez-Iglesias J, Boulogne F, Warner JD, Yager N, Gouillart E, Yu T (2014) scikit-image: Image processing in Python. Technical Report PeerJ 2:e453. https://doi.org/10.7717/peerj.453

    Article  Google Scholar 

  • Wong LNY, Zhou YM (2021) Boulder falls in Hong Kong-insights from power law relationships and supervised machine learning. Landslides 18:3227–3253

    Google Scholar 

  • Wong LNY, Guo TY, Lam WK, Ng JYH (2019) Experimental study of cracking characteristics of Kowloon granite based on three mode I fracture toughness methods. Rock Mech Rock Eng 52(11):4217–4235

    Google Scholar 

  • Xiao J, Hays J, Ehinger KA, Oliva A, Torralba A (2010) SUN database: large-scale scene recognition from abbey to zoo. In: 2010 IEEE computer society conference on computer vision and pattern recognition, San Francisco, CA, pp 3485–3492

  • Yang Y, Zhang Q (1997) A hierarchical analysis for rock engineering using artificial neural networks. Rock Mech Rock Eng 30:207–222

    Google Scholar 

  • Zhang YH, Wong LNY, Chan KK (2019) An extended grain-based model accounting for microstructures in rock deformation. J Geophys Res Solid Earth 124(1):125–148

    Google Scholar 

  • Zhao Q, Glaser SD (2020) Relocating acoustic emission in rocks with unknown velocity structure with machine learning. Rock Mech Rock Eng 53:2053–2061

    Google Scholar 

  • Zhou YM, Zhao C, Zhao CF, Ma CC, Xie JF (2018) Experimental study on the fracturing behaviors and mechanical properties of cracks under coupled hydro-mechanical effects in rock-like specimens. Water 10(10):1355

    Google Scholar 

  • Zhu XH, Chen MQ, Liu WJ, Luo YX, Hu H (2022) The fragmentation mechanism of heterogeneous granite by high-voltage electrical pulses. Rock Mech Rock Eng 55:4351–4372

    Google Scholar 

Download references

Acknowledgements

The first author acknowledges the Postgraduate Scholarship from the University of Hong Kong. Mr Hui Zong, PhD candidate at Tongji University in China and Mr Yuhan Ping, PhD candidate at Department of Computer Science, the University of Hong Kong, are also acknowledged for discussing the architecture of the proposed HKUDES_Net in great detail. The authors also acknowledge Mr. Frankie Lo and his colleagues from GEO for constructive discussions on data collection and CNN training plans involved in this study. The rock image database is not publicly available at present, and it will be made open source at a later stage. Other data related to the present study are available online (at https://doi.org/10.6084/m9.figshare.21768689.v1).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Louis Ngai Yuen Wong.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 4545 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, Y., Wong, L.N.Y. & Tse, K.K.C. Novel Rock Image Classification: The Proposal and Implementation of HKUDES_Net. Rock Mech Rock Eng 56, 3825–3841 (2023). https://doi.org/10.1007/s00603-023-03235-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00603-023-03235-0

Keywords

Navigation