Abstract
Mineral prospectivity mapping (MPM) is a fundamental task in mineral exploration. In recent years, the random forest (RF), which is recognized as a significant model for ensemble machine learning algorithms, has been widely used in MPM due to its advantages of high performance and robustness. Nevertheless, the RF method does not fully consider the correlation between variables. In this regard, a hybrid model, namely, the projection pursuit RF (PPRF), has been proposed. In this study, for validation and comparison purposes, the PPRF and ordinary RF methods were employed for the MPM of porphyry Cu–Mo deposits in the Eastern Tianshan orogenic belt, northwestern China, under the same conditions. In the case study, the performance of these two methods was critically investigated. The overall prediction accuracies of the PPRF and RF methods were evaluated by the area under the receiver operating characteristic curve and prediction–area plot approaches. To analyze the sensitivity of the PPRF and RF methods to unbalanced data caused by the limited number of known deposits in the study area, the synthetic minority oversampling technique was used to generate a balanced dataset. The results demonstrated that the PPRF method not only outperforms the ordinary RF method in the overall accuracy of MPM but is also less sensitive to unbalanced data than the RF method. Therefore, the PPRF method provides a novel and useful data-driven method with excellent performance for MPM. It utilizes the combination of predictor variables in the projection feature space rather than the variables themselves to identify the inherent characteristics of a given dataset and may thus discover deep hidden features that the ordinary RF method cannot find.
Similar content being viewed by others
References
Abedi M, Norouzi GH, Bahroudi A (2012) Support vector machine for multi-classification of mineral prospectivity areas. Comput Geosci 46:272–283. https://doi.org/10.1016/j.cageo.2011.12.014
Agterberg FP (1989) Computer programs for mineral exploration. Science 245:76–81. https://doi.org/10.1126/science.245.4913.76
Bickel PJ, Kur G, Nadler B (2018) Projection pursuit in high dimensions. Proc Natl Acad Sci 115:9151–9156. https://doi.org/10.1073/pnas.1801177115
Bradley AE (1997) The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition 30:1145–1159. https://doi.org/10.1016/S0031-3203(96)00142-2
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Carranza EJM (2011) Geocomputation of mineral exploration targets. Comput Geosci 37:1907–1916. https://doi.org/10.1016/j.cageo.2011.11.009
Carranza EJM, Laborte AG (2015a) Data-driven predictive mapping of gold prospectivity, Baguio district, Philippines: application of random forests algorithm. Ore Geol Rev 71:777–787. https://doi.org/10.1016/j.oregeorev.2014.08.010
Carranza EJM, Laborte AG (2015b) Random forest predictive modeling of mineral prospectivity with small number of prospects and data with missing values in Abra (Philippines). Comput Geosci 74:60–70. https://doi.org/10.1016/j.cageo.2014.10.004
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357. https://doi.org/10.1613/jair.953
Chen YC, Liu DQ, Tang YL, Wang DH, Zhou RH, Wang JL, Li HQ, Wang XD (2007) Research on large scale ore concentration area of strategic mineral resources in Xinjiang. Geological Publishing House, Beijing (in Chinese with English abstract)
Chen YC, Liu DQ, Tang YL, Wang DH, Dong LH, Xu X, Wang XD (2008) Mineral resources and mineralization system in Tianshan. Geological Publishing House, Beijing (in Chinese with English abstract)
Chen CH, He BB, Zeng Z (2013) A method for mineral prospectivity mapping integrating C4.5 decision tree, weights-of-evidence and m-branch smoothing techniques: a case study in the eastern Kunlun Mountains, China. Earth Sci Inform 7:13–24. https://doi.org/10.1007/s12145-013-0128-0
Cheng QM, Agterberg FP (1999) Fuzzy weights of evidence method and its application in mineral potential mapping. Nat Resour Res 8:27–35. https://doi.org/10.1023/A:1021677510649
Da Silva N, Cook D, Lee E-K (2021) A projection pursuit forest algorithm for supervised classification. J Comput Graph Stat 30:1168–1180. https://doi.org/10.1080/10618600.2020.1870480
Deng H, Zheng Y, Chen J, Wei YF, Mao XC (2020) Deep learning-based 3D prediction model for the Dayingezhuang gold deposit, Shandong province. Acta Geosci Sin 41:157–165. https://doi.org/10.3975/cagsb.2020.020501. (in Chinese with English abstract)
Egan JP (1975) Signal detection theory and ROC analysis. Academic Press, New York
Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27:861–874. https://doi.org/10.1016/j.patrec.2005.10.010
Han CM, Xiao WJ, Zhao GC, Mao JW, Li SZ, Yan Z, Mao QG (2006) Major types, characteristics and geodynamic mechanism of upper Paleozoic copper deposits in northern Xinjiang, northwestern China. Ore Geol Rev 28:308–328. https://doi.org/10.1016/j.oregeorev.2005.04.002
Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36. https://doi.org/10.1148/radiology.143.1.7063747
Hariharan S, Tirodkar S, Porwal A, Bhattacharya A, Joly A (2017) Random forest-based prospectivity modelling of greenfield terrains using sparse deposit data: an example from the Tanami Region, western Australia. Nat Resour Res 26:489–507. https://doi.org/10.1007/s11053-017-9335-6
Harris DP, Pan GC (1999) Mineral favorability mapping: a comparison of artificial neural networks, logistic regression, and discriminant analysis. Nat Resour Res 8:93–109. https://doi.org/10.1023/A:1021886501912
Lee EK (2018) PPtreeviz: an R package for visualizing projection pursuit classification trees. J Stat Softw 83:1–30
Lee YD, Cook D, Park JW, Lee EK (2013) PPtree: projection pursuit classification tree. Electron J Stat 7:1369–1386. https://doi.org/10.1214/13-EJS810
Leite EP, de Souza CR (2009) Artificial neural networks applied to mineral potential mapping for copper-gold mineralizations in the Carajas Mineral Province, Brazil. Geophys Prospect 57:1049–1065. https://doi.org/10.1111/j.1365-2478.2008.00779.x
Mao JW, Yang JM, Han CM, Wang ZL (2002) Metallogenic systems of polymetallic copper and gold deposits and related metallogenic geodynamic model in Eastern Tianshan, Xinjiang. Earth Sci J China Univ Geosci 27:413–424. https://doi.org/10.3321/j.issn:1000-2383.2002.04.010. (in Chinese with English abstract)
Mihalasky MJ, Bonham-Carter GF (2001) Lithodiversity and its spatial association with metallic mineral sites, Great Basin of Nevada. Nat Resour Res 10:209–226. https://doi.org/10.1023/A:1012569225111
Porwal A, González-Álvarez I, Markwitz V, McCuaig TC, Mamuse A (2010) Weights-of-evidence and logistic regression modeling of magmatic nickel sulfide prospectivity in the Yilgarn craton, Western Australia. Ore Geol Rev 38:184–196. https://doi.org/10.1016/j.oregeorev.2010.04.002
Prado EMG, de Souza Filho CR, Carranza EJM, Motta JG (2020) Modeling of Cu–Au prospectivity in the Carajás mineral province (Brazil) through machine learning: dealing with imbalanced training data. Ore Geol Rev 124:103611. https://doi.org/10.1016/j.oregeorev.2020.103611
Rahimi H, Abedi M, Yousefi M, Bahroudi A, Elyasi GR (2021) Supervised mineral exploration targeting and the challenges with the selection of deposit and non-deposit sites thereof. Appl Geochem 128:104940. https://doi.org/10.1016/j.apgeochem.2021.104940
Rodriguez-Galiano V, Sanchez-Castillo M, Chica-Olmo M, Chica-Rivas M (2015) Machine learning predictive models for mineral prospectivity: an evaluation of neural networks, random forest, regression trees and support vector machines. Ore Geol Rev 71:804–818. https://doi.org/10.1016/j.oregeorev.2015.01.001
Singer D, Menzie WD (2010) Quantitative mineral resource assessments: an integrated approach. Oxford University Press, New York
Wang SC (2010) The new development of theory and method of synthetic information mineral resources prognosis. Geol Bull China 29:1399–1403. https://doi.org/10.3969/j.issn.1671-2552.2010.10.002. (in Chinese with English abstract)
Wang ZG, Zhu XQ, Bi H, Wang YL, Wu BQ, Zou TR, Zhao ZH, Gui XT (2006) Granites in Xinjiang. Geological Publishing House, Beijing (in Chinese with English abstract)
Windley BF, Allen MB, Zhang C, Zhao ZY, Wang GR (1990) Paleozoic accretion and Cenozoic redeformation of the Chinese Tianshan range, central Asia. Geology 18:128–131. https://doi.org/10.1130/0091-7613(1990)018%3C0128:PAACRO%3E2.3.CO;2
Witten IH, Eibe F, Hall MA (2011) Data mining: practical machine learning tools and techniques. Morgan Kaufmann, San Francisco
Xia QL, Zhao MY, Wang XC, Leng S, Li TF, Xiong SL (2021) Quantitative prediction of molybdenum-copper polymetallic mineral resources in the Xindalai grassland-covered area of Inner Mongolia based on geological anomalies. Earth Sci Front 28:56–66. https://doi.org/10.13745/j.esf.sf.2021.1.16. (in Chinese with English abstract)
Xiao F, Chen JG (2012) Fractal projection pursuit classification model applied to geochemical survey data. Comput Geosci 45:75–81. https://doi.org/10.1016/j.cageo.2011.10.019
Xiao F, Chen JG (2017) Application of PPC model combined with RCGA to identify and extract geochemial anomaly. J Jilin Univ (earth Sci) 47(4):1319–1330. https://doi.org/10.13278/j.cnki.jjuese.201704307. (in Chinese with English abstract)
Xiao F, Wang ZH (2017) Geological interpretation of Bouguer gravity and aeromagnetic data from the Gobi-desert covered area, eastern Tianshan, China: implications for porphyry Cu–Mo polymetallic deposits exploration. Ore Geol Rev 80:1042–1055. https://doi.org/10.1016/j.oregeorev.2016.08.034
Xiao XC, Tang YQ, Feng YM, Zhu BQ, Li JY, Zhao M (1990) On the tectonic evolution of the northern Xinjiang, northwest China. Geosci Xinjiang 1:47–68 (in Chinese with English abstract)
Xiao WJ, Zhang LC, Qin KZ, Sun S, Li JL (2004) Paleozoic accretionary and collisional tectonics of the Eastern Tianshan (China): implications for the continental growth of central Asia. Am J Sci 304:370–395. https://doi.org/10.2475/ajs.304.4.370
Xiao F, Chen JG, Agterberg F, Wang CB (2014) Element behavior analysis and its implications for geochemical anomaly identification: a case study for porphyry Cu–Mo deposits in eastern Tianshan, China. J Geochem Explor 145:1–11. https://doi.org/10.1016/j.gexplo.2014.04.008
Xiao F, Wang KQ, Hou WS, Erten O (2020a) Identifying geochemical anomaly through spatially anisotropic singularity mapping: a case study from silver-gold deposit in Pangxidong district, SE China. J Geochem Explor 210:106453. https://doi.org/10.1016/j.gexplo.2019.106453
Xiao F, Wang KQ, Hou WS, Wang ZH, Zhou YZ (2020b) Prospectivity mapping for porphyry Cu–Mo mineralization in the eastern Tianshan, Xinjiang, northwestern China. Nat Resour Res 29:89–113. https://doi.org/10.1007/s11053-019-09486-5
Xiao F, Chen WL, Wang J, Erten O (2021) A hybrid logistic regression: gene expression programming model and its application to mineral prospectivity mapping. Nat Resour Res 31:2041–2064. https://doi.org/10.1007/s11053-021-09918-1
Xiong YH, Zuo RG (2018) GIS-based rare events logistic regression for mineral prospectivity mapping. Comput Geosci 111:18–25. https://doi.org/10.1016/j.cageo.2017.10.005
Yang FF, Wang ZY, Zuo RG, Sun SQ, Zhou B (2023) Quantification of uncertainty associated with evidence layers in mineral prospectivity mapping using direct sampling and convolutional neural network. Nat Resour Res 32:79–98. https://doi.org/10.1007/s11053-022-10144-6
Yeomans CM, Shail RK, Grebby S, Nykänen V, Middleton M, Lusty PAJ (2020) A machine learning approach to tungsten prospectivity modelling using knowledge-driven feature extraction and model confidence. Geosci Front 11:2067–2081. https://doi.org/10.1016/j.gsf.2020.05.016
Yin BJ, Zuo RG, Sun SQ (2023) Mineral prospectivity mapping using deep self-attention model. Nat Resour Res 32:37–56. https://doi.org/10.1007/s11053-022-10142-8
Yousefi M, Carranza EJM (2015) Prediction-area (P-A) plot and C-A fractal analysis to classify and evaluate evidential maps for mineral prospectivity modeling. Comput Geosci 79:69–81. https://doi.org/10.1016/j.cageo.2015.03.007
Yousefi M, Hronsky JMA (2023) Translation of the function of hydrothermal mineralization-related focused fluid flux into a mappable exploration criterion for mineral exploration targeting. Appl Geochem 149:105561. https://doi.org/10.1016/j.apgeochem.2023.105561
Yousefi M, Nykänen V (2016) Data-driven logistic-based weighting of geochemical and geological evidence layers in mineral prospectivity mapping. J Geochem Explor 164:94–106. https://doi.org/10.1016/j.gexplo.2015.10.008
Yousefi M, Kreuzer OP, Nykänen V, Hronsky JMA (2019) Exploration information systems—a proposal for the future use of GIS in mineral exploration targeting. Ore Geol Rev 111:103005. https://doi.org/10.1016/j.oregeorev.2019.103005
Yousefi M, Carranza EJM, Kreuzer OP, Nykänen V, Hronsky JMA, Mihalasky MJ (2021) Data analysis methods for prospectivity modelling as applied to mineral exploration targeting: state-of-the-art and outlook. J Geochem Explor 229:106839. https://doi.org/10.1016/j.gexplo.2021.106839
Yu ZB, Liu BL, Xie M, Wu YX, Kong YH, Li C, Chen GD, Gao YX, Zha S, Zhang HY, Wang L, Tang R (2022) 3D mineral prospectivity mapping of Zaozigou gold deposit, west Qinling, China: deep learning-based mineral prediction. Minerals 12:1382. https://doi.org/10.3390/min12111382
Zhang S, Xiao KY, Carranza EJM, Yang F (2018) Maximum entropy and random forest modeling of mineral potential: analysis of gold prospectivity in the Hezuo-Meiwu district, west Qinling orogen, China. Nat Resour Res 28:645–664. https://doi.org/10.1007/s11053-018-9425-0
Zhang S, Carranza EJM, Wei HT, Xiao KY, Yang F, Xiang J, Zhang SH, Xu Y (2021) Data-driven mineral prospectivity mapping by joint application of unsupervised convolutional auto-encoder network and supervised convolutional neural network. Nat Resour Res 30:1011–1031. https://doi.org/10.1007/s11053-020-09789-y
Zhao PD (2007) Quantitative mineral prediction and deep mineral exploration. Earth Sci Front 14(5):1–10. https://doi.org/10.3321/j.issn:1005-2321.2007.05.001. (in Chinese with English abstract)
Zhou TF, Yuan F, Zhang DY, Fan Y, Liu SA, Peng MX, Zhang JD (2010) Geochronology, tectonic setting and mineralization of granitoids in Jueluotage area, eastern Tianshan, Xinjiang. Acta Petrol Sin 26:478–502. https://doi.org/10.0000/9fb98c962dc4463386e058fa62838216. (in Chinese with English abstract)
Zuo RG, Carranza EJM (2011) Support vector machine: a tool for mapping mineral prospectivity. Comput Geosci 37:1967–1975. https://doi.org/10.1016/j.cageo.2010.09.014
Zuo RG, Xu Y (2023) Graph deep learning model for mapping mineral prospectivity. Math Geosci 55:1–21. https://doi.org/10.1007/s11004-022-10015-z
Zuo RG, Xiong YH, Wang J, Carranza EJM (2019) Deep learning and its application in geochemical mapping. Earth Sci Rev 192:1–14. https://doi.org/10.1016/j.earscirev.2019.02.023
Zuo RG, Luo ZJ, Xiong YH, Yin BJ (2022) A geologically constrained variational autoencoder for mineral prospectivity mapping. Nat Resour Res 31:1121–1133. https://doi.org/10.1007/s11053-022-10050-x
Acknowledgements
Thanks due to the two anonymous referees for their constructive comments and suggestions to improve this study. This research was financially supported by the Ministry of Science and Technology of China (Nos. 2022YFF0801201, 2021YFC2900300), National Natural Science Foundation of China (Nos. 41872245, U1911202) and Guangdong Basic and Applied Basic Research Foundation (No. 2020A1515010666).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, M., Xiao, F. Projection Pursuit Random Forest for Mineral Prospectivity Mapping. Math Geosci 55, 963–987 (2023). https://doi.org/10.1007/s11004-023-10070-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11004-023-10070-0