Skip to main content

Advertisement

Log in

Revealing Geochemical Patterns Associated with Mineralization Using t-Distributed Stochastic Neighbor Embedding and Random Forest

  • Special Issue
  • Published:
Mathematical Geosciences Aims and scope Submit manuscript

Abstract

The identification of multivariate geochemical anomalies is critical in mineral exploration. Machine learning algorithms have been successfully employed in the recognition of multivariate geochemical anomalies in support of mineral exploration, owing to their strong ability to learn the complex relationship between geochemical characteristics and mineralization. However, applications of machine learning algorithms suffer from data redundancy and the curse of dimensionality. In this study, a hybrid model combining t-distributed stochastic neighbor embedding (t-SNE) and random forest (RF) was used to solve the aforementioned problems in geochemical mapping for gold exploration in the northwestern Hubei Province of China. Specifically, t-SNE was used for dimension reduction and feature extraction from the major and trace elements of geochemical survey data, and RF was used for probabilistic classification of geochemical patterns related to gold deposits. A comparative study demonstrated that the hybrid model of t-SNE + RF possesses stronger generalization ability than that of PCA + RF and pure RF. Specifically, after 15 experiments, the mean area under the receiver operator characteristic curve (AUC) values of t-SNE + RF, PCA + RF, and pure RF were 0.83, 0.65, and 0.75, respectively. These results suggest that the hybrid model combining t-SNE and RF can more efficiently recognize geochemical anomalies associated with gold mineralization. Compared with PCA, t-SNE can more effectively identify hidden information in complex and nonlinear geochemical survey data. In addition, it can reduce information redundancy and further improve the efficiency of RF for processing multidimensional geochemical survey data. The high-probability areas obtained by t-SNE + RF showed a strong spatial correlation with known gold deposits, which can provide critical clues for further prospecting in the study area.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

Download references

Acknowledgements

We are grateful Dr. Guoxiong Chen (guest editor) and two anonymous reviewers for their valuable comments and suggestions which improved this study. This study was supported by the National Natural Science Foundation of China (Nos. 42172326 and 41972303).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Renguang Zuo.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shi, Z., Zuo, R., Xiong, Y. et al. Revealing Geochemical Patterns Associated with Mineralization Using t-Distributed Stochastic Neighbor Embedding and Random Forest. Math Geosci 55, 321–344 (2023). https://doi.org/10.1007/s11004-022-10024-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11004-022-10024-y

Keywords

Navigation