Classification of Gold-Bearing Particles Using Visual Cues and Cost-Sensitive Machine Learning
- 329 Downloads
- 3 Citations
Abstract
Ore sorting increases the grade of an ore feed stream by separating very low-grade particles (‘waste’) from those containing higher concentrations of the desired mineral (‘ore’), thus economically reducing the amount of material processed in further mineral concentration steps. This paper reports a preliminary study that aims to develop an automated method for discriminating waste and gold-bearing particles. The study used both hyperspectral measurements and RGB images of waste and gold-bearing particles from the Sunrise Dam Gold Mine as input to the discriminating method. Advanced feature extraction methods were employed to capture visual cues such as texture and colour from the RGB images, which were combined with hyperspectral features to give nine types of representative features. Feature selection was applied to groups of the representative features and resulting feature subsets were evaluated using three machine learning algorithms, namely a support vector machine, a naïve Bayes classifier, and a majority decision table, to identify a highly informative subset of features. Cost-sensitive training was used to minimise the nominal profit lost due to sorting error based on real cost values from the milling process, with the aim of economically balancing the ore acceptance rate with the waste rejection rate. A cost-blind support vector machine achieved an ore acceptance rate of 84 % and a waste rejection rate of 87 %, which resulted in $0.98 nominal profit lost per tonne of crushed rock particles. Cost-sensitive training reduced the nominal profit lost to $0.34 per tonne, undercutting the costs associated with refining all particles by $0.24 per tonne.
Keywords
Gold Ore classification Support vector machine MetaCost Hyperspectral reflectance Feature selectionNotes
Acknowledgments
We would like to thank the funding body AngloGold Ashanti Australia, and note that a co-author, John Vann, was VP Mineral Resources for this company at the time the work was done.
References
- ASD Inc (2008) TerraSpec user manual: 600541 rev. gGoogle Scholar
- Berman M, Bischof L, Huntington J (1999) Algorithms and software for the automated identification of minerals using field spectra or hyperspectral imagery. In: 13th international conference on applied geologic remote sensing, pp 222–232Google Scholar
- Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit 30:1145–1159CrossRefGoogle Scholar
- Chatterjee S, Bandopadhyay S, Machuca D (2010) Ore grade prediction using a genetic algorithm and clustering based ensemble neural network model. Math Geosci 42(3):309–326. doi: 10.1007/s11004-010-9264-y CrossRefGoogle Scholar
- Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297. doi: 10.1023/A:1022627411411 Google Scholar
- Domingos P (1999) MetaCost: a general method for making classifiers cost-sensitive. In: Proceedings of the 5th ACM SIGKDD international conference on knowledge discovery and data mining, ACM, New York, KDD ’99, pp 155–164. doi: 10.1145/312129.312220
- Domingos P, Pazzani M (1997) On the optimality of the simple Bayesian classifier under zero-one loss. Mach Learn 29(2–3):103–130CrossRefGoogle Scholar
- Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27(8):861–874. doi: 10.1016/j.patrec.2005.10.010 CrossRefGoogle Scholar
- Gama J, de Carvalho A (2009) Machine learning. IGI Global, book section, Hershey, pp 2462–2468. doi: 10.4018/978-1-60566-026-4.ch392
- Hall MA (1999) Correlation-based feature selection for machine learning. PhD thesis, The University of WaikatoGoogle Scholar
- Hall M, Holmes G (2003) Benchmarking attribute selection techniques for discrete class data mining. IEEE Trans Knowl Data Eng 15(6):1437–1447. doi: 10.1109/TKDE.2003.1245283 CrossRefGoogle Scholar
- Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18CrossRefGoogle Scholar
- Holden EJ, Fu SC, Kovesi P, Dentith M, Bourne B, Hope M (2011) Automatic identification of responses from porphyry intrusive systems within magnetic data using image analysis. J Appl Geophys 74(4):255–262. doi: 10.1016/j.jappgeo.2011.06.016 CrossRefGoogle Scholar
- Holden EJ, Wong JC, Kovesi P, Wedge D, Dentith M, Bagas L (2012) Identifying structural complexity in aeromagnetic data: an image analysis approach to greenfields gold exploration. Ore Geol Rev 46:47–59. doi: 10.1016/j.oregeorev.2011.11.002 CrossRefGoogle Scholar
- Huang J, Ling CX (2005) Using AUC and accuracy in evaluating learning algorithms. IEEE Trans Knowl Data Eng 17:299–310CrossRefGoogle Scholar
- Hunt EB, Marin J, Stone PJ (1966) Experiments in induction. Academic Press, New YorkGoogle Scholar
- John GH, Kohavi R, Pfleger K (1994) Irrelevant features and the subset selection problem. In: Proceedings of the 11th international conference on machine learning, Morgan Kaufmann, Burlington, pp 121–129Google Scholar
- Kistner M, Jemwa GT, Aldrich C (2013) Monitoring of mineral processing systems by using textural image analysis. Miner Eng 52:169–177. doi: 10.1016/j.mineng.2013.05.022 CrossRefGoogle Scholar
- Kohavi R (1995) The power of decision tables. In: Lavrac N, Wrobel S (eds) Machine learning: ECML-95. Lecture notes in computer science, vol 912. Springer, Berlin, pp 174–189. doi: 10.1007/3-540-59286-5_57
- Kononenko I (1994) Estimating attributes: analysis and extensions of RELIEF. In: Proceedings of the European conference on machine learning on machine learning, ECML-94, pp 171–182. Springer, New YorkGoogle Scholar
- Lewis DD (1998) Naive (Bayes) at forty: the independence assumption in information retrieval. In: Machine learning: ECML-98, pp 4–15. Springer, New YorkGoogle Scholar
- Liu H, Motoda H (2007) Computational methods of feature selection. Chapman & Hall/CRC Data Mining and Knowledge Discovery Series, Taylor & Francis, New YorkGoogle Scholar
- Liu H, Setiono R (1996) A probabilistic approach to feature selection—a filter solution. In: ICML96, pp 319–327Google Scholar
- Loy G, Zelinsky A (2003) Fast radial symmetry for detecting points of interest. IEEE Trans Pattern Anal Mach Intell 25(8):959–973CrossRefGoogle Scholar
- Maron ME (1961) Automatic indexing: an experimental inquiry. J ACM 8(3):404–417. doi: 10.1145/321075.321084 CrossRefGoogle Scholar
- Marsden JO, House CI (2006) Chemistry of gold extraction. SME, EnglewoodGoogle Scholar
- Matheron G (1984) The selectivity of the distributions and “the second principle of geostatistics”. In: Verly G, David M, Journel A, Marechal A (eds) Geostatistics for natural resources characterization, pp 421–433. Springer, The Netherlands. doi: 10.1007/978-94-009-3699-7_24
- Nugus M, Briggs M, Tombs S, Elms P, Erickson M (2013) Sunrise dam gold mine, AngloGold Ashanti. In: Australasian mining and metallurgical operating practices: the Sir Maurice Mawby memorial volume. Monograph, vol 2, 3rd edn. The Australian Institute of Mining and Metallurgy, Carlton, p 1920Google Scholar
- Ojala T, Pietikinen M, Harwood D (1996) A comparative study of texture measures with classification based on featured distributions. Pattern Recognit 29(1):51–59. doi: 10.1016/0031-3203(95)00067-4 CrossRefGoogle Scholar
- Perez CA, Estvez PA, Vera PA, Castillo LE, Aravena CM, Schulz DA, Medina LE (2011) Ore grade estimation by feature selection and voting using boundary detection in digital image analysis. Int J Miner Process 101(14):28–36. doi: 10.1016/j.minpro.2011.07.008 CrossRefGoogle Scholar
- Platt JC (1999) Fast training of support vector machines using sequential minimal optimization. In: Advances in kernel methods. MIT Press, Cambridge, pp 185–208Google Scholar
- Rendu JM (2008) Introduction to cut-off grade estimation. Society for Mining, Metallurgy, and Exploration (SME), EnglewoodGoogle Scholar
- Singh V, Rao SM (2005) Application of image processing and radial basis neural network techniques for ore sorting and ore classification. Miner Eng 18(15):1412–1420. doi: 10.1016/j.mineng.2005.03.003 CrossRefGoogle Scholar
- Tahmasebi P, Hezarkhani A (2012) A hybrid neural networks-fuzzy logic–genetic algorithm for grade estimation. Comput Geosci 42:18–27. doi: 10.1016/j.cageo.2012.02.004 CrossRefGoogle Scholar
- Tessier J, Duchesne C, Gauthier C, Dufour G (2008) Estimation of alumina content of anode cover materials using multivariate image analysis techniques. Chem Eng Sci 63(5):1370–1380. doi: 10.1016/j.ces.2007.07.028 CrossRefGoogle Scholar