Skip to main content

Advertisement

Log in

Customized decision tree-based approach for classification of soil on cloud environment

  • Regular Paper
  • Published:
Computing Aims and scope Submit manuscript

Abstract

Agriculture is the economic backbone and the main means of livelihood in numerous developing countries. Numerous challenges related to farming and agriculturists exist. Cultivators face crop loss due to inappropriate selection of crops, inappropriate use of fertilizers, alterations in soil, ambiguous conditions in climate, and so on. The type of soil forms a crucial element in agriculture.The class of soil plays an important role in identifying what kind of crop should be planted along with the manure type to be applied. Classification of soil is essential to make effective use of the resources of soil. The texture of the soil has a major impact on crop growth. The role played by soil texture in determining the type of crop to be grown is significant. It is also employed in soil labs for determining the categories of soil. Soil texture plays a major role in determining the suitability of crops and handling famines. Soil chemical properties include “Electrical Conductivity” (\(E_C\)), “Organic Carbon” (\(O_C\)), “Phosphorous” (P), “Potassium” (K), “Power of Hydrogen” (\(P_H\)), “Zinc” (\(Z_n\)), “Boron” (B), and “Sulphur” (S). The crop growth is heavily influenced by the soil’s chemical composition. Keeping these considerations in mind, this work develops a customised decision tree (\(C_{DT}\)) that serves as a soil classifier (SC). A predictive framework is then devised that utilises the \(C_{DT}\) to perform the soil classification based on the texture of the soil and its chemical properties. Extensive experiments on several real-world soil datasets from Karnataka, India; and benchmark agricultural datasets such as seeds, Urban Land Cover (ULC), Satellite Image of Land Data (LS), and Forest Cover Type (FCT) were conducted. The results demonstrated that the designed \(C_{DT}\) classifier outperformed existing classifiers such as k-Nearest Neighbor (\(K_{NN}\)), Logistic Regression (\(L_R\)), Artificial Neural Network (\(A_{NN}\)), Classification and Regression Trees (\(C_{ART}\)), \(C_{4.5}\)), traditional SVM (\(S_{VM}\)), and Random Forest (\(R_F\)) in terms of Accuracy (\(A_{cc}\)), Sensitivity (\(S_{ens}\)), Specificity (\(S_{pec}\)), Precision (\(P_{rec}\)), and F-Score (\(F_S\)) on these datasets. The devised SC was deployed on the Heroku (Hk) cloud for effective access. Effective access in terms of end-user availability at all times was provided. An expert system for soil classification was built to provide information about soil classification round the clock using an internet-enabled device to the stakeholders of agriculture, such as cultivators and agricultural organizations. The agricultural raw data was stored in the form of blob objects on Amazon S3 (AS3).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

References

  1. Pardey PG, Alston JM, Piggott RR (2009) Agricultural R &D in the developing world: Too little, too late? Int Food Policy Res Inst 47:109–112

    Google Scholar 

  2. Neenu S, Biswas AK, Subba Rao A (2013) Impact of climatic factors on crop production—a review. Agric Rev 34:97–106

    Google Scholar 

  3. Seth A, Ganguly K (2017) Digital technologies transforming Indian agriculture. Technical report, The Global Innovation Index

  4. Mucherino A, Papajorgji PJ, Pardalos PM (2009) Data mining in agriculture, 1st edn. Springer Publishing Company, Incorporated

  5. Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd ed. Morgan Kaufmann Publishers Inc

  6. Chung S-O, Cho K-H, Kong J-W, Sudduth KA, Jung K-Y (2010) Soil texture classification algorithm using RGB characteristics of soil images. IFAC Proc Vol 43:34–38

    Article  Google Scholar 

  7. Wu W, Li A-D, He X-H, Ma R, Liu H-B, Lv J-K (2018) A comparison of support vector machines, artificial neural network, and classification tree for identifying soil texture classes in southwest china. Comput Electron Agric 144:86–93

    Article  Google Scholar 

  8. Davis ROE, Bennett HH (1927) Grouping of soils on the basis of mechanical analysis. https://archive.org/details/groupingofsoilso419davi

  9. Suhas W, Kanwar S, Sarvesh KV, Baburao M, Krishnappa K (2011) Soil fertility atlas for Karnataka, India, 1st ed. International Crops Research Institute for the Semi-Arid Tropics

  10. Delgado A, Gómez José A (2016) The soil. Physical, chemical and biological properties. Springer

  11. Suman BBN (2013) Soil classification and fertilizer recommendation using WEKA. Int J Comput Sci Manag Stud 13:142–146

    Google Scholar 

  12. Gholap J, Ingole A, Gohil J, Gargade S, Attar V (2012) Soil data analysis using classification techniques and soil attribute prediction. Int J Comput Sci Iss 9:415–418

    Google Scholar 

  13. Kisan S (2016) http://www.kisansuvidha.com/

  14. Sitton JD, Zeinali Y, Story BA (2017) Rapid soil classification using artificial neural networks for use in constructing compressed earth blocks. Constr Build Mater 138:214–221

    Article  Google Scholar 

  15. Hassannejad H, Pakbaz MS, Mehdizadeh R (2015) Comparison and evaluation of artificial neural network (ANN) training algorithms in predicting soil type classification. Bull Environ Pharmacol Life Sci 4:212–218

    Google Scholar 

  16. Zhai Y, Thomasson JA, Boggess JE, Sui R (2006) Soil texture classification with artificial neural networks operating on remote sensing data. Comput Electron Agric 54:53–68

    Article  Google Scholar 

  17. Goktepe AB, Altun S, Sezer A (2005) Soil clustering by fuzzy c-means algorithm. Adv Eng Softw 36:691–698

    Article  Google Scholar 

  18. Bhargavi P, Jyothi S (2011) Soil classification using data mining techniques: a comparative study. Int J Eng Trends Technol 1:55–59

    Google Scholar 

  19. Manjula E, Djodiltachoumy S (2017) Data mining technique to analyse soil nutrients based on hybrid classification. Int J Adv Res Comput Sci 8:505–510

    Article  Google Scholar 

  20. Awasthi N, Bansal A (2017) Application of data mining classification techniques on soil data using R. Int J Adv Electron Comput Sci 4:33–37

    Google Scholar 

  21. Hemageetha N, Nasira GM (2016) Analysis of soil condition based on pH value using classification techniques. IOSR J Comput Eng 18:50–54

    Google Scholar 

  22. Karthik D, Vijayarekha K, Manickkam V (2014) Land characterizations based on soil properties using clustering techniques. World Appl Sci J 29:60–64

    Google Scholar 

  23. Kovacevic M, Bajat B, Gajic B (2010) Soil type classification and estimation of soil properties using support vector machines. Geoderma 154:340–347

    Article  Google Scholar 

  24. Bhattacharya B, Solomatine DP (2006) Machine learning in soil classification. Neural Netw 19:186–195

    Article  MATH  Google Scholar 

  25. Taneja S, Arora R, Kaur S (2012) Mining of soil data using unsupervised learning technique. Int J Appl Eng Res 7:1–7

    Google Scholar 

  26. Harlianto PA, Adji TB, Setiawan NA (2017) Agriculture yield prediction using predictive analytic techniques. In: 3rd IEEE International conference on science and technology - computer (ICST), pp 7–10

  27. Paul M, Vishwakarma SK, Verma A (2015) Analysis of soil behaviour and prediction of crop yield using data mining approach. In: IEEE international conference on computational intelligence and communication networks (CICN), pp 766–771

  28. Smith D, Peng W (2009) Machine learning approaches for soil classification in a multi-agent deficit irrigation control system. In: IEEE international conference on industrial technology, pp 1–6

  29. Hot E, Popovic-Bugarin V (2015) Soil data clustering by using k-means and fuzzy k-means algorithm. In: 23rd IEEE telecommunications forum Telfor (TELFOR), pp 890-893

  30. Mutalib S, Jamian SNF, Abdul-Rahman S, Mohamed A(2010) Soil classification: An application of self-organising map and k-means. In: 10th IEEE international conference on intelligent systems design and applications, pp 439–444

  31. Vibha L, HarshaVardhan GM, Prashanth SJ, Shenoy PD, Venugopal KR, Patnaik LM (2007) A hybrid clustering and classification technique for soil data mining. In: 2007 IET-UK international conference on information and communication technology in electrical sciences (ICTES 2007), pp 1090–1095

  32. Samundeeswari JK, Srinivasan K Dr (2017) Data mining techniques in agriculture prediction of soil fertility. Int J Sci Eng Res 8:45–51

  33. Exploring soil texture (2011) http://flux.aos.wisc.edu/adesai/documents/cmn/2011/activities/Soiltexture.pdf

  34. Sirsat MS, Cernadas E, Fernandez Delgado M, Khan R (2017) Classification of agricultural soil parameters in India. Comput Electron Agric 135:269–279

    Article  Google Scholar 

  35. Ribeiro Mariana V, Cunha Luiz Manoel S, Camargo Heloisa A, Rodrigues Luiz Henrique A (2014) Applying a fuzzy decision tree approach to soil classification. In: Information processing and management of uncertainty in knowledge-based systems. Springer, pp 87–96

  36. Sumangala K, Nithya G (2012) Comparative study on bio-inspired approach for soil classification. Int J Comput Appl 38:32–37

    Google Scholar 

  37. Costa Eduardo P, Lorena Ana C, Carvalho Andre CPLF, Freitas Alex A (2007) A review of performance evaluation measures for hierarchical classifiers. In: Evaluation methods for machine learning II: papers from the AAAI-2007 Workshop, AAAI Technical Report WS-07-05, 1-6. AAAI Press

  38. Charytanowicz M, Niewczas J, Kulczycki P, Kowalski PA, Kukasik S, Zak S (2010) Complete gradient clustering algorithm for features analysis of x-ray images. In: Pietka E, Kawa J (eds) Information technologies in biomedicine. Springer, Berlin, pp 15–24

    Chapter  Google Scholar 

  39. Symeonaki E, Arvanitis K, Piromalis D (2017) Review on the trends and challenges of cloud computing technology in climate—smart agriculture. In: Proceedings of the 8th international conference on information and communication technologies in agriculture, food and environment, pp 21–24

  40. Hiestermann J, Ferreira SL (2017) Cloud-based agricultural solution: a case study of near real-time regional agricultural crop growth information in South Africa. In: International archives of the photogrammetry, remote sensing & spatial information sciences, pp 79–82

  41. Gill SS, Chana I, Buyya R (2017) IoT based agriculture as a cloud and big data service: the beginning of digital India. J Org End User Comput 29:1–23

    Article  Google Scholar 

  42. Bhagyashri Deyannavar S, Rajpurohit Vijay S, Sambrekar Kuldeep P (2017) A survey on cloud-based framework for providing updates on agricultural products. Int J Emerg Technol Comput Sci Electron (IJETCSE) 14(2):425–428

    Google Scholar 

  43. Kaloxylos A, Groumas A, Sarris V, Katsikas L, Magdalinos P, Antoniou E, Politopoulou Z, Wolfert S, Brewster C, Eigenmann R, Maestre TC (2014) A cloud-based farm management system: architecture and implementation. Comput Electron Agric 100:168–179

    Article  Google Scholar 

  44. Tuli A, Hasteer N, Sharma M, Bansal A (2014) Framework to leverage cloud for the modernization of the indian agriculture system. In: IEEE International Conference on Electro/Information Technology. IEEE, pp 109-115

  45. Venkataramana K, Padmavathamma M (2012) A design of framework for agricloud. IOSR J Comput Eng (IOSRJCE) 4(5):1–6

    Article  Google Scholar 

  46. Murakami Y, Utomo SKT, Hosono K, Umezawa T, Osawa N (2013) Ifarm: development of cloud-based system of cultivation management for precision agriculture. In: IEEE 2nd global conference on consumer electronics (GCCE). IEEE, pp 233–234

  47. Prasad S, Peddoju SK, Ghosh D (2013) Agromobile: a cloud-based framework for agriculturists on mobile platform. Int J Adv Sci Technol 59:41–52

    Article  Google Scholar 

  48. Wakhare MM, Choure AG (2017) Design of framework for agriculture web service. Int J Adv Electron Comput Sci 4(1):38–41

    Google Scholar 

  49. Janssen SJC, Porter CH, Moore AD, Athanasiadis IN, Foster I, Jones JW, Antle JM (2017) Towards a new generation of agricultural system data, models and knowledge products: information and communication technology. Agric Syst 155:200–212

    Article  Google Scholar 

  50. Tan L (2016) Cloud-based decision support and automation for precision agriculture in orchards. In: 5th IFAC conference on sensing. Control and automation technologies for agriculture agricontrol 49(16):330–335

  51. Balaji Prabhu BV, Dakshayini M (2016) A novel cloud based data analytics framework for effective crop management. Int J Control Theory Appl 9:257–264

    Google Scholar 

  52. Awuor F et al (2016) Building e-agriculture framework in Kenya. J Agric Inf 7(1):75–93

    Google Scholar 

  53. Khattab A, Abdelgawad A, Yelmarthi K (2016) Design and implementation of a cloud-based IOT scheme for precision agriculture. In: 28th International conference on microelectronics (ICM). IEEE, pp 201–204

  54. Praveen B, Vishwesh M (2013) Agriculture updates via sms-a cloud computing approach. Int J Innov Manag Technol 4(5):523–527

    Article  Google Scholar 

  55. Cho Y, Cho K, Shin C, Park J, Lee E-S (2012) An agricultural expert cloud for a smart farm. Future Information Technology. Application, and Service. Springer, Netherlands, pp 657–662

  56. Aditya Shastry K, Sanjay HA, Deexith G (2017) Quadratic-radial-basis function- kernel for classifying multi-class agricultural datasets with continuous attributes. Appl Soft Comput 58:65–74

    Article  Google Scholar 

  57. Aci M, Inan C, Avci M (2010) A hybrid classification method of nearest neighbor, Bayesian methods and genetic algorithm. Expert Syst Appl 37(7):5061–5067

    Article  Google Scholar 

  58. Zhang Z, Zhu Q, Xie Y (2012) A novel image matting approach based on naive bayes classifier. In Intelligent Computing Technology. Springer, Berlin, pp 433–441

  59. National bureau of soil survey and land use planning (2014). https://www.nbsslup.in//

  60. Socio-economic statistical information about India (2015) http://www.indiastat.com/agriculture/

  61. Klibi S, Tounsi K, Rebah ZB, Solaiman B, Farah IR (2016) Soil salinity prediction using a machine learning approach through hyperspectral satellite image. In: 2020 5th International conference on advanced technologies for signal and image processing (ATSIP), pp 1–6. https://doi.org/10.1109/ATSIP49331.2020.9231870.

  62. Maagorzata C, Jerzy N, Piotr K, Kowalski PA, Szymon L, Slawomir Z (2012) UCI repository of machine learning, databases

  63. Blackard Jock A, Dean DJ, Anderson CW (1998) UCI repository of machine learning databases. https://archive.ics.uci.edu/ml/datasets/covertype

  64. Ashwin S (1993) UCI repository of machine learning databases. https://archive.ics.uci.edu/ml/datasets/Statlog+(Landsat+Satellite)

  65. Brian J (2014) UCI repository of machine learning databases. https://archive.ics.uci.edu/ml/datasets/Urban+Land+Cover

  66. Patro SG, Sahu KK (2015) Normalization: a pre-processing stage. IARJSET, 03

  67. How heroku works (2014) https://devcenter.heroku.com/articles/how-heroku-works

  68. Yasrab R (2018) Platform-as-a-service (paas): the next hype of cloud computing, pp 1-21. CoRR arXiv:1804.10811

  69. Slug compiler (2014) https://devcenter.heroku.com/articles/slug-compiler

  70. Zhu J (2010) Cloud computing technologies and applications. Springer, pp 21–45

  71. Sarna DEY (2010) Implementing and developing cloud computing applications, 1st ed. Auerbach Publications

  72. Sahu AK (2013) Java web deployment in cloud computing. Int J Comput Appl 75(15):31–34

    Google Scholar 

  73. Vaadin docs (2015) https://vaadin.com/docs/v8

  74. Amazon s3 (2015) https://aws.amazon.com/s3/

  75. Yasrab Robail (2018) Paas cloud: the business perspective. CoRR arXiv:1803.07680

  76. Sokolova M, Lapalme G (2009) A systematic analysis of performance measures for classification tasks. Inf Process Manag 45(4):427–437

    Article  Google Scholar 

  77. Sanchez MA, Castillo O, Castro JR, Melin P (2014) Fuzzy granular gravitational clustering algorithm for multivariate data. Inf Sci 279:498–511

    Article  MathSciNet  MATH  Google Scholar 

  78. Hussain L, Ajaz RH (2015) Seed classification using machine learning techniques. J Multidiscip Eng Sc Technol (JMEST) 02:1098–1102

    Google Scholar 

  79. Wang X, Bai Y (2016) The global minmax k-means algorithm. SpringerPlus 5:1. https://doi.org/10.1186/s40064-016-3329-4

    Article  Google Scholar 

  80. Hammami I, Mercier G, Hamouda A, Dezert J (2016) Kohonena’s map approach for the belief mass modeling. IEEE Trans Neural Netw Learn Syst 27(10):2060–2071

    Article  MathSciNet  Google Scholar 

  81. Furnkranz J (2001) Round robin rule learning. In: Proceedings of the 18th international conference on machine learning, ICML ’01, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc., pp 146–153

  82. Gupta A, Jagadeesh R, Harshad S, Aniruddha Z (2015) Classifying forest categories using cartographic variables. Technical report, Indian Institute of Technology

  83. Alwadi MD, Chetty G (2015) Energy efficient data mining scheme for high dimensional data. Procedia Comput Sci 46(12):483–490. https://doi.org/10.1016/j.procs.2015.02.047

    Article  Google Scholar 

  84. Au A, Eccles J, Haynes A, Thatcher T, Zhang Y (2012) A comparison of classification methods for forest cover type. http://www.andrehaynes.me/static/cover-type.pdf

  85. Ghosh S, Biswas D, Biswas S, Sarkat D, Sarkar PP (2016) Soil classification from large imagery databases using a neuro-fuzzy classifier. Can J Electr Comput Eng 39(12):333–343. https://doi.org/10.1109/CJECE.2016.2596767

    Article  Google Scholar 

  86. Sjoqvist H (2017) Classifying forest cover type with cartographic variables via the support vector machine, naive bayes and random forest classifiers, Master’s thesis, School of Business and Economics, Spring

  87. Tetko IV (2002) Associative neural network. Neural Process Lett 16(2):187–199

    Article  MATH  Google Scholar 

  88. Al-Obeidat F, Al-Taani AT, Belacel N, Feltrin L, Banerjee N (2015) A fuzzy decision tree for processing satellite images and landsat data. Procedia Comput Sci 52:1192–1197

    Article  Google Scholar 

  89. Michie Donald, Spiegelhalter DJ, Taylor CC, Campbell J (eds) (1994) Machine learning, neural and statistical classification, Ellis Horwood, Upper Saddle River

  90. Asofur YR, Carpenter GA, Grossberg S (1995) Landsat satellite image segmentation using the fuzzy artmap neural network

  91. Liu FT, Ting KM, Zhou Z-H (2008) Isolation forest. In: Proceedings of the 2008 8th IEEE international conference on data mining, ICDM ’08, Washington, DC, USA. IEEE Computer Society, pp 413–422. https://doi.org/10.1109/ICDM.2008.17

  92. Ting KM (2009) Adaptive anomaly detection using isolation forest. Technical report, Gippsland School of Information Technology

  93. Wang S, Mathew A, Chen Y, Xi L, Ma L, Lee J (2009) Empirical analysis of support vector machine ensemble classifiers. Expert Syst Appl 36(3):6466–6476

    Article  Google Scholar 

  94. Iglesias JA, Ledezma A, Sanchis A (2013) Ensemble method based on individual evolving classifiers. In: 2013 IEEE conference on evolving and adaptive intelligent systems (EAIS), pp 56–61. https://doi.org/10.1109/EAIS.2013.6604105

  95. Johnson B, Xie Z (2013) Classifying a high-resolution image of an urban area using super-object information. ISPRS J Photogramm Remote Sens 83(9):40–49

    Article  Google Scholar 

  96. Johnson B (2013) High-resolution urban land-cover classification using a competitive multi- scale object-based approach. Remote Sens Lett 4(2):131–140. https://doi.org/10.1080/2150704X.2012.705440

    Article  Google Scholar 

  97. Giaccom Ribeiro BM, Garcia Fonseca LM (2013) Urban land cover classification using worldview-2 images and c4.5 algorithm. In Joint Urban Remote Sensing Event 250–253:2013. https://doi.org/10.1109/JURSE.2013.6550712

    Article  Google Scholar 

  98. Oszust M (2015) Evaluation of subspace clustering using internal validity measures. Adv Electr Comput Eng 15(8):141–146

    Article  Google Scholar 

  99. Abro MA, Lashari H, Abro WA (2016) Performance analysis of dissimilar classification methods using rapidminer. Sindh Univ Res J (Sci Ser) 48(3):185–188

    Google Scholar 

  100. Blackard JA, Dean DJ (1999) Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables. Comput Electron Agric 24(3):131–151

    Article  Google Scholar 

  101. Zhang Z (2016) Introduction to machine learning: k-nearest neighbors. Ann Transl Med 4(11):1–7

    Article  Google Scholar 

  102. Rish I (2001) An empirical study of the naive bayes classifier. IJCAI 2001 Work Empir Methods Artif Intell 3(1):41–46

  103. The platform for open innovation and collaboration (2016). http://www.eclipse.org/

  104. Li X, Fan P, Li Z, Chen G, Qiu H, Hou G (2021) Soil classification based on deep learning algorithm and visible near-infrared spectroscopy. J Spectrosc 2021:11. https://doi.org/10.1155/2021/1508267

    Article  Google Scholar 

  105. Srivastava P, Shukla A, Bansal A (2021) A comprehensive review on soil classification using deep learning and computer vision techniques. Multimed Tools Appl 80:14887–14914. https://doi.org/10.1007/s11042-021-10544-5

    Article  Google Scholar 

  106. Eyo EU, Abbey SJ, Lawrence TT, Tetteh FK (2022) Improved prediction of clay soil expansion using machine learning algorithms and meta-heuristic dichotomous ensemble classifiers. Geosci Front 13(1):1. https://doi.org/10.1016/j.gsf.2021.101296

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Aditya Shastry.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shastry, K.A., Sanjay, H.A. Customized decision tree-based approach for classification of soil on cloud environment. Computing 105, 1295–1336 (2023). https://doi.org/10.1007/s00607-022-01144-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-022-01144-8

Keywords

Mathematics Subject Classification

Navigation