Abstract
Agriculture is the economic backbone and the main means of livelihood in numerous developing countries. Numerous challenges related to farming and agriculturists exist. Cultivators face crop loss due to inappropriate selection of crops, inappropriate use of fertilizers, alterations in soil, ambiguous conditions in climate, and so on. The type of soil forms a crucial element in agriculture.The class of soil plays an important role in identifying what kind of crop should be planted along with the manure type to be applied. Classification of soil is essential to make effective use of the resources of soil. The texture of the soil has a major impact on crop growth. The role played by soil texture in determining the type of crop to be grown is significant. It is also employed in soil labs for determining the categories of soil. Soil texture plays a major role in determining the suitability of crops and handling famines. Soil chemical properties include “Electrical Conductivity” (\(E_C\)), “Organic Carbon” (\(O_C\)), “Phosphorous” (P), “Potassium” (K), “Power of Hydrogen” (\(P_H\)), “Zinc” (\(Z_n\)), “Boron” (B), and “Sulphur” (S). The crop growth is heavily influenced by the soil’s chemical composition. Keeping these considerations in mind, this work develops a customised decision tree (\(C_{DT}\)) that serves as a soil classifier (SC). A predictive framework is then devised that utilises the \(C_{DT}\) to perform the soil classification based on the texture of the soil and its chemical properties. Extensive experiments on several real-world soil datasets from Karnataka, India; and benchmark agricultural datasets such as seeds, Urban Land Cover (ULC), Satellite Image of Land Data (LS), and Forest Cover Type (FCT) were conducted. The results demonstrated that the designed \(C_{DT}\) classifier outperformed existing classifiers such as k-Nearest Neighbor (\(K_{NN}\)), Logistic Regression (\(L_R\)), Artificial Neural Network (\(A_{NN}\)), Classification and Regression Trees (\(C_{ART}\)), \(C_{4.5}\)), traditional SVM (\(S_{VM}\)), and Random Forest (\(R_F\)) in terms of Accuracy (\(A_{cc}\)), Sensitivity (\(S_{ens}\)), Specificity (\(S_{pec}\)), Precision (\(P_{rec}\)), and F-Score (\(F_S\)) on these datasets. The devised SC was deployed on the Heroku (Hk) cloud for effective access. Effective access in terms of end-user availability at all times was provided. An expert system for soil classification was built to provide information about soil classification round the clock using an internet-enabled device to the stakeholders of agriculture, such as cultivators and agricultural organizations. The agricultural raw data was stored in the form of blob objects on Amazon S3 (AS3).
Similar content being viewed by others
References
Pardey PG, Alston JM, Piggott RR (2009) Agricultural R &D in the developing world: Too little, too late? Int Food Policy Res Inst 47:109–112
Neenu S, Biswas AK, Subba Rao A (2013) Impact of climatic factors on crop production—a review. Agric Rev 34:97–106
Seth A, Ganguly K (2017) Digital technologies transforming Indian agriculture. Technical report, The Global Innovation Index
Mucherino A, Papajorgji PJ, Pardalos PM (2009) Data mining in agriculture, 1st edn. Springer Publishing Company, Incorporated
Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd ed. Morgan Kaufmann Publishers Inc
Chung S-O, Cho K-H, Kong J-W, Sudduth KA, Jung K-Y (2010) Soil texture classification algorithm using RGB characteristics of soil images. IFAC Proc Vol 43:34–38
Wu W, Li A-D, He X-H, Ma R, Liu H-B, Lv J-K (2018) A comparison of support vector machines, artificial neural network, and classification tree for identifying soil texture classes in southwest china. Comput Electron Agric 144:86–93
Davis ROE, Bennett HH (1927) Grouping of soils on the basis of mechanical analysis. https://archive.org/details/groupingofsoilso419davi
Suhas W, Kanwar S, Sarvesh KV, Baburao M, Krishnappa K (2011) Soil fertility atlas for Karnataka, India, 1st ed. International Crops Research Institute for the Semi-Arid Tropics
Delgado A, Gómez José A (2016) The soil. Physical, chemical and biological properties. Springer
Suman BBN (2013) Soil classification and fertilizer recommendation using WEKA. Int J Comput Sci Manag Stud 13:142–146
Gholap J, Ingole A, Gohil J, Gargade S, Attar V (2012) Soil data analysis using classification techniques and soil attribute prediction. Int J Comput Sci Iss 9:415–418
Kisan S (2016) http://www.kisansuvidha.com/
Sitton JD, Zeinali Y, Story BA (2017) Rapid soil classification using artificial neural networks for use in constructing compressed earth blocks. Constr Build Mater 138:214–221
Hassannejad H, Pakbaz MS, Mehdizadeh R (2015) Comparison and evaluation of artificial neural network (ANN) training algorithms in predicting soil type classification. Bull Environ Pharmacol Life Sci 4:212–218
Zhai Y, Thomasson JA, Boggess JE, Sui R (2006) Soil texture classification with artificial neural networks operating on remote sensing data. Comput Electron Agric 54:53–68
Goktepe AB, Altun S, Sezer A (2005) Soil clustering by fuzzy c-means algorithm. Adv Eng Softw 36:691–698
Bhargavi P, Jyothi S (2011) Soil classification using data mining techniques: a comparative study. Int J Eng Trends Technol 1:55–59
Manjula E, Djodiltachoumy S (2017) Data mining technique to analyse soil nutrients based on hybrid classification. Int J Adv Res Comput Sci 8:505–510
Awasthi N, Bansal A (2017) Application of data mining classification techniques on soil data using R. Int J Adv Electron Comput Sci 4:33–37
Hemageetha N, Nasira GM (2016) Analysis of soil condition based on pH value using classification techniques. IOSR J Comput Eng 18:50–54
Karthik D, Vijayarekha K, Manickkam V (2014) Land characterizations based on soil properties using clustering techniques. World Appl Sci J 29:60–64
Kovacevic M, Bajat B, Gajic B (2010) Soil type classification and estimation of soil properties using support vector machines. Geoderma 154:340–347
Bhattacharya B, Solomatine DP (2006) Machine learning in soil classification. Neural Netw 19:186–195
Taneja S, Arora R, Kaur S (2012) Mining of soil data using unsupervised learning technique. Int J Appl Eng Res 7:1–7
Harlianto PA, Adji TB, Setiawan NA (2017) Agriculture yield prediction using predictive analytic techniques. In: 3rd IEEE International conference on science and technology - computer (ICST), pp 7–10
Paul M, Vishwakarma SK, Verma A (2015) Analysis of soil behaviour and prediction of crop yield using data mining approach. In: IEEE international conference on computational intelligence and communication networks (CICN), pp 766–771
Smith D, Peng W (2009) Machine learning approaches for soil classification in a multi-agent deficit irrigation control system. In: IEEE international conference on industrial technology, pp 1–6
Hot E, Popovic-Bugarin V (2015) Soil data clustering by using k-means and fuzzy k-means algorithm. In: 23rd IEEE telecommunications forum Telfor (TELFOR), pp 890-893
Mutalib S, Jamian SNF, Abdul-Rahman S, Mohamed A(2010) Soil classification: An application of self-organising map and k-means. In: 10th IEEE international conference on intelligent systems design and applications, pp 439–444
Vibha L, HarshaVardhan GM, Prashanth SJ, Shenoy PD, Venugopal KR, Patnaik LM (2007) A hybrid clustering and classification technique for soil data mining. In: 2007 IET-UK international conference on information and communication technology in electrical sciences (ICTES 2007), pp 1090–1095
Samundeeswari JK, Srinivasan K Dr (2017) Data mining techniques in agriculture prediction of soil fertility. Int J Sci Eng Res 8:45–51
Exploring soil texture (2011) http://flux.aos.wisc.edu/adesai/documents/cmn/2011/activities/Soiltexture.pdf
Sirsat MS, Cernadas E, Fernandez Delgado M, Khan R (2017) Classification of agricultural soil parameters in India. Comput Electron Agric 135:269–279
Ribeiro Mariana V, Cunha Luiz Manoel S, Camargo Heloisa A, Rodrigues Luiz Henrique A (2014) Applying a fuzzy decision tree approach to soil classification. In: Information processing and management of uncertainty in knowledge-based systems. Springer, pp 87–96
Sumangala K, Nithya G (2012) Comparative study on bio-inspired approach for soil classification. Int J Comput Appl 38:32–37
Costa Eduardo P, Lorena Ana C, Carvalho Andre CPLF, Freitas Alex A (2007) A review of performance evaluation measures for hierarchical classifiers. In: Evaluation methods for machine learning II: papers from the AAAI-2007 Workshop, AAAI Technical Report WS-07-05, 1-6. AAAI Press
Charytanowicz M, Niewczas J, Kulczycki P, Kowalski PA, Kukasik S, Zak S (2010) Complete gradient clustering algorithm for features analysis of x-ray images. In: Pietka E, Kawa J (eds) Information technologies in biomedicine. Springer, Berlin, pp 15–24
Symeonaki E, Arvanitis K, Piromalis D (2017) Review on the trends and challenges of cloud computing technology in climate—smart agriculture. In: Proceedings of the 8th international conference on information and communication technologies in agriculture, food and environment, pp 21–24
Hiestermann J, Ferreira SL (2017) Cloud-based agricultural solution: a case study of near real-time regional agricultural crop growth information in South Africa. In: International archives of the photogrammetry, remote sensing & spatial information sciences, pp 79–82
Gill SS, Chana I, Buyya R (2017) IoT based agriculture as a cloud and big data service: the beginning of digital India. J Org End User Comput 29:1–23
Bhagyashri Deyannavar S, Rajpurohit Vijay S, Sambrekar Kuldeep P (2017) A survey on cloud-based framework for providing updates on agricultural products. Int J Emerg Technol Comput Sci Electron (IJETCSE) 14(2):425–428
Kaloxylos A, Groumas A, Sarris V, Katsikas L, Magdalinos P, Antoniou E, Politopoulou Z, Wolfert S, Brewster C, Eigenmann R, Maestre TC (2014) A cloud-based farm management system: architecture and implementation. Comput Electron Agric 100:168–179
Tuli A, Hasteer N, Sharma M, Bansal A (2014) Framework to leverage cloud for the modernization of the indian agriculture system. In: IEEE International Conference on Electro/Information Technology. IEEE, pp 109-115
Venkataramana K, Padmavathamma M (2012) A design of framework for agricloud. IOSR J Comput Eng (IOSRJCE) 4(5):1–6
Murakami Y, Utomo SKT, Hosono K, Umezawa T, Osawa N (2013) Ifarm: development of cloud-based system of cultivation management for precision agriculture. In: IEEE 2nd global conference on consumer electronics (GCCE). IEEE, pp 233–234
Prasad S, Peddoju SK, Ghosh D (2013) Agromobile: a cloud-based framework for agriculturists on mobile platform. Int J Adv Sci Technol 59:41–52
Wakhare MM, Choure AG (2017) Design of framework for agriculture web service. Int J Adv Electron Comput Sci 4(1):38–41
Janssen SJC, Porter CH, Moore AD, Athanasiadis IN, Foster I, Jones JW, Antle JM (2017) Towards a new generation of agricultural system data, models and knowledge products: information and communication technology. Agric Syst 155:200–212
Tan L (2016) Cloud-based decision support and automation for precision agriculture in orchards. In: 5th IFAC conference on sensing. Control and automation technologies for agriculture agricontrol 49(16):330–335
Balaji Prabhu BV, Dakshayini M (2016) A novel cloud based data analytics framework for effective crop management. Int J Control Theory Appl 9:257–264
Awuor F et al (2016) Building e-agriculture framework in Kenya. J Agric Inf 7(1):75–93
Khattab A, Abdelgawad A, Yelmarthi K (2016) Design and implementation of a cloud-based IOT scheme for precision agriculture. In: 28th International conference on microelectronics (ICM). IEEE, pp 201–204
Praveen B, Vishwesh M (2013) Agriculture updates via sms-a cloud computing approach. Int J Innov Manag Technol 4(5):523–527
Cho Y, Cho K, Shin C, Park J, Lee E-S (2012) An agricultural expert cloud for a smart farm. Future Information Technology. Application, and Service. Springer, Netherlands, pp 657–662
Aditya Shastry K, Sanjay HA, Deexith G (2017) Quadratic-radial-basis function- kernel for classifying multi-class agricultural datasets with continuous attributes. Appl Soft Comput 58:65–74
Aci M, Inan C, Avci M (2010) A hybrid classification method of nearest neighbor, Bayesian methods and genetic algorithm. Expert Syst Appl 37(7):5061–5067
Zhang Z, Zhu Q, Xie Y (2012) A novel image matting approach based on naive bayes classifier. In Intelligent Computing Technology. Springer, Berlin, pp 433–441
National bureau of soil survey and land use planning (2014). https://www.nbsslup.in//
Socio-economic statistical information about India (2015) http://www.indiastat.com/agriculture/
Klibi S, Tounsi K, Rebah ZB, Solaiman B, Farah IR (2016) Soil salinity prediction using a machine learning approach through hyperspectral satellite image. In: 2020 5th International conference on advanced technologies for signal and image processing (ATSIP), pp 1–6. https://doi.org/10.1109/ATSIP49331.2020.9231870.
Maagorzata C, Jerzy N, Piotr K, Kowalski PA, Szymon L, Slawomir Z (2012) UCI repository of machine learning, databases
Blackard Jock A, Dean DJ, Anderson CW (1998) UCI repository of machine learning databases. https://archive.ics.uci.edu/ml/datasets/covertype
Ashwin S (1993) UCI repository of machine learning databases. https://archive.ics.uci.edu/ml/datasets/Statlog+(Landsat+Satellite)
Brian J (2014) UCI repository of machine learning databases. https://archive.ics.uci.edu/ml/datasets/Urban+Land+Cover
Patro SG, Sahu KK (2015) Normalization: a pre-processing stage. IARJSET, 03
How heroku works (2014) https://devcenter.heroku.com/articles/how-heroku-works
Yasrab R (2018) Platform-as-a-service (paas): the next hype of cloud computing, pp 1-21. CoRR arXiv:1804.10811
Slug compiler (2014) https://devcenter.heroku.com/articles/slug-compiler
Zhu J (2010) Cloud computing technologies and applications. Springer, pp 21–45
Sarna DEY (2010) Implementing and developing cloud computing applications, 1st ed. Auerbach Publications
Sahu AK (2013) Java web deployment in cloud computing. Int J Comput Appl 75(15):31–34
Vaadin docs (2015) https://vaadin.com/docs/v8
Amazon s3 (2015) https://aws.amazon.com/s3/
Yasrab Robail (2018) Paas cloud: the business perspective. CoRR arXiv:1803.07680
Sokolova M, Lapalme G (2009) A systematic analysis of performance measures for classification tasks. Inf Process Manag 45(4):427–437
Sanchez MA, Castillo O, Castro JR, Melin P (2014) Fuzzy granular gravitational clustering algorithm for multivariate data. Inf Sci 279:498–511
Hussain L, Ajaz RH (2015) Seed classification using machine learning techniques. J Multidiscip Eng Sc Technol (JMEST) 02:1098–1102
Wang X, Bai Y (2016) The global minmax k-means algorithm. SpringerPlus 5:1. https://doi.org/10.1186/s40064-016-3329-4
Hammami I, Mercier G, Hamouda A, Dezert J (2016) Kohonena’s map approach for the belief mass modeling. IEEE Trans Neural Netw Learn Syst 27(10):2060–2071
Furnkranz J (2001) Round robin rule learning. In: Proceedings of the 18th international conference on machine learning, ICML ’01, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc., pp 146–153
Gupta A, Jagadeesh R, Harshad S, Aniruddha Z (2015) Classifying forest categories using cartographic variables. Technical report, Indian Institute of Technology
Alwadi MD, Chetty G (2015) Energy efficient data mining scheme for high dimensional data. Procedia Comput Sci 46(12):483–490. https://doi.org/10.1016/j.procs.2015.02.047
Au A, Eccles J, Haynes A, Thatcher T, Zhang Y (2012) A comparison of classification methods for forest cover type. http://www.andrehaynes.me/static/cover-type.pdf
Ghosh S, Biswas D, Biswas S, Sarkat D, Sarkar PP (2016) Soil classification from large imagery databases using a neuro-fuzzy classifier. Can J Electr Comput Eng 39(12):333–343. https://doi.org/10.1109/CJECE.2016.2596767
Sjoqvist H (2017) Classifying forest cover type with cartographic variables via the support vector machine, naive bayes and random forest classifiers, Master’s thesis, School of Business and Economics, Spring
Tetko IV (2002) Associative neural network. Neural Process Lett 16(2):187–199
Al-Obeidat F, Al-Taani AT, Belacel N, Feltrin L, Banerjee N (2015) A fuzzy decision tree for processing satellite images and landsat data. Procedia Comput Sci 52:1192–1197
Michie Donald, Spiegelhalter DJ, Taylor CC, Campbell J (eds) (1994) Machine learning, neural and statistical classification, Ellis Horwood, Upper Saddle River
Asofur YR, Carpenter GA, Grossberg S (1995) Landsat satellite image segmentation using the fuzzy artmap neural network
Liu FT, Ting KM, Zhou Z-H (2008) Isolation forest. In: Proceedings of the 2008 8th IEEE international conference on data mining, ICDM ’08, Washington, DC, USA. IEEE Computer Society, pp 413–422. https://doi.org/10.1109/ICDM.2008.17
Ting KM (2009) Adaptive anomaly detection using isolation forest. Technical report, Gippsland School of Information Technology
Wang S, Mathew A, Chen Y, Xi L, Ma L, Lee J (2009) Empirical analysis of support vector machine ensemble classifiers. Expert Syst Appl 36(3):6466–6476
Iglesias JA, Ledezma A, Sanchis A (2013) Ensemble method based on individual evolving classifiers. In: 2013 IEEE conference on evolving and adaptive intelligent systems (EAIS), pp 56–61. https://doi.org/10.1109/EAIS.2013.6604105
Johnson B, Xie Z (2013) Classifying a high-resolution image of an urban area using super-object information. ISPRS J Photogramm Remote Sens 83(9):40–49
Johnson B (2013) High-resolution urban land-cover classification using a competitive multi- scale object-based approach. Remote Sens Lett 4(2):131–140. https://doi.org/10.1080/2150704X.2012.705440
Giaccom Ribeiro BM, Garcia Fonseca LM (2013) Urban land cover classification using worldview-2 images and c4.5 algorithm. In Joint Urban Remote Sensing Event 250–253:2013. https://doi.org/10.1109/JURSE.2013.6550712
Oszust M (2015) Evaluation of subspace clustering using internal validity measures. Adv Electr Comput Eng 15(8):141–146
Abro MA, Lashari H, Abro WA (2016) Performance analysis of dissimilar classification methods using rapidminer. Sindh Univ Res J (Sci Ser) 48(3):185–188
Blackard JA, Dean DJ (1999) Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables. Comput Electron Agric 24(3):131–151
Zhang Z (2016) Introduction to machine learning: k-nearest neighbors. Ann Transl Med 4(11):1–7
Rish I (2001) An empirical study of the naive bayes classifier. IJCAI 2001 Work Empir Methods Artif Intell 3(1):41–46
The platform for open innovation and collaboration (2016). http://www.eclipse.org/
Li X, Fan P, Li Z, Chen G, Qiu H, Hou G (2021) Soil classification based on deep learning algorithm and visible near-infrared spectroscopy. J Spectrosc 2021:11. https://doi.org/10.1155/2021/1508267
Srivastava P, Shukla A, Bansal A (2021) A comprehensive review on soil classification using deep learning and computer vision techniques. Multimed Tools Appl 80:14887–14914. https://doi.org/10.1007/s11042-021-10544-5
Eyo EU, Abbey SJ, Lawrence TT, Tetteh FK (2022) Improved prediction of clay soil expansion using machine learning algorithms and meta-heuristic dichotomous ensemble classifiers. Geosci Front 13(1):1. https://doi.org/10.1016/j.gsf.2021.101296
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shastry, K.A., Sanjay, H.A. Customized decision tree-based approach for classification of soil on cloud environment. Computing 105, 1295–1336 (2023). https://doi.org/10.1007/s00607-022-01144-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00607-022-01144-8