Skip to main content

Intelligent Data Analytics Approaches for Predicting Dissolved Oxygen Concentration in River: Extremely Randomized Tree Versus Random Forest, MLPNN and MLR

  • Chapter
  • First Online:
Intelligent Data Analytics for Decision-Support Systems in Hazard Mitigation

Abstract

This chapter designs intelligent data analytic approaches for predicting dissolved oxygen concentration in river utilizing extremely randomized tree versus random forest, MLPNN and MLR. Dissolved oxygen concentration (DO) in river, lake and stream can be measured directly in situ. However, mathematical models based on intelligent data analytic technique can provide a reasonably good alternative by linking several water quality variables to the concentration of DO at different time scale. Recent studies conducted worldwide have successfully demonstrated that models using intelligent data analytics contribute to accurately estimate dissolved oxygen with high precision. Here, we applied the extremely randomized tree (ERT) to develop a robust and computationally simple model for predicting dissolved oxygen concentration in river. Results obtained using the proposed ERT were compared to those obtained using the random forest (RF), the multilayer perceptron neural networks (MLPNN) and the standard multiple linear regression (MLR). The proposed models were developed using several inputs variables, e.g. water temperature, specific conductance, water pH and phycocyanin pigment concentration. Several inputs combinations were considered and compared to find the best inputs variables for predicting DO. All the proposed models were applied and compared using data collected from two rivers located in the USA. The accuracy of the models was evaluated using coefficient of correlation (R), Nash–Sutcliffe efficiency (NSE), root mean squared error (RMSE) and mean absolute error (MAE). Results were evaluated based on several input combinations and they showed that the RF provided the most effective estimation of DO concentration amongst the all the proposed models, while the ERT was ranked in the second place slightly less than the RF, the MLPNN ranked thirdly and the MLR model provided the worst accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Salim Heddam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Heddam, S. (2021). Intelligent Data Analytics Approaches for Predicting Dissolved Oxygen Concentration in River: Extremely Randomized Tree Versus Random Forest, MLPNN and MLR. In: Deo, R., Samui, P., Kisi, O., Yaseen, Z. (eds) Intelligent Data Analytics for Decision-Support Systems in Hazard Mitigation. Springer Transactions in Civil and Environmental Engineering. Springer, Singapore. https://doi.org/10.1007/978-981-15-5772-9_5

Download citation

Publish with us

Policies and ethics