Skip to main content
Log in

Prediction of missing temperature data using different machine learning methods

  • Original Paper
  • Published:
Arabian Journal of Geosciences Aims and scope Submit manuscript

Abstract

Temperature data is one of the basic inputs of meteorological, hydrological and climatic studies. The completeness of this data is of great importance for reliability in research. This study aimed to compare the performances of various machine learning methods such as support vector machines (SVM), adaptive neuro-fuzzy inference system (ANFIS) and decision tree (DT) to infill missing air temperature data. Monthly average temperature data from 1968 to 2017 (50 years) was used to develop the models. In the established model, the data is divided as 80/20% (1968–2007 training/2008–2017 testing). Neighbouring stations, like Sarıkamış, Tortum and Ağrı, which have a high correlation with Horasan, were used as inputs to estimate the temperature data of the Horasan station. The most suitable machine learning method was chosen according to the mean square error (MSE), root mean square error (RMSE), mean absolute error (MAE) and determination coefficients (R2) of the training and test results. The ANFIS model with four sub-sets, triangular membership function, hybrid learning algorithm and 300 iterations was selected as the most suitable model. It was recommended using ANFIS to estimate monthly air temperatures in the northeastern part of Turkey and perhaps in other semi-arid climatic regions around the world.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data and material availability

Not applicable.

References

Download references

Acknowledgements

The authors thank the General Directorate of Meteorology of Turkey for the observed monthly temperature data provided, the Editor and the anonymous reviewers for their contributions to the content and development of this paper.

Author information

Authors and Affiliations

Authors

Contributions

This is a single author paper, and the author, O. M. Katipoğlu, solely made the study conception, analysis, and manuscript preparation.

Corresponding author

Correspondence to Okan Mert Katipoğlu.

Ethics declarations

Ethical approval

The manuscript complies with all the ethical requirements; the paper was not published in any journal.

Consent to participate

Not applicable.

Consent to for publication

Not applicable.

Conflict of interest

The author declares no competing interests.

Additional information

Responsible Editor: Zhihua Zhan

Part of this work was presented orally at the IV. International Conference on Data Science and Applications 2021

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Katipoğlu, O. Prediction of missing temperature data using different machine learning methods. Arab J Geosci 15, 21 (2022). https://doi.org/10.1007/s12517-021-09290-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12517-021-09290-7

Keywords

Navigation