Skip to main content
Log in

Enhancing Rainfall Data Consistency and Completeness: A Spatiotemporal Quality Control Approach and Missing Data Reconstruction Using MICE on Large Precipitation Datasets

  • Published:
Water Resources Management Aims and scope Submit manuscript

Abstract

Accurate and complete data are crucial for climate, environmental, water, and agricultural research. Any record of data that is contaminated with errors should be considered missing and reconstructed. Pollution in climate data can lead to systematic errors, such as polluted outlier data. Simply removing outlier data is not a reliable method, and it is important to perform quality control checks to determine the reliability of the data. While methods for detecting outlier data have received significant attention from researchers, less investigation has been conducted on determining the pollution of outlier data. We propose methods for quality control and reconstruction of incomplete rainfall data using data from 141 stations in the Qaraqhum basin in northeastern Iran. We performed checks for gross errors, temporal consistency, and outlier data. As we observed that the probability distribution of monthly precipitation had a skewness shape, we utilized a robust 3σ-rule to detect outlier values. We propose the use of information such as the number of daily precipitation events per month, maximum monthly rainfall, and standardized monthly rainfall (based on robust 3σ-rule) to detect pollution of outlier values. Additionally, we performed a spatial–temporal comparison to determine the difference between no record and no occurrence of precipitation. For data reconstruction, we used the "mice" package in R, which imputes data using chain equations. We investigated the performance of five functions available in the mice package, and the results showed that the "norm.nob" method had the best performance, while the "sample" and "mean" methods had the weakest performance.

Graphical Abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Availability of Data and Materials

We are pleased to submit our manuscript entitled “Enhancing Rainfall Data Consistency and Completeness: A Spatiotemporal Quality Control Approach and Missing Data Reconstruction Using MICE on Large Precipitation Datasets” to be considered for publication as an original paper. The data that support the findings of this study are available from [Meteorological Organization (https://www.irimo.ir/) and Ministry of Energy] but restrictions apply to the availability some of daily rainfall, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of [Meteorological Organization (https://www.irimo.ir/) and Ministry of Energy].

Notes

  1. Median of Absolute Deviation.

References

Download references

Acknowledgements

The guidance and encouragement of our dear professor, Hojat Rezaee Pazhand are thanked and her memory is cherished.

Author information

Authors and Affiliations

Authors

Contributions

Nafiseh SeyyedNezhad and Mahboobeh Farzandi performed the study conception, material preparation and design. Nafiseh Seyyed Nezhad performed data collection and analysis and wrote the first draft of the manuscript. All authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Nafiseh Seyyed Nezhad Golkhatmi.

Ethics declarations

Ethical Approval

Not applicable.

Consent to Participate

Not applicable.

Consent to Publish

Not applicable.

Competing Interests

“The authors have no relevant financial or non-financial interests to disclose.” We certify that there is no actual or potential conflict of interest in relation to this article. The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Highlights

• Not all outlier data is corrupted and should not be immediately removed.

• In arid climates, monthly precipitation often has a skewed probability distribution.

• A robust 3σ-rule is recommended for detecting outlier data in monthly precipitation.

• "norm.nob" function is the best method of mice package to reconstruct missing data.

• "sample" and "mean" functions of mice package have the weakest performance.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Golkhatmi, N.S.N., Farzandi, M. Enhancing Rainfall Data Consistency and Completeness: A Spatiotemporal Quality Control Approach and Missing Data Reconstruction Using MICE on Large Precipitation Datasets. Water Resour Manage 38, 815–833 (2024). https://doi.org/10.1007/s11269-023-03567-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11269-023-03567-0

Keywords

Navigation