Skip to main content
Log in

Prediction of homogeneous region over Bangladesh based on temperature: a non-hierarchical clustering approach

  • Original Paper
  • Published:
Theoretical and Applied Climatology Aims and scope Submit manuscript

Abstract

Bangladesh is one of the countries in the north-eastern part of South Asia, and it covers an area of about one hundred forty-eight thousand (1.48 × 105) square kilometers. It is located in the subtropical monsoon climate regime and is one of the most climate-vulnerable countries in the world. Due to the high impact of climate change, climate information is highly demandable to the government and policymakers. The different four types of temperature data from thirty-four locations in Bangladesh are considered in this study to find the homogeneous region and the distributional patterns of the temperatures are revealed as negatively skewed. The non-hierarchical k-means clustering technique is one of the unsupervised machine learning techniques is applied to reveal homogeneous regions in Bangladesh based on different types of temperature (dew point temperature, minimum temperature, maximum temperature, and temperature) separately, and jointly in the present research study. The elbow algorithm is used to find the optimal number of clusters, and the optimal number of clusters is obtained as nine with the different cluster solutions for different temperatures in the present study. The clustering solution produces different clustering results for dew point temperature, minimum temperature, maximum temperature, and temperature. The − regions (Dinajpur, Rangpur, Saidpur) from the northern part of Bangladesh always are the members of a cluster for four types of temperatures. Also, similar results are found for the regions Chuadanga, Ishurdi, and Rajshahi. Besides, the area Mymensingh forms cluster with different members for different temperatures, which indicates that the unique solution is not found. To find a significant and homogeneous cluster solution before applying the non-hierarchical clustering method, the linear principal component analysis is applied to produce a score based on dew point temperature, minimum temperature, maximum temperature, and temperature for each location to find a similar region. The northern regions (Bogra, Dinajpur, Rangpur, and Saidpur) and west-middle regions (Chuadanga, Ishurdi, Jessore, and Rajshahi) construct unique clusters with nearest regions for principal component analysis (PCA) scores. Similarly, the southern area (Cox’s Bazar and Teknaf) constructs a cluster for PCA scores or linear combination of different temperatures in this study. The map for the clustering solution based on PCA scores shows that the nearest and similar regions like hill regions, coastal regions, and plain lands have formed significant nine clusters. Finally, the clustering results based on the first principal component scores based on temperatures for each location are the final clustering solution in the present research work. In addition, the Ward linkage clustering algorithm is brought into play to the clustering solutions of PCA scores and found very significant results. The seven regions from the northern part of Bangladesh (Bogra, Dinajpur, Mymensingh, Rangpur, Saidpur, Sylhet, Tangail) are found in a cluster, and the nearest locations from the middle-west regions (Chandpur, Chuadanga, Ishardi, Jessore, Khulna, Maijdi Court, Mongla, Rajshahi, Satkhira) have formed another cluster with second largest members.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  • Abu-Jamous B, Fa R, Nandi AK (2015) Integrative cluster analysis in bioinformatics. Wiley, NY

    Book  Google Scholar 

  • Anderberg MR (1973) Cluster Analysis for Applications. Academic Press, New York

    Google Scholar 

  • Asakereh H, Shadman H (2018) On the relationship between tropospheric conditions and widespread hot days in iran. Theor Appl Climatol 131(1-2):805–817

    Article  Google Scholar 

  • Cattell RB (1943) The description of personality: Basic traits resolved into clusters. J Abnorm Psychol 38(4):476

    Google Scholar 

  • Driver H, Kroeber A (1932) Quantitative expression of cultural relationships. Univ Cal Publ Am Archeol Ethnol 31(4):211–256

    Google Scholar 

  • Ertöz L, Steinbach M, Kumar V (2003) Finding clusters of different sizes, shapes, and densities in noisy, high dimensional data. In: Proceedings of the 2003 SIAM International conference on data mining SIAM, p 47–58

  • Everitt BS, Landau S, Leese M, Stahl D (2011) Cluster analysis. Wiley, NY

    Book  Google Scholar 

  • Forgy EW (1965) Cluster analysis of multivariate data: efficiency versus interpretability of classifications. Biometrics 21:768–769

    Google Scholar 

  • Gerstengarbe FW, Werner PC, Fraedrich K (1999) Applying non-hierarchical cluster analysis algorithms to climate classification:some problems and their solution. Theor Appl Climatol 64(3-4):143–150

    Article  Google Scholar 

  • Hartigan JA (1975) Clustering algorithms. Wiley, New York

    Google Scholar 

  • Hartigan JA, Wong MA (1979) Algorithm as 136: a k-means clustering algorithm. J R Stat Soc Ser C Appl Stat 28(1):100–108

    Google Scholar 

  • Huth R, Nemesova I, Klimperová N (1993) Weather categorization based on the average linkage clustering technique: an application to european mid-latitudes. Int J Climatol 13(8):817–835

    Article  Google Scholar 

  • Iyigun C, Türkeş M, Batmaz İ, Yozgatligil C, Purutçuoğlu V, Koç EK, Öztürk MZ (2013) Clustering current climate regions of Turkey by using a multivariate statistical method. Theor Appl Climatol 114(1-2):95–106

    Article  Google Scholar 

  • Johnson RA, Wichern DW (2014) Applied multivariate statistical analysis, 6th ed., Pearson.

  • Kakade S, Kulkarni A (2016) Prediction of summer monsoon rainfall over India and its homogeneous regions. Meteorol Appl 23(1):1–13

    Article  Google Scholar 

  • Kassomenos P, Vardoulakis S, Borge R, Lumbreras J, Papaloukas C, Karakitsios S (2010) Comparison of statistical clustering techniques for the classification of modelled atmospheric trajectories. Theor Appl Climatol 102(1-2):1–12

    Article  Google Scholar 

  • Kaufman L, Rousseeuw PJ (2009) Finding groups in data: an introduction to cluster analysis, vol 344. Wiley, NY

    Google Scholar 

  • Korkmaz S, Goksuluk D, Zararsiz G (2014) MVN: An R package for assessing multivariate normality. The R Journal 6(2):151–162

    Article  Google Scholar 

  • Kulkarni A (2017) Homogeneous clusters over India using probability density function of daily rainfall. Theor Appl Climatol 129(1-2):633–643

    Article  Google Scholar 

  • Legendre P, Legendre LF (2012) Numerical ecology (3rd ed.) Elsevier, NY

    Google Scholar 

  • Littmann T (2000) An empirical classification of weather types in the mediterranean basin and their interrelation with rainfall. Theor Appl Climatol 66(3-4):161–171

    Article  Google Scholar 

  • Liu Z, George R (2005) Mining weather data using fuzzy cluster analysis, Fuzzy Modeling with Spatial Information for Geographic Problems, Springer, p 105–119

  • Lloyd S (1982) Least squares quantization in pcm. IEEE Trans Inf Theory 28(2):129–137

    Article  Google Scholar 

  • Mace A, Sommariva R, Fleming Z, Wang W (2011) Adaptive k-means for clustering air mass trajectories, International Conference on Intelligent Data Engineering and Automated Learning, Springer, p 1–8

  • MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1, Oakland, CA, USA p 281–297

  • Mardia KV (1970) Measures of multivariate skewness and kurtosis with applications. Biometrika 57(3):519–530

    Article  Google Scholar 

  • Mongi C, Langi Y, Montolalu C, Nainggolan N (2019) Comparison of hierarchical clustering methods (case study: data on poverty influence in north sulawesi), IOP Conference series: Materials Science and Engineering, vol 567 IOP Publishing, p 012048

  • Montazeri M (2011) A cluster analysis of thermal seasons of iran. Geogr Res 26(2(101)):173–198

    Google Scholar 

  • Murtagh F, Legendre P (2014) Ward’s hierarchical agglomerative clustering method: which algorithms implement ward’s criterion? J Classif 31(3):274–295

    Article  Google Scholar 

  • Netzel P, Stepinski T (2016) On using a clustering approach for global climate classification. J Clim 29(9):3387–3401

    Article  Google Scholar 

  • Rahman MH (2017) Clustering of pre-monsoon precipitation of Bangladesh: A ward’s hierarchical agglomerative clustering approach. Res Rev J Stat 6(2):1–7

    Google Scholar 

  • Rahman MH, Matin M, Salma U (2018) Analysis of precipitation data in Bangladesh through hierarchical clustering and multidimensional scaling. Theor Appl Climatol 134(1-2):689– 705

    Article  Google Scholar 

  • Roushangar K, Alizadeh F (2018) A multiscale spatio-temporal framework to regionalize annual precipitation using k-means and self-organizing map technique. J Mt Sci 15(7):1481–1497

    Article  Google Scholar 

  • Saha M, Mitra P (2015) Co-clustering based approach for indian monsoon prediction. Procedia Comput Sci 51:2938–2942

    Article  Google Scholar 

  • Shirin AH S, Thomas R (2016) Regionalization of rainfall in kerala state. Procedia Technol 24:15–22

    Article  Google Scholar 

  • Steinhaeuser K, Chawla NV, Ganguly AR (2011) Comparing predictive power in climate data: Clustering matters, International symposium on spatial and temporal databases, Springer, p 39–55

  • Steinhaus H (1956) Sur la division des corp materiels en parties. Bull Acad Polon Sci 4(12):801–804

    Google Scholar 

  • Tenenbaum JB, De Silva V, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500):2319–2323

    Article  Google Scholar 

  • Thorndike RL (1953) Who belongs in the family?. Psychometrika 18(4):267–276

    Article  Google Scholar 

  • Tian W, Zheng Y, Yang R, Ji S, Wang J (2014) A survey on clustering based meteorological data mining. Int J Grid Distrib Comput 7(6):229–240

    Article  Google Scholar 

  • Tryon RC (1939) Cluster analysis: Correlation profile and orthometric (factor) analysis for the isolation of unities in mind and personality. edwards brother, incorporated, Ann Arbor.

  • Ward JJH (1963) Hierarchical grouping to optimize an objective function. J Am Stat Assoc J Am Stat Assoc 58(301):236–244

    Article  Google Scholar 

  • Yarnal B (1993) Synoptic climatology in environmental analysis: a primer, Belhaven

  • Yarnal B, Comrie AC, Frakes B, Brown DP (2001) Developments and prospects in synoptic climatology. Int J Climatol: J R Meteorol Soc 21(15):1923–1950

    Article  Google Scholar 

  • Yokoi S, Takayabu YN, Nishii K, Nakamura H, Endo H, Ichikawa H, Inoue T, Kimoto M, Kosaka Y, Miyasaka T et al (2011) Application of cluster analysis to climate model performance metrics. J Appl Meteorol Climatol 50(8):1666–1675

    Article  Google Scholar 

  • Zubin J (1938) A technique for measuring like-mindedness. J Abnorm Psychol 33(4):508

    Google Scholar 

Download references

Acknowledgements

The author is greatly thankful to the Bangladesh Meteorological Department for providing the data necessary for this study. Also the author is thankful to the Jahangirnagar University for support the fund. The author especially thanks the Editor and anonymous reviewers for their valuable suggestions that improved the quality of this manuscript.

Funding

This research work was supported by Jahangirnagar University, Bangladesh. The author also declare that funds and other support were received from Jahangirnagar University during the preparation of this manuscript. The institute has no disagreement for publishing this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Md. Habibur Rahman.

Ethics declarations

Ethics approval

Not applicable. Also, the author declare that the paper is not currently being considered for publication elsewhere.

Consent for publication

The author gives consent to the publication of all details of the manuscript, including texts, figures, and tables.

Conflict of interest

The author declares no competing interests.

Additional information

Author contribution

The entire research work is conducted by a single author (or corresponding author). Only one author of the study contributed to the study’s conception, design, material preparation, data analysis, and the first draft of the manuscript was written. The author or corresponding author of this research work read the manuscript and approved the final manuscript.

Availability of data and material

The data sets used during the current study are available from the corresponding author on reasonable request.

Code availability

Available on reasonable request.

Consent to participate

Not applicable

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rahman, M.H. Prediction of homogeneous region over Bangladesh based on temperature: a non-hierarchical clustering approach. Theor Appl Climatol 148, 1127–1149 (2022). https://doi.org/10.1007/s00704-022-03955-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00704-022-03955-3

Keywords

Navigation