1 Introduction

The estimation of magnitudes and frequencies of extreme hydrometeorological events such as daily maximum rainfall is central in the design of hydraulic structures, flood plain zoning and economic estimation of flood protection projects (Noto and La Loggia 2009; Sarkar et al. 2009). Often the interest is in the very rare events with return periods (T) of above 50 or 100 years. This mainly owes to their destructive nature to life and infrastructures. However, reliable estimation of such extreme events requires very long station records if single station data are to be used. Availability and quality of such data are often a challenge in many parts of the world, especially in the data scarce regions of Africa. The Southern Africa region is one such region as it is considered especially vulnerable to and ill-equipped (in terms of adaptation) for extreme events such as rainfall, droughts and flooding. This is due to a number of factors including extensive poverty, famine, disease and political instability (Williams et al. 2009). Regional frequency analysis (RFA) is a commonly used and practical means of providing information at sites with little or no data available (Zhang and Hall 2004).

Various regionalisation techniques have been developed and can be broadly classified into those used for prediction in data scarce areas and those used for RFA (Chen et al. 2006; Durrans and Tomic 1996; Mazvimavi et al. 2004; Sivapalan et al. 2003; Hosking and Wallis 1997). Techniques which have been widely applied in rainfall regionalisation include: linkage analysis (e.g. Jackson 1972); spatial correlation analysis (Gadgil et al. 1993); common factor analysis (e.g. Barring 1988); empirical orthogonal function analysis (e.g. Kulkarni et al. 1992); principal component analysis (PCA) (e.g. Baeriswyl and Rebetez 1996; Singh and Singh 1996); cluster analysis (e.g. Easterling 1989; Venkatesh and Jose 2007); combination of PCA and cluster analysis (e.g. Dinpashoh et al. 2004); L-moments in association with cluster analysis (e.g. Schaefer 1990; Guttman 1993; Wallis et al. 2007; Satyanarayana and Srinivas 2008) and a combination of L-moments and generalised least squares regression (Haddad et al. 2010).

Past rainfall regionalisation studies in Southern Africa include that of Jackson (1972), who used the spatial correlation based simple linkage analysis on rainfall data from 30 stations in Tanzania between 1930 and 1960. In that study, six rainfall homogenous regions were identified. PCA based studies include: Barring (1988) who identified five main rainfall regions in Kenya based on daily rainfall; Van Regenmortel (1995) in which Botswana was categorised into a single rainfall region based on a correlation matrix of 49 daily rainfall stations; and Unganai and Mason (2001) who used summer rainfall characteristics in Zimbabwe to identify two major rainfall regions. Parida and Moalafhi (2008) applied the L-moments approach to analyse annual rainfall series for 11 stations in Botswana for the period 1960–2003. The study established that the whole of Botswana behaved homogeneously and the Generalized Extreme Value (GEV) distribution was accepted as the best model for the data. In South Africa, Smithers and Schulze (2000a, b, 2001, 2003) also applied the L-moments approach in various regional rainfall frequency analysis.

Very few studies on extreme rainfall events for Malawi and specifically our study domain in Southern Malawi, an economically very important region, are documented in the literature. The most notable work on rainfall extremes in Malawi is probably that of Drayton (1980) who analysed 1-day annual maximum rainfall from 38 stations across Malawi using the Gumbel (Extreme Value I) distribution to derive estimates of point daily rainfall with return periods T = 20 and 50 years. New et al. (2006), however, reported that there is some evidence of increasing trends in regionally averaged rainfall on extreme precipitation days and in annual 1- and 5-day maximum rainfalls in Southern Africa. This apparent need to provide updated knowledge of rainfall extremes using well recognised regionalisation tools in a data scarce region partly motivated this study. Drayton (1980) also recommended the application of more robust computer based frequency analysis procedures.

The main aims of this study therefore, are (1) to improve the understanding of the regional rainfall characteristics in Southern Malawi as a key factor for regional flood estimation to support flood risk management, and (2) to provide a well designed and verified procedure for operational use for RFA of rainfall extremes in data scarce region of African countries. The aims will be achieved through the following specific objectives (1) to perform a RFA of 1-, 3-, 5- and 7-day annual maximum rainfall series (AMS1, AMS3, AMS5, AMS7 hereafter) in Southern Malawi using the well known L-moments approach; (2) to develop regional index rainfalls for the various annual maximum rainfall series; and (3) to evaluate the accuracy of the regional L-moments approach in estimating design rainfall at sites in the region through uncertainty assessments. To our best knowledge, the RFA approach used in this study, with the most up to date data available, is the first of its kind for rainfall extremes in Malawi.

2 Study area and data set

2.1 Study area

Southern Malawi has a total population of 5.3 million making it the most densely populated region in Malawi (Sajidu et al. 2008). The northern part of the area borders the southern shore of Lake Malawi in Mangochi district (Fig. 1). The Shire river, a tributary of the Zambezi river, is the major river in the area and drains Lake Malawi southward to the Zambezi (Jury and Gwazantini 2002; Jury and Mwafulirwa 2002). The Shire Highlands dominate the topography of the eastern part of the region with major relief features like Zomba and Mulanje mountain massifs. These two mountains rise to above 2,000 m above mean sea level (m a.s.l.) in the Lake Chilwa catchment area of the Chilwa-Phalombe plain. To the south of the region is the lower Shire river area, a low lying flood plain.

Fig. 1
figure 1

Map of Central Africa (left) and Malawi (middle) showing the study area and stations in Southern Malawi (right). The arrow shows the location of Southern Malawi on the map of Malawi

The climate of Southern Malawi is tropical wet and dry, commonly known as Savanna. The main rain bearing system is the Inter-Tropical Convergence Zone (ITCZ), where the north easterly monsoon and south easterly trade winds converge. A distinct rainy season is experienced between November and April when over 80% of the annual rainfall occurs. Tropical cyclones originating from the Indian Ocean, frequently occur during the rainy season bringing very intense rainfall over few days. Annual rainfall varies from 700 mm in the low lying areas to 2,500 mm in highlands of Mulanje and Zomba. There is considerable sporadic winter rainfall locally called chiperone in the highlands during the period from May to August. The winter rains originate from an influx of cool moist south-eastern winds. Monthly average temperatures are around 10–16°C in the highlands and 21–30°C along the lower Shire valley (British Geological Survey 2004). Figure 1 shows the location of the study area and the geographic distribution of the rainfall stations used in this study.

Southern Malawi is economically very important. Over 95% of Malawi’s hydropower is generated along the river Shire towards its confluence with the Zambezi River. Further, the Malawi National Contingency Plan (MNCP 2009) identified flooding triggered by heavy rainfall as the cause of more than 40% of disasters in Malawi since 1940. Recent flooding events as a result of heavy rainfall occurred in 1996/1997, 1998/1999, 2000/2001, and 2002/2003. The flooding often results in the loss of life, crops, property and vital infrastructure. Up to date information about rainfall extremes of the area is therefore crucial for ensuring the security of human lives and properties (Yang et al. 2010a).

2.2 Data availability

The study analysed AMS1, AMS3, AMS5 and AMS7 of rainfall derived from daily readings at 23 rainfall stations in Southern Malawi covering the period 1978–2007. These were sourced from the Malawi Department of Climate Change and Meteorological Services. Table 1 lists the stations including their location and elevation. Although some stations had gaps, none had a record less than 13 years.

Table 1 Location and elevation of the rainfall stations in Southern Malawi

3 Methodology

This section presents the regionalisation procedure applied. Initial data screening procedures for serial independence of the at-site data and stationarity of the at-site means are firstly presented. This is followed by a spatial independence check among the stations using Moran’s I coefficient. The K-means clustering procedure, for the determination of the possible number of homogeneous regions, is then presented. Finally, a summary of the four steps in RFA using the L-moments approach is presented.

According to Fowler and Kilsby (2003), extreme value theory recommends the peak-over-threshold (POT) approach, which includes all events above some chosen threshold, for rainfall frequency estimation. In this study, however, AMS were chosen mainly to preserve the number of stations and station years available. Some sites had missing values for considerable time periods. POT analysis requires a common period of analysis which would imply a reduction in the available number of stations and station years. Alternatively, missing values would have to be filled in. These conditions were considered challenging to meet due to low spatial correlations among stations in this sparsely gauged region (Ngongondo et al. 2011). The AMS procedures used in this study do not have to satisfy any of the above conditions.

3.1 Data screening

Frequency analysis requires that the at-site data are independent (without serial correlation and trends) and identically distributed (from the same population) (i.i.d.). Serial correlation within a time series reduces the effective sample size of the series compared with independent data (Matalas and Langbein 1962; Tallaksen et al. 2004). Independence was tested using lag-1 to lag-5 autocorrelation coefficients. Trend was determined using the non-parameteric Mann–Kendall (MK) test (Mann 1945; Kendall 1975) as recommended by the World Meteorological Organisation (WMO 1988). For stations with data that is serially correlated and having trends, von Storch (1995) recommends pre-whitening of the series.

Another requirement is that the AMS1 at different stations in a homogeneous region should be spatially independent. High spatial cross-correlation between stations gives a lower degree of additional regional information to the site being studied than uncorrelated sites. The magnitude of cross-correlations therefore provides a measure of the amount of independent information contained in the regional data relative to the amount of station years. If not taken into consideration, cross-correlation leads to estimates that are less accurate than they would be if the samples were spatially independent (Stedinger 1983; Schaefer 1990; Yue and Wang 2002). Moran’s I test (Moran 1950) was used to test for spatial independence. The test is based on cross products of deviations from the mean and is computed from:

$$ \beta = \frac{N}{S}\;{\frac{{\sum\nolimits_{i = 1}^{N} {\sum\nolimits_{j}^{N} {w_{ij} \left( {x_{i} - \bar{x}} \right)\left( {x_{j} - \bar{x}} \right)} } }}{{\sum\nolimits_{i = 1}^{N} {\left( {x_{i} - \bar{x}} \right)^{2} } }}} $$
(1)

where x i is the observed value at station i, x j is the observed value at station j, N is the number of stations, and w ij are the elements of the weight and \( S = \sum\nolimits_{i = 1}^{N} {\sum\nolimits_{i = 1}^{N} {w_{ij} \left( {i \ne j} \right)} }. \) Two neighbouring stations will have w ij  = 1 and 0 otherwise. Randomly arranged and uncorrelated values over space have Moran’s I equal to its expected value \( \left[ { - 1/\left( {n - 1} \right)} \right] \) where n is the total number of locations (Khalili et al. 2007). A significance level of α = 0.05 was used in these data screening tests.

3.2 Regional rainfall frequency analysis

The K-means cluster algorithm, a supervised approach, and Ward’s hierarchical unsupervised technique were applied to identify homogeneous regions. The two clustering techniques were chosen owing to their successful application in other rainfall regionalisation studies (e.g. McQueen 1967; Hosking and Wallis 1997; Ward 1963; Ramos 2001). The L-moments algorithm (Hosking and Wallis 1997) and the index-flood procedure (Dalrymple 1960) were used for the RFA. L-moments, a linear combination of PWMs (Hosking 1986, 1990), are considered robust over the use of conventional moments in certain aspects. Firstly, they are relatively insensitive to outliers and do not have sample size related bounds as do conventional moments. Secondly, parameter estimations are more reliable and less biased than the conventional method of moment estimates, particularly for small samples. Thirdly, they are usually computationally more tractable than maximum likelihood estimates and fourthly, estimators of L-moments are virtually unbiased (Hosking 1990). Vogel and Fennessey (1993) further highlight that the use of product moment ratio estimators of the coefficient of variation \( \left( {Cv} \right) \), skewness \( \left( \gamma \right) \) and kurtosis \( \left( k \right) \) exhibit substantial bias for samples less than 100, which is very common in hydrology. L-moments, however, are more robust for both large and small samples.

3.2.1 Homogeneous clusters

The number of homogeneous regions and their stations composition were initially defined using the K-means and Ward’s clustering methods. As presented by Satyanarayana and Srinivas (2008), a data set is classified by K-means clustering through a certain number of clusters fixed a priori, assumed K clusters. If \( Y = \left\{ {y_{i} /i = 1 \ldots N} \right\} \) is a set of N feature vectors (rain gauges in this study) in n-dimensional attribute space:

$$ \left\{ {y_{i} = \left[ {y_{i1,} \ldots ,y{}_{ij} \ldots ,y_{in} } \right] < n \in \Re^{n} } \right\} $$
(2)

where \( y_{ij} \) is the value of attribute j in ith feature vector \( y_{i} \). Each feature vector represents one of the N sites (rain gauges) in the study region. Variables influencing precipitation at a site or their principal components and its geographical location attributes are used. To avoid dominance of feature vectors with large absolute values (e.g. altitude), each feature vector is rescaled as:

$$ x_{ij} = {\frac{{\left( {y_{ij} - \overline{y}_{j} } \right)}}{{\sigma_{j} }}}\;\;\;\; {\text{for}}\;1 \le j \le n, $$
(3)

where \( x_{ij} \) are the rescaled values of \( y_{ij} \), \( \sigma_{j} \) is the standard deviation of attribute j, and \( \overline{y}_{j} \) is the mean value of attribute j over all the N feature vectors. Through an iterative procedure, the K-means algorithm move the feature vectors from one cluster to another to minimize the objective function, F defined as:

$$ F = \sum\limits_{k = 1}^{K} {\sum\limits_{j = 1}^{n} {\sum\limits_{i = 1}^{{N_{k} }} {d^{2} } \left( {x_{ij}^{k} - x_{j}^{k} } \right)} } $$
(4)

where the number of clusters K is set a priori; \( N_{k} \) is the number of feature vectors in cluster k; \( x_{ij}^{k} \) denotes the rescaled value of attribute in the feature vector i assigned to cluster k; \( x_{j}^{k} \) is the mean value of attribute j for cluster k, computed as:

$$ x_{j}^{k} = {\frac{{\sum\nolimits_{i = 1}^{{N_{k} }} {x_{ij}^{k} } }}{{N_{k} }}} $$
(5)

Each feature vector i.e. rain gauge, is allocated to a cluster by minimising F in (4) whereby the distance of each feature vector from the center of the cluster (centroid) to which it belongs is minimized. The number of clusters was determined using the PCA based scree plot. Total loadings across the abscissa show separation (also called a break) in fraction of total variance where the ‘most important’ components cease and the ‘least important’ components begin. On the other hand, Ward’s clustering approach is unsupervised requiring no a priori setting of the number of clusters. The same attributes used in the K-means approach were used. The method has been widely applied in rainfall cluster analysis (e.g. Muñoz-Díaz and Rodrigo 2004).

The criteria for choosing clustering variables largely depend on the major factors that influence rainfall in an area. Satyanarayana and Srinivas (2008) in India used 15 large-scale atmospheric variables of air temperature, geopotential height, specific humidity, zonal and meridional wind velocities, precipitable water and surface pressure in addition to latitude/longitude location, elevation and mean annual rainfall (MAR). Schaefer (1990) used at-site Mean Annual Precipitation (MAP) to define homogeneous areas in the regionalisation of precipitation annual maxima in Washington State, USA. Schaffer’s assumption was that the MAP is numerically descriptive of arid versus semi-arid environments. In this study, the approach by Yang et al. (2010b), which used station latitude/longitude location, elevation and MAP in defining rainfall regions in the Pearl River Basin, China, was adopted. These four variables present a fair balance of commonly used attributes and were readily available for Southern Malawi. The variables were scaled to values between \( - 1 \) and 1 to avoid bias of those variables with large absolute values. Using the K-means clustering algorithm, five simulations were performed with the number of clusters set between two and six.

Two cluster validity indices, i.e. Hubert’s gamma coefficient and the Dunn index (Satyanarayana and Srinivas 2008) were applied to determine optimal partition provided by the K-means clustering algorithm. This was done by pair-wise comparisons of the compositions of the different clusters between two and six.

3.2.2 Discordancy measure \( D_{i} \)

The discordancy measure \( \left( {D_{i} } \right) \) is used to screen data within the identified homogenous region(s) (Hosking and Wallis 1997). Stations in a homogenous region having either gross errors in their data or not belonging to the region, will have \( D_{i} \ge 3 \). The case of gross errors in the data requires careful scrutinisation of the data. If gross errors are not detected in the data, but \( D_{i} \ge 3 \), it is recommended to relocate such stations to other regions (Hosking and Wallis 1997; Adamowski 2000). Further details of the discordancy measure are described by Hosking and Wallis (1997).

3.2.3 Heterogeneity measure

The heterogeneity measures \( H_{n} \left( {n = 1,2,3} \right) \) are used to verify whether the proposed sites make up a spatially homogeneous region with the same underlying distribution apart from a site-specific scale factor (Hosking and Wallis 1997). The check is based on observed and simulated dispersion of L-moments for a group of sites under consideration. A region is considered ‘acceptably homogenous’ if \( H_{n} < 1 \); ‘possibly heterogeneous’ if \( 1 < H_{n} < 2 \); and ‘definitely heterogeneous’ if \( H_{n} \ge 2 \). A large positive value of \( H_{1} \) indicates that the observed L-moments are more dispersed than what is consistent with the hypothesis of homogeneity. \( H_{2} \) indicates whether the at-site and regional estimates are close to each other. A large value of \( H_{2} \) indicates a large deviation between regional and at-site estimates. \( H_{3} \) indicates whether the at-site and the regional estimates agree. Large values of \( H_{3} \) suggest a large deviation between at-site estimates and observed data. However, \( H_{1} \) is the primary measure for the heterogeneity test as both \( H_{2} \) and \( H_{3} \) rarely yield values larger than 2 even for grossly heterogeneous regions (Hosking and Wallis 1997; Yang et al. 2010b). Computation details for the calculation of \( H_{n} \) are given in Hosking and Wallis (1997).

3.2.4 Distribution selection

The best fitted distributions for each homogeneous regions can be identified by several means: visually on the L-moment ratio diagram, a plot of sample L-skewness (τ3) versus L-kurtosis (τ4) where the best distribution fits evenly through the cloud of points; the goodness of fit measure \( \left| {Z^{\text{Dist}} } \right| \le 1.64 \); and the difference between sample kurtosis and distribution kurtosis \( \left( {\left| {\tau_{{4{\text{sample}}}} - \tau_{{4({\text{DIST}})}} } \right|} \right) \), which should be the minimum. Hosking and Wallis (1997) presented details for the calculation the \( \left| {Z^{\text{Dist}} } \right| \) statistics.

Six commonly applied distributions namely the Generalized Logistic (GLO), Generalized Extreme Value (GEV), Generalized Normal (GNO), Generalized Pareto (GPA), Pearson type III (PE3) and Wakeby (WAK) were tested. The five-parameter Wakeby distribution is included in case the choice of the candidate distributions is inconclusive. This normally occurs when the region is misspecified as being homogeneous. The Wakeby therefore offers the best option for frequency analysis in such cases as it is more robust (Hosking and Wallis 1997).

3.2.5 Derivation of regional rainfall quantiles

Rainfall quantiles of the best frequency distribution were derived for each region using the index rainfall method, first introduced for floods (Dalrymple 1960). The procedure assumes that the frequency distributions of all sites in a homogenous region are identical, except for a site-specific scale factor. The quantile estimates \( \hat{Q}\left( F \right) \), with non-exceedance probability F, at a site in a region with N sites is then computed by: \( Q_{i} \left( F \right) = l_{1} q\left( F \right) \); where q is a common dimensionless quantile function (growth curve) and \( l_{1} \) is a site-specific scaling factor, also called the index rainfall value, representing the T-year quantile of the normalized regional distribution. In this study, the mean \( \left( {\mu_{i} } \right) \) was used as the site specific scale factor at a given location. At-site AMS1, AMS3, AMS5 and AMS7 values with return periods T = 2, 10, 100 and 1,000 years were estimated from the regional quantiles.

The accuracy of the estimated rainfall quantiles were assessed using Monte Carlo simulations (Hosking and Wallis 1997). According to the 5T guide (Robinson and Reed 1999), the T-year rainfall estimate in each of the regions will only be accurate up to 5T station-years of data available. To further assess the accuracy of the regional rainfall quantiles, regionally derived at-site rainfall estimates were firstly compared with those derived from fitting the best distribution to the at-site data. Secondly, at-site and regional based estimates for three validation stations were compared. Alumenda station has a 10 year record from 1998 to 2007, Toleza Farm has a 49 year record from 1941 to 1989, whereas Zomba Plateau has a 10 year record from 1983 to 1993. Alumenda and Zomba Plateau were selected for this assessment because of their short, but up to date record. On the other hand, the Toleza Farm record was long but not up to date because of a 10-year gap in the 1990s. The at-site and regional based RMSE values from Monte Carlo simulations were then compared.

Finally, at-site and regional based estimates for the AMS1 were compared to those derived for Malawi by Drayton (1980) (see Sect. 1). In this comparison, both the mean and the median were used as the site specific scale factor since Drayton (1980) used the median rainfall. However, differences can in this case be attributed to a range of factors such as different data periods and thus extreme events, different regionalisation approaches and possibly measurement practices.

For all the procedures presented above, various packages and macros of the free statistical software R were used (R Development Core Team 2008). The L-moments approach used the package lmomRFA (http://cran.r-project.org/web/packages/lmomRFA/index.html) by Hosking (2009) in R Software (R Development Core Team 2008).

4 Results and discussions

4.1 Stationarity and serial independence check

The basic statistics for the stations, including the MK statistic at α = 0.05% significance level, are presented in Table 2. The overall average for AMS1, AMS3, AMS5 and AMS7 were 86.8, 130, 160 and 182 mm, respectively. The highest mean AMS1-AMS7 rainfall during the period were 108.4, 169, 202 and 231 mm all, recorded at Mimosa located in the eastern highlands which has the highest average annual rainfall in Malawi (Ngongondo et al. 2011).

Table 2 Basic statistics of the annual maximum 1-day rainfall series for Southern Malawi

None of the AMS1 MK trend statistics were significant at the α = 0.05 level, and thus it can be assumed that all stations had stationary series in terms of mean values. However, statistically significant positive trends were exhibited by three stations for the AMS3 (Makhanga, Ngabu and Ntaja), four stations for the AMS5 (Bvumbwe, Makhanga, Ngabu and Satemwa), and three stations for the AMS7 series (Makhanga, Nchalo, and Thyolo). However, as most of the stations in the region did not have statistically significant trends, we could not reject the null hypothesis that the station trends in the region were homogenous. Hence, it was concluded that the significant trends were not significant at the regional level. We therefore accept that the region has statistically stationary trends. Annual, seasonal and monthly series analysed by Ngongondo et al. (2011) for the whole of Malawi also revealed statistically non-significant trends at α = 0.05 level.

Further, the absolute values of the autocorrelation coefficients for lags 1 and 5, were not significant at the α = 0.05 level. For a time series with n observations, the critical values at α = 0.05 can be calculated from \( 1.96/\sqrt n \) (Douglas et al. 2000). We therefore accepted that the series were independently, identically distributed. Moran’s I coefficient for all series suggested that cross-correlation among the stations were not statistically significant at α = 0.05 for all series. The stations were therefore considered spatially independent in further computations.

4.2 Identification of homogenous regions

Prior to cluster analysis, the region was treated as one large homogeneous group and tested using the discordancy measure \( D_{i} \) and the heterogeneity measures \( H_{1} \), \( H_{2} \) and \( H_{3} \). The region did not pass this homogeneity test because Liwonde station had a value \( D_{i} \ge 3 \) (the critical value for the 23 station grouping). A check of the data at the Liwonde station did not reveal any obvious inconsistencies. The heterogeneity values for the whole of Southern Malawi were: \( H_{1} = 1.64,\;H_{2} = - 0.66,\;H_{3} = - 0.57 \). The region should therefore be considered as possibly heterogeneous since \( 1 < H_{1} < 2 \), although the \( H_{2} \) and \( H_{3} \) values classified the area as acceptably homogeneous. The region did not pass the tests even after the removal of the Liwonde station and cluster analysis was therefore applied.

In the cluster analysis, the number of possible clusters was initially determined from a scree plot using a 15 cluster solution (figure not shown). Total loadings of the scree plot suggested between two and six possible clusters. The Hubert’s gamma coefficients and the Dunn index cluster validity tests both suggested a three region partitioning with four, nine and ten stations. The results of Ward’s clustering (dendrogram not shown) also agreed with the three clusters solution and station membership. However, in the suggested region 2, Chileka and Naminjiwa stations had discordancy values \( D_{i} \ge 3 \). These were subsequently relocated to region 3 based on similarities in their MAR and altitudes with stations in region 3. Drayton (1980) suggested three rainfall regions in Malawi as follows: Group 1 consisting of all areas where high rainfalls can be caused by convection over land surfaces; Group 2 areas where high rainfall events can be caused by both convection and orographic rainfall processes; and finally Group 3 where high rainfalls can be caused by either moist air convection from the lake surface or strong orographic barriers or a combination of both factors. The suggested regions in this study do not entirely agree with those suggested by Drayton (1980) in terms of their spatial distribution. Figure 2 shows stations in each group, whereas the D i values and H n measures for the three groups are presented in Table 3.

Fig. 2
figure 2

Location of sites in the three rainfall clusters in southern Malawi

Table 3 Discordance and heterogeneity fit results for southern Malawi

The first region (G1 hereafter) was unique in all cluster simulations. The G1 region had four stations namely Chikwawa, Makhanga, Nchalo and Ngabu. These stations are located in the predominantly semi-arid low lying Lower Shire valley. The region is in the southern arm of the Malawi Rift Valley with an average altitude of 84 m a.s.l. MAR for 1978–2007 was around 782 mm with average AMS1, AMS3, AMS5 and AMS7 series respectively of 80, 116, 143 and 163 mm. Rainfall in the region is strongly affected by rain shadow effects due to its low altitude. Intense rainfall in this region is most likely caused by convection over land surfaces (Drayton 1980).

The second region (G2 hereafter) was comprised of seven stations located along Lake Malawi, the upper Shire river basin and the surrounding medium altitude and plain areas with average altitude of 632 m a.s.l. The region had a MAR of around 901 mm with average AMS1, AMS3, AMS5 and AMS7 respectively of 85, 127, 155 and 176 mm. A combination of convective processes over land and adjacent water bodies (e.g. Lake Malawi and rivers) and rain shadow effects are major influences of intense rainfall in the region.

The third region (G3 hereafter) had 12 stations mostly located in the Southern Highlands with average altitude above 1,000 m a.s.l. MAR was 1,193 mm and average AMS1, AMS3, AMS5 and AMS7 of 90, 136, 168 and 192 mm respectively. G3 stations are all located in the high rainfall areas as discussed by Ngongondo et al. (2011). Convective, cyclonic and orographic rainfall processes influence extreme rainfall activities in this region. The G3 region face windward of south easterlies originating from the Indian Ocean. In addition, the region lies along paths tracked by tropical cyclones from the Indian Ocean. The highest recorded rainfall was observed at Zomba town (location of Zomba RTC and Chanco stations) when tropical cyclone “Edith” brought 509 mm of rainfall on 14 December 1946 (Drayton 1980).

4.3 Testing for heterogeneity (H)

The results for the heterogeneity test \( \left( {H_{n} ,\;n = 1,2,3} \right) \) based on 1,000 simulations for each of the three homogeneous regions are shown in Table 5. For the AMS1 series, all three regions passed the heterogeneity test which suggest that for the AMS1 series the three regions can be considered as homogeneous since all \( H_{n} < 1 \). For the AMS3, AMS5 and AMS7, certain stations with shorter record lengths (marked asterisk in Table 3) had to be removed first for the groups to pass the heterogeneity test. However, we did not observe obvious inconsistencies in those stations and therefore subsequent analyses did not exclude these.

4.4 Goodness-of-fit measure (Z) and derivation of the regional growth curves

Goodness of fit (Z) test results for candidate distributions in the three homogeneous regions are shown in Table 4. Acceptable distributions are all those satisfying the criteria \( Z \le \left| {1.64} \right| \) whereas the best distribution is the one satisfying the criteria \( \min Z_{\text{Crit}}^{\text{Dist}} \le \left| {1.64} \right| \) among the acceptable distributions. L-moment ratio diagrams showing the location of regional average L-Cs and L-Ck and their theoretical relationships with the different candidate distributions, are shown in Fig. 3a, b, c and d for AMS1, AMS3, AMS5 and AMS7, respectively.

Table 4 Goodness of fit test results for candidate distributions
Fig. 3
figure 3

L-moment ratio diagrams for the candidate distributions with regional average L-skewness and L-kurtosis. Filled triangle G1; filled circle G2; filled square G3

From Table 4, is can be seen that the GEV is acceptable in all regions, whereas the GPA is the least acceptable appearing in region G1 for AMS3 and AMS7 only. The best distributions, i.e. with \( \min Z_{\text{Crit}}^{\text{Dist}} \le \left| {1.64} \right| \) and \( \min \left| {\tau_{{4\left( {\text{sample}} \right)}} - \tau_{{4\left( {\text{Dist}} \right)}} } \right| \), were the GEV (G1 and G3 for AMS1, G3 for AMS5 and AMS7), PE3 (G1 and G3 for AMS3, G1 and G2 for AMS5 and AMS7) and GLO (G2 for AMS1 and AMS3 and G3 for AMS7). The L-moment ratio plot in Fig. 3 further confirms that these distributions were indeed closest to the regional weighted L-moments means and that the GEV is the best for AMS7 in G3. Hosking and Wallis (1997) recommended the use of four or five parameter distributions e.g. Wakeby or Kappa if the regional L-moment average lies above the GLO line as found for AMS1 and AMS3 in G2. In these cases, the Kappa distribution approximates the GLO. Hence, the choice of the GLO could be justified.

Table 5 shows the location, scale and shape parameters respectively \( \left( {\xi ,\;a\;{\text{and}}\;\kappa } \right) \), of the acceptable distributions as well as the five-parameter WAK distribution in each region. Table 6 shows the T-year regional quantile estimates, 90% error bounds and the RMSE values from Monte Carlo simulations.

Table 5 Regional parameter estimates for the selected distributions
Table 6 Regional quantiles, 90% error bounds and RMSE (%) values for the three regions

From Table 6, all series show that the G3 region’s quantile estimates were not the highest, despite the G3 region being located mostly in the high rainfall areas of southern Malawi highlands. G2 is composed of stations in the upper Shire river valley and along the Lake Malawi region. The G2 region also had the highest AMS1 although its regional average maximum rainfall AMS1 was lower than that of the G3 region.

RMSE values for the regional quantiles increased with return period (Table 6). For the AMS1, regions G1 and G3 had relatively lower RMSE values for the accepted GEV distribution. This is reasonable as the GEV distribution is deemed suitable for estimation of extremes with \( T \le 500 \) years (Norbiato et al. 2007). Further, the 5T rule suggests that rainfall estimates in G1, which had 102 station years, is reliable up to \( T = 500 \). In G3, which had 310 station years, reliability of rainfall estimates goes beyond T = 1,000 years. Therefore, all rainfall estimates with return periods larger than T = 100 years should be treated with caution. In region G2, the supposedly best fitted GLO distribution had higher RMSE values in the upper tail (results not included in Table 6) as compared to the GEV distribution. The 95% error bounds for the GLO were also higher than those of the GEV. On this basis, the GEV distribution is recommended for application in all the three regions for AMS1. The Wakeby or Kappa distribution, considered more robust to frequency distribution misspecification (Wallis et al. 2007), can be used as an additional source of information, especially in G2, whenever the GEV suggest unrealistic or doubtful estimates.

For AMS3, the accepted PE3 distribution had relatively lower RMSE values for G1 and G3 regions. This indicates more reliability of the quantiles even for return periods T ≥ 100 years. The acceptable GLO distribution in region G2, however, had high RMSE values in the upper tail, suggesting the unreliability of quantiles with return period T ≥ 100 years.

Similarly, in both AMS5 and AMS7, lower RMSE values at large return periods for the PE3 (regions G1 and G2) and GEV (G3 region) distributions suggest higher reliability of quantile estimates.

4.4.1 Comparison between regional and at-site based design values

Design rainfall estimates derived from regional quantiles were compared with those derived from fitting the acceptable distribution based on at-site L-moment ratios. Simulation results of the RMSE values were used as the comparison basis. Lower RMSE values give an indication of better accuracy.

RMSE values for regional based rainfall estimates were mostly lower than at-site based RMSE for the GEV distribution in all regions, in particular in the extreme upper tail where \( F \ge 0.99 \) or \( T \ge 100 \), as shown in selected AMS1 plots in Fig. 4a–f. In the lower tail, RMSE values of site based estimates and regional based estimates are similar. Despite the GLO distribution being accepted as the best for the G2 region, its RMSE values were higher than those of the GEV at all sites. This could be an indication that the GLO distribution was either misspecified as the best for region G2 or that some stations had erroneous data.

Fig. 4
figure 4

Regional and at-site based AMS1 RMSE values for various return periods at selected stations: a and b in G1; c and d in G2; e and f in G3

Finally, three stations not used neither in the cluster analysis nor in the derivation of the regional quantiles, were used to test the accuracy of the design rainfall estimates. All three stations passed the independence and stationarity tests with serial correlation coefficients (lag-1 and lag-5) and Mann–Kendall test statistic not significant at α = 0.05.

Alumenda station has an observed maximum AMS1 of 130 mm (mean 87.26 mm), whereas Toleza has a maximum AMS1 of 149.4 mm (mean 71.3 mm). The observed maximum AMS1 for Zomba plateau is 162 mm (mean 119 mm). Figure 5 shows the regional and at-site estimates of design rainfall at these three stations based on the GEV distribution. It is seen that on the one hand, the design values of AMS1 rainfall for different return periods are in approximate agreement between the two approaches, except for Zomba plateau where larger deviations are found for increasing return periods. On the other hand, the estimation uncertainty as measured by the 95% error bounds is much smaller for the regional based approach than that for the at-site estimates. From Fig. 4, regional based RMSE values for Alumenda ranged from 1.2% (T = 2) to a maximum of 28.0% (T = 1,000). These were lower than the at-site based RMSE, which ranged between 8.9% (T = 2) and 177% (T = 1,000). For T = 100, RMSE values were 11.8 and 49.3% for the regional and at-site based estimates, respectively. For Toleza Farm, regional RMSE values ranged between 1.1% to a maximum of 38.0% while at-site based RMSE values ranged between 3.6–76.6%. The RMSE values for the 100-year estimates were 12.4% for the regional based and 24.3% for the site based estimates. Zomba plateau had regional RMSE values from 0.89% (T = 2 years) to 23% (T = 1,000 years). The at-site RMSE’s were 17.0% (T = 2 years) to 69.0% (T = 1,000 years).

Fig. 5
figure 5

Design storms from at-site (shaded) and regional analysis for Alumenda in G1, Toleza Farm in G2 and Zomba Plateau with 95% error bounds as bars

Thus, it can be concluded that the regional based estimates have smaller uncertainty than the at-site based estimates, and that, based on the RMSE values, regional based rainfall estimates also have better accuracy than the at-site based estimates. However, these estimates are only reliable up to T = 100 after which very high RMSE values were noted.

4.4.2 Comparison with previous studies

Drayton (1980) analysed AMS1 using an at-site approach with the EV1 distribution. Table 7 provides the ranked AMS1 rainfall extremes in Malawi from that study based on data from 1895 to 1978.

Table 7 One-day rainfall extremes in Malawi from 1895 to 1978

As demonstrated in Table 7, most 1-day rainfall extremes in Malawi occur in the southern region. The earliest record of 1-day rainfall extreme was 315 mm recorded on 15 January 1895 at Lauderdale Estate in the Shire Highlands. The largest event was recorded at Nkhotakota in 1956 with 572.5 mm.

The largest recorded 1-day event to date in the southern region was 509 mm reported on 14.12.1946 at Zomba Town as a result of the combined effects of Cyclone “Edith” and orographic processes (Drayton 1980). Studies on AMS3, AMS5 and AMS7 for Malawi have not been documented.

Our results suggest that there are both similar and different results from those by Drayton (1980). For the common stations used, Table 8 shows the 20-year rainfall estimates from the two studies.

Table 8 Comparison of some regional and at-site AMS1 statistics calculated by this study with those from Drayton (1980)

The comparison in Table 8 suggests that most estimates are comparable. The differences can be attributed to the different periods used and also the methodologies. It appears that the maximum AMS1 for the stations reported in Table 7 did not occur between 1978 and 1980, the common period of the data used in these two studies.

5 Conclusions

RFA of AMS1, AMS3, AMS5 and AMS7 using 23 rainfall stations in Southern Malawi has been implemented based on the well known index (flood) and L-moments methods. All stations passed some minimum requirements for RFA and were considered i.i.d., i.e. they passed the independence and stationarity tests based on their autocorrelations, Mann–Kendall statistics and spatial correlation.

The 23 stations did not constitute one homogeneous region. The k-means cluster analysis and Ward’s classification suggested three homogeneous rainfall regions: G1 in the Lower Shire valley, G2 in the Lake Malawi and Upper Shire plains and G3 in the Southern Highlands. Although Monte Carlo simulations for AMS1 identified the GLO distribution as the best for region G2 and the GEV in G1 and G3 regions, further accuracy assessments suggested that the GEV distribution is the best model for AMS1 in all three regions. Accuracy for rainfall estimates for return period \( T > 100 \) was, however, low for AMS1. More station years, either from longer records or more stations in the regions, would be required for rainfall estimates above T = 100 years. In the AMS3, AMS5 and AMS7, most regions accepted the PE3 distribution, followed by the GEV and least the GLO distribution. Our results are in general agreement with Sarkar et al. (2009) who reported that most extreme flood events find their quantiles in the GEV, GLO and PE3 distributions.

The performance of the derived regional quantiles at validation sites was satisfactory and had smaller uncertainty as compared to at-site estimates. However, there is a need to develop a procedure for using at-site characteristics for estimating the mean or median rainfall at ungauged areas in the regions. This would enable the testing of the regional index rainfall quantiles that have been developed at ungauged sites. Depending on data availability, it would also be interesting to compare these results with those from a POT analysis.