Introduction

Absolutely, environmental gradients play a crucial role in shaping vegetation patterns. The complex interplay between abiotic factors and plants communities is evident across various ecosystems and scales Aljasmi et al.1. While strides have been made in understanding of these relationships, there is still a gap in comprehending microclimatic influences on plant abundance in some tropical and subtropical ecosystems Mehmood et al.2; Zeb et al.3. Bridging the gap between biotic, abiotic components is essential for a comprehensive understanding of ecosystem. The ecological vigor of plant species is often observed in their natural habitats, reflecting adaptation to specific environmental conditions. Distinct compositional groups vary along ecological gradients, showcasing the dynamic nature of the habitats. Fluctuations in species composition along ecological gradients have been longstanding focus of study Mumshad et al.4.

Major plant groups of an area are seen as co-evolved populations forming discernible units which could be explored using robust multivariate statistical analysis. Several statistical tools aid ecologists in analyzing large environmental and vegetation datasets, simplifying complexity and revealing patterns. While widely used, their application remains relatively rare in some of the areas, aiding in distinguishing patterns between species and their environments Dar et al.5.

Understanding of how site-specific factors influence vegetation diversity and abundance across different ecological habitats, multivariate analysis particularly cluster analysis (CA) and two way cluster analysis (TWCA) allow us to explore complex relationships by incorporating multiple predictors and response variables. The overarching aim is to contribute to a comprehension of the interconnected influence shaping the ecosystem structure and functioning Khan et al.6; Khan et al.7. Plant species are found in diverse range of environments forming different major plant groups driven by the different ecological factors Rahman et al.8.

The interaction between vegetation and environmental gradients is crucial. Studying of plant groups, considering factors like topography, soil, and status provides insights into the intricate relationships in shaping ecological communities Abbas et al.9. Comprehensive analysis at both species and major plant group levels is essential for understanding the dynamic interplay between flora and their surroundings Ullah et al.10. Phytosociology focuses on the analysis of major plant groups, their characteristics, classification and the impact of environmental factors on their properties and functions Akhlaq et al.11. Understanding vegetation distribution pattern and their relationship with environmental factors is important, considering both abiotic conditions and ecosystem processes. Various factors such as topography, soil properties and status play role in influencing plant species richness at different scales. This knowledge helps in comprehending the effects of natural and human induced disturbance on individual species and communities Ullah et al.10. The current multivariate analyses of the herbaceous vegetation were conducted with the aim to study the distribution of species in the face of prevailing environmental conditions.

Materials and methods

Study area

Kohat is a district in the Kohat Division of Khyber Pakhtunkhwa (KP) and is located in its southern part. The research area extends from 33.0676° N to 33.7405° N latitude and from 71.0663° E to 72.0084° E longitude. The total area of the district is about 2991 Km2 (Fig. 1). It is bounded on the north by district Peshawar, on the north-west by district Orakzai, on the south by district Karak and district Mianwali, to the west by district Hangu, to the east by River Indus and district Attock and to the north-east by district Nowshera Haseeb et al.12. District Kohat climatically falls in the sub-tropical region with slightly arid conditions. Topographically the district is composed of plains and low hills. Most of the area is non-cultivated, comprises of forested areas and rangelands. The cultivated area is mostly non-irrigated. The irrigated area is supplied with the water of rain fed dams. In most of the area the weather is hot from May to September while it is pleasant from October to February. Kohat as a diverse ecological region, harbouring a variety of ecological habitats, having a diverse species composition with different communities, the area direly needs to be studied to fill the scientific knowledge gap.

Figure 1
figure 1

Locational map of the study area with elevation (ArcGIS 10.8.2., https://www.esri.com/en-us/home).

Study design

A vegetation survey of the herbaceous layer was conducted during the summer season of 2022–2023 using the Quadrat method. The data was recorded from 40 different sites of the area (Table 1). The herbaceous plant species collected from study area were identified with the help of flora of Pakistan. To get precise image of vegetation of whole district a combination of systematic and random quadrat was used. Quadrats size 1 × 1 m was used for getting vegetational data of the plant species following the minimal area roles. A total of 10 quadrats were applied in each site. These sites were selected on the basis of topography, altitude, status, soil composition and species composition.

Table 1 Name, locality and habitat type of the studied forty study sites.

Soil analysis

A composite soil sample was collected from each site (forty soil samples) and provided to Agriculture Research Institute Tarnab, Peshawar for the physicochemical analysis for understanding its impact on vegetation structure.

Data analysis

MS excel 2017 was used for basic calculation like frequency, density, cover and relative values. Modern statistical packages PC-ORD version 5 and CANOCO version 5.1 were used to analysis the data. Importance value (IV) was obtained by adding the values of relative density (RD), relative cover (RC) and relative frequency (RF) and all species were sorted by IVs. Plant IV data were treated in CANOCO version 5.1 to measure the environmental gradients responsible for distribution of plants species and identification of plants communities of the area. In present study different multivariate methods like CA, TWCA and detrended correspondence analysis (DCA) etc. were used for identification of plant communities. Names were assigned to the plant communities according to the top three dominant plant species. DCA is a type of indirect gradient and canonical correspondence analysis (CCA) is a type direct gradient analysis to confirm the results of CA and TWCA.

IUCN policy statement

The collection of plant material complies with relevant institutional, national, and international guidelines and legislation.

Results

Floristic diversity

The present vegetation survey of the herbaceous vegetation was conducted during the summer season and was blessed with a rich floristic diversity of a total 216 herbaceous species recorded from forty different sites of the research area.

Classification of the summer herbaceous vegetation

Species area curve

Species area curve showed the adequacy of the number of sampling sites in the area. As the graph showing species increase with the increase in number of sampling sites. The results revealed that site 5 showed maximum number of species which were continued up to site 30. Moving further from site 30, decline in species number started and site 40 had the minimum number of species showing adequacy of sampling in the area (Fig. 2).

Figure 2
figure 2

Species area and compositional curves based on IV data and for all 216 species and 40 sampling sites.

Cluster analysis

The Cluster Analysis (CA) of PC-ORD v.5 classified all the 40 sites into six major plant groups (clusters) on the basis floristic similarity. The detail description of each group is given in (Fig. 3).

Figure 3
figure 3

Cluster dendrogram showing all the 40 sites classified into 6 cluster based on Sorenson measures.

Two-way cluster analysis

The two-way cluster analysis (TWCA) classified the vegetation data of 216 species recorded at forty different sites of the area mainly into six different major plant groups. The groups were named after the dominant species of each group based on IV (Fig. 4).

Figure 4
figure 4

Two-way cluster dendrogram generated through PC-ORD Version 5.1 based on Sorenson measures, showing the distribution of 216 summer herbaceous species in 40 sites and 6 plant groups.

Group—1 (Paspalum paspalodes, Alternanthera sessilis, Typha domingensis)

This group was established at the stream beds and wetland habitats where sufficient moisture is available. These sites were located in the plain areas and the altitude of these areas were ranging from 330 to 570 m. The soil were having with clay % particles in the range of (11), silt % (44–76), sand % (13–45), pH (7.9–8.3), Electrical conductivity dsm−1 (EC) (0.08–0.22), total soluble salts (TSS) (0.025–0.07), organic matter % (OM) (2–3.08), CaCO3% (6.5–9.25), Soil moisture content % (MC) (25.5–47.2), Nitrogen ppm (N) (0.1–154), Phosphorus ppm (P) (2.6–12.9), Potassium ppm (K) (68–240), This group includes 6 different sites of the area. A total of 72 species were included in this group. The dominant species of the group one with highest importance value (IV) were Paspalum paspalodes (193.7) followed by Alternanthera sessilis (121.3) and Typha domingensis (99.2). The co-dominant species of this group were Bolboschoenus affinis (74.2) Echinochloa crus-galli (70) Aster subulatus (68.3), Persicaria hydropiper (65.6), Saccharum spontaneum (63.7) and Apluda mutica (63.6) (Table 2).

Table 2 Importance value of the species in the research area, six major groups clustered with CA and TWCA classification analysis.
Group—2 (Cynodon dactylon, Parthenium hysterophorus, Brachiaria ramose)

Group 2 comprises of 9 sites. This group was established in the cultivated plain habitats which have mainly mesic types of condition. The altitude of these areas were ranging from 405 to 575 m. The soil were having with clay % particles in the range of (10–11), silt % (24–66), sand % (23–65), pH (8.1–8.6), EC dsm−1 (0.06–0.17), TSS % (0.019–0.05), OM % (1.03–3.1), CaCO3% (7.25–10), MC% (9.4–19.2), N ppm (0.051–0.155), P ppm (2.6–8), K ppm (84–562), Cynodon dactylon was the dominant species of this group having the IV of (305), followed by Parthenium hysterophorus (235.9) and Brachiaria ramosa (148). While the other prominent species of this group were Cyperus rotundus (142), Malvastrum coromandelianum (84.7), Echinochloa colona (83.5), Digiteria sanguinalis (77.4) and Achyranthus aspera (71) (Table 2).

Group—3 (Cynodon dactylon, Eragrostis minor, Cymbopogon jwarancusa)

This group includes 7 sites. This group was established mostly in the non-irrigated plain areas mostly which were having xeric conditions. The altitude of these areas were ranging from 315 to 560 m. The soil were having with clay % particles in the range of (10–11), silt % (14–40), sand % (49–75), pH (8–8.9), EC dsm−1 (0.05–0.23), TSS % (0.016–0.073), OM % (0.34–1.72), CaCO3% (6.5–9.25), MC% (8–9.9), N ppm (0.017–0.086), P ppm (2.4–7.4), K ppm (70–212). Cynodon dactylon was the dominant species of group 3 having the IV of 237 followed by Eragrostis minor (155) and Cymbopogon jwarancusa (121). Other co-dominant species of this group were Cyperus rotundus (106), Pennisetum orientale (104), Boerhavia procumbens (86.8), Dactyloctenium scindicum (75), Cenchrus ciliaris (74.3) and Solanum surattense (61) (Table 2).

Group—4 (Cymbopogon jwarancusa, Aristida adscensionis, Boerhavia procumbens)

This group was relatively small comprised of 4 sites. These sites were located at the rangeland areas and the altitude of these areas were ranging from 365 to 760 m. The soil were having with clay % particles in the range of (11), silt % (26–58), sand % (31–63), pH (8–8.4), EC dsm−1 (0.11–0.16), TSS % (0.035–0.051), OM % (0.34–2.76), CaCO3% (6.5–9.25), MC% (7.8–8.6), N ppm (0.017–0.138), P ppm (3.7–7.4), K ppm (392–510). Cymbopogon jwarancusa was the dominant species having the IV of 417 followed by Aristida adscensionis (98.8) and Boerhavia procumbens (62.9). other prominent species of this community were Cynodon dactylon (47.2), Enneapogon schimperanus (41.1), Cenchrus ciliaris (36), Lolium perenne (33.1) and Salvia aegyptiaca (31) (Table 2).

Group—5 (Cymbopogon jwarancusa, Aristida adscensionis, Pennisetum orientale)

This group was the largest group among all group. It includes 11 sites. These sites were located mostly at the hilly region of low altitude ranging from 460 to 670 m. The soil were having with clay % particles in the range of (11), silt % (22–52), sand % (37–67), pH (8.2–8.9), EC dsm−1 (0.05–0.11), TSS % (0.016–0.035), OM % (0.62–3.1), CaCO3% (6.75–9.75), MC% (7.8–9.9), N ppm (0.031–0.155), P ppm (2.4–8.2), K ppm (70–522). Cymbopogon jwarancusa was the dominant species having the IV 632 followed by Aristida adscensionis (383) and Pennisetum orientale (267). Other prominent members of this group were Chrysopogon aucheri (240), Tetrapogon villosus (180), Eulaliopsis binata (156), Heteropogon contortus (143) and Enneapogon schimperanus (111) (Table 2).

Group—6 (Heteropogon contortus, Bothrriochloa ischaemum, Chrysopogon serrulatus)

This was the smallest group comprised of 3 sites. This group was established at the high-altitude hills. The altitude of these areas were ranging from 895 to 1225 m. The soil were having with clay % particles in the range of (10–11), silt % (19–40), sand % (49–71), pH (8.1–8.9), EC dsm−1 (0.11–0.19), TSS % (0.035–0.061), OM % (0.48–2.07), CaCO3% (8.25–9.75), MC% (8.3–9.3), N ppm (0.024–0.103), P ppm (4.6–10.2), K ppm (104–198). The dominant species of this group was Heteropogon contortus having IV of (76.4) followed by Bothrriochloa ischaemum (68.6) and Chrysopogon serrulatus (59.6). Other prominent members of this group were Themeda anathera (54.2). Chrysopogon aucheri (52.6), Cymbopogon martini (45.2) and Cyperus niveus (43.9) (Table 2).

Ordination

Detrended correspondence analysis (DCA)

DCA ordination of sites

DCA ordination revealed about the pattern in complex data set. The gradient length for axis 1 was 7.62 with the Eigen value of 0.801, for axis 2 gradient length was 3.30 with Eigen value of 0.326, for axis 3 gradient length was 2.53 with Eigen value of 0.22 and for axis 4 the gradient length 2.75 with Eigen value of 0.159. The DCA diagram was used to analyze the ordination of different sites on the basis of species composition. The present DCA diagram of all the stations revealed the position of different station along the axis. Based on the floristic composition data the vegetation of the area was mainly grouped into six major groups. Group 1 comprises of S1, S2, S3 S4, S5, S7 these sites cluster under the influence of high moisture content, group two comprises of S6, S8, S9, S10, S11, S12, S15, S17, S18, group 3 comprises of S13, S14, S16, S19, S20, S22 these sites cluster under the influence of topography and low altitude, group 4 comprises of S21, S23, S24, S25 these were rangelands, group 5 comprises of S26, S27, S30, S32, S34, S35, S36, S37, S38, S39 and S40, these sites were clustered due to the hilly landscape with low altitude while group 6 comprises of S28, S31 and S33 which were cluster under the strong influence of high altitude and south aspect. Among all the station, S6, S28 and S33 located separately because of their different extremes of ecological conditions.

DCA confirmed the CA classification of the vegetation data. In the present DCA graph, group 1 locate separately at left side, group 2, group 3 and group 4 locate at the mid center of the graph while group 5 and group 6 locate at the right side (Table 3) (Fig. 5).

Table 3 Description of the four axes of the DCA analysis.
Figure 5
figure 5

DCA ordination plot of the 40 different sites in the study area.

DCA ordination of the species

All the 216 recorded species were analyzed through DCA analysis. In the graph the species which were lying closed to each other showed strong correlation with each other while the species which were located away and faraway from each other showed no correlation with each other.

The driving ecological factors for the clustering of vegetation into different groups were topography, altitude and soil moisture content. In both the DCA plots from left to right the soil moisture contents decreases while the altitude increases and the topography from left to right changes from plains to hilly landscape (Table 3) (Fig. 6).

Figure 6
figure 6

DCA ordination plot presenting the distribution of 216 herbaceous plant species of the area.

Canonical correspondence analysis (CCA)

CCA analysis of the species and environmental factors

To find out the influences of environmental variables on the species distribution, Canonical Correspondence Analysis (CCA) were used. Statistically, CCA analysis showed significant relation (p = 0.002) between environmental variables and species distribution. The first quadrant CCA (Biplot) revealed that higher number of species were clustered around the influence of available Moisture content (MC), higher concentration of silt, Total soluble salts (TSS), organic matter (OM), Nitrogen (N), Electrical conductivity and phosphorus. The species present in second quadrant of CCA biplot were clustered under the influences of Altitude, Aspect, pH and also by the type of habitat i.e. forested and protected habitat. The third quadrant of CCA biplot showed that these species were influenced by higher concentration of sand and potassium (K). The fourth quadrant of CCA biplot showed that these species were grouped in the cultivated plains of the area. The plain topography and the cultivation practices supports the growth of species present in quadrant 4 (Table 4) (Fig. 7).

Table 4 Summary of the CCA analysis.
Figure 7
figure 7

CCA biplot showing the distribution of the herbaceous plant species along the environmental factors.

CCA analysis of the major group and environmental factors

Our cluster analysis yielded 6 different groups based on floristic similarity. The CCA graph showed that group 2 was more related to the plain topography and cultivated status of the habitats. the high concentration of K, pH, and sand showed their influences on group 3, CaCO3, Aspect, Forested nature and Altitude showed its impact on the group 4, group 5 and group 6. While high soil moisture contents, silt and Phosphorus control the distribution of group 1 (Fig. 8).

Figure 8
figure 8

CCA plot showing the impact of different ecological factors on the yielded six major plant groups of the area. Number of species of a group were shown by the size of the circle.

Discussion

The aggregation of different plant species into various major groups is basically response to the existing environmental conditions Haq et al.13. The present sub-tropical region owes a variety of different ecological habitats. from plains to hills, from xeric to moist, from low to high altitude and from cultivated to forested areas, because of the availability of a variety of habitats the area supports the growth of many distinct plant groups. In the present study a total of 216 summer herbaceous plant species were recorded from 40 different selected ecological habitats. vegetation study of an area includes the classification, distribution and the relationships of the existing plant cover with the prevailing ecological condition Iqbal et al.14. The microclimatic conditions of the district vary from the low altitude plains with relatively high soil moisture contents up to the high-altitude hilly topography. The collected vegetation data from such highly diversified area (40 sites) were classified through multivariate statistical analysis via PC-ORD v.5 software. The whole data of the 40 sites and 216 herbaceous species were sorted in six major plant groups on the basis of there floristic similarities. These six major plants groups were named after the dominant species on the basis of IV in each group. The six different major plant groups of the area were Group 1. Paspalum paspalodes, Alternanthera sessilis, Typha domingensis, high soil moisture content and high silt concentration of these sites influence this plant group. Group 2. Cynodon dactylon, Parthenium hysterophorus, Brachiaria ramosa, this plant group was prevailing due to the cultivated status and plain topography of the sites with moderate soil moisture content. Group 3. Cynodon dactylon, Eragrostis minor, Cymbopogon jwarancusa, the species of this plant groups clustered under the influence of high sand concentration which in term holds low soil moisture contents. Group 4. Cymbopogon jwarancusa, Aristida adscensionis, Boerhavia procumbens, this plant group was found in the rangelands which have relatively high altitude. group 5. Cymbopogon jwarancusa, Aristida adscensionis, Pennisetum orientale,this plant group was established in the low altitude hills of the area. Group 6. Heteropogon contortus, Bothrriochloa ischaemum, Chrysopogon serrulatus. The plant life of the species of this group were influenced by the high altitude. These groups were formed due to there floristic similarities and were named after the dominant species of the groups. These major vegetation groups types exist in the area due to the specific site conditions (Environmental factors) Rahman et al.15. The grouping of plants species into identifiable plant groups is the consequences of the prevailing environmental conditions Giuponi et al.16.

In the sub-tropical regions, ecological studies, in particular vegetation analysis are important for the understanding of underlying relationship between the plant species and environmental factors Lolila et al.17. Mainly the environmental factors of an area included status, soil characteristics, topography and altitude Waheed et al.18. CANOCO version 5.1 was used to analyze the effect of different prevailing ecological conditions on the distribution of the plant species in the area. To find out the impact of ecological factors on the existing plants, the DCA (indirect gradient) and CCA (direct gradient) analysis were often used in many vegetational studies such as Ali et al.19.

The DCA graphs are interpretable in the multivariate analysis of the vegetation. It shows the distribution of the plant species in the research area Kobal et al.20. The present DCA diagrams of the stands and species showed their distribution in relation to topography, altitude, soil moisture content and status whether it was cultivated or forested. The present DCA graphs beautifully arranged the whole data into highly interpretable patterns. The species and stands which were located at the left sites of the graphs were having relatively high soil moisture contents with plain topography and low altitude. While the species and stands which were located at the right side of the DCA graph were having low moisture content with hilly topography with relatively high altitude.

In the present study the CCA analysis showed that soil moisture content, altitude, topography, status and the edaphic factors were the strong influencing factors which were acting as the driving environmental factors which checked the distribution of different plants species in different ecological habitats Majeed et al.21. The anthropogenic factors also played key role in the structuring of vegetation at different habitats Jamil et al.22. The present study also showed that the plain areas with high moisture contents and high silt particles supports the growth of more diverse plants groups as compared to the xeric areas with hilly nature and high sand particles Ali et al.23. Some of the species were only restricted to the high altitude’s habitats Mumshad et al.4. Plant species by origins restricted to specific environmental condition and be present in the particular habitats due to the presence of certain principal ecological components due to which a specific plant group is formed Rahman et al.15. The present CCA analysis showed the influences of different environmental factors on the clustering of different plants species into identifiable plant groups. The CCA diagram showed that the six major plants groups were strongly influenced by the existing biotic and abiotic conditions due to which these plants groups were formed. Our statement about the role of biotic and abiotic conditions in the aggregation of different plant species into specific group were supported by many other researches such as Hamdy et al.24; Wali et al.25; Zeb et al.3; Hussain et al.26 and Anwar et al.27.

DCA and CCA analysis are widely used ordination techniques in vegetation ecology and are frequently used for accurate determination of the distinct unit and significance of the relationship between the plant species and environmental factors. The present analysis revealed that, among environmental factors MC, topography, status (Forested or cultivated), altitude and soil texture has great influences on the distribution of plant species Mansoor et al.28, Khan et al.29. CCA analysis was employed to know the vegetation-environment inter-relationships Iqbal et al.14; Khan et al.30 and Ilyas et al.31. It provides a biplot that directly showed the influences of different environmental factors on the distribution of plant species Mehmood et al.32; Khan et al.33; Shaheen et al.34 and Zareen et al.35.

Conclusion

In the present study a total of 216 herbaceous species were recorded from forty different sites during summer season. Six major groups were recognized that were associated with particular environmental factors. Environmental factors include biotic, edaphic and topographic factors. DCA results further clarified the six major plant groups. CCA analysis confirmed the significance of the environmental factors which moulds the vegetation structure and distribution of the plant species in the area. The present study explored the vegetation pattern and vegetation-environment interrelationship which will be helpful to plan suitable measures for the conservation of the vegetation structure (Supplementary file 1).