Introduction

A diversified family of food crops known as amaranths is impressively adaptable to new environments despite a variety of biotic and abiotic obstacles1. Practically speaking, it is a pseudo-cereal because it is dicotyledonous and not a true cereal2. There are 87 species in the genus3. However, because of the paucity of systematic studies on Amaranths, the number of species is unclear, and the genus is regarded as challenging by systematics4.

Amaranths are now becoming more significant in both human and animal nourishment5. They have a balanced amino acid profile and high seed protein concentration6. Amaranth is high in iron, calcium, potassium, phosphorus, protein, and lysine. It also contains vitamins A and C7. The Food and Agriculture Organization of the United Nations views them as a critical crop for food and nutritional security because of their nutritional value (FAO)8. Amaranths are a great substitute for cereal for those with celiac disease due to their low gluten level9.

The complex variable of yield is influenced by many different variables, notably polygenes, the environment, and genetic variability10,11. Understanding the relationships between quantitatively inherited crop traits and their direct and indirect effects on yield is crucial for the success of breeding program selections12. Moreover, studies on the genotypic and phenotypic correlations among crop plant traits are useful for planning, assessing, and creating selection criteria for the targeted characters for selection in the breeding program13. Generally speaking, the strength of character association has a significant impact on the effectiveness of any crop development effort.

Path coefficient analysis is an efficient statistical technique specially designed to quantify the interrelationship of different components and their direct and indirect effects on yield. Partitioning of total correlation into direct and indirect effects by path coefficient analysis helps in making the selection more effective14. Keeping these facts under consideration, a planned effort was undertaken to evaluate different genotypes of amaranth under the agro-climatic conditions of Ethiopia. The selection of better genotypes for breeding programs depends on this information. There has not been a remarkable study conducted in Ethiopia to date that evaluates the relationships between yield components and plant characteristics and grain and leaf yield, as well as the direct and indirect effects of these relationships on grain and leaf yield in different amaranth genotypes. To obtain the yield data required for amaranth breeding and development, the objective of this study was to (1) quantify the magnitude of the correlation coefficient and path coefficients between traits of amaranth genotypes grown in Ethiopia. (2) Analyse the direct and indirect effects of quantitative features on grain and leaf yields. (3) Put out the selection criteria for boosting amaranth genotypes' grain and leaf yields.

Material and methods

Experimental Site

The experiment was conducted in 2020 and 2021 at the agricultural experimental site, Hawassa University. It is geographically located around 275 kms from Ethiopia's capital, Addis Ababa, in the Sidama Region. The experimental area is situated at an altitude of 1709 m above sea level, at latitude 7°2′ 54.7503′′ N and longitude 38°30′ 17.1608′′ E (Fig. 1). Clay loam was the dominant soil texture class in the experimental location, where the pH ranged from 6 to 6.5. The district's mean monthly low and high temperatures are, respectively, 14.10 °C and 27.9 °C. In all, throughout the two growing seasons, the experimental farm received 1379.16 mm of rain.

Figure 1
figure 1

Map of the study area. Maps were generated using Arcmap v 10.6 1.

Plant material

Amaranth was collected legitimately and under the proper national (Ethiopian Biodiversity Institute) and international guidelines. The Ethiopian Biodiversity Institute field staff assisted in collecting the samples that were used as vouchers. The specimens were accepted by the National Herbarium (ETH) at Addis Abeba University after they had been placed there, dried, tagged, pressed, and given a number. The specimens were identified by systematist Melaku Wondafrash Ayitaged at the Addis Abeba University National Herbarium.

A total of 120 amaranth genotypes were used in this study, with 34 genotypes coming from the Ethiopian Biodiversity Institute (EBI), 2 from the Melkasa Research Center, 15 from the Werere Research Center (Afar region), 8 from Sidama, and the remaining 61 genotypes being collected in the Southern Nations, Nationalities and Peoples Region, Oromia, and Tigray regions of Ethiopia in 2019 and characterized for various agro-morphological traits. One hundred eighteen members contain passport information, but the other two do not and are regarded as released varieties (Table 1).

Table 1 List and passport data of plant materials included in the study.

Experimental layout

A tractor cleared the experimental field, correctly plowed it, and harrowed it. A manual hoe was used to prepare the ridges. The experimental design was an alpha lattice. A layout with 2 replications, 16 blocks, and 15 plots per block for amaranths was conducted. Each unit plot was separated by a 0.60-m distance between plots, a one-meter distance between blocks, and a three-meter distance between replications, with a plot size of 1.80 m in length and 1.50 m in width for a 2.7 m2 area. Each season required a total space of 1345.2 m2 (38 m × 35.4 m). On April 15th, 2020, and 2021, the seed was sown at one location in each season, which is under ideal growing circumstances and during the agricultural season. One growing season on the experimental site in one year is considered the environment. Seeds of various genotypes (Table 1) were consistently sowed in two rows with a gap of 0.75 m between them. Seeds are quite tiny, with sizes ranging from 0.37 to 1.21 g per 1000 seed weight15, and they were planted in seedbeds and covered with powdered, finely diced cow farmyard manure after being combined with sand in a 1:4 ratio. At 14 and 22 days after sowing (DAS), thinning was done twice at a distance of 75 cm between rows and 30 cm between plants. According to Grubben and Van Sloten16 and Shukla, et al.17, the experiment followed standard cultural practices. Hand-hoeing was used to control weeds at 2-week intervals following germination and whenever necessary. A total of 12 plants were maintained in each plot.

Data collection

Data on agro-morphological traits were collected either on a plot-by-plot basis or from ten randomly selected plants per plot. A list of the studied 24 agro-morphological traits with their descriptions and sampling methods is indicated in Table 2. At various phonological phases, observations were conducted on a variety of morphological characteristics. Ten randomly selected plants in each plot had their phenotypic traits evaluated. The International Board for Plant Genetic Resources suggested utilizing amaranth descriptors to describe mature plants based on taxonomic keys18. Characters that were not on the list but were deemed crucial for the characterization were included. To accomplish the aforementioned objectives, a two-season experiment was carried out at the Agricultural Research Site of Hawassa University, Ethiopia.

Table 2 List of quantitative agro-morphological traits used along with their marker code, unit, basis of sampling, and description.

Statistical analysis

Diagnosis of multicollinearity

One of the requirements for accurate and trustworthy path coefficient estimation is the analysis of multicollinearity between any explanatory variables19. The key plant breeding phenomena of collinearity diagnostic boosts the effectiveness of indirect selection for genetic improvement when the target characteristics have low heritability20. When there is a moderate to severe multicollinearity, the computed coefficients may have utterly irrational (higher than unity) values, and the variances linked to estimators of path coefficients may inflate, generating incorrect estimates21, and the variances associated with estimators of path coefficients may become inflated and leads to unreliable estimates22. By examining the correlation matrix or by using the variance inflation factor (VIF), tolerance limit (TOL), and eigenvalues, multicollinearity can be identified. Two explanatory variables may be multicollinear if there are strong coefficients (near + 1) between them in the correlation matrix23. This possibility can be verified using the VIF, TOL, and eigenvalues. Using the VIF, TOL, and COLLIN option methods, the PROC REG was used to perform the VIF, TOL, and eigenvalues (SAS).

A number greater than 10 indicates that the related regression coefficients are underestimated due to multicollinearity24. The VIF quantifies how much the variance of an estimated regression coefficient is increased due to the effect of multicollinearity25. The TOL, which is the inverse of the VIF, indicates that there is severe collinearity in the database when it is larger than 0.125. According to Montgomery et al.25, the correlation matrices' maximum eigenvalue (max) can be divided by the subsequent smallest eigenvalue (min) to determine the condition number (CN) for each trait. Montgomery et al.25 determined that the degree of multicollinearity in the matrices was categorized as weak when the estimates of CN were less than 100, moderate when they were between 100 and 1000, and severe when they were greater than 1000. Either eigenvalues or eigenvectors can be used simultaneously to find the multicollinearity issue. Therefore, removing the traits responsible for boosting the variance of the regression coefficient is essential unless the trait is of importance for improvement. Traits with a low eigenvalue and a high matching eigenvector indicate a striking collinearity problem22. Therefore, removing the traits that cause multicollinearity from the study is a quicker and more efficient way to solve this issue and enables obtaining the most precise path coefficients23. As a result, the correlation matrices, VIF, TOL, CN, and Eigen system analysis of the correlation matrix were used to perform the multicollinearity test between the researched features. In total 24 traits were measured and based on the multicollinearity diagnosis tests. These multicollinearity diagnosis tests allowed us to exclude variables like TSW, GFP, BLBL, TLBL, SD, NN, PL, and LT from the analysis of correlation since they were so strongly linked to multicollinearity in the matrix. However, multicollinearity is associated with DM according to grain yield path analysis and GSFR related to leaf yield path analysis; these features were removed as well from the analysis. The remaining 16 variables (or around 66%) and 14 variables (58.33%) were considered for correlation and path, respectively, for the analysis due to their low multicollinearity in the current study.

Phenotypic and genotypic correlations

The collected data from the two years were combined and subjected to correlation and path coefficients analysis. Correlation analysis was performed for all possible combinations of traits after diagnosing the multicollinearity test. Phenotypic correlation, the observable correlation between two explanatory traits, which includes both genotypic and environmental effects, and genotypic correlation, the inherent association between two explanatory traits, were computed from the components of variance and co-variances as described by26. The coefficient of correlation was tested using ‘r’ tabulated value at n − 2 degrees of freedom, at 5, 1, and 0.1% probability level, where n is the number of genotypes.

$${\mathrm{rp}}_{\rm{XY}}=\frac{{\mathrm{PCOV}}_{(\mathrm{x },\mathrm{y})}}{\sqrt{{{\upsigma }^{2}}_{{\mathrm{P}}_{\rm{x}}}* {\upsigma }^{2}{\mathrm{P}}_{\rm{y}}}}$$
(1)
$${\mathrm{rg}}_{\rm{XY}}=\frac{{\mathrm{GCOV}}_{(\mathrm{x },\mathrm{y})}}{\sqrt{{{\upsigma }^{2}}_{{\mathrm{g}}_{\rm{x}}}* {\upsigma }^{2}{\mathrm{g}}_{\rm{y}}}}$$
(2)

where rpxy and rgxy are the phenotypic and genotypic correlation coefficient between traits x and y, Pcov(x,y) and Gcov(x,y) are phenotypic and genotypic covariance between traits x and y; σ2px and σ2py are phenotypic variances for traits x and y; σ2gx and σ2gy are genotypic variances for traits x and y, respectively.

Path-coefficient analysis

Using path coefficient analysis, which was proposed by Dewey and Lu27 and developed by Wright28, the correlation coefficient was further divided into direct and indirect effects28. It is possible to tell whether a cause-and-effect relationship between two explanatory traits is real and independent of other traits by using path analysis. LY and GY were chosen for the path-coefficient analysis as the result (dependent) variables and the other traits as the causal (predictor) traits. The LY was also considered as an independent trait in path coefficient analysis for GY and vice versa. The direct and indirect effects of the independent traits on LY and GY were estimated by the simultaneous solution of the following equations which may be represented by the following general formula as applied by Dewey and Lu27.

$$rij=Pij+\sum rik*Pkj$$
(3)

where is the mutual association between the independent trait (i) and dependent trait (j) as measured by the genotypic and phenotypic correlation coefficients; is components of direct effects of the independent trait (i) on the dependent trait (j) as measured by the genotypic and phenotypic path-coefficients; and \(\sum \mathrm{rik}*\mathrm{Pkj}\) is the summation of components of indirect effects of a given independent trait (i) on a given dependent trait (j) via all other independent traits (k). To determine values, square matrices of the correlation coefficients between independent traits in all possible pairs were inverted and then multiplied by the correlation coefficients between the independent and dependent traits.

Based on average data from the two years, path coefficient analysis was conducted using phenotypic and genotypic correlation matrices built up as X = Y − 1 * Z for LY and GY. The phenotypic and genotypic correlation coefficients of dependent traits (LY and GY) versus independent agro-morphological traits are represented in this matrix by the vector ′X. Vector Z is the direct effect of the route coefficients, and vector Y′ is the inversed value of the phenotypic and genotypic correlations for all conceivable combinations among the examined traits. The matrix inverse (MINVERSE) function of Microsoft Excel 2010 was used) compute the inverse of matrix Y′ after inverting the reciprocals and autocorrelations of the correlation coefficient matrix. Using the matrix multiplication (MMULT) function of Microsoft Excel 2010, the direct influence of path coefficients was estimated as the product of vector ′X and each row of vector Y′ inverse. As recommended, the value of the direct effect path coefficient was multiplied by the correlation coefficient in the matrix to assess the indirect effect of the path coefficient27. The direct effect product in the path analysis and the dependent trait coefficient in the correlation analysis via all predictor traits are added to create the coefficient of determination (R2). The contribution of the remaining unknown factor was measured as the residual factor (RF), which was calculated with the following formula;

$$\mathrm{RF}= \sqrt{1-{\mathrm{R}}^{2 }} ;\mathrm{ where }{\mathrm{R}}^{2 }=(\sum \mathrm{PIJ}*\mathrm{rij})$$
(4)

The value of RF reveals how effectively the causative factors explain the dependent factor's variability29. That is, if the RF value is low (for example, close to zero), the variability in the predictor characteristics fully accounts for the variance in the dependent trait, but a greater RF value suggests that other factors that have not been taken into account must be included in the study.

Results

Phenotypic and genotypic correlation of yield-related traits

Out of 91 relations, 71 yield-related trait relationships were significant at the phenotypic level, according to the linear pairwise correlations analysis (Table 3). The results revealed a very strong positive correlation between the LW and the LL (rp = 0.90), LA (rp = 0.77), DF (rp = 0.61), PHF (rp = 0.48), and PHM (rp = 0.45). The correlation coefficients of TILL (rp = 0.86), TISL (rp = 0.80), and GSFR (rp = 0.30) were also observed to be highly significant and positive with AIL. On the contrary, it revealed a very highly significant negative phenotypic correlation with PHM (rp = −0.56), BN (rp = −0.50), LN (rp = −0.35), PHF (rp = −0.54), DM (rp = −0.42), and DF (rp = −0.64). BN was significantly and positively correlated with PHM (rp = 0.74), LN (rp = 0.39), PHF (rp = 0.66), DM (rp = 0.70), DF (rp = 0.58), and DE (rp = 0.58), but very negatively correlated with TILL (rp = −0.48) and GSFR (rp = −0.36), as well as highly and negatively correlated with TISL (rp = −0.18). The results also demonstrated a significant negative relationship between LL (rp = −0.20 ), and DE (rp = −0.21), with the AIL. The LL had a very high positive correlation with LW (rp = 0.90), PHM (rp = 0.50), PHF (rp = 0.56), LA (rp = 0.76), DF (rp = 0.62), and GSFR (rp = 0.29), but a high negative correlation with DE (rp = −0.19). The AIL (rp = 0.86), TISL (rp = 0.78), and GSFR (rp = 0.44) were shown to have a very highly significant positive connection with the TILL. With PHM (rp = 0.78), BN (rp = 0.66), LL (rp = 0.56), DM (rp = 0.60), LA (rp = 0.32), DF (rp = 0.82), and DE (rp = 0.32), it was shown that the PHF had a very highly significant positive association with each of these variables. Strong positive associations with DM were seen for PHM (rp = 0.52), BN (rp = 0.70), LN (rp = 0.32), PHF (rp = 0.60), DF (rp = 0.60), and DE (rp = 0.60). The LA was identified to have a very highly significant negative association with DE (rp = −0.34), but a very highly significant positive connection with LW (rp = 0.77), LL (rp = 0.76), and DF (rp = 0.47). The DE was observed to have a very highly significant negative association with LA (rp = −0.34) but a very highly significant positive relationship with PHM (rp = 0.42), BN (rp = 0.579), PHF (rp = 0.32), and DM (rp = 0.36). DF was significantly positively correlated with BN (rp = 0.58), LL (rp = 0.62), PHF (rp = 0.82), LA (rp = 0.47), and DM (rp = 0.82). AIL (rp = 0.30), LL (rp = 0.29), and TILL (rp = 0.44) were found to have a very highly significant positive correlation with the GSFR, whereas BN (rp = 0.36), LN (rp = −0.31), and DM (rp = 0.48 were found to have a very highly significant negative correlation with the GSFR.

Table 3 Phenotypic (above diagonal) and genotypic (below diagonal) correlations of 16 agro-morphological traits in 120 amaranth genotypes grown at Hawassa University agricultural research site in 2020 and 2021 cropping seasons.

The analysis revealed that among the variables that contribute to yield, 60 associations out of a total of 91 relationships were significant at the genotypic level (Table 3). The findings showed that PHM (rg = 0.56), BN (rg = 0.17), LL (rg = 0.86), TILL (rg = 0.68), LN (rg = 0.73), and DF (rg = 0.23) all exhibited a very highly significant positive correlation with the LW. The opposite was true, however, as it showed a very strong negative genotypic relationship (rg = −0.35) with DE. While the AIL displayed a highly significant and negative link with DM (rg = −0.14), it had a very highly significant and positive correlation with TISL (rg = 0.38) and TILL (rg = 0.27) instead. The PHM showed very significant positive relationships with the following variables: BN (rg = 0.26), LL (rg = 0.58), TILL (rg = 0.75), LN (rg = 0.70), DF (rg = 0.22), and GSFR (rg = 0.19); however, the PHM also showed very high negative associations with DE (rg = -0.47). With PHM (rg = 0.26), LL (rg = 0.19), and DM (rg = 0.22), as well as with LN (rg = 0.17), and TISL (rg = 0.13), the BN showed a very strong positive association. The TISL had a very highly significant negative correlation with GSFR (rg = −0.25) but a very highly significant positive correlation with AIL (rg = 0.38) and TILL (rg = 0.22). Concerning LW (rg = 0.73), PHM (rg = 0.70), LL (rg = 0.71), DM (rg = 0.21), and DF (rg = 0.28), the LN exhibited very highly significant positive relationships. Conversely, it showed a very strong negative genotypic relationship with DE (rg = -0.55) and LA (rg = -0.24). The DM noted that BN (rg = 0.22), LL (rg = 0.20), LN (rg = 0.21), and DF (rg = 0.28) all had very highly significant positive correlations. The DF was considerably positively connected with LW (rg = 0.23), PHM (rg = 0.22), LL (rg = 0.29), TILL (rg = 0.21), LN (rg = 0.28), PHF (rg = 0.25), and DM (rg = 0.28), but strongly negatively correlated with DE (rg = −0.22). With LW (rg = −0.35), PHM (rg = −0.47), LL (rg = −0.31), TILL (rg = −0.55), LN (rg = −0.55), DF (rg = −0.22), and GSFR (rg = −0.20), the DE displayed extremely strong negative associations. While TISL (rg = −0.25) showed a very strong and negative association, the GSFR showed extremely substantial and positive correlations with PHM (rg = 0.19).

Yields' phenotypic and genotypic associations with other traits

In this study, LY and its yield-contributing characteristics were correlated in 15 ways, with 10 phenotypic associations and 12 genotypic relationships being significant (Table 3). The results showed that LY had positive phenotypic associations with LW (rp = 0.43), PHM (rp = 0.29), LL (rp = 0.40), PHF (rp = 0.37), LA (rp = 0.45), DF (rp = 0.41), and BN (rp = 0.23), all of which were highly significant. AIL (rp = −0.19), TISL (rp = −0.21), and TILL (rp = −0.19) all demonstrated a strong negative phenotypic connection with it as well. Additionally, LY had extremely strong positive genotypic correlations with PHM (rg = 0.25), LL (rg = 0.21), LN (rg = 0.18), TILL (rg = 0.17), LA (rg = 0.36), GY (rg = 0.20), and GSFR (rg = 0.18). Additionally, it showed a strong positive genotypic correlation with the BN, PHF, and DF (rg = 0.13, 0.14, and 0.11, respectively.

In the current study, GY and its yield-contributing characteristics had 17 significant correlations, of which 9 genotypic associations and 8 phenotypic relations were significant (Table 3). The findings revealed a very strong positive phenotypic correlation between GY and GSFR (rp = 0.92), AIL (rp = 0.38), TISL (rp = 0.38), and TILL (rp = 0.51). Additionally, it showed a substantial positive phenotypic connection with both LW and LL (rp = 0.18 and 0.21, respectively). Additionally, it had a very strong negative phenotypic correlation (rp = −0.24), (rp = −0.30), and (rp = −0.28) with BN, LN, and DM respectively. GSFR (rg = 0.95), TISL (rg = 0.28), PHM (rg = 0.21), and LY (rg = 0.21) likewise displayed a very highly significant positive genotypic correlation with the GY. Furthermore, it showed a highly significant positive genotypic association with LN (rg = 0.16), a substantial positive genotypic association with DF (rg = 0.12), and a significant positive genotypic association with TILL (rg = 0.12).

Phenotypic and genotypic path coefficients on leaf yield

Table 4 lists the estimations of the phenotypic effects, both direct and indirect, on LY in Amaranth genotypes. At the phenotypic level, the results showed that LA (0.3849; 47.93%) had the highest positive direct effect on LY, followed by AIL (0.4411; 34.283%), BN (0.3248; 30.17%), LW (0.1465; 17.894%), PHF (0.1420; 12.120%), and LN (0.0417; 7.135%). DM, DF, and GY were additional characteristics that had strong direct effects on LY. TILL (−0.3009; 26.735%), TISL (−0.1246; 12.059%), and PHM (−0.0976; 8.541%) all had detrimental direct effects on LY. Similarly, LL and DE directly have negative effects on LY. Except for AIL, BN, TISL, LN, DM, and DE, the examined characteristics had positive indirect effects on LY via LA. Likewise, GY had positive indirect effects on LY via LW, AIL, PHM, DM, LA, and DE. LW (0.2968; 36.248%), LL (0.2940; 35.758%), DF (0.1826; 14.206%), PHF (0.1230; 10.499%), PHM (0.0796; 6.960%), GY (0.0471; 7.717%), and TILL (0.0027; 0.243%) via LA all had the strongest positive indirect effects on LY. Through LA (0.2968; 36.248%), PHF (0.0681; 8.316%), BN (0.0397; 4.851%), DF (0.0554; 4.118%), TISL (0.0227; 2.777%), DE (0.0080; 0.976%), TILL (0.0044; 0.533%), GY (0.0023; 0.276%), and LN (0.0006; 0.072%), LW had positive indirect effects on LY. Likewise, the positive indirect effects of PHM were observed on LY via BN (0.2396; 20.959%), TILL (0.1317; 11.519%), PHF (0.1110; 9.713%), LW (0.0657; 5.746%), DF (0.0390; 3.415%), TISL (0.0381; 3.329%), and LN (0.0103; 0.903%).

Table 4 Estimated phenotypic direct (horizontal) and indirect (off-diagonal) effects of 14 traits on leaf yield of 120 amaranth genotypes in Hawassa University agricultural research site in 2020 and 2021 cropping seasons.

Table 5 shows the magnitudes of the direct and indirect genotypic effects of various traits on LY in Amaranth genotypes. LW (0.2225; 38.256%), PHF (0.1378; 73.869%), PHM (0.1277; 23.738%), GY (0.1264; 45.404%), LN (0.0969; 15.269%), TISL (0.0827; 11.643), and BN (0.0168; 8.513%) showed the next largest direct effects on LY after LA (0.3766; 78.608%). TILL (−0.0837; 13.441%), LL (−0.0672; 11.643%), AIL (−0.0760; 30.517%), and DM (−0.0166; 8.928%) on the other hand, had negative direct effects on LY. DE similarly showed adverse direct effects on LY. Despite having the largest positive direct effect on LY, LN, TISL, TILL, LL, LW, AIL, PHM, and DM had negative indirect effects through LA. BN (0.0351; 17.784%) had strong positive indirect effects on LY via LA, followed by DE (0.0352; 8.591%), GY (0.0218; 7.836%), DF (0.0078; 3.023%), and PHF (0.0066; 3.554%). Similar indirect effects via GY were seen for the following variables: DF (0.0157; 6.104%), PHM (0.0261; 4.858%), LN (0.0202; 3.184%), TILL (0.0110; 2.408%), PHF (0.0041; 2.215%), LW (0.0118; 1.527%), LA (0.0073; 2.408%), and LL (0.0069; 1.198%). But LN (−0.0898; 14.149%) had the most detrimental indirect effect on LY through LA.

Table 5 Estimated genotypic direct (horizontal) and indirect (off-diagonal) effects of 14 traits on leaf yield of 120 amaranth genotypes in Hawassa University agricultural research site in 2020 and 2021 cropping seasons.

Phenotypic and genotypic path coefficients on grain yield

Table 6 lists the assessed traits' phenotypic direct and indirect effects on GY. According to the findings, the variables that had the largest positive direct effects on GY were GSFR (0.9379; 79.845), BN (0.1825; 0.1825%), PHF (0.141; 0.1401%), TILL (0.1620; 17.694%), PHM (0.0678; 9.398%), AIL (0.661; 7.839%), and LA (0.0193; 0.0016%). The negative direct effects on GY were also exerted by DF (−0.1182; 16.302%), LW (−0.0789; 13.054%), LY (−0.0306; 10.146%), DE (−0.0452; 8.731%), LL (−0.0349; 5.591%), TISL (−0.0325; 5.053%), and LN (−0.0195; 3.464%). Despite the high positive direct effects of GSFR on GY, its indirect effects via LW, PHM, BN, LL, TISL, PHF, and, LY were negative. Along with the strongest positive direct effect on GY, the GSFR also had positive indirect effects via AIL, TILL, LN, LA, DF, and DE. BN positively affected GY both directly and indirectly through PHM, TISL, and PHF. The biggest positive indirect effects via BN were obtained by PHM (0.1347; 18.653%), PHF (0.11; 16.281%), DF (0.1056; 14.567%), LY (0.41; 13.822%), LN (0.07; 12.588%), LW (0.02; 3.694%), and LL (0.210; 3.358%). DE came in second (0.1056; 20.406%). TILL (0.4130; 45.127%), LA (0.2258; 45.042%), LL (0.2679; 42.887%), LW (0.2590; 42.855%), and TISL (0.2446; 37.999%) all contributed high and positive indirect effects via GSFR. However, BN had the largest negative indirect effects through GSFR, followed by LN, DE, PHF, PHM, and DF.

Table 6 Estimated phenotypic direct (horizontal) and indirect (off-diagonal) effects of 14 traits on grain yield of 120 amaranth genotypes in Hawassa University agricultural research site in 2020 and 2021 cropping seasons.

Table 7 lists the examined traits' genotypic effects on GY, both directly and indirectly. GSFR (0.9297; 92.706%) had the most positive direct effect on GY, followed by PHF (0.0215; 23.943%), PHM (0.0533; 14.498%), and LN (0.0358; 10.385%). On the contrary, the highest negative direct effect on GY was exerted by DF (−0.0957; 27.051%), followed by BN (−0.0206; 18.309%), DE (−0.0531; 15.039%), TISL (−0.0420; 12.860%), TILL (−0.0354; 11.320%), LL (−0.0260; 9.961) and LA (−0.0112; 9.125%). GSFR had a highly positive direct effect on GY, whereas it had negative indirect effects via DF, TILL, LL, LW, and LA. PHM's positive direct effects on GY were accompanied by positive indirect effects through LW, AIL, BN, LL, TISL, TILL, and DF. The PHF had a positive direct effect on GY, and through BN, LL, LA, DF, and DE, this trait also had negative indirect effects on GY. For DF (0.0055; 1.542%), LY (0.0030; 1.139%), LL (0.0021; 0.814%), BN (0.0011; 0.983%), PHM (0.0009; 0.233%), GSFR (0.0718; 0.072%), LA (0.0004; 0.309%), DE (0.0002; 0.063%), and LN (0.0001; 0.039%), the highest positive indirect effects via PHF were observed. The highest positive indirect effects via GSFR were recorded for LY (0.1661; 62.387%), DF (0.1889; 53.371%), LA (0.0654; 53.246%), PHM (0.1736; 47.216%), LN (0.1408; 40.816%), LW (0.1047; 36.799%), TILL (0.1081; 34.620%), PHF (0.0310; 34.414%), and LL (0.0742; 28.449%). On the contrary, the highest negative indirect effects via GSFR were observed in TISL, AIL, BN, and DE.

Table 7 Estimated genotypic direct (horizontal) and indirect (off-diagonal) effects of 14 traits on grain yield of 120 amaranth genotypes in Hawassa University agricultural research site in 2020 and 2021 cropping seasons.

Discussion

Understanding the relationships between characters is crucial for any crop improvement effort since it indirectly influences selection success. To find the component characters that can be used in selection to boost yield, a correlation analysis is conducted to evaluate the underlying correlations between distinct characters. The correlation coefficient measures the degree to which two variables are positively or negatively associated. In the current trial, from the two sets of correlation coefficients, 61(50.83%) of the 120 possible pairs of trait comparisons across traits showed that the genotypic correlation coefficient was higher than the corresponding phenotypic correlation coefficient, which showed that the association was largely due to genetic factors among various traits, and that enhanced genetic inherent association30,31,32, this aids in determining the attributes that will be used in the breeding scheme. Moreover, genetic correlations between two characters arise because of linkage, pleiotropic or developmentally induced functional relationships33. As a result, it is more important and may be used to build a successful selection system. However, the remaining pairs had higher phenotypic correlation coefficients corresponding to genotypic correlation coefficients suggesting that environmental factors influenced the inherent associations among the different traits under study.

The phenotypic correlation analysis indicated that the LW was associated significantly and positively with LL which also correlated significantly and positively with PHM, PHF, LA, and DF. Other research findings also showed that the LL had a significant positive association with LW and PHM, and significant positive associations with LA were reported at the phenotypic level34,35. The LL had a very high positive correlation with LW, PHM, PHF, LA, DF, and GSFR. Similar to LW and LA, Jangde et al.34 found a substantial positive association between leaf length and both. In the current finding of PHM, BN, LL, DM, LA, DF, and DE, it was shown that the PHF had a very highly significant positive association with each of these variables. Singh36, discovered comparable results in the previous study on amaranth genotypes for plant height exhibited positive significant phenotypic correlations with days to 50% flowering and number of branches. The strong and positive link between leaf width and length also raises the prospect of enhancing both characteristics at the same time. A similar finding was reported by Olusanya37. BN was significantly and positively correlated with LN, PHM, and PHF, which indicates that selecting genotypes with taller plant heights would result in a greater number of branches, which is necessary for the production of leaf yields. Akter et al.38, came to the same results. DF was significantly and positively correlated with BN, PHF, and PHM. Similar findings were reported by Singh36. Days to maturity showed a positive and highly significant correlation with plant height. Similar results were reported by Shrivastav et al.39. The time of flowering are important adaptive trait in grain amaranth15,40. It was observed that leaf number and leaf area correlated negatively, suggesting that the more leaves amaranth produced, the smaller those leaves were. A negative correlation between the number of leaves and leaf area might act as a challenge in the improvement of leaf yield in amaranths. Furthermore, this pattern suggests that the length of the leaf and the area of the leaf decrease as the number of leave increase. Breeding strategies should focus on increasing the area of each leaf or on producing more leaves in each main branch. Similar results were reported by Gerrano et al.41.

Genotypic correlation between traits is important for identifying the most and least, important characteristics to be considered for the success of selection in breeding42. At the genotypic level, genes governing two separate traits are correlated positively at the coupling phase of linkage and negatively at the repulsive phase of linkage43. The relationship between leaf length and leaf width is very strong. Similarly, most of the leaf traits showed high positive correlations with one another, indicating that these traits are crucial for improving and selecting amaranth yields. The results of the trait correlations in this study imply that selection for these features in amaranth genotypes will be efficient and successful. Similar findings on garden peas were reported by Sharma and Sharma44. With the following variables BN, LL, TILL, LN, DF, and GSFR, the PHM demonstrated highly significant positive relationships. Similar results were reported by Shrivastav et al.39, and inflorescence length and days to 50% flowering were positively and extremely significantly correlated with plant height, according to Yadav et al.45. A positive and significant correlation with lateral inflorescence length, leaf length, leaf width, number of branches per plant, and days to maturity was obtained46. Similar results were found by Varalakshmi and Pratap Reddy47.

For the leaf yield, the phenotypic correlation studies carried out for this investigation revealed a strong and positive association between economically important parameters such as LA, LW, and LL. The results show that any enhancement of these characteristics will increase the potential LY of amaranth. Similar trends were identified by Kumar48, in their analysis. Therefore, consideration for these traits should be given in amaranth breeding programs. LY had significant genotypic and phenotypic relationships with LW, PHM, LL, PHF, LA, DF, and BN. The taller genotypes in the current study produced more leaves, which resulted in higher leaf yields; hence PHM as well as PHF did forecast the genotypes' high leaf-yielding potential due to their significant association. However, the fact that PHM and PHF had a negative association with GY suggested that taller genotypes had poor grain yields. Therefore, selecting a genotype for higher leaf yield by focusing on the taller genotypes and shorter genotypes would produce good grain yields. As a result, the breeding strategy gave higher priority to characters that increased LW, PHM, LL, PHF, LA, DF, and BN, increasing the chances of obtaining a genotype with a high leaf yield. Similar results were reported in amaranth genotypes by Varalakshmi and Pratap Reddy47, who reported that plant height, leaf length, and leaf width were positively and significantly correlated with the leaf yield. Moreover, choosing genotypes with taller plant heights will result in a better number of branches, which is required for the production of leafy vegetables, according to the statistically significant positive association between plant height and the number of branches. A similar finding was reported by Olaniyi49 Vegetable amaranths are said to benefit from late flowering and maturation since farmers would have more time to harvest the leaves50. A single gene controls early flowering in amaranth, and the dominant allele determines early flowering51. For grain production, early flowering and maturity would be preferable. The positive and significant correlations between yield and yield components in amaranth were also reported by52,53.

The results showed that at the phenotypic level, GY with GSFR, AIL, TISL, and TILL have extremely high positive phenotypic correlations. Furthermore, it demonstrated a strong positive phenotypic relationship with both LW and LL. The results suggest that these traits might be simultaneously enhanced to raise grain yield either individually or in combination. Showemimo et al.54, reported comparable results. As a result, amaranth should be chosen for future breeding programs to ensure the maximum exploitation of grain amaranth, as the photosynthetic capacity of a plant with longer leaves is predicted to be higher than that of a plant with shorter leaves55, and traits like a high leaf area, leaf length, and plant height enable optimal crop output when water is accessible as an essential agricultural input in sufficient quantities56. Grain yield exhibited a highly significant and positive phenotypic association with inflorescence length (TISL, TILL, and AIL). Nyasulu et al.57, Kumar58, and Yadav et al.45, obtained consistent findings in amaranth species. As a result, characteristics may be prioritized during selection to produce genotypes with higher grain yield.

The genotypic relationship between GSFR, PHM, and LY with the GY was also very substantial and positive. It also revealed a highly significant positive genotypic association with LN, a substantial positive genotypic association with DF, and a significant positive genotypic link with TILL. These findings demonstrated that selection for any one of these traits that contribute to grain yield will result in increases in the other traits, thereby finally boosting the grain yield. To select genotypes with higher grain production, primary selection for characteristics like GSFR, PHM, and LY may be prioritized. There were similar trends for the association of GY with other traits (PHM, TILL, and DF) that may be prioritized45. Grain yield and leaf area, length, and width are strongly correlated with one another. Similar to grain yield, all leaf traits showed high positive correlations with one another, indicating that these features are important for selecting amaranth leaf and grain productivity. The findings of the trait correlations in this study imply that selection for GSFR, PHM, and LY will be efficient and beneficial in improving grain production in grain amaranth genotypes, as also found in garden peas by Sharma and Sharma44. In addition, the positive correlation between grain yields and days of flowering revealed that early flowering would be a viable choice for a greater grain yield in this scenario. These results appear to be in line with a general pattern that early blooming is linked to higher grain yields across species37,59. These findings appear inconsistent with a general trend for late flowering to be associated with high grain yield across species37.

The relationship between any two features is merely depicted by their correlation coefficients, which do not reveal any potential underlying causes. The degree of the relationship between the yield and yield components can be determined by path coefficient analysis, which also helps to clarify the cause-and-effect interactions between different characters. Path coefficient analysis was performed to segregate the correlation coefficient into the direct and indirect effects of various features on yield. The yield-contributing features for LY and GY's varied phenotypic and genotypic correlation coefficients were further divided into direct and indirect effects. According to the findings, both phenotypic and genotypic levels of LY were positively affected directly by the LW, BN, LN, PHM, LA, and GY. This suggests that these traits can be used to develop a selection index that is both maximally reliable and effective at improving the leaf yield of amaranth genotypes. These findings confirmed the results of Aycicek and Yildirim60, for PHF in wheat and amaranths genotypes48,61, for the number of leaves in amaranths genotypes, Islam62 for leaf area, and Jangde et al.34 for leaf width . LL, TILL, and DE, on the other hand, were found to have the greatest negative direct effects on LY, both at the phenotypic and genotypic levels, showing that direct selection of these traits did not improve LY in amaranth genotypes. Following the present findings, the LL had a negative direct influence on LY in amaranth genotypes35,48,62, However, since PHM, LL, and TILL had such strong indirect effects, mostly through LA, while DM and DE exhibited such strong indirect effects, primarily through PHF, it was anticipated that LY in amaranth genotypes may also rise through an indirect selection of these traits. While TISL had positive direct effects on LY at the genotypic level, AIL had positive direct effects on LY solely at the phenotypic level. Additionally estimated was the residual effect, which establishes the unaccounted variability of the dependent variables (LY and GY). The characteristics that were included in the phenotypic and genotype path coefficient analyses explained 87.12% and 87.79% of the total variance in LY, respectively, according to the residual effects of 0.1288 and 0.1221. These results suggest that the independent variables taken into account in this study effectively represented the diversity of LY in amaranth genotypes. Other factors (at phenotypic level 12.88% and genotypic level 12.21%) that caused variations in LY, but were not taken into account in the current investigation, did exist.

A path analysis' findings revealed the importance of GSFR, PHM, and PHF traits for direct selection since they had considerable positive direct effects on GY both at the phenotypic and genotypic levels. This suggests that GY improvement in amaranth genotypes might be obtained through selection based on these characteristics. The result was congruity with35,48 for the plant height in amaranth genotypes. On the other hand, LW, TISL, LL, and DF had negative direct effects at both phenotypic and genotypic levels indicating that direct selection based on these traits could not improve the GY in amaranth genotypes. The results were also in agreement with the findings of Shrivastav et al.39, who reported that DF had a negative direct effect on GY in amaranth genotypes. The direct selection of LW, TISL, LL, and DF at both phenotypic and genotypic levels would decrease the GY; however, their indirect selection through mainly PHM may improve the GY in amaranth genotypes. PHM had beneficial indirect effects on LW, LL, TISL, and DF in addition to its beneficial direct effects on GY. For the indirect selection of high seed-yielding genotypes in amaranths, the DF may be a trustworthy indicator. Alake and Otusanya63, obtained consistent findings in amaranths genotypes. The direct effect of an AIL, BN, TILL, LA, and DE on GY was negative only at the genotypic level while LN and LY had a positive direct effect on GY only at the genotypic level. Other research findings also showed that LN had a positive direct effect at the genotypic level on GY in amaranth genotypes64. While LY had a negative direct effect on GY only at the phenotypic level, AIL, BN, TILL, and LA had positive direct effects on GY only at the phenotypic level. On grain yield in amaranth genotypes, the same observation was made for TILL, BN, LA, and AIL65. Furthermore, LW, AIL, BN, TISL, TILL, LN, DF, LY, and GSFR might be indirectly selected, mostly through PHM, to improve the GY at the genotypic level. According to the residual values of 0.1005 and 0.1140, the traits included in the phenotypic and genotypic path coefficient analyses, respectively, explained 89.95% and 88.6% of the overall variation in GY. This suggests that additional traits may also be influencing GY.

When combined with path coefficients, correlation coefficients offer more accurate data that may be successfully predicted in crop development initiatives66. The findings indicated that selection based on the phenotypic expression of LW, BN, LN, PHF, LA, and GY could be successful for LY improvement in amaranth genotypes these traits had positive direct effects on LY along with a significant positive association with LY at both phenotypic and genotypic levels. On the other hand, LL demonstrated negative direct effects on LY both at the phenotypic and genotypic levels, even though LL, DF, and GSFR had significant positive phenotypic and genotypic associations with LY. The occurrence of significant positive phenotypic and genotypic associations along with inverse direct effects between LY and these traits could be due to the high positive counterbalancing potential of the phenotypic and genotypic indirect effects exerted by these traits via mainly LN. The genotypic direct effect of DM on LY was negative despite the non-significant positive genotypic association between the two traits, indicating that the positive genotypic association between these traits was caused by the high and positive neutralizing genotypic indirect effect of this via chiefly by LA. Despite AIL's positive phenotypic direct effect on LY, its negative phenotypic correlation with LY might be because of AIL's significant negative phenotypic indirect effect, which largely acts through LA.

The GSFR had significant positive phenotypic and genotypic relationships with GY as well as positive direct impacts on GY, suggesting that selection for genotypes of amaranth based on the phenotypic expression of these traits may be successful in improving GY. Wei et al.67 obtained consistent findings in rice, while Knott and Gebeyehou68, reported equivalent results in bread wheat. Likewise, GSFR, PHF, and LY exhibited significant positive genotypic associations with GY as well as positive genotypic direct effects, revealing that genotypic selection based on the phenotypic expression of these traits would be effective for enhancing GY in amaranth genotypes. Although PHM and PHF had positive phenotypic and genotypic direct effects on GY, their correlations with GY were negative and insignificant on both a phenotypic and genotypic level. This might be because PHM and PHF's indirect actions, which primarily affect BN, LL, and DF, have significant and detrimental neutralizing potential. The phenotypic and genotypic direct effects of LW, LL, and TISL were negative but their associations with GY were positive at genotypic levels. This was due to the high and positive neutralizing potential of the genotypic indirect effects exerted by both traits via mainly PHM, LN, LA, and LY. The phenotypic (DE) and genotypic (DF and DE) direct effects on GY and their respective associations with GY were negative and insignificant indicating that the GY improvement based on the selection of these traits could not be effective in amaranth genotypes.

Conclusions

Information from correlation may be augmented with the use of route analysis to select features that significantly increase the grain and leaf yield of amaranth genotypes and hence enhance selection efficiency in breeding. The studies of phenotypic and genotypic correlation and path coefficients revealed that LA was the variable that was most crucial for the indirect selection of LY since it exhibited a strong positive direct influence on LY as well as a substantial positive correlation with LY. Furthermore, it was suggested that LY in amaranth genotypes may also rise by an indirect selection of PHM, LL, and TILL, which exhibited such potent indirect influences, mostly through LA, and DM and DE, which showed such strong indirect effects, primarily through PHF. Therefore, this study pointed out that traits that showed significant positive indirect impacts via LA should also be taken into account as indirect selection criteria for LY improvement in amaranth genotypes. Similarly, the studies of correlation and path coefficients demonstrated strong direct impacts of GSFR on GY, and noteworthy positive correlations of both traits with GY were seen at the phenotypic and genotypic levels. Additionally, PHF strongly influenced GY at the genotypic level and had a substantial positive correlation with GY. At both the phenotypic and genotypic levels, the GSFR also had a significant indirect favorable impact on GY, mostly through DF and LY. This showed that selection mostly focused on DF, LY, and GSFR would ultimately enhance the GY in amaranth genotypes. However, because DE has detrimental direct impacts on LY and GY at the genetic level, breeding progress in these genotypes might be negatively impacted.