International Journal of Statistics and Applications
p-ISSN: 2168-5193 e-ISSN: 2168-5215
2020; 10(6): 150-159
doi:10.5923/j.statistics.20201006.02
Received: Nov. 20, 2020; Accepted: Dec. 11, 2020; Published: Dec. 15, 2020

Siraj Osman Omer1, Abdel Wahab Hassan Abdalla2, Narendra Pratap Singh3, Hemant Kumar3, Murari Singh4
1Experimental Design and Analysis Unit, Agricultural Research Corporation (ARC), Wad Medani, Sudan
2Department of Agronomy, Faculty of Agriculture, University of Khartoum, Sudan
3Indian Institute of Pulses Research (IIPR), India
4International Center for Agricultural Research in the Dry Areas (ICARDA), Amman, Jordan
Correspondence to: Siraj Osman Omer, Experimental Design and Analysis Unit, Agricultural Research Corporation (ARC), Wad Medani, Sudan.
| Email: | ![]() |
Copyright © 2020 The Author(s). Published by Scientific & Academic Publishing.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

Mixed models are suited to describe the parameterization needed to estimate variance components due to genotypes, the environment and genotype × environment interaction over several locations and years. In Bayesian approach, incorporating the prior information of variance component from multi environment trials on the genotypic parameters available from previous similar trials has potential for adding value to the crop breeding program and genetic variability. The objective of this study was to obtain Bayesian estimates of variance components, heritability in broad-sense and genetic advance due to selection for seed yield of chickpea. Chickpea yield (kg/ha) on twelve genotypes data were collected from a series of multi-year multi-location trials conducted in randomized complete block designs in Indian environments. An MCMC estimator is implemented in the WinBUGS and R software for Bayesian posterior. The differences in variance component estimates obtained by two approaches, the classical approach using restricted maximum likelihood method and the Bayesian approach, were investigated. Bayesian estimate of heritability for seed yield on the plot-basis was different from that on the mean-basis, as may be expected. For seed yield, the Bayesian estimates of heritability were 9% on plot basis and 52% on mean basis, and the genetic advance due to selection was 7% using half-t prior. and were 13% on plot-basis and 58% on mean-basis, and the genetic advance due to selection was 8% using half-normal prior, which is higher in comparison to the frequentist approach.
Keywords: Bayesian analysis, Variance Components, Heritability, Genetic Advance, MCMC
Cite this paper: Siraj Osman Omer, Abdel Wahab Hassan Abdalla, Narendra Pratap Singh, Hemant Kumar, Murari Singh, Bayesian Estimation of Variance Components, Heritability and Genetic Advance from Multi-Year and Location Chickpea Trials in Indian Environments, International Journal of Statistics and Applications, Vol. 10 No. 6, 2020, pp. 150-159. doi: 10.5923/j.statistics.20201006.02.
is modeled as follows:![]() | (1) |
is the yield response of the genotype i in the location j, year (or season) t and block k;
= grand mean,
is the effect of the genotype i,
is the effect of the location j.
is the effect of the year t and
is the effect of block k within location (j) and year (t). This model is useful for multiple experiments trials conducted over locations and repeated in different time; with associated subscripts,
is the effect interaction between locations and years,
is the effect interaction between genotypes and locations,
is the effect interaction between genotypes and years,
is the effect interaction between genotypes, locations and years.
is the residual error from the plot for
, and assumed to be normally distributed with homogeneous variance
It is assumed that
and
are normally and independently distributed, with means zero and variances
and
respectively. The values of vector indices are
and
where
and
are number of locations, years, genotypes and blocks respectively. Bayesian estimation of variance component estimation will be based on the linear mixed model in equation (1). Bayesian approach therefore uses wide- adaptation rather than specific- adaptation where one pools the GY and the GLY interaction components to estimate temporal stability of genotypes [26].
where
The next level of the Bayesian hierarchy includes prior distributions for location, year, and location parameters (i.e., means)
and their variances. In REML model all priors distribution were implied as normal distribution with means zero and variances defined to condition the desired level of information sharing among levels of the factor [28]. Independent prior distributions were assigned for the parameters used. These are specified as follows: for block effect =
, effect of location =
, effect of year =
, effect of location and year interaction =
, effect of genotypes =
, effect of genotypes and location interaction =
, effect of genotypes and year interaction =
, and effect of GLS =
. The location factor was argued to be with random effects when the main interest of the analysis lies in the estimation of variance components for locations that are representative of the relevant production area within the target region [29].
and
and experimental error (environmental) variance by
.In this model the environment E is partitioned as L + S + L x S): Broad sense heritability on a mean-basis
where
where
is genotypes variance,
is estimate of location by genotypes interaction variance,
is estimate of year by genotypes interaction variance,
is estimate of year x location x genotypes interaction variance,
is experimental error variance [33],
where the phenotypic variance, 
Gelman reviewed several options for non-informative priors for scale parameters in hierarchical models and suggested the use of uniform,
and half-normal families of distributions [34]. Crossa et al., used inverse-gamma distribution as a prior for variance components [29]. Based on normal distribution of the trait, the genetic gain due to selection of model,
, at selection intensity
is given by
where
The truncation point
in the standard normal distribution is given by the equation
where
is the trial mean. When
= 0.20,
= 1.4 [35].
of the SDC
, defined as the inverse of its variance. The a priori distribution for the SDC
, may be denoted as the positively-truncated-normal: 
,
,
,
,
,
,
and
.2) P1: the priors for the standard deviation components
and
follow Uniform (0, 1000), 3) P2: the priors for the standard deviation components
follow Half-t distribution
= (Half-t (0, 4,3). Here,
is non-centrality parameter and
is the degree of freedom of the t-distribution. The values of
and
are set at 5 and 2 respectively.4) P3: the priors for the standard deviation components
follow Half-normal distribution
= Half- normal (0.001, 0.01)I(0,),
is precision parameter,
given as inverse of variance.Winbugs and R codes have been presented in a Bayesian framework in appendixes presented in the paper. The example files contents of the WinBUGS and R codes are given in Appendices A1 and A2. The number of iterations was set at 50,000, the number of chains was set at three, and the last 5,000 simulated values of the parameters were taken for evaluating the posterior distributions.
|
|
|
or posterior expected value of the error variance given the plot-wise response data under the assumed linear model and half-normal prior (P0) slightly lower than the frequentists approach, and experimental error variance using half-t prior (P2) is slightly higher than the frequentists approach or classical least-square estimate. Bayesian estimate of heritability in compare to frequentist approach based on mean basis were (0.52 vs. 0.56) for P2 and (0.57 vs0.56) for P0. Bayesian estimate of heritability in compare to frequentist approach based on plot basis were (0.09 vs. 0.05) for P2 and (0.14 vs0.05) for P0. Bayesian estimate of genetic advance in compare to frequentist approach based on mean-basis were (7.2 vs 6.2) for P2 and (7.9 vs. 6.2) for P0. Half- normal distribution ((P0) given lowest CV% was 10.6 in comparing with a half-t prior and classical approach were (15.1) and (15.44) respectively. The CV% estimate based on mean value under frequentist and Bayesian approach was quite different, thus, indicating reliable numerical approximation through the number of simulations runs used. In other words, coefficients of variation for each prior are different to each other, indicating a major part of phenotypic variations belonging to genotypic variation (Table 2). The Bayesian estimate of experimental error variance based on mean value using P0 is slightly lower than that under the Frequentist approach were (51310 vs. 114469), in percentage (31% vs. 69%). The Bayesian estimate of the environment (location × variety interaction) based on mean value were (54560 vs.7951) or in percentage (87% vs. 13%) using P0 and were (57380 vs.7951) or in percentage (88% vs. 12%) using P2 are higher than that under the frequentist approach. The Bayesian estimate of the environment (year ×location interaction) based on mean value were (206700 vs.467174) or in percentage (31% vs. 69%) using P0 and were (330400 vs. 467174) or in percentage (41% vs. 59%) using P2 are higher than that under the frequentist approach. Bayesian estimate of variance components of all parameters include (year × location interaction), (location × year × variety interaction) and the plot errors are very smallest in comparison to the frequentist approach. While Bayesian estimate of variance components of all parameters include (genotypes, year × genotype interaction, location × variety interaction based on plot are very higher in comparison to the frequentist approach. Bayesian estimates of heritability and genetic advance under two approaches are more efficient; however, Bayesian approach provided confidence interval. Bayesian heritability and genetic advance estimates have been found to be useful in indicating the relative values of selection based on phenotypic expression of different characters. The Monte Carole error in all the parameters of Table 3 and Table 4 are small, indicating reliable numerical approximation through the number of simulations runs used. The distribution of variance components and heritability from Bayesian approach are skewed reflecting a remarkable difference between their means and their variances in both models. It was found that the posterior mean (i.e., the Bayesian estimates) was higher than the corresponding frequentist estimates for heritability and genetic advance. For all the estimates, the posterior standard deviations in the Bayesian approach were smaller than the corresponding standard errors in the frequentist approach.![]() | Table 5. Predicted values of the genotypes under classical model and Bayesian approach for chickpea for seed yield from the trials in 18 environments (2006 – 2008), at (Delhi, Sriganganagar, Kanpur, Faizabad, Sehore and Junagarh) in India based on half-t-prior |
| [1] | Cuevas, J., Crossa, J., Montesinos-López, O.A. and Burgueño, J. (2017). Bayesian genomic prediction with genotype environment interaction kernel models. G3: Genes, Genomes, Genetics, 7(1)43-53. |
| [2] | Crossa, J., Perez-Elizalde, S., Jarquin. D., Cotes. M.J., Viele. K., Liu, G. and Cornelius, P.L. (2011). Bayesian estimation of the additive main effects and multiplicative interaction model. Crop Science, 51(4): 1458-1469. |
| [3] | Vargas, M., Combs, E., Alvarado, G., Atlin, G., Mathews, K, and Crossa, j. (2013). META: A suite of SAS programs to analyze multi-environment breeding trials. Agronomy Journal 105(1): 11-19. |
| [4] | Omer, S.O., Abdalla, A.W.H., Mohammed, M.H. and Singh, M. (2015). Bayesian estimation of genotype-by-environment interaction in sorghum variety trials. Communications in Biometry and Crop Science 10: 82–95. |
| [5] | Shahriari, Z., Heidari, B., Dadkhodaie, A. (2018). Dissection of genotype × environment interactions for mucilage and seed yield in Plantago species: Application of AMMI and GGE biplot analyses. PLoS ONE 13(5): e0196095. https://doi.org/10.1371/journal.pone.0196095. |
| [6] | Ceccarelli, S. (2015). Efficiency of Plant Breeding. Crop Science, 55(1):87-97. doi:10.2135/cropsci2014.02.0158. |
| [7] | Dror, H.A. and Steinberg, D.M. (2008). Sequential experimental designs for generalized linear models. Journal of the American Statistical Association, 103(481): 288-298. |
| [8] | Omer, S. O., Abdalla, M. S., Alzain, I. N. and Dafaalla, A. (2017). Bayesian credible intervals for maize grain yields of the maintenance varieties evaluated in Sudan. International Journal of Applied Sciences and Biotechnology, 5(3): 390-396. https://doi.org/10.3126/ijasbt.v5i3.18303. |
| [9] | Orellana, M.A. (2012). Bayesian prediction of crop performance modeling genotype by environment interaction with heterogeneous variances. Graduate Theses and Dissertations, 12740. |
| [10] | Stock, K.F., Distl, O., and Hoeschele, I. (2007). Influence of priors in Bayesian estimation of genetic parameters for multivariate threshold models using Gibbs sampling. Genetics Selection Evolution. 39(2): 123–137. |
| [11] | Singh M., Al-Yassin A. and Omer, S.O. (2015) Bayesian estimation of genotypes means, precision and genetic gain due to selection from routinely used barley trials. Crop Science 55: 501–513. |
| [12] | Jiang, Z. and Skorupski, W. (2018). A Bayesian approach to estimating variance components within a multivariate generalizability theory framework. Behavior Research 50: 2193–2214. |
| [13] | Gelman, A., Brooks, S., Jones, G., Meng, X.L. (2011). Handbook of Markov chain Monte Carlo: methods and applications. Chapman and Hall/CRC, New York. |
| [14] | Bardsley, J.M. and Cui, T. (2019). A Metropolis-Hastings-within-Gibbs sampler for nonlinear hierarchical-Bayesian inverse problems. In: Wood D., de Gier J., Praeger C., Tao T. (eds) 2017 MATRIX Annals. MATRIX Book Series, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-030-04161-8_1. |
| [15] | Searle, S. R., Casella, G. and McCulloch, C. E. (2006). Variance Components. Wiley, NewYork. |
| [16] | Luo, M.F., Boettcher, P.J., Schaeffer, L.R. and Dekkers, J.C. (2001). Bayesian inference for categorical traits with an application to variance component estimation. Journal of Dairy Science, 84(3): 694-704. Doi: 10.3168/jds.S0022-0302(01)74524-9. |
| [17] | Da Silva, C.P., de Oliveira, L.A., Nuvunga, J.J., Pamplona, A.K.A. and Balestre, M. (2019). Heterogeneity of variances in the Bayesian AMMI model for multi environment trial studies. Crop Science, 59(6): 2455-2472. Doi: 10.2135/cropsci2018.10.0641. |
| [18] | Sun, L., Hsu, J.S.J., Guttman, I. and leonard. T. (1996). Bayesian methods for variance component model. Journal of the American Statistical Association 91(434): 743-752. |
| [19] | Burgueño, J., Crossa, J., Cotes, J. M., Vicente, F. S. and Das, B. (2011). Prediction Assessment of Linear Mixed Models for Multienvironment Trials. Crop Science, 51(3): 944. doi: 10.2135/cropsci2010.07.0403. |
| [20] | Schmidt, P., Hartung, J., Rath, J. and Piepho, H.P. (2019). Estimating Broad-Sense Heritability with Unbalanced Data from Agricultural Cultivar Trials. Crop Science, 59(2): 525-536. |
| [21] | Gauch, H.G. (2006). Statistical analysis of yield trials by AMMI and GGE. Crop Science, 46(4): 1488–1500. |
| [22] | Aguate, F., Crossa, J. and Balzarini, M. (2019). Effect of missing values on variance component estimates in multi environment trials. Crop Science, 59(2): 508-51. Doi: 10.2135/cropsci2018.03.0209. |
| [23] | Zorić, M., Terzić, S., Sikora, V., Brdar-Jokanović, M. and Vassilev, D. (2016). Effect of environmental variables on performance of Jerusalem artichoke (Helianthus tuberosus L.) cultivars in a long term trial: A statistical approach. Euphytica, 213(1): 23. doi: 10.1007/s10681-016-1819-7. |
| [24] | Omer, S. O., Slafab, E. H. and Rathore, A. (2015). Bayesian analysis for genotype x environment interactions and the GGE-biplot assessment: Evaluation of balanced classifications with missing values. International Journal of Applied Sciences and Biotechnology, 3(2): 210-217. https://doi.org/10.3126/ijasbt.v3i2.11908. |
| [25] | Meyer, K. (2009). Factor-analytic models for genotype × environment type problems and structured covariance matrices. Genet Sel Evol 41, 21. https://doi.org/10.1186/1297-9686-41-21. |
| [26] | Lee, C. and Wang, C.D. (2001). Bayesian inference on variance components using Gibbs sampling with various priors. Asian-Australasian Journal of Animal Science, 14(8): 1051-1056. |
| [27] | Satagopan, J. M., Olson, S. H., & Elston, R. C. (2015). Statistical interactions and Bayes estimation of log odds in case-control studies. Statistical Methods in Medical Research, 26(2): 1021–1038. doi: 10.1177/0962280214567140. |
| [28] | Edwards, J.W. and Jannink, J.L. (2006). Bayesian modeling of heterogeneous error and genotype and environment interaction variances. Crop Science, 46(2): 820–833. |
| [29] | Crossa, J., Burguen˜o, J., Vargas, M. (2010). Statistical models for studying and understanding genotype 9 environment interaction in an era of climate change and increased genetic information. In: Reynolds M (ed) Climate change and crop production, CABI Climate Change Series. CIMMYT, Mexico, pp 263–283. |
| [30] | Srinivasan, M.R. and Ponnuswamy, K.N. (1993) Estimation of variance components based on triallel mating design. Theoretical and Applied Genetics 85: 593–597. |
| [31] | Wiggins, B., Wiggins, S., Cunicelli, M., Smallwood, C., Allen, F., West, D. and Pantalone, V. (2019). Genetic gain for soybean seed protein, oil, and yield in a recombinant inbred line population. The Journal of the American Oil Chemists' Society 96(1): 43-50. |
| [32] | Bekele, A. and Rao, T.N. (2014). Estimates of heritability, genetic advance and correlation study for yield and it is attributes in maize (Zea mays L.). Journal of Plant Sciences, 2(1): 1-4. |
| [33] | Omer, S.O., Abdalla, A. H., Ceccarelli, S., Grando, S. and Singh, M. (2014) Bayesian estimation of heritability and genetic gain for subsets of genotypes evaluated in a larger set of genotypes in a block design. European Journal of Experimental Biology, 04(03): 566-575. ISSN 2248-921. |
| [34] | Gelman, A. (2006). Prior distributions for variance parameters in hierarchical models. Bayesian Analysis 1: 515–533. |
| [35] | Singh M., Grando S., Ceccarelli S. (2006) Measure of repeatability of genotypes by location Interaction using data from barely trails in Northern Syria. Experimental Agriculture 42: 189–198. |