Publication | Open Access
Accurate Prediction of Genetic Values for Complex Traits by Whole-Genome Resequencing
433
Citations
17
References
2010
Year
GeneticsGenetic EpidemiologyLinkage AnalysisGenomicsGenomic SelectionWhole-genome ResequencingGenomic PredictionGenome-wide Association StudyGenetic AnalysisGenetic ValuesGenotype-phenotype AssociationMolecular EcologyComputational GenomicsEqual High DensityIndividual SnpsBiostatisticsPublic HealthPersonal GenomicsBayesian Nonlinear ModelComplex TraitsStatistical GeneticsGenetic VariationPopulation GeneticsEvolutionary BiologyPopulation GenomicsMedicine
Whole‑genome resequencing has rapidly advanced, making $1,000 human genome sequencing feasible, and its application to plant, animal breeding, and human disease risk prediction requires nonlinear methods beyond BLUP, with results potentially extendable to larger genomes and populations. The study aims to use whole‑genome sequencing data to predict individuals’ genetic values for complex traits and assess prediction accuracy. The authors simulated population history and genomic architecture under a Wright‑Fisher infinite‑sites model and applied a Bayesian nonlinear genomic‑selection approach that assumes only a few SNPs have nonzero effects to predict genetic values. Using whole‑genome sequence data increased prediction accuracy by over 40% versus dense 30K SNP chips, added 2.5–3.7% when causal mutations were included, and maintained accuracy even when training and evaluation data were 10 generations apart.
Whole-genome resequencing technology has improved rapidly during recent years and is expected to improve further such that the sequencing of an entire human genome sequence for $1000 is within reach. Our main aim here is to use whole-genome sequence data for the prediction of genetic values of individuals for complex traits and to explore the accuracy of such predictions. This is relevant for the fields of plant and animal breeding and, in human genetics, for the prediction of an individual's risk for complex diseases. Here, population history and genomic architectures were simulated under the Wright-Fisher population and infinite-sites mutation model, and prediction of genetic value was by the genomic selection approach, where a Bayesian nonlinear model was used to predict the effects of individual SNPs. The Bayesian model assumed a priori that only few SNPs are causative, i.e., have an effect different from zero. When using whole-genome sequence data, accuracies of prediction of genetic value were >40% increased relative to the use of dense approximately 30K SNP chips. At equal high density, the inclusion of the causative mutations yielded an extra increase of accuracy of 2.5-3.7%. Predictions of genetic value remained accurate even when the training and evaluation data were 10 generations apart. Best linear unbiased prediction (BLUP) of SNP effects does not take full advantage of the genome sequence data, and nonlinear predictions, such as the Bayesian method used here, are needed to achieve maximum accuracy. On the basis of theoretical work, the results could be extended to more realistic genome and population sizes.
| Year | Citations | |
|---|---|---|
Page 1
Page 1