Publication | Open Access
Inferring population structure in biobank-scale genomic data
46
Citations
30
References
2022
Year
GeneticsGenetic EpidemiologyGenomicsGenotype-phenotype AssociationMolecular EcologyHuman VariationComputational GenomicsBiostatisticsPublic HealthStatisticsPersonal GenomicsPopulationGenetic VariantsHaplotype DeterminationUk BiobankStatistical GeneticsPopulation StructureGenetic VariationPopulation Structure InferencePopulation GeneticsPopulation GenomicsMedicine
Inferring the structure of human populations from genetic variation data is a key task in population and medical genomic studies. Although a number of methods for population structure inference have been proposed, current methods are impractical to run on biobank-scale genomic datasets containing millions of individuals and genetic variants. We introduce SCOPE, a method for population structure inference that is orders of magnitude faster than existing methods while achieving comparable accuracy. SCOPE infers population structure in about a day on a dataset containing one million individuals and variants as well as on the UK Biobank dataset containing 488,363 individuals and 569,346 variants. Furthermore, SCOPE can leverage allele frequencies from previous studies to improve the interpretability of population structure estimates.
| Year | Citations | |
|---|---|---|
Page 1
Page 1