Publication | Closed Access
Bayesian multi-population haplotype inference via a hierarchical dirichlet process mixture
36
Citations
14
References
2006
Year
Unknown Venue
Bayesian StatisticBayesian MethodologyGeneticsInfinite MixturesGenetic EpidemiologyHaplotype InferenceBayesian InferenceMolecular EcologyMixture AnalysisBiostatisticsBayesian MethodsPublic HealthStatisticsBayesian Hierarchical ModelingPopulationHaplotype DeterminationStatistical GeneticsPopulation GeneticsMixture DistributionStatistical InferenceMedicineApproximate Bayesian Computation
Uncovering the haplotypes of single nucleotide polymorphisms and their population demography is essential for many biological and medical applications. Methods for haplotype inference developed thus far---including methods based on coalescence, finite and infinite mixtures, and maximal parsimony---ignore the underlying population structure in the genotype data. As noted by Pritchard (2001), different populations can share certain portion of their genetic ancestors, as well as have their own genetic components through migration and diversification. In this paper, we address the problem of multi-population haplotype inference. We capture cross-population structure using a nonparametric Bayesian prior known as the hierarchical Dirichlet process (HDP) (Teh et al., 2006), conjoining this prior with a recently developed Bayesian methodology for haplotype phasing known as DP-Haplotyper (Xing et al., 2004). We also develop an efficient sampling algorithm for the HDP based on a two-level nested Pólya urn scheme. We show that our model outperforms extant algorithms on both simulated and real biological data.
| Year | Citations | |
|---|---|---|
2001 | 7.5K | |
1973 | 4.7K | |
2006 | 3.5K | |
2001 | 3K | |
1995 | 2K | |
2005 | 1.4K | |
1973 | 1.3K | |
2001 | 1.2K | |
2001 | 1.1K | |
1999 | 987 |
Page 1
Page 1