Concepedia

Publication | Open Access

Estimating Species Trees from Unrooted Gene Trees

277

Citations

46

References

2011

Year

TLDR

The study develops a distance-based method for inferring unrooted species trees from unrooted gene trees. The method, called NJst, constructs a neighbor‑joining tree from a distance matrix where distances are the average internode distances between species across gene trees. Under the coalescent model, NJst is statistically consistent, performs comparably to STAR but is outperformed by BEST, and uniquely allows inference from unrooted gene trees without an outgroup while handling missing data.

Abstract

In this study, we develop a distance method for inferring unrooted species trees from a collection of unrooted gene trees. The species tree is estimated by the neighbor joining (NJ) tree built from a distance matrix in which the distance between two species is defined as the average number of internodes between two species across gene trees, that is, average gene-tree internode distance. The distance method is named NJst to distinguish it from the original NJ method. Under the coalescent model, we show that if gene trees are known or estimated correctly, the NJst method is statistically consistent in estimating unrooted species trees. The simulation results suggest that NJst and STAR (another coalescence-based method for inferring species trees) perform almost equally well in estimating topologies of species trees, whereas the Bayesian coalescence-based method, BEST, outperforms both NJst and STAR. Unlike BEST and STAR, the NJst method can take unrooted gene trees to infer species trees without using an outgroup. In addition, the NJst method can handle missing data and is thus useful in phylogenomic studies in which data sets often contain missing loci for some individuals.

References

YearCitations

Page 1