Publication | Open Access
Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
2.7K
Citations
52
References
2014
Year
Large SamplesComparative GenomicsGeneticsHaploid Genotype AlignmentsGenomicsSequence AlignmentAccurate ReconstructionsMicrobial EvolutionHigh Throughput SequencingPhylogenetic AnalysisMolecular EcologySequence AnalysisHorizontal Sequence TransferBioinformaticsMicrobial SystematicsNatural SciencesEvolutionary BiologyRapid Phylogenetic AnalysisMicrobiologyMedicine
New sequencing technologies enable large bacterial whole‑genome alignments for evolutionary and outbreak studies, but growing datasets contain multiple horizontal transfer mechanisms that substantially alter chromosomes. The study aims to develop a rapid, flexible method that accounts for recombination in reconstructing recent bacterial isolate diversification. Gubbins is an iterative algorithm that scans alignments for recombination hotspots, builds a maximum‑likelihood phylogeny from non‑recombinant sites, and is freely available in Python/C for Linux and Mac OS X. Simulations show Gubbins produces highly accurate phylogenies and converges in a few hours on hundreds of genomes.
The emergence of new sequencing technologies has facilitated the use of bacterial whole genome alignments for evolutionary studies and outbreak analyses. These datasets, of increasing size, often include examples of multiple different mechanisms of horizontal sequence transfer resulting in substantial alterations to prokaryotic chromosomes. The impact of these processes demands rapid and flexible approaches able to account for recombination when reconstructing isolates' recent diversification. Gubbins is an iterative algorithm that uses spatial scanning statistics to identify loci containing elevated densities of base substitutions suggestive of horizontal sequence transfer while concurrently constructing a maximum likelihood phylogeny based on the putative point mutations outside these regions of high sequence diversity. Simulations demonstrate the algorithm generates highly accurate reconstructions under realistically parameterized models of bacterial evolution, and achieves convergence in only a few hours on alignments of hundreds of bacterial genome sequences. Gubbins is appropriate for reconstructing the recent evolutionary history of a variety of haploid genotype alignments, as it makes no assumptions about the underlying mechanism of recombination. The software is freely available for download at github.com/sanger-pathogens/Gubbins, implemented in Python and C and supported on Linux and Mac OS X.
| Year | Citations | |
|---|---|---|
2010 | 15.1K | |
2010 | 4K | |
1997 | 3.9K | |
1981 | 2.9K | |
1993 | 1.8K | |
2010 | 1.6K | |
2006 | 1.4K | |
2010 | 1.1K | |
2011 | 903 | |
1989 | 876 |
Page 1
Page 1