Publication | Open Access
Introducing EzAAI: a pipeline for high throughput calculations of prokaryotic average amino acid identity
468
Citations
19
References
2021
Year
GeneticsMolecular BiologyAai CalculationGenomicsSequence AlignmentBioinformatics DatabaseHigh Throughput SequencingProtein SynthesisPhylogeneticsMolecular EcologyComputational GenomicsProteomicsHigh Throughput CalculationsBiochemistryEzaai ToolSequence AnalysisProtein ModelingProtein Structure PredictionFunctional GenomicsBioinformaticsProtein BioinformaticsProtein BiosynthesisBiologyNatural SciencesAccurate Aai CalculationComputational BiologyProtein EngineeringSystems BiologyMedicine
The average amino acid identity (AAI) is an index of pairwise genomic relatedness, and multiple studies have proposed its application in prokaryotic taxonomy and related disciplines. AAI demonstrates better resolution in elucidating taxonomic structure beyond the species rank when compared with average nucleotide identity (ANI), which is a standard criterion in species delineation. However, an efficient and easy-to-use computational tool for AAI calculation in large-scale taxonomic studies is not yet available. Here, we introduce a bioinformatic pipeline, named EzAAI, which allows for rapid and accurate AAI calculation in prokaryote sequences. The EzAAI tool is based on the MMSeqs2 program and computes AAI values almost identical to those generated by the standard BLAST algorithm with significant improvements in the speed of these evaluations. Our pipeline also provides a function for hierarchical clustering to create dendrograms, which is an essential part of any taxonomic study. EzAAI is available for download as a standalone JAVA program at http://leb.snu.ac.kr/ezaai .
| Year | Citations | |
|---|---|---|
Page 1
Page 1