Publication | Open Access
antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters
1.9K
Citations
21
References
2015
Year
Biosynthetic Gene ClustersMolecular BiologyMicrobial Secondary MetabolismGenomicsBioinformatics DatabaseHigh Throughput SequencingClusterfinder AlgorithmGene ClustersAntimicrobial Drug DiscoveryBiochemistryKnowledge DiscoveryComprehensive ResourceBioinformaticsProtein BioinformaticsNatural SciencesNext-generation SequencingComputational BiologyGenome MiningMicrobiologySystems BiologyMedicineDrug Discovery
Microbial secondary metabolism supplies antibiotics, chemotherapeutics, insecticides, and other high‑value chemicals, and genome mining of their biosynthetic gene clusters—first enabled by the 2011 antiSMASH web server and stand‑alone tool—has become a key strategy for discovering novel compounds. This paper introduces antiSMASH 3.0, a substantially upgraded version of the tool. Version 3.0 incorporates the ClusterFinder probabilistic algorithm for detecting unknown clusters, a dereplication‑enhanced ClusterBlast module matching against 1,172 known products, pattern‑based enzyme active‑site annotation with EC numbers, refined polyketide reduction‑state predictions, and an XML output format for offline editing in Geneious.
Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software.
| Year | Citations | |
|---|---|---|
2009 | 21.8K | |
2012 | 20.1K | |
2014 | 14.1K | |
2013 | 6.4K | |
2009 | 6.2K | |
2011 | 2K | |
2014 | 964 | |
2013 | 804 | |
2010 | 768 | |
2014 | 493 |
Page 1
Page 1