Publication | Open Access
A scalable assembly-free variable selection algorithm for biomarker discovery from metagenomes
13
Citations
19
References
2016
Year
We present a set of sequence clustering ("binning") modules and their application to biomarker (e.g., genomes of pathogenic organisms) discovery from large synthetic and real metagenomics datasets. Initially designed for the "assembly-free" analysis of individual metagenomic samples, we demonstrate their extension to setups involving multiple samples via the usage of the "alignment-free" d2S statistic to relate clusters across samples, and illustrate how the clustering modules can otherwise be leveraged for de novo "pre-assembly" tasks by segregating sequences into biologically meaningful partitions.
| Year | Citations | |
|---|---|---|
Page 1
Page 1