Publication | Closed Access
Indexing Graphs for Path Queries with Applications in Genome Research
319
Citations
18
References
2014
Year
Burrows-wheeler TransformGeneticsGeneric ApproachGenomicsSequence AlignmentBioinformatics DatabasePhylogeneticsMolecular EcologyData ScienceComputational GenomicsSystems BiologySequence AnalysisCanonical Sequence RepresentationBioinformaticsFunctional GenomicsData IndexingBiologyLong-read SequencingGraph TheoryPath QueriesNatural SciencesComputational BiologyReference GenomeIndexing TechniqueMedicineSequence Assembly
We propose a generic approach to replace the canonical sequence representation of genomes with graph representations, and study several applications of such extensions. We extend the Burrows-Wheeler transform (BWT) of strings to acyclic directed labeled graphs, to support path queries as an extension to substring searching. We develop, apply, and tailor this technique to a) read alignment on an extended BWT index of a graph representing pan-genome, i.e., reference genome and known variants of it; and b) split-read alignment on an extended BWT index of a splicing graph. Other possible applications include probe/primer design, alignments to assembly graphs, and alignments to phylogenetic tree of partial-order graphs. We report several experiments on the feasibility and applicability of the approach. Especially on highly-polymorphic genome regions our pan-genome index is making a significant improvement in alignment accuracy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1