Publication | Open Access
Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing
67
Citations
51
References
2018
Year
Single-molecule full-length complementary DNA (cDNA) sequencing can aid genome annotation by revealing transcript structure and alternative splice forms, yet current annotation pipelines do not incorporate such information. Here we present long-read annotation (LoReAn) software, an automated annotation pipeline utilizing short- and long-read cDNA sequencing, protein evidence, and ab initio prediction to generate accurate genome annotations. Based on annotations of two fungal genomes (<i>Verticillium dahliae</i> and <i>Plicaturopsis crispa</i>) and two plant genomes (Arabidopsis [<i>Arabidopsis thaliana</i>] and <i>Oryza sativa</i>), we show that LoReAn outperforms popular annotation pipelines by integrating single-molecule cDNA-sequencing data generated from either the Pacific Biosciences or MinION sequencing platforms, correctly predicting gene structure, and capturing genes missed by other annotation pipelines.
| Year | Citations | |
|---|---|---|
Page 1
Page 1