Publication | Open Access
Comparison of three variant callers for human whole genome sequencing
86
Citations
22
References
2018
Year
NeurogenomicsGeneticsGenetic EpidemiologyGenomicsVariant CallersHigh Throughput SequencingGenetic MedicineGene Panel SequencingClinical GeneticsBiostatisticsPublic HealthMolecular DiagnosticsVariant InterpretationStatistical GeneticsGenetic VariationDeepvariant ToolSequencingFunctional GenomicsBioinformaticsGenomic MedicineWgs DataNext-generation SequencingGenome SequencingMedicineSequence Assembly
Testing of patients with genetics-related disorders is in progress of shifting from single gene assays to gene panel sequencing, whole-exome sequencing (WES) and whole-genome sequencing (WGS). Since WGS is unquestionably becoming a new foundation for molecular analyses, we decided to compare three currently used tools for variant calling of human whole genome sequencing data. We tested DeepVariant, a new TensorFlow machine learning-based variant caller, and compared this tool to GATK 4.0 and SpeedSeq, using 30×, 15× and 10× WGS data of the well-known NA12878 DNA reference sample. According to our comparison, the performance on SNV calling was almost similar in 30× data, with all three variant callers reaching F-Scores (i.e. harmonic mean of recall and precision) equal to 0.98. In contrast, DeepVariant was more precise in indel calling than GATK and SpeedSeq, as demonstrated by F-Scores of 0.94, 0.90 and 0.84, respectively. We conclude that the DeepVariant tool has great potential and usefulness for analysis of WGS data in medical genetics.
| Year | Citations | |
|---|---|---|
Page 1
Page 1