Publication | Open Access
High coverage whole genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios
259
Citations
54
References
2021
Year
Unknown Venue
Illumina Novaseq 6000GeneticsGenomicsGenomes Project CohortHigh Throughput SequencingExpanded 1000Computational GenomicsStatistical ComputingBiostatisticsPublic HealthMolecular DiagnosticsSystems BiologyGenomes ProjectStatistical GeneticsGenetic VariationPopulation GeneticsSequencingBioinformaticsWhole Genome SequencingNext-generation SequencingGenome SequencingPopulation GenomicsMedicineSequence Assembly
SUMMARY The 1000 Genomes Project (1kGP) is the largest fully open resource of whole genome sequencing (WGS) data consented for public distribution of raw sequence data without access or use restrictions. The final release of the 1kGP included 2,504 unrelated samples from 26 populations and was based primarily on low coverage WGS. Here, we present a new, high coverage 3,202-sample WGS 1kGP resource, sequenced to a targeted depth of 30X using the Illumina NovaSeq 6000 system, which now includes 602 complete trios. We performed SNV/INDEL calling against the GRCh38 reference using GATK’s HaplotypeCaller, and generated a comprehensive set of SVs by integrating multiple analytic methods through a sophisticated machine learning model. We make all the data generated as part of this project publicly available and we envision it to become the new de facto public resource for the worldwide genomics and genetics community.
| Year | Citations | |
|---|---|---|
Page 1
Page 1