Publication | Open Access
CADD: predicting the deleteriousness of variants throughout the human genome
3.7K
Citations
60
References
2018
Year
GeneticsGenetic EpidemiologyGenomicsSelective PressureGenome-wide Association StudyGenetic AnalysisMolecular EcologyComputational GenomicsBiostatisticsHuman GenomeAnnotation-dependent DepletionPublic HealthPersonal GenomicsVariant InterpretationStatistical GeneticsPopulation GeneticsBioinformaticsFunctional GenomicsAllelic VariantEvolutionary BiologyComputational BiologyCadd ScoresSystems BiologyMedicine
Combined Annotation‑Dependent Depletion (CADD) is a widely used, integrative score that evaluates variant deleteriousness across the human genome by combining over 60 genomic features and training a machine‑learning model on simulated de novo versus fixed variants. This review updates CADD to version 1.4, extending support to the GRCh38 genome build. CADD scores, software, and documentation are provided via a public website and API, facilitating integration into other tools. The updated website offers simplified variant lookup, expanded documentation, and enhanced integration mechanisms for incorporating CADD scores into external applications.
Combined Annotation-Dependent Depletion (CADD) is a widely used measure of variant deleteriousness that can effectively prioritize causal variants in genetic analyses, particularly highly penetrant contributors to severe Mendelian disorders. CADD is an integrative annotation built from more than 60 genomic features, and can score human single nucleotide variants and short insertion and deletions anywhere in the reference assembly. CADD uses a machine learning model trained on a binary distinction between simulated de novo variants and variants that have arisen and become fixed in human populations since the split between humans and chimpanzees; the former are free of selective pressure and may thus include both neutral and deleterious alleles, while the latter are overwhelmingly neutral (or, at most, weakly deleterious) by virtue of having survived millions of years of purifying selection. Here we review the latest updates to CADD, including the most recent version, 1.4, which supports the human genome build GRCh38. We also present updates to our website that include simplified variant lookup, extended documentation, an Application Program Interface and improved mechanisms for integrating CADD scores into other tools or applications. CADD scores, software and documentation are available at https://cadd.gs.washington.edu.
| Year | Citations | |
|---|---|---|
Page 1
Page 1