A general framework for estimating the relative pathogenicity of human genetic variants.

TitleA general framework for estimating the relative pathogenicity of human genetic variants.
Publication TypeJournal Article
Year of Publication2014
AuthorsKircher, M, Witten, DM, Jain, P, O'Roak, BJ, Cooper, GM, Shendure, J
JournalNat Genet
Volume46
Issue3
Pagination310-5
Date Published2014 Mar
ISSN1546-1718
KeywordsGenetic Variation, Genome, Human, Genome-Wide Association Study, Humans, INDEL Mutation, Models, Genetic, Molecular Sequence Annotation, Polymorphism, Single Nucleotide, Selection, Genetic, Support Vector Machine
Abstract

Current methods for annotating and interpreting human genetic variation tend to exploit a single information type (for example, conservation) and/or are restricted in scope (for example, to missense changes). Here we describe Combined Annotation-Dependent Depletion (CADD), a method for objectively integrating many diverse annotations into a single measure (C score) for each variant. We implement CADD as a support vector machine trained to differentiate 14.7 million high-frequency human-derived alleles from 14.7 million simulated variants. We precompute C scores for all 8.6 billion possible human single-nucleotide variants and enable scoring of short insertions-deletions. C scores correlate with allelic diversity, annotations of functionality, pathogenicity, disease severity, experimentally measured regulatory effects and complex trait associations, and they highly rank known pathogenic variants within individual genomes. The ability of CADD to prioritize functional, deleterious and pathogenic variants across many functional categories, effect sizes and genetic architectures is unmatched by any current single-annotation method.

DOI10.1038/ng.2892
Alternate JournalNat. Genet.
PubMed ID24487276
PubMed Central IDPMC3992975
Grant ListU54 HG006493 / HG / NHGRI NIH HHS / United States
U54HG006493 / HG / NHGRI NIH HHS / United States
DP1HG007811 / DP / NCCDPHP CDC HHS / United States
DP5 OD009145 / OD / NIH HHS / United States
UM1 HG006493 / HG / NHGRI NIH HHS / United States
DP1 HG007811 / HG / NHGRI NIH HHS / United States
DP5OD009145 / OD / NIH HHS / United States