A structural variation reference for medical and population genetics.

TitleA structural variation reference for medical and population genetics.
Publication TypeJournal Article
Year of Publication2020
AuthorsCollins, RL, Brand, H, Karczewski, KJ, Zhao, X, Alföldi, J, Francioli, LC, Khera, AV, Lowther, C, Gauthier, LD, Wang, H, Watts, NA, Solomonson, M, O'Donnell-Luria, A, Baumann, A, Munshi, R, Walker, M, Whelan, CW, Huang, Y, Brookings, T, Sharpe, T, Stone, MR, Valkanas, E, Fu, J, Tiao, G, Laricchia, KM, Ruano-Rubio, V, Stevens, C, Gupta, N, Cusick, C, Margolin, L, Taylor, KD, Lin, HJ, Rich, SS, Post, WS, Chen, Y-DIda, Rotter, JI, Nusbaum, C, Philippakis, A, Lander, E, Gabriel, S, Neale, BM, Kathiresan, S, Daly, MJ, Banks, E, MacArthur, DG, Talkowski, ME
Corporate AuthorsGenome Aggregation Database Production Team, Genome Aggregation Database Consortium
JournalNature
Volume581
Issue7809
Pagination444-451
Date Published2020 05
ISSN1476-4687
KeywordsContinental Population Groups, Disease, Female, Genetic Testing, Genetic Variation, Genetics, Medical, Genetics, Population, Genome, Human, Genotyping Techniques, Humans, Male, Middle Aged, Mutation, Polymorphism, Single Nucleotide, Reference Standards, Selection, Genetic, Whole Genome Sequencing
Abstract

Structural variants (SVs) rearrange large segments of DNA and can have profound consequences in evolution and human disease. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD) have become integral in the interpretation of single-nucleotide variants (SNVs). However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25-29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings. This SV resource is freely distributed via the gnomAD browser and will have broad utility in population genetics, disease-association studies, and diagnostic screening.

DOI10.1038/s41586-020-2287-8
Alternate JournalNature
PubMed ID32461652
PubMed Central IDPMC7334194
Grant ListHHSN268201500003C / HL / NHLBI NIH HHS / United States
R03 HD099547 / HD / NICHD NIH HHS / United States
N01HC95169 / HL / NHLBI NIH HHS / United States
R01 HD091797 / HD / NICHD NIH HHS / United States
N01HC95162 / HL / NHLBI NIH HHS / United States
N01HC95168 / HL / NHLBI NIH HHS / United States
N01HC95167 / HL / NHLBI NIH HHS / United States
R01 HL071205 / HL / NHLBI NIH HHS / United States
N01HC95166 / HL / NHLBI NIH HHS / United States
R01 HD096326 / HD / NICHD NIH HHS / United States
R01 HL071258 / HL / NHLBI NIH HHS / United States
K08 HG010155 / HG / NHGRI NIH HHS / United States
P50 HD028138 / HD / NICHD NIH HHS / United States
UL1 RR033176 / RR / NCRR NIH HHS / United States
R01 MH115957 / MH / NIMH NIH HHS / United States
N01HC95160 / HL / NHLBI NIH HHS / United States
R01 HL071251 / HL / NHLBI NIH HHS / United States
R01 HL071259 / HL / NHLBI NIH HHS / United States
N01HC95163 / HL / NHLBI NIH HHS / United States
K99 DE026824 / DE / NIDCR NIH HHS / United States
UM1 HG008895 / HG / NHGRI NIH HHS / United States
KL2 TR000107 / TR / NCATS NIH HHS / United States
CS/14/2/30841 / BH / British Heart Foundation / United Kingdom
R01 HL071250 / HL / NHLBI NIH HHS / United States
R00 DE026824 / DE / NIDCR NIH HHS / United States
T32 HG002295 / HG / NHGRI NIH HHS / United States
N01HC95164 / HL / NHLBI NIH HHS / United States
P30 DK063491 / DK / NIDDK NIH HHS / United States
R01 HL071051 / HL / NHLBI NIH HHS / United States
P01 GM061354 / GM / NIGMS NIH HHS / United States
N01HC95165 / HL / NHLBI NIH HHS / United States
N01HC95159 / HL / NHLBI NIH HHS / United States
MC_UP_1102/20 / MR / Medical Research Council / United Kingdom
R01 HD081256 / HD / NICHD NIH HHS / United States
N01HC95161 / HL / NHLBI NIH HHS / United States
U01 MH105669 / MH / NIMH NIH HHS / United States
UL1 TR001420 / TR / NCATS NIH HHS / United States
R01 MH111776 / MH / NIMH NIH HHS / United States
HHSN268201500003I / HL / NHLBI NIH HHS / United States
UL1 TR000040 / TR / NCATS NIH HHS / United States
P01 HD068250 / HD / NICHD NIH HHS / United States