Gene-based segregation method for identifying rare variants in family-based sequencing studies.

TitleGene-based segregation method for identifying rare variants in family-based sequencing studies.
Publication TypeJournal Article
Year of Publication2017
AuthorsQiao, D, Lange, C, Laird, NM, Won, S, Hersh, CP, Morrow, J, Hobbs, BD, Lutz, SM, Ruczinski, I, Beaty, TH, Silverman, EK, Cho, MH
JournalGenet Epidemiol
Volume41
Issue4
Pagination309-319
Date Published2017 05
ISSN1098-2272
KeywordsAge of Onset, Boston, Computer Simulation, Databases, Genetic, Family, Genetic Variation, Genome, Human, Humans, Models, Genetic, Penetrance, Pulmonary Disease, Chronic Obstructive, Reference Standards, Sequence Analysis, DNA
Abstract

Whole-exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene-based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region-based methods. We apply our method to whole-exome sequencing data from 49 extended pedigrees with severe, early-onset chronic obstructive pulmonary disease (COPD) in the Boston Early-Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family-based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/).

DOI10.1002/gepi.22037
Alternate JournalGenet. Epidemiol.
PubMed ID28191685
PubMed Central IDPMC5397337
Grant ListR01 HL113264 / HL / NHLBI NIH HHS / United States
R01 HL089897 / HL / NHLBI NIH HHS / United States
K01 HL125858 / HL / NHLBI NIH HHS / United States
T32 ES007142 / ES / NIEHS NIH HHS / United States
U01 HL089897 / HL / NHLBI NIH HHS / United States
R01 HL075478 / HL / NHLBI NIH HHS / United States
R01 HL089856 / HL / NHLBI NIH HHS / United States
U01 HL089856 / HL / NHLBI NIH HHS / United States
R01 HG008976 / HG / NHGRI NIH HHS / United States
P01 HL105339 / HL / NHLBI NIH HHS / United States
K01 HL129039 / HL / NHLBI NIH HHS / United States
UM1 HG006493 / HG / NHGRI NIH HHS / United States
R01 HL084323 / HL / NHLBI NIH HHS / United States
P01 HL114501 / HL / NHLBI NIH HHS / United States
T32 HL007427 / HL / NHLBI NIH HHS / United States