Title | Gene-based segregation method for identifying rare variants in family-based sequencing studies. |
Publication Type | Journal Article |
Year of Publication | 2017 |
Authors | Qiao, D, Lange, C, Laird, NM, Won, S, Hersh, CP, Morrow, J, Hobbs, BD, Lutz, SM, Ruczinski, I, Beaty, TH, Silverman, EK, Cho, MH |
Journal | Genet Epidemiol |
Volume | 41 |
Issue | 4 |
Pagination | 309-319 |
Date Published | 2017 05 |
ISSN | 1098-2272 |
Keywords | Age of Onset, Boston, Computer Simulation, Databases, Genetic, Family, Genetic Variation, Genome, Human, Humans, Models, Genetic, Penetrance, Pulmonary Disease, Chronic Obstructive, Reference Standards, Sequence Analysis, DNA |
Abstract | Whole-exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene-based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region-based methods. We apply our method to whole-exome sequencing data from 49 extended pedigrees with severe, early-onset chronic obstructive pulmonary disease (COPD) in the Boston Early-Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family-based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/). |
DOI | 10.1002/gepi.22037 |
Alternate Journal | Genet. Epidemiol. |
PubMed ID | 28191685 |
PubMed Central ID | PMC5397337 |
Grant List | R01 HL113264 / HL / NHLBI NIH HHS / United States R01 HL089897 / HL / NHLBI NIH HHS / United States K01 HL125858 / HL / NHLBI NIH HHS / United States T32 ES007142 / ES / NIEHS NIH HHS / United States U01 HL089897 / HL / NHLBI NIH HHS / United States R01 HL075478 / HL / NHLBI NIH HHS / United States R01 HL089856 / HL / NHLBI NIH HHS / United States U01 HL089856 / HL / NHLBI NIH HHS / United States R01 HG008976 / HG / NHGRI NIH HHS / United States P01 HL105339 / HL / NHLBI NIH HHS / United States K01 HL129039 / HL / NHLBI NIH HHS / United States UM1 HG006493 / HG / NHGRI NIH HHS / United States R01 HL084323 / HL / NHLBI NIH HHS / United States P01 HL114501 / HL / NHLBI NIH HHS / United States T32 HL007427 / HL / NHLBI NIH HHS / United States |