Title | Generation of sequence-based data for pedigree-segregating Mendelian or Complex traits. |
Publication Type | Journal Article |
Year of Publication | 2015 |
Authors | Li, B, Wang, GT, Leal, SM |
Journal | Bioinformatics |
Volume | 31 |
Issue | 22 |
Pagination | 3706-8 |
Date Published | 2015 Nov 15 |
ISSN | 1367-4811 |
Keywords | Base Sequence, Chromosome Segregation, Humans, Pedigree, Quantitative Trait, Heritable, Sequence Analysis, Software |
Abstract | MOTIVATION: There is great interest in analyzing next generation sequence data that has been generated for pedigrees. However, unlike for population-based data there are only a limited number of rare variant methods to analyze pedigree data. One limitation is the ability to evaluate type I and II errors for family-based methods, due to lack of software that can simulate realistic sequence data for pedigrees. SUMMARY: We developed RarePedSim (Rare-variant Pedigree-based Simulator), a program to simulate region/gene-level genotype and phenotype data for complex and Mendelian traits for any given pedigree structure. Using a genetic model, sequence variant data can be generated either conditionally or unconditionally on pedigree members' qualitative or quantitative phenotypes. Additionally, qualitative or quantitative traits can be generated conditional on variant data. Sequence data can either be simulated using realistic population demographic models or obtained from sequence-based studies. Variant sites can be annotated with positions, allele frequencies and functionality. For rare variants, RarePedSim is the only program that can efficiently generate both genotypes and phenotypes, regardless of pedigree structure. Data generated by RarePedSim are in standard Linkage file (.ped) and Variant Call (.vcf) formats, ready to be used for a variety of purposes, including evaluation of type I error and power, for association methods including mixed models and linkage analysis methods. AVAILABILITY AND IMPLEMENTATION: bioinformatics.org/simped/rare CONTACT: sleal@bcm.edu. |
DOI | 10.1093/bioinformatics/btv412 |
Alternate Journal | Bioinformatics |
PubMed ID | 26177964 |
PubMed Central ID | PMC4757949 |
Grant List | DC011651 / DC / NIDCD NIH HHS / United States R01 DC011651 / DC / NIDCD NIH HHS / United States HG006493 / HG / NHGRI NIH HHS / United States R01 DC003594 / DC / NIDCD NIH HHS / United States UM1 HG006493 / HG / NHGRI NIH HHS / United States DC003594 / DC / NIDCD NIH HHS / United States |