Generation of sequence-based data for pedigree-segregating Mendelian or Complex traits.

TitleGeneration of sequence-based data for pedigree-segregating Mendelian or Complex traits.
Publication TypeJournal Article
Year of Publication2015
AuthorsLi, B, Wang, GT, Leal, SM
JournalBioinformatics
Volume31
Issue22
Pagination3706-8
Date Published2015 Nov 15
ISSN1367-4811
KeywordsBase Sequence, Chromosome Segregation, Humans, Pedigree, Quantitative Trait, Heritable, Sequence Analysis, Software
Abstract

MOTIVATION: There is great interest in analyzing next generation sequence data that has been generated for pedigrees. However, unlike for population-based data there are only a limited number of rare variant methods to analyze pedigree data. One limitation is the ability to evaluate type I and II errors for family-based methods, due to lack of software that can simulate realistic sequence data for pedigrees.

SUMMARY: We developed RarePedSim (Rare-variant Pedigree-based Simulator), a program to simulate region/gene-level genotype and phenotype data for complex and Mendelian traits for any given pedigree structure. Using a genetic model, sequence variant data can be generated either conditionally or unconditionally on pedigree members' qualitative or quantitative phenotypes. Additionally, qualitative or quantitative traits can be generated conditional on variant data. Sequence data can either be simulated using realistic population demographic models or obtained from sequence-based studies. Variant sites can be annotated with positions, allele frequencies and functionality. For rare variants, RarePedSim is the only program that can efficiently generate both genotypes and phenotypes, regardless of pedigree structure. Data generated by RarePedSim are in standard Linkage file (.ped) and Variant Call (.vcf) formats, ready to be used for a variety of purposes, including evaluation of type I error and power, for association methods including mixed models and linkage analysis methods.

AVAILABILITY AND IMPLEMENTATION: bioinformatics.org/simped/rare

CONTACT: sleal@bcm.edu.

DOI10.1093/bioinformatics/btv412
Alternate JournalBioinformatics
PubMed ID26177964
PubMed Central IDPMC4757949
Grant ListDC011651 / DC / NIDCD NIH HHS / United States
R01 DC011651 / DC / NIDCD NIH HHS / United States
HG006493 / HG / NHGRI NIH HHS / United States
R01 DC003594 / DC / NIDCD NIH HHS / United States
UM1 HG006493 / HG / NHGRI NIH HHS / United States
DC003594 / DC / NIDCD NIH HHS / United States