Utilizing graph theory to select the largest set of unrelated individuals for genetic analysis.

TitleUtilizing graph theory to select the largest set of unrelated individuals for genetic analysis.
Publication TypeJournal Article
Year of Publication2013
AuthorsStaples, J, Nickerson, DA, Below, JE
JournalGenet Epidemiol
Volume37
Issue2
Pagination136-41
Date Published2013 Feb
ISSN1098-2272
KeywordsAlgorithms, Genome-Wide Association Study, HapMap Project, Humans, Models, Theoretical
Abstract

Many statistical analyses of genetic data rely on the assumption of independence among samples. Consequently, relatedness is either modeled in the analysis or samples are removed to "clean" the data of any pairwise relatedness above a tolerated threshold. Current methods do not maximize the number of unrelated individuals retained for further analysis, and this is a needless loss of resources. We report a novel application of graph theory that identifies the maximum set of unrelated samples in any dataset given a user-defined threshold of relatedness as well as all networks of related samples. We have implemented this method into an open source program called Pedigree Reconstruction and Identification of a Maximum Unrelated Set, PRIMUS. We show that PRIMUS outperforms the three existing methods, allowing researchers to retain up to 50% more unrelated samples. A unique strength of PRIMUS is its ability to weight the maximum clique selection using additional criteria (e.g. affected status and data missingness). PRIMUS is a permanent solution to identifying the maximum number of unrelated samples for a genetic analysis.

DOI10.1002/gepi.21684
Alternate JournalGenet. Epidemiol.
PubMed ID22996348
PubMed Central IDPMC3770842
Grant ListU54 HG006493 / HG / NHGRI NIH HHS / United States
UC2 HL102926 / HL / NHLBI NIH HHS / United States
RC2 HL102926 / HL / NHLBI NIH HHS / United States
HG006493 / HG / NHGRI NIH HHS / United States
T32 HG000035 / HG / NHGRI NIH HHS / United States
HL102926 / HL / NHLBI NIH HHS / United States
T32 HG00035 / HG / NHGRI NIH HHS / United States
UM1 HG006493 / HG / NHGRI NIH HHS / United States