Skip Navigation


Bioinformatics Advance Access originally published online on September 25, 2007
Bioinformatics 2007 23(22):3048-3055; doi:10.1093/bioinformatics/btm435
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
23/22/3048    most recent
btm435v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kirkpatrick, B.
Right arrow Articles by Halperin, E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kirkpatrick, B.
Right arrow Articles by Halperin, E.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

HAPLOPOOL: improving haplotype frequency estimation through DNA pools and phylogenetic modeling

Bonnie Kirkpatrick 1,*, Carlos Santos Armendariz 2, Richard M. Karp 1,3 and Eran Halperin 3

1Department of Electrical Engineering and Computer Sciences, UC Berkeley, CA, 2Computer Science Department, Universidad Rey Juan Carlos, Madrid, Spain and 3International Computer Science Institute, Berkeley, CA, USA

*To whom correspondence should be addressed.


   Abstract

Motivation: The search for genetic variants that are linked to complex diseases such as cancer, Parkinson's;, or Alzheimer's; disease, may lead to better treatments. Since haplotypes can serve as proxies for hidden variants, one method of finding the linked variants is to look for case-control associations between the haplotypes and disease. Finding these associations requires a high-quality estimation of the haplotype frequencies in the population. To this end, we present, HAPLOPOOL, a method of estimating haplotype frequencies from blocks of consecutive SNPs.

Results: HAPLOPOOL leverages the efficiency of DNA pools and estimates the population haplotype frequencies from pools of disjoint sets, each containing two or three unrelated individuals. We study the trade-off between pooling efficiency and accuracy of haplotype frequency estimates. For a fixed genotyping budget, HAPLOPOOL performs favorably on pools of two individuals as compared with a state-of-the-art non-pooled phasing method, PHASE. Of independent interest, HAPLOPOOL can be used to phase non-pooled genotype data with an accuracy approaching that of PHASE.

We compared our algorithm to three programs that estimate haplotype frequencies from pooled data. HAPLOPOOL is an order of magnitude more efficient (at least six times faster), and considerably more accurate than previous methods. In contrast to previous methods, HAPLOPOOL performs well with missing data, genotyping errors and long haplotype blocks (of between 5 and 25 SNPs).

Availability: The HAPLOPOOL software is available at: http://haplopool.icsi.berkeley.edu/haplopool/

Contact: bbkirk{at}eecs.berkeley.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

Associate Editor: Keith Crandall


Received on June 7, 2007; revised on August 3, 2007; accepted on August 16, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome ResHome page
J. E. Craig, A. W. Hewitt, A. E. McMellon, A. K. Henders, L. Ma, L. Wallace, S. Sharma, K. P. Burdon, P. M. Visscher, G. W. Montgomery, et al.
Rapid inexpensive genome-wide association using pooled whole blood
Genome Res., November 1, 2009; 19(11): 2075 - 2080.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Y. C. Kuk, H. Zhang, and Y. Yang
Computationally feasible estimation of haplotype frequencies from pooled DNA with and without Hardy-Weinberg equilibrium
Bioinformatics, February 1, 2009; 25(3): 379 - 386.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Zhang, H.-C. Yang, and Y. Yang
PoooL: an efficient method for estimating haplotype frequencies from large DNA pools
Bioinformatics, September 1, 2008; 24(17): 1942 - 1948.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.