Skip Navigation



Bioinformatics Advance Access published online on November 4, 2008

Bioinformatics, doi:10.1093/bioinformatics/btn572
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
25/1/68    most recent
btn572v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Climer, S.
Right arrow Articles by Zhang, W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Climer, S.
Right arrow Articles by Zhang, W.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2008). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

How Frugal is Mother Nature with Haplotypes?

Sharlee Climer 1,2,3, Gerold Jäger 1, Alan R. Templeton 4 and Weixiong Zhang 1,5,*

1Department of Computer Science,2Department of Cardiology, 3Division of Biostatistics, 4Department of Biology, 5Department of Genetics, Washington University, St. Louis, MO, USA

*To whom correspondence should be addressed. Prof. Weixiong Zhang, E-mail: weixiong.zhang{at}wustl.edu


   Abstract

Motivation: Inference of haplotypes from genotype data is crucial and challenging for many vitally important studies. The first, and most critical step, is the ascertainment of a biologically sound model to be optimized. Many models that have been proposed rely partially or entirely on reducing the number of unique haplotypes in the solution.

Results: This paper examines the parsimony of haplotypes using known haplotypes as well as genotypes from the HapMap project. Our study reveals that there are relatively few unique haplotypes, but not always the least possible, for the data sets with known solutions. Furthermore, we show that there are frequently very large numbers of parsimonious solutions, and the number increases exponentially with increasing cardinality. Moreover, these solutions are quite varied, most of which are not consistent with the true solutions. These results quantify the limitations of the Pure Parsimony model and demonstrate the imperative need to consider additional properties for haplotype inference models. At a higher level, and with broad applicability, this paper illustrates the power of combinatorial methods to tease out imperfections in a given biological model.

Associate Editor: Dr. Alex Bateman


Received on July 29, 2008; revised on October 26, 2008; accepted on October 30, 2008

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.