Bioinformatics Advance Access published online on February 24, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti352
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100080, China
* To whom correspondence should be addressed.
Motivation: Haplotype reconstruction based on aligned SNP fragments is to infer a pair of haplotypes from localized polymorphism data gathered through short genome fragment assembly. An important computational model of this problem is the Minimum Error Correction (MEC) model, which has been suggested in several literatures. The model retrieves a pair of haplotypes by correcting minimum number of SNPs in given genome fragments coming from an individual's DNA. Results: In the first part of this paper, an exact algorithm for the MEC model is presented. Due to the NP-hardness of the MEC model, we also design a heuristic method based on genetic algorithm (GA). The designed GA is intended to solve large size problems and has very good performance. The strength and weakness of the MEC model are shown through experimental results on real data and simulation data. In the second part of this paper, to overcome the weakness of the MEC model, a new computational model is proposed, which simultaneously employs genotype information of an individual in the process of SNP correction, then is called MEC with genotype information (shortly, MEC/GI). Computational results on extensive data sets show the new model has much higher accuracy of haplotype reconstruction than the pure MEC model.
Received November 23, 2004
Revised January 31, 2005
Accepted February 22, 2005
Article
Haplotype reconstruction from SNP fragments by Minimum Error Correction
2 Beijing Materials Institute, Beijing 101149, China
Rui-Sheng Wang, E-mail: wangrsh{at}amss.ac.cn
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
V. Bansal and V. Bafna HapCUT: an efficient and accurate algorithm for the haplotype assembly problem Bioinformatics, August 15, 2008; 24(16): i153 - i159. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Xie, J. Wang, and J. Chen A model of higher accuracy for the individual haplotyping problem based on weighted SNP fragments and genotype with errors Bioinformatics, July 1, 2008; 24(13): i105 - i113. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al. Machine learning in bioinformatics Brief Bioinform, March 1, 2006; 7(1): 86 - 112. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Li, W. Zhou, X.-S. Zhang, and L. Chen A parsimonious tree-grow method for haplotype inference Bioinformatics, September 1, 2005; 21(17): 3475 - 3481. [Abstract] [Full Text] [PDF] |
||||

