Bioinformatics Advance Access published online on October 25, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti732
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Electrical Engineering and Computer Science Department, Case Western Reserve University, Cleveland, OH 44106, USA
* To whom correspondence should be addressed.
Motivation: With the availability of large-scale, high-density single-nucleotide polymorphism (SNP) markers and information on haplotype structures and frequencies, a great challenge is how to take advantage of haplotype information in the association mapping of complex diseases in case-control studies. Results: We present a novel approach for association mapping based on directly mining haplotypes (i.e., phased genotype pairs) produced from case-control data or case-parent data via a density-based clustering algorithm, which can be applied to whole-genome screens as well as candidate-gene studies in small genomic regions. The method directly explores the sharing of haplotype segments in affected individuals that are rarely present in normal individuals. The measure of sharing between two haplotypes is defined by a new similarity metric that combines the length of the shared segments and the number of common alleles around any marker position of the haplotypes, which is robust against recent mutations/genotype errors and recombination events. The effectiveness of the approach is demonstrated by using both simulated datasets and real datasets. The results show that the algorithm is accurate for different population models and for different disease models, even for genes with small effects, and it outperforms some recently developed methods. Availability: The software, HapMiner, is available on the authors' website at http://vorlon.case.edu/~jxl175/HapMiner.html.
Received August 17, 2005
Revised October 4, 2005
Accepted October 19, 2005
Article
Haplotype-based linkage disequilibrium mapping via direct data mining
2 Department of Computer Science and Engineering, University of California, Riverside, CA 92521, USA; Center for Advanced Study, Tsinghua University, Beijing, China; Shanghai Center for Bioinformatics Technology, Shanghai, China
Jing Li, E-mail: jingli{at}eecs.case.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Li, K. Wang, S. F. A. Grant, H. Hakonarson, and C. Li ATOM: a powerful gene-based association test by combining optimally weighted markers Bioinformatics, February 15, 2009; 25(4): 497 - 503. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-Y. Su, D. J. Balding, and L. J.M. Coin Disease association tests by inferring ancestral haplotypes using a hidden markov model Bioinformatics, April 1, 2008; 24(7): 972 - 978. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Montana Statistical methods in genetics. Brief Bioinform, September 1, 2006; 7(3): 297 - 308. [Abstract] [Full Text] [PDF] |
||||

