Bioinformatics Advance Access published online on June 29, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl341
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Statistics, The University of Chicago
* To whom correspondence should be addressed.
Motivation: Preliminary results on the data produced using the Affymetrix large-scale genotyping platforms (Matsuzaki et al., 2004) show that it is necessary to construct improved genotype calling algorithms (Rabbee and Speed, 2006). There is evidence that some of the existing algorithms (e.g., Di et al., 2005) lead to an increased error rate in heterozygous genotypes, and a disproportionately large rate of heterozygotes with missing genotypes. Non-random errors and missing data can lead to an increase in the number of false discoveries in genetic association studies. Therefore, the factors that need to be evaluated in assessing the performance of an algorithm are the missing data (call) and error rates, but also the heterozygous proportions in missing data and errors. Results: We introduce a novel genotype calling algorithm (GEL) for the Affymetrix GeneChip arrays. The algorithm uses likelihood calculations that are based on distributions inferred from the observed data. A key ingredient in accurate genotype calling is weighting the information that comes from each probe quartet according to the quality/reliability of the data in the quartet, and prior information on the performance of the quartet. Availability: The GEL software is implemented in R and is available by request from the corresponding author at nicolae@galton.uchicago.edu.
Received April 22, 2006
Revised June 16, 2006
Accepted June 20, 2006
Article
GEL: a novel genotype calling algorithm using empirical likelihood
Dan L. Nicolae 1 *,
Xiaolin Wu 2,
Kazuaki Miyake 2,
and
Nancy J. Cox 2
2 Department of Medicine, The University of Chicago
Dan L. Nicolae, E-mail: nicolae{at}galton.uchicago.edu
![]()
Abstract
Associate Editor: Alex Bateman
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Wan, K. Sun, Q. Ding, Y. Cui, M. Li, Y. Wen, R. C. Elston, M. Qian, and W. J Fu Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation Nucleic Acids Res., September 1, 2009; 37(17): e117 - e117. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Lin, G. C. Tseng, S. Y. Cheong, L. J. H. Bean, S. L. Sherman, and E. Feingold Smarter clustering methods for SNP genotype calling Bioinformatics, December 1, 2008; 24(23): 2665 - 2671. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Hayes, A. Pluzhnikov, K. Miyake, Y. Sun, M. C.Y. Ng, C. A. Roe, J. E. Below, R. I. Nicolae, A. Konkashbaev, G. I. Bell, et al. Identification of Type 2 Diabetes Genes in Mexican Americans Through Genome-Wide Association Studies Diabetes, December 1, 2007; 56(12): 3033 - 3044. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Xiao, M. R. Segal, Y.H. Yang, and R.-F. Yeh A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays Bioinformatics, June 15, 2007; 23(12): 1459 - 1467. [Abstract] [Full Text] [PDF] |
||||


