Bioinformatics Advance Access originally published online on September 10, 2007
Bioinformatics 2007 23(20):2741-2746; doi:10.1093/bioinformatics/btm443
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A genotype calling algorithm for the Illumina BeadArray platform


1Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN and 2Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Large-scale genotyping relies on the use of unsupervised automated calling algorithms to assign genotypes to hybridization data. A number of such calling algorithms have been recently established for the Affymetrix GeneChip genotyping technology. Here, we present a fast and accurate genotype calling algorithm for the Illumina BeadArray genotyping platforms. As the technology moves towards assaying millions of genetic polymorphisms simultaneously, there is a need for an integrated and easy-to-use software for calling genotypes.
Results: We have introduced a model-based genotype calling algorithm which does not rely on having prior training data or require computationally intensive procedures. The algorithm can assign genotypes to hybridization data from thousands of individuals simultaneously and pools information across multiple individuals to improve the calling. The method can accommodate variations in hybridization intensities which result in dramatic shifts of the position of the genotype clouds by identifying the optimal coordinates to initialize the algorithm. By incorporating the process of perturbation analysis, we can obtain a quality metric measuring the stability of the assigned genotype calls. We show that this quality metric can be used to identify SNPs with low call rates and accuracy.
Availability: The C++ executable for the algorithm described here is available by request from the authors.
Contact: teo{at}well.ox.ac.uk or tgc{at}well.ox.ac.uk
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.
Associate Editor: Martin Bishop
Received on July 31, 2007; revised on August 20, 2007; accepted on August 20, 2007
This article has been cited by other articles:
![]() |
Y. Y. Teo Exploratory data analysis in large-scale genetic studies Biostat., October 14, 2009; (2009) kxp038v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Johnson Single-Nucleotide Polymorphism Bioinformatics: A Comprehensive Review of Resources Circ Cardiovasc Genet, October 1, 2009; 2(5): 530 - 536. [Full Text] [PDF] |
||||
![]() |
A. Lynch, M. Dunning, M. Iddawela, N. Barbosa-Morais, and M. Ritchie Considerations for the processing and analysis of GoldenGate-based two-colour Illumina platforms Statistical Methods in Medical Research, October 1, 2009; 18(5): 437 - 452. [Abstract] [PDF] |
||||
![]() |
Y. Y. Teo, A. E. Fry, K. Bhattacharya, K. S. Small, D. P. Kwiatkowski, and T. G. Clark Genome-wide comparisons of variation in linkage disequilibrium Genome Res., October 1, 2009; 19(10): 1849 - 1860. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Ritchie, B. S. Carvalho, K. N. Hetrick, S. Tavare, and R. A. Irizarry R/Bioconductor software for Illumina's Infinium whole-genome genotyping BeadChips Bioinformatics, October 1, 2009; 25(19): 2621 - 2623. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Panoutsopoulou and E. Zeggini Finding common susceptibility variants for complex disease: past, present and future Brief Funct Genomic Proteomic, September 1, 2009; 8(5): 345 - 352. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Sun, F. A. Wright, Z. Tang, S. H. Nordgard, P. V. Loo, T. Yu, V. N. Kristensen, and C. M. Perou Integrated study of copy number states and genotype calls using high-density SNP arrays Nucleic Acids Res., September 1, 2009; 37(16): 5365 - 5377. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Bayjanov, M. Wels, M. Starrenburg, J. E. T. van Hylckama Vlieg, R. J. Siezen, and D. Molenaar PanCGH: a genotype-calling algorithm for pangenome CGH data Bioinformatics, February 1, 2009; 25(3): 309 - 314. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Lin, G. C. Tseng, S. Y. Cheong, L. J. H. Bean, S. L. Sherman, and E. Feingold Smarter clustering methods for SNP genotype calling Bioinformatics, December 1, 2008; 24(23): 2665 - 2671. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Giannoulatou, C. Yau, S. Colella, J. Ragoussis, and C. C. Holmes GenoSNP: a variational Bayes within-sample SNP genotyping algorithm that does not require a reference population Bioinformatics, October 1, 2008; 24(19): 2209 - 2214. [Abstract] [Full Text] [PDF] |
||||






