Skip Navigation



Bioinformatics Advance Access published online on October 24, 2006

Bioinformatics, doi:10.1093/bioinformatics/btl536
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrowOA All Versions of this Article:
23/1/57    most recent
btl536v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Hua, J.
Right arrow Articles by Stephan, D. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hua, J.
Right arrow Articles by Stephan, D. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2006 The Author(s)
Received July 28, 2006
Revised September 25, 2006
Accepted October 12, 2006

Article

SNiPer-HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays

Jianping Hua 1, David W. Craig 2, Marcel Brun 1, Jennifer Webster 2, Victoria Zismann 2, Waibhav Tembe 1, Keta Joshipura 2, Matthew J. Huentelman 2, Edward R. Dougherty 3, and Dietrich A. Stephan 2 *

1 Computation Biology Division, Translational Genomics Research Institute, Phoenix, 445 N 5th St., Phoenix, AZ
2 Neurogenomics Division, Translational Genomics Research Institute, Phoenix, 445 N 5th St., Phoenix, AZ
3 Computation Biology Division, Translational Genomics Research Institute, Phoenix, 445 N 5th St., Phoenix, AZ; Department of Electrical & Computer Engineering, Texas A&M University, College Station, TX

* To whom correspondence should be addressed.
Dietrich A. Stephan, E-mail: dstephan{at}tgen.org


   Abstract

Motivation: The technology to genotype single nucleotide polymorphisms (SNPs) at extremely high densities provides for hypothesis-free genome-wide scans for common polymorphisms associated with complex disease. However, we find that some errors introduced by commonly employed genotyping algorithms may lead to inflation of false associations between markers and phenotype.

Results: We have developed a novel SNP genotype calling program, SNiPer-High Density (SNiPer-HD), for highly accurate genotype calling across hundreds of thousands of SNPs. The program employs an expectation-maximization (EM) algorithm with parameters based on a training sample set. The algorithm choice allows for highly accurate genotyping for most SNPs. Also, we introduce a quality control metric for each assayed SNP, such that poor-behaving SNPs can be filtered using a metric correlating to genotype class separation in the calling algorithm. SNiPer-HD is superior to the standard dynamic modelling algorithm and is complementary and non-redundant to other algorithms, such as BRLMM. Implementing multiple algorithms together may provide highly accurate genotyping calls, without inflation of false positives due to systematically miss-called SNPs. A reliable and accurate set of SNP genotypes for increasingly dense panels will eliminate some false association signals and false negative signals, allowing for rapid identification of disease susceptibility loci for complex traits.

Availability: SNiPer-HD is available at TGen's website: http://www.tgen.org/neurogenomics/data.


Associate Editor: Keith A Crandall
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.