Bioinformatics Advance Access originally published online on November 4, 2008
Bioinformatics 2008 24(24):2887-2893; doi:10.1093/bioinformatics/btn571
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Cross-hybridization modeling on Affymetrix exon arrays
1Department of Statistics, 2Institute for Computational and Mathematical Engineering, Stanford University, Stanford, CA, 3Department of Internal Medicine and Department of Biomedical Engineering, University of Iowa, Iowa City, IA and 4Department of Health Research and Policy, Stanford University, Stanford, CA, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Microarray designs have become increasingly probe-rich, enabling targeting of specific features, such as individual exons or single nucleotide polymorphisms. These arrays have the potential to achieve quantitative high-throughput estimates of transcript abundances, but currently these estimates are affected by biases due to cross-hybridization, in which probes hybridize to off-target transcripts.
Results: To study cross-hybridization, we map Affymetrix exon array probes to a set of annotated mRNA transcripts, allowing a small number of mismatches or insertion/deletions between the two sequences. Based on a systematic study of the degree to which probes with a given match type to a transcript are affected by cross-hybridization, we developed a strategy to correct for cross-hybridization biases of gene-level expression estimates. Comparison with Solexa ultra high-throughput sequencing data demonstrates that correction for cross-hybridization leads to a significant improve-ment of gene expression estimates.
Availability: We provide mappings between human and mouse exon array probes and off-target transcripts and provide software extending the GeneBASE program for generating gene-level expression estimates including the cross-hybridization correction http://biogibbs.stanford.edu/~kkapur/GeneBase/.
Contact: whwong{at}stanford.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Received on July 15, 2008; revised on October 3, 2008; accepted on October 30, 2008
This article has been cited by other articles:
![]() |
L. Wan, K. Sun, Q. Ding, Y. Cui, M. Li, Y. Wen, R. C. Elston, M. Qian, and W. J Fu Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation Nucleic Acids Res., September 1, 2009; 37(17): e117 - e117. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Jiang and W. H. Wong Statistical inferences for isoform expression in RNA-Seq Bioinformatics, April 15, 2009; 25(8): 1026 - 1032. [Abstract] [Full Text] [PDF] |
||||

