Skip Navigation


Bioinformatics Advance Access originally published online on October 5, 2007
Bioinformatics 2007 23(21):2897-2902; doi:10.1093/bioinformatics/btm478
This Article
Right arrow Full Text
Right arrow Full Text (Print PDF)
Right arrow Supplementary Data
Right arrow All Versions of this Article:
23/21/2897    most recent
btm478v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Talloen, W.
Right arrow Articles by Göhlmann, H. W.H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Talloen, W.
Right arrow Articles by Göhlmann, H. W.H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

I/NI-calls for the exclusion of non-informative genes: a highly effective filtering tool for microarray data

Willem Talloen 1,*,{dagger}, Djork-Arné Clevert 2,3,{dagger}, Sepp Hochreiter 2, Dhammika Amaratunga 4, Luc Bijnens 1, Stefan Kass 1 and Hinrich W.H. Göhlmann 1

1Johnson & Johnson Pharmaceutical Research & Development, a division of Janssen Pharmaceutica n.v., Beerse, Belgium, 2Institute of Bioinformatics, Johannes Kepler Universität Linz 4040 Linz, Austria, 3Department of Nephrology and Internal Intensive Care, Charité University Medicine, Berlin, Germany and 4Johnson & Johnson Pharmaceutical Research & Development, Raritan, USA

*To whom correspondence should be addressed.


   Abstract

Motivation: DNA microarray technology typically generates many measurements of which only a relatively small subset is informative for the interpretation of the experiment. To avoid false positive results, it is therefore critical to select the informative genes from the large noisy data before the actual analysis. Most currently available filtering techniques are supervised and therefore suffer from a potential risk of overfitting. The unsupervised filtering techniques, on the other hand, are either not very efficient or too stringent as they may mix up signal with noise. We propose to use the multiple probes measuring the same target mRNA as repeated measures to quantify the signal-to-noise ratio of that specific probe set. A Bayesian factor analysis with specifically chosen prior settings, which models this probe level information, is providing an objective feature filtering technique, named informative/non-informative calls (I/NI calls).

Results: Based on 30 real-life data sets (including various human, rat, mice and Arabidopsis studies) and a spiked-in data set, it is shown that I/NI calls is highly effective, with exclusion rates ranging from 70% to 99%. Consequently, it offers a critical solution to the curse of high-dimensionality in the analysis of microarray data.

Availability: This filtering approach is publicly available as a function implemented in the R package FARMS (www.bioinf.jku.at/software/farms/farms.html).

Contact: wtalloen{at}prdbe.jnj.com

Supplementary information: Supplementary data are available at Bioinformatics online.

{dagger}The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.

Associate Editor: David Rocke


Received on April 13, 2007; revised on September 7, 2007; accepted on September 18, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.