Skip Navigation


Bioinformatics Advance Access originally published online on September 17, 2007
Bioinformatics 2007 23(24):3406-3408; doi:10.1093/bioinformatics/btm469
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/24/3406    most recent
btm469v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Sanges, R.
Right arrow Articles by Calogero, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sanges, R.
Right arrow Articles by Calogero, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

oneChannelGUI: a graphical interface to Bioconductor tools, designed for life scientists who are not familiar with R language

Remo Sanges 1,{dagger}, Francesca Cordero 2,{dagger} and Raffaele A. Calogero 3,*

1CBM (Cluster in Biomedicine) S.c.r.l., AREA Science Park, Basovizza SS 14, km 163,5 34012 Trieste, 2Department of Informatics, University of Torino, Corso Svizzera 185, 10149 Torino and 3Department of Clinical and Biological Sciences, University of Torino, Regione Gonzole 10, 10043 Orbassano, Torino, Italy

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 SOFTWARE OVERVIEW
 3 CONCLUSION
 4 PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 

Summary: OneChannelGUI is an add-on Bioconductor package providing a new set of functions extending the capability of the affylmGUI package. This library provides a graphical interface (GUI) for Bioconductor libraries to be used for quality control, normalization, filtering, statistical validation and data mining for single channel microarrays. Affymetrix 3' expression (IVT) arrays as well as the new whole transcript expression arrays, i.e. gene/exon 1.0 ST, are actually implemented. oneChannelGUI is available for most platforms on which R runs, i.e. Windows and Unix-like machines.

Availability: http://www.bioconductor.org/packages/2.0/bioc/html/oneChannelGUI.html

Contact: raffaele.calogero{at}unito.it

Supplementary information: http://www.bioinformatica.unito.it/oneChannelGUI/


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 SOFTWARE OVERVIEW
 3 CONCLUSION
 4 PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
The Bioconductor project (Gentleman et al., 2004) is an enormous repository of academic software for the analysis of genomic data and is centered around the analysis of microarray data. The software packages, available in Bioconductor, have sophisticated command-line driven interfaces tuned to users from mathematical or computing backgrounds. The command-line approach is very powerful since it gives complete control of the statistical tool in use. However, learning its use can be very frustrating for users without consolidated programming experience. oneChannelGUI is designed specifically for life scientists who are not familiar with R language but do wish to capitalize on the vast analysis opportunities of Bioconductor.

oneChannelGUI offers a comprehensive microarray analysis for Affymetrix (www.affymetrix.com) Affymetrix 3' (IVT) expression arrays as well as for the new generation of whole transcript arrays: human/mouse/rat exon 1.0 ST and human gene 1.0 ST arrays. It extends the capability of affylmGUI (Wettenhall et al., 2006). affylmGUI was designed as an interface to the affy, gcrma (Wu et al., 2004), affyPLM and limma (Smyth, 2004) package. oneChannelGUI inherits the core affylmGUI functionalities and permits a wider range of analysis allowing biologists to choose among different criteria and algorithms in order to analyze their data. The actual design of the package adds the following functionalities to the core affylmGUI:

  • Data set loading
    • Affymetrix exon 1.0 ST gene/exon level expression data can be loaded as well as single channel array expression data derived from GEO (www.ncbi.nih.gov/geo) or from tab delimited files containing only expression data.

  • Quality controls
    • The visualization of source of variation present in a microarray study can be visualized by principal component analysis (PCA) and hierarchical clustering.

  • Study design
    • New functionalities are added to the core affylmGUI to investigate the statistical quality of a microarray study.

  • Probe sets summary and normalization
    • Expresso function (affy package) is implemented.
    • A module that allows quantile/loess/qspline normalizations is available in case no normalized data are loaded as tab delimited files.

  • Filtering
    • Modules for filtering microarray data set on the basis of signal and annotation features are added to the core affylmGUI.

  • Statistical analysis
    • SAM (Tusher et al., 2001) and rank product (Breitling et al., 2004) algorithms for two-class unpaired samples are implemented, as well as an interface to maSigPro time-course analysis package (Conesa et al., 2006).

  • Classification
    • Basic classification analyses can be performed using the nearest shrunken centroids (Tibshirani et al., 2002) and the penalized discriminant analysis (Hastie, 1994) algorithms.

  • Biological interpretation
    • A graphical interface to GOstats (Falcon and Gentleman, 2007) was added to the core affylmGUI functionalities.
    • IPA template files (www.ingenuity.com) can be generated starting from tables of differentially expressed probe sets.

  • Exon analysis
    • Modules for basic alternative splicing detection and filtering are implemented.


    2 SOFTWARE OVERVIEW
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 SOFTWARE OVERVIEW
 3 CONCLUSION
 4 PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
2.1 Data loading
oneChannelGUI inherits all affylmGUI modules and functions to analyze Affymetrix IVT arrays. In addition expression flat data files from GEO (i.e. Series Matrix Files) can be loaded as well as tab delimited files containing expression data only, e.g. derived from ArrayExpress processed flat files.

The new Exon 1.0 ST arrays can be also loaded and manipulated in oneChannelGUI. It is important to point out that, due to high memory requirements for the generation of summarized expression values, the loading of even a few exon 1.0 ST .CEL files is unfeasible on 32 bit Windows systems. However, oneChannelGUI offers also the possibility to load Affymetrix exon 1.0 ST expression data using the gene/exon level expression data exported, as tab delimited files, from the Affymetrix Expression Console (www.affymetrix.com/support/technical/software_downloads.affx). Gene/exon level expression data can also be calculated from .CEL files using a oneChannelGUI module that run and upload results produced by Affymetrix APT tools (www.affymetrix.com/support/developer/powertools/index.affx), which embeds a set of cross-platform command line programs generating the summarized expression values for individual or collections of Affymetrix arrays. DABG [detection above background; (Affymetrix-Technical-Note)] P-values calculation is also implemented.

2.2 Quality controls
Probe level QC can be done when .CEL files are loaded, using the affylmGUI QC functions. In addiction sample clustering, to evaluate the homogeneity of the experimental groups, can be done by PCA and hierarchical clustering at probe set level.

2.3 Study design
Graphical implementations of the size and sizepower Bioconductor libraries allow the user to determine how many samples are needed to achieve a specified power for a test of whether a gene is differentially expressed or, in reverse, to determine the power of a given sample size.

2.4 Probe sets summary and normalization
The expresso function, which allows the integration of different methods for background correction, normalization, probe specific correction and summary value computation, was added to the affylmGUI inherited probe set summary methods (Irizarry et al., 2003; Wu et al., 2004).

2.5 Filtering
A central problem in microarray data analysis is the high dimensionality of gene expression space, which prohibits a comprehensive statistical analysis without focusing on particular aspects of the joint distribution of the gene expression levels. Possible strategies are to perform data-driven non-specific filtering of genes (von Heydebreck et al., 2004) before the actual statistical analysis or to filter, making use of biologically relevant a priori knowledge. For this purpose IQR/Intensity-based filters (von Heydebreck et al., 2004) as well as filtering based on probe set/Entrez gene annotations methods are implemented. For exon arrays, a filter based on DABG P-values is also available.

2.6 Statistical analysis
SAM (Tusher et al., 2001) and rank product (Breitling et al., 2004) algorithms for two-class unpaired samples are implemented in addition to affylmGUI modules for linear modeling analysis, which are fully inherited. An interface to maSigPro package (Conesa et al., 2006) is also available to allow time-course analyses.

2.7 Classification
A graphical interface in oneChannelGUI provides a link to the pamr and pdmclass packages designed to carry out sample classification from gene expression data, respectively, by the method of nearest shrunken centroids (Tibshirani et al., 2002) and by penalized discriminant methods.

2.8 Biological interpretation
An important point downstream from the identification of a set of differential expressed genes is the definition of the presence of possible relations between each other, e.g. belonging to the same biological pathway. For this reason we have implemented a graphical interface to the GOstats package (Gentleman, 2004), which has extensive facilities for testing the association of gene ontology (GO) terms to genes in a gene list. Graphical modules are also implemented to allow the selection and annotation of differentially expressed probe sets associated to a specific enriched GO term.

Furthermore, a routine to build template files to be loaded on IPA software (www.ingenuity.com) starting from limma/SAM/RankProd outputs has also been implemented.

2.9 Exon analysis
Modules to calculate Splice Index (Affymetrix-Technical-Note) and Microarray Detection of Alternative Splicing [MiDAS; (Affymetrix-White-Paper)] P-value are implemented. A filter based on MiDAS P-values is also available for the selection of putative alternative splicing events.


    3 CONCLUSION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 SOFTWARE OVERVIEW
 3 CONCLUSION
 4 PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
oneChannelGUI is a microarray analysis tool intended to enable the interested life scientists to analyze microarray experiments based on the consolidated 3' Affymetrix expression arrays as well as the new whole transcript expression arrays. oneChannelGUI is also a didactical tool since it could be used to introduce young life scientists to the use and interpretation of microarray data. For this purpose various data sets and exercises are available at the oneChannelGUI web site.


    4 PERSPECTIVES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 SOFTWARE OVERVIEW
 3 CONCLUSION
 4 PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
We will expand the exon level analysis tools and add new interfaces to allow life scientists to gain maximum advantage from the vast analysis opportunities offered by Bioconductor packages.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 SOFTWARE OVERVIEW
 3 CONCLUSION
 4 PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
This work was supported by an Oncogenomics grant from Italian Association for Cancer Research; the Italian Ministero dell’Università e della Ricerca; the University of Torino; the Regione Piemonte: bando regionale sulla ricerca scientifica applicata 2004.

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: David Rocke

{dagger}The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors. Back

Received on May 26, 2007; revised on August 27, 2007; accepted on September 10, 2007

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 SOFTWARE OVERVIEW
 3 CONCLUSION
 4 PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 

    Affymetrix-Technical-Note, Affymetrix Technical Note: Identifying and Validating Alternative Splicing Events. Affymetrix-White-Paper Alternative Transcript Analysis Methods for Exon Arrays.

    Breitling R, et al. Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments. FEBS Lett (2004) 573:83–92.[CrossRef][Web of Science][Medline]

    Conesa A, et al. maSigPro: a method to identify significantly differential expression profiles in time-course microarray experiments. Bioinformatics (2006) 22:1096–1102.[Abstract/Free Full Text]

    Falcon S, Gentleman R. Using GOstats to test gene lists for GO term association. Bioinformatics (2007) 23:257–258.[Abstract/Free Full Text]

    Gentleman R. Compstat 2004 – Proceedings in Computational Statistics (2004) Heidelberg, Germany: Physica Verlag.

    Gentleman RC, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol (2004) 5:R80.[CrossRef][Medline]

    Hastie T, et al. Flexible discriminant analysis by optimal scoring. J. Am. Stat. Assoc (1994) 89:1255–1270.[CrossRef][Web of Science]

    Irizarry RA, et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics (2003) 4:249–264.[Abstract]

    Smyth G. Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat. Appl. Genet. Mol. Biol (2004) 3. article 3. Available at http://www.bepress.com/sagmb/vol3/iss1/art3.

    Tibshirani R, et al. Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. Natl Acad. Sci. USA (2002) 99:6567–6572.[Abstract/Free Full Text]

    Tusher VG, et al. Significance analysis of microarrays applied to the ionizing radiation response. Proc. Natl Acad. Sci. USA (2001) 98:5116–5121.[Abstract/Free Full Text]

    von Heydebreck A, et al. Differential expression with the Bioconductor project. In: Bioconductor Project Working Papers (2004) Working Paper 7.

    Wettenhall JM, et al. affylmGUI: a graphical user interface for linear modeling of single channel microarray data. Bioinformatics (2006) 22:897–899.[Abstract/Free Full Text]

    Wu, et al. A model based background adjustment for oligonucleotide expression arrays. J. Am. Stat. Assoc (2004) 99:909–917.[CrossRef][Web of Science]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/24/3406    most recent
btm469v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Sanges, R.
Right arrow Articles by Calogero, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sanges, R.
Right arrow Articles by Calogero, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?