Skip Navigation


Bioinformatics Advance Access originally published online on January 29, 2009
Bioinformatics 2009 25(6):832-833; doi:10.1093/bioinformatics/btp059
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
25/6/832    most recent
btp059v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Wu, X.
Right arrow Articles by Watson, M
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wu, X.
Right arrow Articles by Watson, M
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2009 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

CORNA: testing gene lists for regulation by microRNAs

X. Wu * and M Watson

Bioinformatics Group, Institute for Animal Health, Compton, RG20 7NN, UK

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FLOW OF INFORMATION
 3 EXAMPLE ANALYSIS
 4 SUMMARY
 REFERENCES
 

Motivation: With the increasing use of post-genomics techniques to examine a wide variety of biological systems in laboratories throughout the world, scientists are often presented with lists of genes that they must make sense of. A consistently challenging problem is that of defining co-regulated genes within those gene lists. In recent years, microRNAs have emerged as a mechanism for regulating several cellular processes. In this article, we report on how gene lists and microRNA targets data may be integrated to test for significant associations between gene lists and microRNAs.

Results: We discuss CORNA, a package written in R and released under the GNU GPL, which allows users to test gene lists for significant microRNA–target associations using one of three separate statistical tests, to link microRNA targets to functional annotation and to visualize quantitative data associated with those data.

Availability: CORNA is available as an R package from http://corna.sf.net

Contact: xikun.wu{at}bbsrc.ac.uk


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FLOW OF INFORMATION
 3 EXAMPLE ANALYSIS
 4 SUMMARY
 REFERENCES
 
Experiments involving post-genomics technologies such as microarrays, proteomics and systems biology often present scientists with gene lists that they must attempt to make sense of. Several software packages exist that allow scientists to assign functional annotation to gene lists, and to assign statistical significance to those associations. These include tools for associating genes with biological ontologies (e.g. Falcon and Gentleman, 2007) and with biological pathways (e.g. Salomonis et al., 2007).

A particular challenge is that of assessing which genes in a given gene list are co-regulated. miRBase (Griffiths-Jones et al., 2006), a database of all known microRNAs, has been created and there have been several published software tools that attempt to predict the targets of microRNAs (Brennecke et al., 2005). An excel-based tool (Creighton et al., 2008) has been produced for linking microarray data to microRNA targets information.

Here we describe CORNA, a package for R that allows scientists to analyse gene lists in the context of microRNA–target predictions. Methods exist to test for significant microRNA–target relationships in gene lists, and to test for significant associations between microRNAs and pathways and GO terms. The software is flexible and can read data from public databases or from a scientists own data files. CORNA is released as open-source under the GNU GPL.


    2 FLOW OF INFORMATION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FLOW OF INFORMATION
 3 EXAMPLE ANALYSIS
 4 SUMMARY
 REFERENCES
 
Central to the flow of information through CORNA is the gene list from which the user may test for significant microRNA–target associations. The user may also start with a microRNA, find genes that are associated with that microRNA and then test that gene list for significant associations with KEGG pathways or GO terms. The user may also plot quantitative data associated with the targets of a particular microRNA.

2.1 Inputs
CORNA exclusively uses R vectors and data frames. CORNA includes functions for reading microRNA–target data directly from miRBase and microRNA.org (Betel et al., 2008). There are also helper functions to read gene and GO term data using biomaRt (Durinck et al., 2005); microarray data directly from GEO (Barrett et al., 2008); and pathway data directly from KEGG (Kanehisa et al., 2004).

2.2 Methods
CORNA employs three complementary statistical methods for enrichments analysis of relationships within lists of genes. These are the HyperGeometric test, Fisher's exact test and the {chi}2-test.

2.3 Outputs
If the user tests a gene list for significant microRNA associations, then the output is an R data frame with one row per microRNA, the observed and expected frequencies from sample and population, and the range of user-selected P-values.

Where the user begins with a particular microRNA, the targets information is used to create a gene list and that gene list is tested for enrichment of pathways and GO terms.

There is also a range of plotting functions for plotting quantitative data associated with microRNA targets.


    3 EXAMPLE ANALYSIS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FLOW OF INFORMATION
 3 EXAMPLE ANALYSIS
 4 SUMMARY
 REFERENCES
 
3.1 Using CORNA to test for enrichment of microRNA–target relationships in a gene list
The list in this example, tsam, consists of 1000 ensembl transcript ids; 940 of these were chosen at random, then 30 predicted targets for two microRNAs were added. The example assumes that the file ‘arch.v5.txt.mus_musculus.zip’ has been downloaded from miRBase targets.

Formula

The only two microRNAs with a significant adjusted P-values are those used to bias the transcript list. The user may work with genes simply by converting the transcript list to microRNA–gene relationships using the BioMart2df.fun and corna.map.fun functions.

3.2 Using CORNA to test for KEGG pathways associated with a microRNA list
The microRNA used in this example is "mmu-mir-155", and we use the predicted targets from miRBase to test for enrichment of KEGG pathways.

Formula

We first convert the microRNA–transcript relationship to a microRNA–gene relationship using the BioMart2df.fun and corna.map.fun functions. We then find those genes predicted to be targets of mmu-mir-155. The next stage is to use the KEGG2df.fun function to obtain links between genes and pathways from KEGG for Mus musculus. Finally, we set the sample to be only those genes targeted by mmu-mir-155 that have a pathway link, and perform hypergeometric and Fisher's exact tests for the KEGG pathways involved. The top five pathways can be seen in Table 1.


View this table:
[in this window]
[in a new window]

 
Table 1. Top five significant pathways from CORNA for mmu-mir-155

 

    4 SUMMARY
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FLOW OF INFORMATION
 3 EXAMPLE ANALYSIS
 4 SUMMARY
 REFERENCES
 
With increasing use of large-scale post-genomics techniques, scientists are often presented with lists of genes. MicroRNAs have emerged as an important regulator of gene function. In this article, we have shown that CORNA can be used to test for significant associations between genes, microRNAs, pathways and GO terms. CORNA can also be used to plot quantitative data associated with microRNA targets. CORNA is flexible and can read data from public databases or from a user's own files. CORNA has been tested on both Microsoft Windows and Red Hat Linux. CORNA is released under the GNU GPL and is available from http://corna.sf.net.

Funding: BBSRC (core strategic grant of the Institute for Animal Health).

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Ivo Hofacker

Received on November 19, 2008; revised on January 6, 2009; accepted on January 25, 2009

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FLOW OF INFORMATION
 3 EXAMPLE ANALYSIS
 4 SUMMARY
 REFERENCES
 

    Barrett T, et al. NCBI GEO: archive for high-throughput functional genomic data. Nucleic Acids Res. (2008) 37:D5–D15.[CrossRef][Web of Science][Medline]

    Betel D, et al. The microRNA.org resource: targets and expression. Nucleic Acids Res. (2008) 36:D149–D153.[Abstract/Free Full Text]

    Brennecke J, et al. Principles of microRNA-target recognition. PLoS Biol. (2005) 3:e85.[CrossRef][Medline]

    Creighton CJ, et al. A bioinformatics tool for linking gene expression profiling results with public databases of microRNA target predictions. RNA (2008) 14:2290–2296.[Abstract/Free Full Text]

    Durinck S, et al. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics (2005) 21:3439–3440.[Abstract/Free Full Text]

    Falcon S, Gentleman R. Using GOstats to test gene lists for GO term association. Bioinformatics (2007) 23:257–258.[Abstract/Free Full Text]

    Griffiths-Jones S, et al. miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. (2006) 34:D140–D144.[Abstract/Free Full Text]

    Kanehisa M, et al. The KEGG resource for deciphering the genome. Nucleic Acids Res. (2004) 32:D277–D280.[Abstract/Free Full Text]

    Salomonis N, et al. GenMAPP 2: new features and resources for pathway analysis. BMC Bioinformatics (2007) 8:217.[CrossRef][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
A. V. Antonov, S. Dietmann, P. Wong, D. Lutter, and H. W. Mewes
GeneSet2miRNA: finding the signature of cooperative miRNA activities in the gene lists
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W323 - W328.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
25/6/832    most recent
btp059v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Wu, X.
Right arrow Articles by Watson, M
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wu, X.
Right arrow Articles by Watson, M
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?