Skip Navigation


Bioinformatics Advance Access originally published online on September 3, 2004
Bioinformatics 2005 21(3):399-401; doi:10.1093/bioinformatics/bti002
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/3/399    most recent
bti002v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Tang, X.
Right arrow Articles by Dickerson, J. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tang, X.
Right arrow Articles by Dickerson, J. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics vol. 21 issue 3 © Oxford University Press 2005; all rights reserved.

BarleyExpress: a web-based submission tool for enriched microarray database annotations

Xiaoyun Tang 1, Lishuang Shen 2 and Julie A. Dickerson 1,*

1 Electrical and Computer Engineering Department, Iowa State University Ames, IA 50011-3060, USA
2 Virtual Reality Applications Center, Iowa State University Ames, IA, 50011-2274, USA

*To whom correspondence should be addressed.


    Abstract
 TOP
 Abstract
 INTRODUCTION
 DESCRIPTION
 DISCUSSION
 REFERENCES
 

Summary: BarleyExpress is a web-based microarray experiment data submission tool for BarleyBase, a public data resource of Affymetrix GeneChip® data for plants. BarleyExpress uses the Plant Ontology vocabularies and enhances the MIAME guidelines to standardize the annotation of microarray gene expression experiments. In addition, BarleyExpress provides explicit support for factorial experiment design and template loading methods to ease the submission process for large experiments.

Availability: http://barleybase.org

Contact: julied{at}iastate.edu

Supplementary information: BarleyExpress Users Manual.


    INTRODUCTION
 TOP
 Abstract
 INTRODUCTION
 DESCRIPTION
 DISCUSSION
 REFERENCES
 
BarleyBase serves as a public repository for raw and normalized expression data from Barley1 and other plant GeneChips. The Barley1 GeneChip is a community-designed, Affymetrix 22K probe array (Close et al., 2004) which is a model for plants without a fully sequenced genome. To make valid comparisons, the experiment annotations must be standardized using community-defined vocabularies. BarleyExpress is a web-based data submission tool that enriches the MIAME (Brazma et al., 2001) guidelines by promoting best practices in experiment design and extending the experimental definitions to the plant community. BarleyExpress also enforces usage of the growth stage, anatomy and trait-controlled vocabularies from the Plant Ontology to annotate microarray gene expression experiment metadata (The Plant OntologyTM Consortium, 2002). This leads the experimental data in BarleyBase into a biological context and enables valuable insights into biological processes. Exemplar data in BarleyBase are annotated with Gene Ontology information as it becomes available (Gene Ontology Consortium, 2000).

There are several prominent public databases that provide access to microarray data. These include general repositories, such as the Gene Expression Omnibus (GEO) (Edgar et al., 2002) the Stanford Microarray Database (Gollub et al., 2003) and ArrayExpress (Brazma et al., 2003). These repositories are intended to act as central data distribution hubs for download and local processing, and not as online analysis tools for related datasets. Currently, only ArrayExpress has a validated MIAME-compliant, web-based submission tool, MIAMExpress (EMBL-EBI, 2004). GEO is currently implementing such a system. None of these systems requires ontology information to specify key biological information on tissue type and development stages.


    DESCRIPTION
 TOP
 Abstract
 INTRODUCTION
 DESCRIPTION
 DISCUSSION
 REFERENCES
 
BarleyBase stores four basic types of information: GeneChip and/or microarray structure data, experimental and labeling protocols, actual measured gene expression data and annotations, and analysis results in both MAS 5.0 (Affymetrix Inc., 2001) and Robust Multichip Average (RMA) (Gautier et al., 2004) normalized formats. BarleyBase shares links with external bioinformatics databases, including Gramene (Ware et al., 2002) GrainGenes (Matthews et al., 2003) PlantGDB (Dong et al., 2004) and TAIR (Rhee et al., 2003) for annotation information integration and linking to sequenced plants such as rice and Arabidopsis. Links between databases make the use of controlled vocabularies from the Gene and Plant Ontologies critical.

Design of BarleyExpress
BarleyExpress follows the MIAME guidelines in organizing microarray data on an experimental basis. BarleyExpress was initially modeled on MIAMExpress which implemented the MIAME guidelines (EMBL-EBI, 2004). It is also customized to meet the needs of users in the plant community as described below.

  1. BarleyExpress captures MIAME-compliant experiment annotation information required for unambiguous interpretation of microarray experiment results. In BarleyExpress, the MIAME guidelines were customized for Barley and other cereal crops when appropriate plant-based definitions did not exist. For example, some requirements that are specific for mammals were not considered when designing BarleyBase, but descriptions relating to plants were added instead such as ‘pathogen isolates’ as an experimental factor. The help files were also changed to reflect plant terminology.
  2. The goal of the Plant Ontology is to produce structured controlled vocabularies to describe plant features, such as phenotypes, traits, developmental stages and anatomical parts (The Plant OntologyTM Consortium, 2002). A structured ontology enables accurate queries and comparative analysis across different databases. BarleyExpress uses the Growth Stage Ontology and Plant Anatomy Ontology for barley (from Gramene) and Arabidopsis thaliana (from TAIR), respectively. These terms are stored in the fields, ‘growth stage’ and ‘organism part’ for each hybridization. Users may also upload pictures of the biomaterial used in the experiments from the ‘Sample preparation’ page. This visual information aids in interpreting experimental results.
  3. Information on experimental factors and factor levels is explicitly required by BarleyExpress. This is not required in the MIAME standards, but it is essential for effective organization of the microarray experiment results. Currently, BarleyExpress assumes that a factorial design will be used (Trochim, 2000). Users specify the experimental factors and factor levels and then the entire experiment design is laid out in a factorial tabular format. Each treatment is the combination of different experimental factors and factor levels. Unused combinations can be deleted.
  4. Secure and flexible account and data access management allows data owners to protect their data before publication and enable dispersed collaboration. It also allows reviewers to anonymously access datasets referenced by a manuscript to verify the conclusions. The submitter can specify the accessibility to data of an experiment as public, private or group accessible. Registered users can create their own groups, add selected users to the groups for data access and analysis.

Implementation and usage
BarleyExpress is a step-by-step submission wizard that leads users through the process. After all experimental information is input and reviewed, user finalizes the submission. The BarleyBase curator processes the submitted data and makes them available for online access. A unique accession number is assigned to the experiment after curation.

The submission process has three parts: experiment information, data upload, and hybridization or sample information. The first part inputs general experiment information and specifies the experimental factors and factor levels. Annotations for the ‘Experiment’ package of MIAME standards are recorded and enhanced with experiment factorial design information. Users can control the order of experimental factors, so that the experiment hybridization data will be organized and presented appropriately (Fig. 1).



View larger version (24K):
[in this window]
[in a new window]
 
Fig. 1 BarleyExpress experiment submission process.

 
Next, users upload raw microarray data through a Java upload applet. Affymetrix CEL, DAT and EXP files are required; CEL and DAT files store raw expression and image data and EXP store information on parameters of the hybridization, washing, staining and scanning protocols.

In the third section, the individual data files are assigned to the hybridizations. Sample preparation information is input for each hybridization. Hybridizations sharing the same sample information are treated as technical replicates of the sample. Experiment protocols can be submitted if the protocols do not exist in BarleyBase yet. Template functions help to avoid repetition in data entry. For example, experimental samples may only differ slightly in a few fields, such as in experiment factor levels. BarleyExpress lets the user choose a previously submitted entry as a template and copy values of the old entry into a new entry. The user can edit the new entry. Template data entry exists for experiment design, sample preparation and all the five types of protocols submissions. Users may stop at any step of the submission process and resume at a later time.


    DISCUSSION
 TOP
 Abstract
 INTRODUCTION
 DESCRIPTION
 DISCUSSION
 REFERENCES
 
BarleyExpress makes supporting biologist users its main consideration. BarleyExpress enhances the submission process by capturing all applicable MIAME-compliant information and applying factorial experiment design specifications. The use of controlled vocabularies aids in accurate comparisons between experimental conditions. BarleyExpress is under active development and several enhancements, such as adding Plant Ontologies for a wide variety of plant species, are planned.


    Acknowledgments
 
The authors would like to thank our biological collaborators Roger P. Wise and Rico A. Caldo for their efforts in improving the BarleyExpress project. The BarleyBase projected is funded by the USDA-NRIprogram (grant no. 2002-03582) and USDA-CSREES North American Barley Genome Project. BarleyBase is hosted at Iowa State University Virtual Reality Applications Center.

Received on May 13, 2004; revised on August 4, 2004; accepted on August 23, 2004

    REFERENCES
 TOP
 Abstract
 INTRODUCTION
 DESCRIPTION
 DISCUSSION
 REFERENCES
 

    Affymetrix Inc. Statistical Algorithms Reference Guide, (2001) , Santa Clara, CA .

    Brazma, A., Hingamp, P., Quackenbush, J., Sherlock, G., Spellman, P., Stoeckert, C., Aach, J., Ansorge, W., Ball, C.A., Causton, H.C., et al. (2001) Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nat. Genet., 29, , pp. 365–371[CrossRef][Web of Science][Medline].

    Brazma, A., Parkinson, H., Sarkans, U., Shojatalab, M., Vilo, J., Abeygunawardena, N., Holloway, E., Kapushesky, M., Kemmeren, P., Lara, G.G., et al. (2003) ArrayExpress—a public repository for microarray gene expression data at the EBI. Nucleic Acids Res., 31, 68–71[Abstract/Free Full Text].

    Close, T.J., Wanamaker, S.I., Caldo, R.A., Turner, S.M., Ashlock, D.A., Dickerson, J.A., Wing, R.A., Muehlbauer, G.J., Kleinhofs, A., Wise, R.P. (2004) A new resource for cereal genomics: 22K barley GeneChip comes of age. Plant Physiol., 134, 960–968[Abstract/Free Full Text].

    Dong, Q., Schlueter, S.D., Brendel, V. (2004) PlantGDB, plant genome database and analysis tools. Nucleic Acids Res., 32, D354–D359[Abstract/Free Full Text].

    Edgar, R., Domrachev, M., Lash, A.E. (2002) Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res., 30, 207–210[Abstract/Free Full Text].

    EMBL-EBI. (2004) MIAMExpress at the EBI.

    Gautier, L., Cope, L., Bolstad, B.M., Irizarry, R.A. (2004) Affy—analysis of Affymetrix GeneChip data at the probe level. Bioinformatics, 20, 307–315[Abstract/Free Full Text].

    Gene Ontology Consortium. (2000) Gene Ontology: tool for the unification of biology. Nat. Genet., 25, 25–29[CrossRef][Web of Science][Medline].

    Gollub, J., Ball, C.A., Binkely, G., Demeter, J., Finkelstein, D.B., Hebert, J.M., Hernandez-Boussard, T., Jin, H., Kaloper, M., Matese, J.C. (2003) The Stanford Microarray Database: data access and quality assessment tools. Nucleic Acids Res., 31, 94–96[Abstract/Free Full Text].

    Matthews, D.E., Carollo, V.L., Lazo, G.R., Anderson, O.D. (2003) GrainGenes, the genome database for small-grain crops. Nucleic Acids Res., 31, 183–186[Abstract/Free Full Text].

    Rhee, S.Y., Beavis, W., Berardini, T.Z., Chen, G., Dixon, D., Doyle, A., Garcia-Hernandez, M., Huala, E., Lander, G., Montoya, M., et al. (2003) The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community. Nucleic Acids Res., 31, 224–228[Abstract/Free Full Text].

    The Plant OntologyTM Consortium. (2002) The Plant OntologyTM Consortium and Plant Ontologies. Comp. Funct. Genomics, 3, 137–142[CrossRef].

    Trochim, W.M. (2000) The Research Methods Knowledge Base. 2nd edn (version current as of August 02, 2000).

    Ware, D., Jaiswal, P., Ni, J., Pan, X., Chang, K., Clark, K., Teytelman, L., Schmidt, S., Zhao, W., Cartinhour, S., et al. (2002) Gramene: a resource for comparative grass genomics. Nucleic Acids Res., 30, 103–105[Abstract/Free Full Text].


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/3/399    most recent
bti002v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Tang, X.
Right arrow Articles by Dickerson, J. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tang, X.
Right arrow Articles by Dickerson, J. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?